Jiashen Hua (华佳燊)

Email: jiashen.hjs@alibaba-inc.com / jiashenhzju@gmail.com

Senior Algorithm Expert | Tongyi Lab

I obtained my M.S. and B.S. from Zhejiang University in 2019 and 2016, in the Machine Vision and Navigation Lab with Prof. Zhiyu Xiang and Assoc. Prof. Xiaojin Gong.

My current work focuses on multimodal large language models, spanning frontier research and large-scale industrial deployment.

multimodal representation: establishing a unified, omni-modal representation space across text, image, video, and audio.
multimodal reasoning & agent: advancing agentic vision by integrating multimodal chain-of-thought with active visual understanding, driving the transition from passive perception to goal-oriented action.

GitHub / Google Scholar

WeChat / Xiaohongshu / Twitter

🔥🔥🔥 I am always looking for self-motivated interns tailored to these topics. Feel free to reach out!

News

[2026/02] One papers accepted by CVPR 2026.
[2026/01] One papers accepted by ICLR 2026 oral, top 1.18%.
[2025/08] One papers accepted by TMM 2025.
[2024/10] One papers accepted by ACM MM 2024.
[2022/03] One papers accepted by CVPR 2022.
[2021/12] One papers accepted by CVPR 2021 RVSU Workshop.
[2019/04] Joined Alibaba Damo Academy as a Research Scientist.

Selected Publications

(*Corresponding Author, †Project Lead)

Through the Lens of Contrast: Self-Improving Visual Reasoning in VLMs

Zhiyu Pan, Yizheng Wu, Jiashen Hua*†, Junyi Feng, Shaotian Yan, Bing Deng, Zhiguo Cao and Jieping Ye.
International Conference on Learning Representations (ICLR, oral presentation, 223/18949), 2026

Paper | Code | 🤗Dataset

Illuminating Visual Identity in Universal Multimodal Embeddings

Jiawei Cao, Junyi Feng, Jiashen Hua*†, Ziheng Huang, Bing Deng, Kaijie Wu, Chaochen Gu, Jieping Ye.
The IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2026

Paper | Code

Online convolutional re-parameterization

Mu Hu, Junyi Feng, Jiashen Hua†, Baisheng Lai, Jianqiang Huang, Xiaojin Gong and Xian-Sheng Hua.
The IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2022

Paper (153 citations) | Code (184⭐)

A normalized convolutional neural network for guided sparse depth upsampling

Jiashen Hua and Xiaojing Gong.
International Joint Conference on Artificial Intelligence (IJCAI), 2018

Paper (58 citations)

Honors & Awards

🏅 Future City Awards, "Novel Perception and Simulation Platform Driven by Urban Digital Twins". 2023

🏆 Winner, CVPR 2021 Robust Video Scene Understanding (RVSU) Challenge. 2021

🏆 Winner, CVPR 2019 The 1st Learning from Imperfect Data (LID) Challenge. 2019

Outstanding Graduate Student, Zhejiang University. 2017

This webpage template is stolen from Jon Barron