Atlas / Reports / Detail
Qwen-VL: A Versatile Vision-Language Model for Understanding, Localization, Text Reading, and Beyond
Vision-Language Models
Connected researchers
Junyang Lin
Qwen
Junyang Lin (Justin Lin) is a researcher and open-source maintainer known for the Qwen family of models. His public profiles list interests in LLMs, AI agents, multimodal learning, long-horizon reasoning, world models, and reinforcement learning; multiple March 2026 news reports said he stepped down from the Qwen tech lead role.
Shuai Bai
Qwen
Senior algorithm expert at Alibaba Group working on large language models, multimodal large language models, and diffusion models.
Jinze Bai
Qwen
PhD student at The Hong Kong University of Science and Technology (Guangzhou) whose research interests include large language models, vision-language models, AI agents, and multimodal retrieval.
Yunfei Chu
Qwen
Algorithm expert at Alibaba Group working on computer vision, multimodal learning, and large language models.
Yang Fan
Qwen
Yang Fan is a research scientist at Alibaba Group. His homepage says he works on large language model post-training and deployment.
Zeyu Cui
Qwen
Research scientist at Meta in New York City and research advisor at the UCLA NLP group; previously completed a PhD in computer science at UCLA.
Jingren Zhou
MiniMax / Moonshot AI / Qwen / Z.ai
Alibaba senior technology leader and researcher associated with Qwen. Public profiles list him with Alibaba Group, and official Alibaba Cloud coverage identifies him as a chief technology officer leading large-model work.
Kai Dang
Qwen
Researcher on Alibaba's Qwen team focused on large language models and NLP, with public research profiles listing a Nankai University background.
Fei Huang
Qwen
Researcher at Alibaba Group working on natural language processing and multimodal AI.
Xiaodong Deng
Qwen
Research scientist in Tongyi Lab whose official profile highlights post-training and multimodal large language models.
Wenbin Ge
Qwen
Research scientist in Tongyi Lab whose official profile highlights work on efficient reinforcement learning, generalization, inference-time scaling, and reasoning for large language models.
Chang Zhou
Qwen
Qwen researcher and co-lead whose work focuses on pretraining and post-training, multimodal models, agent systems, and large-scale model infrastructure.
Yu Han
Qwen
Researcher affiliated with Alibaba Group on Google Scholar and coauthor of the Qwen technical report.
Hao Li
Qwen
Alibaba Group-affiliated co-author of the Qwen-VL technical report, a multimodal model for visual understanding, localization, and text reading.
Keyan Chen
Qwen
Alibaba Group-affiliated co-author of the Qwen-VL technical report, a multimodal model for visual understanding, localization, and text reading.
Long Lan
Qwen
Alibaba Group-affiliated co-author of the Qwen-VL technical report, a multimodal model for visual understanding, localization, and text reading.
Wenyu Liu
Qwen
Alibaba Group-affiliated co-author of the Qwen-VL technical report, a multimodal model for visual understanding, localization, and text reading.
Xiaoyu Wang
Qwen
Alibaba Group-affiliated co-author of the Qwen-VL technical report, a multimodal model for visual understanding, localization, and text reading.
Yinpei Ma
Qwen
Alibaba Group-affiliated co-author of the Qwen-VL technical report, a multimodal model for visual understanding, localization, and text reading.