LLMpeople
Home People Organizations Reports Fields Schools
Public Atlas People first, reports as evidence, organizations as context.

Atlas / Reports / Detail

Qwen2-VL: Enhancing Vision-Language Model's Perception of the World at Any Resolution

Vision-Language Models

Qwen2024-09-1826 researchers
Field
Vision-Language Models
Organization
Qwen
arXiv
2409.12191

Canonical link

https://arxiv.org/abs/2409.12191

Connected researchers

Profile Reports

Yiheng Xu

Qwen

Yiheng Xu is a research scientist focused on multimodal AI, coding agents, and reasoning systems. His public profiles link him to Qwen research and later work at OpenAI, with publications spanning vision-language models and code generation.

Qwen
Unknown 1
Profile Reports

Shuai Bai

Qwen

Senior algorithm expert at Alibaba Group working on large language models, multimodal large language models, and diffusion models.

Qwen
Unknown 5
Profile Reports

Jiabo Ye

Qwen

Research scientist in Tongyi Lab whose public homepage and OpenReview profile describe work on large language models, multimodal learning, and visual grounding. His public profiles also list affiliations with Alibaba Group and East China Normal University.

Qwen
Unknown 3
Profile Reports

Wei Ding

Qwen

Research scientist at Alibaba working on multimodal learning and generation; previously a postdoctoral researcher at Carnegie Mellon University.

Qwen
Unknown 2
Profile Reports

Jun Tang

Qwen

Jun Tang works on multimodal foundation models, open-source language models, and agent systems. His personal site highlights work on Qwen and Qwen3-VL alongside related multimodal research.

Qwen
Unknown 2
Profile Reports

Keqin Chen

Qwen

Researcher focused on large language models and multimodal learning, with public profiles linking Keqin Chen to Beihang University and to Qwen vision-language model work.

Qwen
Unknown 2
Profile Reports

Zesen Cheng

Qwen

Qwen researcher and author on the Qwen2-VL and Qwen2.5-VL technical reports, with public profiles linking his work to multimodal and vision-language systems.

Qwen
Unknown 2
Profile Reports

Xi Zhang

Qwen

Xi Zhang works on multimodal and vision-language model research. Public profiles connect him to Qwen2-VL and related open research projects.

Qwen
Unknown 1
Profile Reports

Mingkun Yang

Qwen

Mingkun Yang works on multimodal large language models, embodied AI, and robotics. His public profile says he is a postdoc at Zhejiang University and a research scientist at Qwen.

Qwen
Unknown 3
Profile Reports

Jianqiang Wan

Qwen

Research scientist in Alibaba DAMO Academy's Tongyi Lab working on multimodal learning, vision-language models, and embodied AI; author on the Qwen2-VL and Qwen2.5-VL technical reports.

Qwen
Unknown 2
Profile Reports

Zhibo Yang

Qwen

Zhibo Yang works on multimodal and vision-language systems. Public profiles connect him to the Qwen2.5-VL technical report and to an individual GitHub account that links back to his personal site.

Qwen
Unknown 2
Profile Reports

Zheren Fu

Qwen

Tongyi Lab researcher working on large language models, vision-language models, and reinforcement learning; public profiles connect Zheren Fu to the Qwen2-VL technical report.

Qwen
Unknown 1
Profile Reports

Tianbao Xie

Qwen

Research scientist on the Qwen team at Alibaba Group, focusing on foundation models and language agents. He received a PhD in computer science from the University of Illinois Urbana-Champaign.

Qwen
Unknown 2
Profile Reports

Kai Dang

Qwen

Researcher on Alibaba's Qwen team focused on large language models and NLP, with public research profiles listing a Nankai University background.

Qwen
Unknown 7
Profile Reports

Yuanzhi Zhu

Qwen

Yuanzhi Zhu is a Qwen researcher whose public work includes multimodal and audio-language models.

Qwen
Unknown 3
Profile Reports

Peng Wang

Qwen

Researcher affiliated with the Qwen team at Alibaba Group on Google Scholar and coauthor of the Qwen and Qwen3 technical reports.

Qwen
Unknown 5
Profile Reports

Wenbin Ge

Qwen

Research scientist in Tongyi Lab whose official profile highlights work on efficient reinforcement learning, generalization, inference-time scaling, and reasoning for large language models.

Qwen
Unknown 5
Profile Reports

Shijie Wang

Qwen

Senior research scientist in Tongyi Lab whose official profile highlights post-training, AI for science, evaluation and alignment, multimodal reasoning, and large language model reasoning.

Qwen
Unknown 3
Profile Reports

Haiyang Xu

Qwen

Independent researcher focused on multimodal learning, document intelligence, and efficient training; coauthor of Qwen2.5-VL and mPLUG-related vision-language systems.

Qwen
Unknown 2
Profile Reports

Hang Zhang

Qwen

Researcher at Alibaba Group working on multimodal large language models; public profile and publication context connect Hang Zhang to the Qwen2-VL technical report.

Qwen
Unknown 2
Profile Reports

Jialin Wang

Qwen

Research scientist in Tongyi Lab and contributor to Qwen2-VL, with public work on multimodal large language models.

Qwen
Unknown 2
Profile Reports

Pengfei Wang

Qwen

Research scientist in Alibaba DAMO Academy's Tongyi Lab working on machine learning, computer vision, and multimodal large language models; author on the Qwen2-VL and Qwen2.5-VL technical reports.

Qwen
Unknown 2
Profile Reports

Sibo Song

Qwen

Research scientist in Tongyi Lab and maintainer of Qwen-VL, with public work on vision-language models.

Qwen
Unknown 2
Profile Reports

Xuejing Liu

Qwen

Xuejing Liu is a researcher whose public OpenReview profile includes the Qwen2-VL and Qwen2.5-VL technical report papers.

Qwen
Unknown 2

LLMpeople is a public atlas for discovering frontier AI researchers with context, provenance, and respect.