Atlas / Reports / Detail
DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding
Vision-Language Models report from DeepSeek with 13 connected researchers in the LLMpeople atlas.
Connected researchers
Yanxia Cui
DeepSeek
Researcher working on multimodal and vision-language models, including DeepSeek-VL2 and related model optimization work.
Zhengyang Wang
DeepSeek
Research intern at DeepSeek and master's student at Renmin University of China working on multimodal large language models and AI agents.
Yao Lu
DeepSeek / Google Gemini
Yao Lu is listed as an author of the Google technical report Gemini Robotics: Bringing AI into the Physical World.
Xinyu Chen
DeepSeek
Research intern at NUS and Nanjing University working on machine learning and multimodal large language models; coauthor of DeepSeek-VL2.
Shujie Wang
DeepSeek
First-year PhD student at Shanghai Jiao Tong University focused on multimodal large language models, text-to-image generation, and image/video generation; coauthor of DeepSeek-VL2.
Yonggang Zhang
DeepSeek
Yonggang Zhang is a researcher whose public OpenReview profile includes the DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding paper.
Yufei Zhang
DeepSeek
Researcher at the University of Illinois Urbana-Champaign focused on vision-language models, multimodal large language models, and physical AI.
Wei Xiong
DeepSeek
Wei Xiong is listed as an author of the DeepSeek technical report DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding.
Zihan Liu
DeepSeek
Zihan Liu is a research scientist at DeepSeek. His public homepage highlights work in multimodal learning, vision-language models, and large-scale machine learning.
Sifan Zhou
DeepSeek
DeepSeek report author listed on DeepSeek-VL2, with report-backed work on mixture-of-experts vision-language models and multimodal understanding.
Wenhai Wang
DeepSeek
Wenhai Wang is a researcher working on visual perception foundation models, efficient learning, and multimodal large models. Public profiles list him with OpenGVLab and Shanghai AI Laboratory, and he is a coauthor of DeepSeek-VL2.
Yue Cao
DeepSeek
CEO of Sand AI. His homepage describes prior work leading multimodal and vision research at BAAI and serving as a senior researcher at Microsoft Research Asia.
Xinlong Wang
DeepSeek
Xinlong Wang is a researcher working across computer vision, embodied AI, robotics, and machine learning. Public profiles link him to OpenGVLab and Shanghai AI Laboratory, and he is a coauthor of DeepSeek-VL2.