LLMpeople
Home People Organizations Reports Fields Schools
Public Atlas People first, reports as evidence, organizations as context.

Atlas / Reports / Detail

DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding

Vision-Language Models report from DeepSeek with 13 connected researchers in the LLMpeople atlas.

DeepSeek2024-12-1213 researchers
Field
Vision-Language Models
Organization
DeepSeek
arXiv
2412.10302

Canonical link

https://arxiv.org/abs/2412.10302

Connected researchers

Yanxia Cui portrait
Researcher 1 reports

Yanxia Cui

DeepSeek

Researcher working on multimodal and vision-language models, including DeepSeek-VL2 and related model optimization work.

DeepSeek
Zhengyang Wang portrait
Researcher 1 reports

Zhengyang Wang

DeepSeek

Research intern at DeepSeek and master's student at Renmin University of China working on multimodal large language models and AI agents.

DeepSeek
Yao Lu portrait
Researcher 3 reports

Yao Lu

DeepSeek / Google Gemini

Yao Lu is listed as an author of the Google technical report Gemini Robotics: Bringing AI into the Physical World.

DeepSeekGoogle Gemini
Xinyu Chen portrait
Researcher 1 reports

Xinyu Chen

DeepSeek

Research intern at NUS and Nanjing University working on machine learning and multimodal large language models; coauthor of DeepSeek-VL2.

DeepSeek
Shujie Wang portrait
Researcher 3 reports

Shujie Wang

DeepSeek

First-year PhD student at Shanghai Jiao Tong University focused on multimodal large language models, text-to-image generation, and image/video generation; coauthor of DeepSeek-VL2.

DeepSeek
Yonggang Zhang portrait
Researcher 3 reports

Yonggang Zhang

DeepSeek

Yonggang Zhang is a researcher whose public OpenReview profile includes the DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding paper.

DeepSeek
Yufei Zhang portrait
Researcher 1 reports

Yufei Zhang

DeepSeek

Researcher at the University of Illinois Urbana-Champaign focused on vision-language models, multimodal large language models, and physical AI.

DeepSeek
Wei Xiong portrait
Researcher 1 reports

Wei Xiong

DeepSeek

Wei Xiong is listed as an author of the DeepSeek technical report DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding.

DeepSeek
Zihan Liu portrait
Researcher 1 reports

Zihan Liu

DeepSeek

Zihan Liu is a research scientist at DeepSeek. His public homepage highlights work in multimodal learning, vision-language models, and large-scale machine learning.

DeepSeek
Sifan Zhou portrait
Researcher 1 reports

Sifan Zhou

DeepSeek

DeepSeek report author listed on DeepSeek-VL2, with report-backed work on mixture-of-experts vision-language models and multimodal understanding.

DeepSeek
5 likes
Wenhai Wang portrait
Researcher 1 reports

Wenhai Wang

DeepSeek

Wenhai Wang is a researcher working on visual perception foundation models, efficient learning, and multimodal large models. Public profiles list him with OpenGVLab and Shanghai AI Laboratory, and he is a coauthor of DeepSeek-VL2.

DeepSeek
Yue Cao portrait
Researcher 2 reports

Yue Cao

DeepSeek

CEO of Sand AI. His homepage describes prior work leading multimodal and vision research at BAAI and serving as a senior researcher at Microsoft Research Asia.

DeepSeek
Xinlong Wang portrait
Researcher 3 reports

Xinlong Wang

DeepSeek

Xinlong Wang is a researcher working across computer vision, embodied AI, robotics, and machine learning. Public profiles link him to OpenGVLab and Shanghai AI Laboratory, and he is a coauthor of DeepSeek-VL2.

DeepSeek

LLMpeople is a public atlas for discovering frontier AI researchers with context, provenance, and respect.

Privacy ยท Terms