LLMpeople
Home People Organizations Reports Fields Schools
Public Atlas People first, reports as evidence, organizations as context.

Atlas / Reports / Detail

DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding

Vision-Language Models

DeepSeek2024-12-1213 researchers
Field
Vision-Language Models
Organization
DeepSeek
arXiv
2412.10302

Canonical link

https://arxiv.org/abs/2412.10302

Connected researchers

Profile Reports

Sifan Zhou

DeepSeek

Researcher at DeepSeek AI interested in generative models, large language models, multimodal learning, and computer vision. He is pursuing a PhD in electrical engineering at Stanford University after earning a bachelor's degree from Tsinghua University, and has also worked at Meta AI and Google.

DeepSeek
Unknown 1
Profile Reports

Yue Cao

DeepSeek

Yue Cao is a researcher working on multimodal large language models and computer vision. His public homepage lists previous time at DeepSeek and Apple and links to work including DeepSeek-VL2.

DeepSeek
Unknown 2
Profile Reports

Wei Xiong

DeepSeek

Research scientist at DeepSeek working on large language models, multimodal learning, and machine learning systems. He was previously an applied scientist at AWS AI Labs and earned a PhD in computer science from Johns Hopkins University.

DeepSeek
Unknown 1
Profile Reports

Yufei Zhang

DeepSeek

Researcher at the University of Illinois Urbana-Champaign focused on vision-language models, multimodal large language models, and physical AI.

DeepSeek
Unknown 1
Profile Reports

Zhengyang Wang

DeepSeek

Research intern at DeepSeek and master's student at Renmin University of China working on multimodal large language models and AI agents.

DeepSeek
Unknown 1
Profile Reports

Xinlong Wang

DeepSeek

Xinlong Wang is a researcher working across computer vision, embodied AI, robotics, and machine learning. Public profiles link him to OpenGVLab and Shanghai AI Laboratory, and he is a coauthor of DeepSeek-VL2.

DeepSeek
Unknown 3
Profile Reports

Wenhai Wang

DeepSeek

Wenhai Wang is a researcher working on visual perception foundation models, efficient learning, and multimodal large models. Public profiles list him with OpenGVLab and Shanghai AI Laboratory, and he is a coauthor of DeepSeek-VL2.

DeepSeek
Unknown 1
Profile Reports

Shujie Wang

DeepSeek

First-year PhD student at Shanghai Jiao Tong University focused on multimodal large language models, text-to-image generation, and image/video generation; coauthor of DeepSeek-VL2.

DeepSeek
Unknown 3
Profile Reports

Yonggang Zhang

DeepSeek

Yonggang Zhang is a researcher whose public OpenReview profile includes the DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding paper.

DeepSeek
Unknown 3
Profile Reports

Xinyu Chen

DeepSeek

Research intern at NUS and Nanjing University working on machine learning and multimodal large language models; coauthor of DeepSeek-VL2.

DeepSeek
Unknown 1
Profile Reports

Yanxia Cui

DeepSeek

Researcher working on multimodal and vision-language models, including DeepSeek-VL2 and related model optimization work.

DeepSeek
Unknown 1
Profile Reports

Zihan Liu

DeepSeek

Zihan Liu is a research scientist at DeepSeek. His public homepage highlights work in multimodal learning, vision-language models, and large-scale machine learning.

DeepSeek
Unknown 1
Profile Reports

Yao Lu

DeepSeek / Google

Profile still being enriched.

DeepSeekGoogle
Unknown 1

LLMpeople is a public atlas for discovering frontier AI researchers with context, provenance, and respect.