Atlas / Reports / Detail
Janus-Pro: Unified Multimodal Understanding and Generation with Data and Model Scaling
Multimodal Large Language Models
Connected researchers
Jifeng Dai
DeepSeek / MiniMax
Researcher focused on computer vision, multimodal learning, and generative AI. His public homepage says he is currently with Stepfun, after serving as a principal scientist at SenseTime Research and a researcher at Microsoft Research Asia, and that he earned a PhD in computer science from Tsinghua University.
Huazuo Gao
DeepSeek
Researcher at DeepSeek AI working on decision-making and post-training for large language models.
Binyuan Hui
DeepSeek / MiniMax / Qwen
AI researcher whose public work includes large language models, vision-language models, and multimodal systems. His public profile notes prior work as a senior algorithm expert at Alibaba and co-authorship of Qwen technical reports.
Xiangkun Wang
DeepSeek
Research intern at DeepSeek and undergraduate student at Tsinghua University focusing on multimodal large language models, agents, and embodied AI.
Xiaoze Liu
DeepSeek
Research intern at DeepSeek and PhD student at Carnegie Mellon University interested in machine learning, agents, language, vision, robotics, and healthcare.
Xinyu Li
DeepSeek
Research intern at DeepSeek and undergraduate student at Tsinghua University working on vision-language models, inference-time scaling, and reinforcement learning.
Zezhou Wang
DeepSeek
Research intern at DeepSeek and master's student at Tsinghua University working on large language models, reinforcement learning, and multimodal understanding and generation.
Zhihuan Liu
DeepSeek
Research intern at DeepSeek and PhD student at Shanghai Jiao Tong University working on large language models, reasoning, agents, and reinforcement learning.
Jinghong Yuan
DeepSeek
PhD student at UC San Diego researching reasoning, planning, and multimodal foundation models; publication context connects Jinghong Yuan to Janus-Pro.
Jiaxuan Fan
DeepSeek
Jiaxuan Fan is a machine learning researcher at DeepSeek. Her interests include data-centric AI, model efficiency, and multimodal learning.
Hongxia Yang
DeepSeek
External advisor at DeepSeek and former Corporate Vice President and Chief Scientist at Microsoft Research Asia.
Jie Zhou
DeepSeek / MiniMax / Moonshot AI / NVIDIA
Jie Zhou is a Moonshot AI contributor and co-author of Kimi k1.5: Scaling Reinforcement Learning with LLMs.
Shang Yang
DeepSeek / MiniMax
Researcher focused on reinforcement learning, large language model reasoning, and multimodal foundation models; coauthor of Janus-Pro and MiniMax-M1.