Atlas / Reports / Detail
DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models
Mixture-of-Experts Language Models report from DeepSeek with 17 connected researchers in the LLMpeople atlas.
Connected researchers
Damai Dai
DeepSeek
Research scientist at DeepSeek focused on LLM post-training, self-improvement, and reasoning.
Chengqi Deng
DeepSeek
Research scientist at DeepSeek with public GitHub projects spanning language models and AI systems.
Chenggang Zhao
DeepSeek
Research engineer at DeepSeek-AI and co-author of the Nature paper introducing DeepSeek-R1.
R. X. Xu
DeepSeek
R. X. Xu is a research scientist at DeepSeek AI. His homepage says he works on trustworthy and efficient large language models, open-ended reasoning, and AI for healthcare.
Huazuo Gao
DeepSeek
Researcher at DeepSeek AI working on decision-making and post-training for large language models.
Deli Chen
DeepSeek
DeepSeek researcher focused on large language models and reasoning who previously worked in Peking University's LANCO group.
Jiashi Li
DeepSeek
DeepSeek team member and co-author of the DeepSeek-V3, DeepSeek-V2, and DeepSeek LLM technical reports.
Wangding Zeng
DeepSeek
Wangding Zeng is a researcher at DeepSeek. His OpenReview profile also lists graduate and undergraduate study at Beijing University of Posts and Telecommunications.
Y. Wu
DeepSeek
Yu Wu's public homepage says he is a technical staff member at DeepSeek AI who leads the LLM Alignment Team and works on writing, QA, AI search, reasoning, and safety.
Zhenda Xie
DeepSeek
Assistant professor of computer science at Brown University. Previously completed a PhD in computer science and engineering at UC Santa Cruz.
Panpan Huang
DeepSeek
DeepSeek team member and co-author of the DeepSeek-V3, DeepSeek-V2, and DeepSeek LLM technical reports.
Fuli Luo
DeepSeek
Research scientist working on large language models and retrieval-augmented generation; creator of the open-source project tiny-universe.
Chong Ruan
DeepSeek
Researcher at DeepSeek and former master's student in EECS at Peking University. Public profiles list work on LLMs, multimodal systems, transformers, BERT, and machine translation.