LLMpeople
Home People Organizations Reports Fields Schools
Public Atlas People first, reports as evidence, organizations as context.

Atlas / Reports / Detail

DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models

Mixture-of-Experts Language Models report from DeepSeek with 17 connected researchers in the LLMpeople atlas.

DeepSeek2024-01-1117 researchers
Field
Mixture-of-Experts Language Models
Organization
DeepSeek
arXiv
2401.06066

Canonical link

https://arxiv.org/abs/2401.06066

Connected researchers

Damai Dai portrait
Researcher 4 reports

Damai Dai

DeepSeek

Research scientist at DeepSeek focused on LLM post-training, self-improvement, and reasoning.

DeepSeek
Chengqi Deng portrait
Researcher 3 reports

Chengqi Deng

DeepSeek

Research scientist at DeepSeek with public GitHub projects spanning language models and AI systems.

DeepSeek
China
Chenggang Zhao portrait
Researcher 4 reports

Chenggang Zhao

DeepSeek

Research engineer at DeepSeek-AI and co-author of the Nature paper introducing DeepSeek-R1.

DeepSeek
China
R. X. Xu portrait
Researcher 2 reports

R. X. Xu

DeepSeek

R. X. Xu is a research scientist at DeepSeek AI. His homepage says he works on trustworthy and efficient large language models, open-ended reasoning, and AI for healthcare.

DeepSeek
Huazuo Gao portrait
Researcher 7 reports

Huazuo Gao

DeepSeek

Researcher at DeepSeek AI working on decision-making and post-training for large language models.

DeepSeek
Deli Chen portrait
Researcher 4 reports

Deli Chen

DeepSeek

DeepSeek researcher focused on large language models and reasoning who previously worked in Peking University's LANCO group.

DeepSeek
Jiashi Li portrait
Researcher 4 reports

Jiashi Li

DeepSeek

DeepSeek team member and co-author of the DeepSeek-V3, DeepSeek-V2, and DeepSeek LLM technical reports.

DeepSeek
Wangding Zeng portrait
Researcher 3 reports

Wangding Zeng

DeepSeek

Wangding Zeng is a researcher at DeepSeek. His OpenReview profile also lists graduate and undergraduate study at Beijing University of Posts and Telecommunications.

DeepSeek
Y. Wu portrait
Researcher 8 reports

Y. Wu

DeepSeek

Yu Wu's public homepage says he is a technical staff member at DeepSeek AI who leads the LLM Alignment Team and works on writing, QA, AI search, reasoning, and safety.

DeepSeek
Zhenda Xie portrait
Researcher 5 reports

Zhenda Xie

DeepSeek

Assistant professor of computer science at Brown University. Previously completed a PhD in computer science and engineering at UC Santa Cruz.

DeepSeek
United States
Panpan Huang portrait
Researcher 4 reports

Panpan Huang

DeepSeek

DeepSeek team member and co-author of the DeepSeek-V3, DeepSeek-V2, and DeepSeek LLM technical reports.

DeepSeek
Fuli Luo portrait
Researcher 5 reports

Fuli Luo

DeepSeek

Research scientist working on large language models and retrieval-augmented generation; creator of the open-source project tiny-universe.

DeepSeek
Chong Ruan portrait
Researcher 6 reports

Chong Ruan

DeepSeek

Researcher at DeepSeek and former master's student in EECS at Peking University. Public profiles list work on LLMs, multimodal systems, transformers, BERT, and machine translation.

DeepSeek
China

LLMpeople is a public atlas for discovering frontier AI researchers with context, provenance, and respect.

Privacy ยท Terms