LLMpeople
Home People Organizations Reports Fields Schools
Public Atlas People first, reports as evidence, organizations as context.

Atlas / Reports / Detail

DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search

Mathematical Reasoning Models report from DeepSeek with 8 connected researchers in the LLMpeople atlas.

DeepSeek2024-08-148 researchers
Field
Mathematical Reasoning Models
Organization
DeepSeek
arXiv
2408.08152

Canonical link

https://arxiv.org/abs/2408.08152

Connected researchers

Chong Ruan portrait
Researcher 6 reports

Chong Ruan

DeepSeek

Researcher at DeepSeek and former master's student in EECS at Peking University. Public profiles list work on LLMs, multimodal systems, transformers, BERT, and machine translation.

DeepSeek
China
Daya Guo portrait
Researcher 13 reports

Daya Guo

DeepSeek / Moonshot AI

AI researcher at DeepSeek working on natural language processing, code intelligence, and large language model reasoning.

DeepSeekMoonshot AI
Qihao Zhu portrait
Researcher 10 reports

Qihao Zhu

DeepSeek

Research scientist focused on foundation models and multimodal large language models; his homepage notes earlier work at DeepSeek AI and current research at the University of Southern California.

DeepSeek
Runxin Xu portrait
Researcher 6 reports

Runxin Xu

DeepSeek

Researcher at DeepSeek whose public homepage describes work on DeepSeek R1, V1, V2, V3, Math, Coder, and mixture-of-experts systems.

DeepSeek
Y. Wu portrait
Researcher 8 reports

Y. Wu

DeepSeek

Yu Wu's public homepage says he is a technical staff member at DeepSeek AI who leads the LLM Alignment Team and works on writing, QA, AI search, reasoning, and safety.

DeepSeek
Xinyu Zheng portrait
Researcher 1 reports

Xinyu Zheng

DeepSeek

PhD student at Tsinghua University working on formal theorem proving, machine learning, formal methods, and programming languages.

DeepSeek
Zhiyuan Gou portrait
Researcher 1 reports

Zhiyuan Gou

DeepSeek

Researcher working on theorem proving and reinforcement learning.

DeepSeek
Renqi Xu portrait
Researcher 1 reports

Renqi Xu

DeepSeek

PhD student at Tsinghua University and visiting student at MIT, focused on theorem proving and formal verification in Lean.

DeepSeek

LLMpeople is a public atlas for discovering frontier AI researchers with context, provenance, and respect.

Privacy ยท Terms