Atlas / Reports / Detail
Kimi k1.5: Scaling Reinforcement Learning with LLMs
Large Language Models
Connected researchers
Mingxiao Li
Moonshot AI
Mingxiao Li is a machine learning researcher focused on vision-language models, generative models, and multimodal learning. His public homepage lists a PhD at KU Leuven, and public research outputs also list him with Moonshot AI Ltd. affiliation.
Daya Guo
DeepSeek / Moonshot AI
DeepSeek researcher focused on NLP, code intelligence, and LLM reasoning, with public work spanning DeepSeek-Coder, DeepSeekMath, DeepSeek-V2, DeepSeek-V3, and DeepSeek-R1.
Fangzhou Su
Moonshot AI
Researcher at Moonshot AI focused on reinforcement learning and robotics. He earned a Ph.D. in robotics and machine learning from Carnegie Mellon University.
Yuxi Xie
Moonshot AI
Ph.D. candidate in computer science at Princeton University working on LLM agents and machine learning. He is a coauthor of Moonshot AI's Kimi k1.5 report.
Jingren Zhou
MiniMax / Moonshot AI / Qwen / Z.ai
Alibaba senior technology leader and researcher associated with Qwen. Public profiles list him with Alibaba Group, and official Alibaba Cloud coverage identifies him as a chief technology officer leading large-model work.
Jie Zhou
DeepSeek / MiniMax / Moonshot AI / NVIDIA
Jie Zhou is a Moonshot AI contributor and co-author of Kimi k1.5: Scaling Reinforcement Learning with LLMs.
Aimin Yang
Moonshot AI
Aimin Yang is a Moonshot AI contributor and co-author of Kimi k1.5: Scaling Reinforcement Learning with LLMs.
Xiao Wang
Moonshot AI
Xiao Wang is a Moonshot AI contributor and co-author of Kimi k1.5: Scaling Reinforcement Learning with LLMs.
Xixuan Song
Moonshot AI
Xixuan Song is a Moonshot AI contributor and co-author of Kimi k1.5: Scaling Reinforcement Learning with LLMs.
Yingtao Zhang
Moonshot AI
Yingtao Zhang is a Moonshot AI contributor and co-author of Kimi k1.5: Scaling Reinforcement Learning with LLMs.
Yutao Zhou
Moonshot AI
Applied research scientist at Moonshot AI Labs working on large language model reasoning and multimodal systems.