Atlas / Fields / Detail
Large Language Models
Researchers connected to this field in the public atlas.
Luca Soldaini
Ai2
Lead research scientist at Ai2 and a founding member of the OLMo team. He works on open language models, language agents, evaluation, and reasoning, and helps build open-model research infrastructure.
Daya Guo
DeepSeek / Moonshot AI
AI researcher at DeepSeek working on natural language processing, code intelligence, and large language model reasoning.
Amanda Askell
Anthropic / OpenAI
Amanda Askell is a philosopher and AI alignment researcher at Anthropic. Her personal site says she previously worked as a research scientist on the policy team at OpenAI.
Baosong Yang
Alibaba Qwen
Senior Algorithm Expert in Alibaba Tongyi Lab's Language Technology Lab and Qwen Team member whose work focuses on multilingual large language models and machine translation.
Mona Jalal
NVIDIA
Mona Jalal is a computer vision research engineer at Toyota Material Handling. Her homepage says she leads 3D computer vision work such as 6D object pose estimation and synthetic data generation, previously worked as an R&D engineer at UC Berkeley's FHL VIVE Center, completed doctoral study in computer vision at Boston University, and earlier earned master's degrees in computer sciences and electrical engineering from the University of Wisconsin-Madison.
Yuxiang Zheng
Z.ai
Yuxiang Zheng is a Ph.D. student in electrical and computer engineering at Memorial University of Newfoundland. The homepage says Yuxiang Zheng received double B.Eng. degrees in electronics and electrical engineering from the University of Glasgow and the University of Electronic Science and Technology of China in 2021, an M.Sc. with distinction in communications and signal processing from Imperial College London in 2022, and studies reinforcement learning and quantum machine learning for 6G vehicular networks.
Ming-Yu Liu
NVIDIA
Ming-Yu Liu is a vice president of research at NVIDIA and an IEEE Fellow. He leads the Deep Imagination Research group, which focuses on deep generative models for content creation and foundation models for physical AI. He previously held research leadership roles at Mercedes-Benz Research and Development North America and National Tsing Hua University, and he earned degrees from National Taiwan University, National Chiao Tung University, and the University of Illinois Urbana-Champaign.
Xinyun Chen
Google Gemini / Meta AI
Xinyun Chen's homepage identifies her as an AI research scientist at Meta Superintelligence Labs, previously a staff research scientist at Google DeepMind. It also lists a PhD in Computer Science from UC Berkeley and a BS in Computer Science from Shanghai Jiao Tong University.
Xueyan Zou
Z.ai
Xueyan Zou is an assistant professor in the College of AI at Tsinghua University. Her official Tsinghua page says she works on interactive embodied intelligence powered by world models, dexterous control and sensing, and embodied foundation models, after receiving a Ph.D. in computer science from the University of Wisconsin-Madison and then becoming a postdoctoral fellow at UC San Diego's Contextual Robotics Institute.
Arash Vahdat
NVIDIA
Arash Vahdat is a research director at NVIDIA Research, where he leads the GenAIR team on fundamental generative AI research. Before joining NVIDIA, he was a research scientist at D-Wave Systems and a research faculty member at Simon Fraser University, where he led video analysis research and taught machine learning for big data. He earned a PhD in computing science from Simon Fraser University.
Haoyu Lu
DeepSeek / Moonshot AI
Haoyu Lu is a Ph.D. student at Renmin University of China working on multimodal foundation models and video understanding. His homepage highlights papers and code including DeepSeek-VL, UniAdapter, and VDT.
Hyung Won Chung
Google Gemini / OpenAI
Research Scientist at OpenAI focused on reasoning and agents. Previously at Google Brain, he worked on T5X, PaLM, Flan-PaLM, and Flan-T5, and he earned a PhD at MIT.
Dilek Hakkani-Tur
NVIDIA
Dilek Hakkani-Tur is a Professor of Computer Science at the University of Illinois Urbana-Champaign and an Amazon Scholar at Amazon Health Science. Her UIUC faculty profile says her research interests include conversational AI, natural language and speech processing, spoken dialogue systems, and machine learning for language processing.
Jialong Tang
Z.ai
Jialong Tang works at Alibaba Tongyi in Hangzhou. The homepage says Jialong Tang received a Ph.D. from the Chinese Information Processing Laboratory at the Institute of Software, Chinese Academy of Sciences in 2023, a bachelor's degree from Xiamen University in 2018, and studies natural language processing, event understanding, multilingual systems, and Qwen large language models.
Xinyu Zhang
Alibaba Qwen
Research scientist at Tongyi Lab, Alibaba Group, working on multimodal large language models, machine reasoning, and efficient learning.
Alexandre Ramé
Ai2
Alexandre Ramé is a research scientist at Google DeepMind and an adjunct professor at Ecole Polytechnique. His homepage says he previously held research roles at NYU and SCAI / Sorbonne Université, completed a PhD in machine learning at Ecole Polytechnique and ENS Paris-Saclay, and works on post-training and alignment for Gemma LLMs.
Benyou Wang
MiniMax
Benyou Wang is an assistant professor at the Chinese University of Hong Kong, Shenzhen. His official school profile lists academic areas in artificial intelligence, computer science, and speech and natural language processing, and his personal homepage says he leads the CUHKSZ LLM group after earning a Ph.D. in information engineering from the University of Padua plus earlier degrees from Tianjin University and Hubei University of Automotive Technology.
Brian Lester
Google Gemini / Cohere
Brian Lester is a Senior Research Engineer at Google Brain and a PhD student at the University of Toronto advised by Colin Raffel. His public work focuses on efficient adaptation of large language models, including prompt tuning and related methods.
Kartikeya Mangalam
NVIDIA
Kartikeya Mangalam is a PhD student in computer vision at UC Berkeley advised by Jitendra Malik. His public homepage says he previously held a visiting researcher role at Meta AI, studied in Stanford CS, graduated summa cum laude in electrical engineering with a machine learning minor from IIT Kanpur, and works on computer vision, video understanding, and human motion analysis.
Wanli Ouyang
MiniMax
Wanli Ouyang is a professor at Shanghai AI Laboratory. His homepage says he is also with MMlab and the SIGMA lab, obtained a PhD from the Chinese University of Hong Kong, and works on AI4Science, computer vision, and pattern recognition.
Ashvin Nair
OpenAI
Researcher at OpenAI. He recently completed a PhD at UC Berkeley's BAIR lab, where he worked on machine learning for robotics, offline reinforcement learning, and self-supervised reinforcement learning.
Kyle Richardson
Ai2 / Cohere
Senior research scientist at the Allen Institute for Artificial Intelligence working on natural language processing, machine learning, and reasoning on the Aristo project; previously a researcher at the University of Stuttgart.
Matei Zaharia
NVIDIA
Computer scientist, UC Berkeley professor, and Databricks co-founder and CTO. He created Apache Spark and works on systems for large-scale data, machine learning, and compound AI applications.
Tulsee Doshi
Google Gemini
Tulsee Doshi is a Senior Director of Product Management at Google DeepMind and currently leads product for Gemini Model. She previously served as Head of Product for Responsible AI at Google and holds both an M.S. and a Ph.D. in Symbolic Systems from Stanford.
Jason Phang
Cohere
Jason Phang is a researcher at OpenAI working on the Foundations team. His homepage highlights transfer learning and natural language understanding with large-scale language models.
Juntang Zhuang
OpenAI
Juntang Zhuang leads pretraining at xAI and previously worked at OpenAI. His homepage highlights Grok pretraining work and contributions to GPT-4o, GPT-4 Turbo long-context, DALL-E 3, and OpenAI embeddings.
Minhao Cheng
Google Gemini
Minhao Cheng is an assistant professor in the College of Information Sciences and Technology at Penn State. His homepage says he earned a PhD in computer science from UCLA under Cho-Jui Hsieh, served as an assistant professor at HKUST before joining Penn State in Spring 2024, and broadly studies machine learning with a focus on trustworthy machine learning and AutoML.
Sang Michael Xie
NVIDIA
Researcher at OpenAI focused on data-centric methods for foundation models, including synthetic data and reinforcement learning. Previously a research scientist at Meta GenAI; earned BS, MS, and PhD degrees in computer science at Stanford.
Sarath Chandar
Google Gemini
Associate professor at Polytechnique Montreal, core academic member at Mila, and CIFAR AI Chair whose research spans continual learning, reasoning, meta-learning, and reinforcement learning.
Tim Brooks
OpenAI
AI researcher who co-created Sora at OpenAI and invented InstructPix2Pix during his PhD at Berkeley AI Research.
Tong Wu
DeepSeek
Tong Wu is a researcher at BIGAI in Beijing. His homepage says he works on diffusion language models, long context, and long sequence generation, and that he holds a master's from Tsinghua University and a bachelor's from Beijing Institute of Technology.
Utku Evci
NVIDIA
Utku Evci is a researcher at Google. His public Google Research profile describes work on making neural network training faster and more efficient, with public materials centered on sparsity, transfer learning, and machine intelligence.
David Mély
OpenAI
Researcher and engineer working on artificial general intelligence; his public experience page lists him as a Member of Technical Staff at OpenAI, after roles at Google/X and Vicarious.
Joost Huizinga
OpenAI
OpenAI researcher whose work spans alignment, machine learning, complexity science, open-ended evolution, and artificial life.
David Brandfonbrener
Google Gemini
Member of Technical Staff at Anthropic working on reinforcement learning for LLMs; previously at Meta FAIR and the Kempner Institute at Harvard, with a Ph.D. from NYU and undergraduate study at Yale.
Junyang Lin
Alibaba Qwen
Junyang Lin (Justin Lin) is a researcher and open-source maintainer known for the Qwen family of models. His public profiles list interests in LLMs, AI agents, multimodal learning, long-horizon reasoning, world models, and reinforcement learning; multiple March 2026 news reports said he stepped down from the Qwen tech lead role.
Chinmay Hegde
NVIDIA
Chinmay Hegde is an associate professor at the NYU Tandon School of Engineering. His public homepage and NYU faculty profile say he works on machine learning, optimization, and inverse problems, joined NYU in 2021 after three years at Iowa State University, previously completed a postdoc at Stanford University, and earned a PhD in electrical and computer engineering from Georgia Tech in 2016.
Jason Yosinski
Google Gemini
Scientist at OpenAI and president of the ML Collective research group, focused on understanding and controlling large neural networks. He completed a PhD at Cornell and previously helped start Uber AI and later founded Windscape AI.
Kelvin Guu
Cohere
Research leader at Google DeepMind working on language models, retrieval-augmented generation, post-training, and factuality. His homepage highlights foundational contributions including REALM, FLAN, and Gemini factuality work.
Jean-Baptiste Alayrac
Google Gemini / Meta AI
DeepMind researcher working on machine learning, computer vision, and structured learning from video and language.
Le Yu
Alibaba Qwen
Algorithm Engineer at the Qwen Team, Alibaba Group. He received his Ph.D. in computer science from Beihang University in 2024 and works on post-training alignment for Qwen models, with research interests in large language models, graph neural networks, and recommender systems.
Hao Zhang
Moonshot AI / NVIDIA
Researcher at NVIDIA Research. Previously a PhD student in Computer Science and Engineering at HKUST, with earlier internships at International Digital Economy Academy and Microsoft Research.
Luke Zettlemoyer
Ai2
Luke Zettlemoyer works on empirical methods for natural language semantics, machine learning, new tasks and datasets, and self-supervision for pre-training.
Shyamal Anadkat
OpenAI
Engineer and product leader who worked on OpenAI's Applied AI team and now advises startups on AI products. He writes publicly about agents, retrieval, and evaluation on his personal site.
Yang Song
OpenAI / Alibaba Qwen
Research Principal at Meta Superintelligence Labs. He previously led the strategic explorations team at OpenAI and is known for foundational work on score-based diffusion models.
Rowan Zellers
Google Gemini / OpenAI
Researcher at Thinking Machines, formerly at OpenAI, working on real-time multimodal systems across language, vision, and voice.
Adrià Garriga-Alonso
Google Gemini
Adria Garriga-Alonso is a research scientist at Google DeepMind focused on machine learning and AI safety. His public profile notes a PhD in computer science from the University of Cambridge in 2020.
Adrien Ecoffet
OpenAI
Research scientist at OpenAI working on AI agents, large models, and reinforcement learning. Previously worked as an applied scientist at Uber and completed a PhD in computer science at ENS Paris-Saclay under Marc Schoenauer and Olivier Teytaud.
Claire Cui
Google Gemini
Claire Cui is a Google Fellow in the Google Brain team, where she leads research on deep generalist learning, large language models, multimodal learning, and interpretable AI. During her tenure at Google she helped found AdSense for Content and co-founded Google Health Research and Medical Brain; she holds a Ph.D. in computer science from Stanford and a B.S. in computer science from Tsinghua University.
Hamish Ivison
Ai2
PhD student at the University of Washington advised by Luke Zettlemoyer and researcher at Ai2. His work focuses on open language models, instruction tuning, data curation, and evaluation.
Valentina Pyatkin
Ai2
Postdoctoral researcher at the Allen Institute for AI and the University of Washington whose work focuses on open language model post-training, steerability, and evaluation.
William Merrill
Ai2
TTIC and William Merrill's homepage describe him as currently at the Allen Institute for AI and incoming to TTIC in Fall 2026 after a PhD from New York University. Public profile pages describe research spanning transformer expressivity, formal language theory, computational linguistics, and language models.
Lester James V. Miranda
Ai2
Lester James V. Miranda previously worked as a Predoctoral Young Investigator at Ai2 and as a machine learning engineer on the spaCy team at ExplosionAI.
Yonggan Fu
NVIDIA
Public profiles say he completed a Georgia Tech PhD in 2025 after earlier study at Rice and USTC, and his current work focuses on bringing frontier AI to everyday devices.
Xiaohuan Zhou
Alibaba Qwen
Alibaba Qwen report author with public bibliography links for work on large language models, multimodal foundation models, and AI agents.
Xin Hu
OpenAI
Xin Hu is a Member of Technical Staff at OpenAI focused on privacy and security. Xin previously held cybersecurity and machine learning roles at Robinhood, Pinterest, IBM Research, and Intel, and earned a PhD in Computer Science and Engineering from the University of Michigan.
Binyuan Hui
DeepSeek / MiniMax
Staff research scientist at Alibaba's Qwen Team and initiator of OpenDevin, focused on foundation models, reasoning models, coding agents, and computer-use agents.
Bowen Yu
Alibaba Qwen
Bowen Yu is an Algorithm Expert at Qwen (Alibaba Group) working on LLM post-training and automated alignment.
Sanjiv Kumar
Google Gemini / NVIDIA
Sanjiv Kumar is a Google Fellow and vice president at Google Research. His public homepage says he leads teams working on large machine learning foundation models and generative AI, has spent more than 25 years building machine learning systems and products, and received a PhD in computer science from Carnegie Mellon University in 2005.
Shantanu Jain
OpenAI
Shantanu Jain has worked on language models at OpenAI since 2020, focusing on training data and infrastructure. His homepage also highlights work in Python's open-source and static type-checking ecosystems.
Josh Achiam
OpenAI
Joshua (Josh) Achiam is Chief Futurist at OpenAI. He joined OpenAI in 2017 as a research scientist and has worked on AI safety research and operations, AI impacts research, and educational resources (including Spinning Up in Deep RL).
Isaac MacMillan
Ai2
Research scientist at AI2; previously a machine learning engineer at Koko, where he worked on safety systems for language models.
Diederik P. Kingma
NVIDIA
Machine learning researcher focused on scalable learning and generative modeling, known for work on variational autoencoders, Adam, Glow, and variational diffusion models; his homepage says he is currently at Anthropic.
Tao Xu
OpenAI
Research scientist at OpenAI working on intelligent agents, computer vision, natural language processing, and robotics; he previously completed a PhD in computer science at Penn State.
Tomer Kaftan
OpenAI
Researcher working on machine learning systems and infrastructure at OpenAI. He earned his PhD from UCLA.
Ming Lin
NVIDIA
Ming Lin is a professor of computer science at the University of Maryland, College Park and the chief executive officer and co-founder of Aitrios.ai. Her public homepage says she was the founding director of the GAMMA lab, previously served as assistant dean of the University of Maryland's Institute for Advanced Computer Studies, and earned her BS, MS, and PhD in EECS from the University of California, Berkeley.
Louis Martin
Meta AI / Mistral AI
Research scientist at Meta AI working on natural language processing and AI safety. His homepage says he completed a PhD at Facebook AI Research and Inria focused on text simplification and accessibility.
Yuning Mao
Meta AI
Research scientist at Meta GenAI working on efficient and safe language and multimodal systems. She completed a Ph.D. in computer science at Carnegie Mellon University.
Hannaneh Hajishirzi
Ai2
Hannaneh Hajishirzi is Senior Director at Ai2 and a professor of computer science at the University of Washington. Her public materials focus on generative AI, reasoning, open language models, pre-training, and evaluation.
Junxian He
NVIDIA
Junxian He is an assistant professor in computer science and engineering at the Hong Kong University of Science and Technology. His public homepage and Princeton profile say he works at the intersection of natural language processing and machine learning, focusing on scalable methods for language model training and reasoning, completed a PhD in computer science at Princeton University in 2024, and previously earned bachelor's degrees in economics and computer science from Peking University.
Tyler Murray
Ai2
Tyler Murray is a research scientist on Ai2's PRIOR team working on vision and multimodal models. His OpenReview profile shows previous research engineering roles at Google and Facebook and degrees from Stanford University and the University of California, Berkeley.
Yukun Zha
DeepSeek
Yukun Zha is a research scientist at DeepSeek. His public profile lists research interests in machine learning, reinforcement learning, and efficient inference, alongside a PhD in computer science from Dartmouth College.
Arvind Neelakantan
OpenAI
Research scientist at Google DeepMind working on Gemini. He previously worked at Meta, OpenAI, and Google Brain, and earned a PhD from UMass Amherst.
Hugo Touvron
Meta AI
Research scientist at Meta FAIR focused on computer vision and multimodal understanding, including image and video models, visual reasoning, large-scale training, and evaluation.
Pang Wei Koh
Ai2
The Allen School lists Pang Wei Koh as an assistant professor at the University of Washington and a visiting research scientist at the Allen Institute for AI. The same page lists expertise in explainable AI, generative AI, machine learning, and natural language processing.
Daogao Liu
Ai2
Daogao Liu's homepage says he is currently a visiting postdoctoral researcher at Google. It says his research focuses on machine learning and theoretical computer science, especially differential privacy, stochastic optimization, and algorithm design.
Rulin Shao
Ai2
Rulin Shao's homepage says he is a third-year PhD at the University of Washington advised by Pang Wei Koh and Luke Zettlemoyer, and also a visiting researcher at Meta.
Wonmin Byeon
NVIDIA
NVIDIA Research and Wonmin Byeon's personal site identify him as a researcher at NVIDIA Research in California. Public site materials describe interests in computer vision, robotics, recurrent and state-space models, sequence learning, and spatio-temporal learning.
Mohammad Norouzi
Cohere
Co-founder and CEO of Ideogram whose public homepage highlights work on self-supervised representation learning, generative modeling, and multimodal learning; previously a senior staff research scientist at Google Brain Toronto.
Trevor Gale
NVIDIA
Research scientist at NVIDIA working on compression, efficient inference, post-training, multimodality, and evaluation for large language models. He completed a PhD in machine learning at Stanford and a master's in artificial intelligence at the University of Edinburgh.
Hao Yang
DeepSeek / Moonshot AI
Hao Yang works on multimodal data infrastructure at Moonshot.ai. He previously worked at ByteDance ICVG and Microsoft Research Asia, and received BS and PhD degrees from Tsinghua University.
Jerry Tworek
OpenAI
Research scientist at OpenAI whose public profiles highlight multi-agent alignment, scalable training, and reinforcement learning.
Leon Derczynski
NVIDIA
Leon Derczynski is a professor in natural language processing at DTU Compute whose research focuses on information extraction, online harms, and the dynamics of abusive language and misinformation.
Jin Xu
Alibaba Qwen
Jin Xu's homepage says he leads the audio group at Qwen Team, Alibaba, working on audio understanding, real-time multimodal interaction, speech synthesis, general audio synthesis, and audio-centered chat models. He previously completed a Ph.D. at IIIS, Tsinghua University and received a BSc in 2018 from Beijing University of Posts and Telecommunications.
Noah A. Smith
Ai2
Noah A. Smith is the University of Washington's inaugural Vice Provost for Artificial Intelligence and a professor in the Paul G. Allen School of Computer Science & Engineering. He is also Senior Director of NLP Research at the Allen Institute for AI.
Shengjia Zhao
OpenAI / Meta AI
AI researcher and former OpenAI scientist who became chief scientist of Meta Superintelligence Labs in July 2025; public academic sources also show earlier Stanford and Tsinghua training.
Jan Kautz
NVIDIA
NVIDIA's research page describes Jan Kautz as vice president of Learning and Perception Research, working across computer vision, machine learning, computational photography, and geometric vision.
Valentina Pyatkin
Ai2
Valentina Pyatkin works on open language models, post-training, steerability, instruction following, and evaluation as a postdoctoral researcher and Young Investigator at Ai2 and the University of Washington.
Yufei Guo
OpenAI
Yufei Guo is an assistant professor in the UCLA Computer Science Department. His public profiles note prior work as a research scientist at OpenAI and research interests in AI agents, reinforcement learning, and large language models.
Tom Henighan
OpenAI
Works on large language model interpretability at Anthropic; previously worked on scaling laws at OpenAI and ML engineering at Beehive AI, and completed a Ph.D. in physics at Stanford.
Dan Roth
Cohere
Head of AI at Cohere and professor at the University of Pennsylvania whose public profiles focus on natural language understanding, reasoning, and grounding.
David Dohan
Google Gemini / OpenAI
David Dohan is a computer scientist at OpenAI studying scalable alignment of language models and generally intelligent reasoning systems. His personal site also notes prior work at Google Brain on foundation model programs, code generation, protein engineering, and scientific reasoning.
Evgenia Rusak
Google Gemini
Research scientist at Google DeepMind working on trustworthy deep learning, multimodal modeling, and generalization in AI systems.
Ahmad Al-Dahle
Meta AI
Ahmad Al-Dahle joined Airbnb as Chief Technology Officer on January 14, 2026. Airbnb's announcement says he previously led Generative AI at Meta, founded Meta's Generative AI group in early 2023, and earlier studied engineering at the University of Waterloo and held senior technical leadership roles at Apple.
Hyung Won Chung
Cohere
Hyung Won Chung is a Research Scientist at OpenAI working on reasoning and agents. His public site says he previously worked at Google Brain and completed his PhD at MIT.
Mitesh M. Khapra
NVIDIA
Mitesh M. Khapra is a professor at IIT Madras and co-founder of AI4Bharat whose research spans natural language processing, machine learning, transfer learning, and deep learning.
Y. Wu
DeepSeek
Yu Wu's public homepage says he is a technical staff member at DeepSeek AI who leads the LLM Alignment Team and works on writing, QA, AI search, reasoning, and safety.
Trung H. Bui
Cohere
Trung H. Bui is a staff researcher at Cohere working on generative AI, large language models, natural language processing, machine learning, and computer vision.
Alec Radford
OpenAI
Alec Radford is a researcher and a co-author of the GPT-4 Technical Report. His GitHub profile links to his personal website at newmu.github.io.
Seungyoun Hong
Cohere
Seungyoun Hong is a research scientist and postdoctoral scholar at Stanford University working on machine unlearning, data attribution, hallucination, and generalization.
Chuanqi Tan
Alibaba Qwen / Z.ai
Chuanqi Tan's homepage says he received a PhD from Tsinghua University in July 2019, is currently focused on LLM research and applications, and is also a postdoctoral fellow at the University of Hong Kong.
Vahid Noroozi
Google Gemini / NVIDIA
Vahid Noroozi is an applied research scientist at NVIDIA. His NVIDIA author profile says his work focuses on deep learning for speech and natural language processing and that he received a PhD in computer science from the University of Illinois Chicago. His homepage says he previously worked on post-training large language models at Google DeepMind after earlier multimedia and neuroscience research at TU Delft and the Max Planck Institute for Biological Cybernetics.
Jan Leike
OpenAI
Jan Leike is a machine learning and alignment researcher who leads the Alignment Science team at Anthropic. He previously co-led the Superalignment team at OpenAI and earlier worked on alignment research at DeepMind.
Shuyue Stella Li
Ai2
Shuyue Stella Li is a Ph.D. student in the Allen School at the University of Washington advised by Yulia Tsvetkov, with research interests including proactive learning, social reasoning, and AI for health.
Logan Kilpatrick
OpenAI
Member of the Technical Staff at Google DeepMind; his homepage says he previously led developer relations at OpenAI from 2022 through early 2024 and earlier worked at NASA and Apple.
Pengcheng Yin
Meta AI
Pengcheng Yin is a research scientist at Google DeepMind working on Gemini Code and natural-language-to-code generation. His homepage also lists earlier experience as a part-time research collaborator at Facebook AI Research and research internships at Microsoft research groups.
Runxin Xu
DeepSeek
Researcher at DeepSeek whose public homepage describes work on DeepSeek R1, V1, V2, V3, Math, Coder, and mixture-of-experts systems.
Yizhong Wang
Ai2
Yizhong Wang is a research scientist at the Allen Institute for AI and incoming assistant professor at the University of Washington whose work focuses on language models, agents, reasoning, and open-source AI.
Vasu Singhal
Meta AI
Vasu Singhal is an AI researcher focused on natural language processing, generative AI, and machine learning. He studied computer science at Stanford University and has worked on NLP research at FAIR.
Chris Alberti
Google Gemini / Ai2
Chris Alberti's Google Research profile lists natural language processing and speech processing as his research areas, and his OpenReview profile lists him as a researcher at Google.
Barret Zoph
OpenAI / Meta AI
Co-founder and CTO at Thinking Machines. Previously VP of Research (Post-Training) at OpenAI, Staff Research Scientist at Google Brain, and a researcher at the Information Sciences Institute.
Amjad Almahairi
Meta AI / Mistral AI
Amjad Almahairi is a researcher at Anyscale. His OpenReview profile lists work spanning LLMs, VLLMs, generative models, and deep learning, with earlier roles at Facebook and Element AI.
Prasanna Parthasarathi
NVIDIA
Prasanna Parthasarathi is a research scientist at Huawei Noah's Ark Lab in Montreal. His public speaker and lab-profile pages say he collaborates with Mila and McGill University, works on natural language processing, dialogue systems, and social simulation, and completed a PhD at McGill University in 2022 under Joelle Pineau.
Sebastian Gehrmann
Google Gemini / Mistral AI
Sebastian Gehrmann leads Responsible AI in the office of the CTO at Bloomberg and works on natural language generation, model evaluation, and interpretability.
Can Cao
Z.ai
Can Cao is a Ph.D. student at Tsinghua University advised by Ming Zhou and Yuxuan Wang. The homepage says Can Cao received a bachelor's degree from Tsinghua University in 2022, previously interned at Microsoft Research Asia in summer 2022 and Meta AI in summer 2025, and studies large language models with a focus on pre-training, reasoning, agents, post-training, and multimodal learning.
Azade Nova
Google Gemini
Staff Research Scientist at Google DeepMind. Public Google profiles describe earlier work at Google Brain and Microsoft Research and research spanning machine learning, graph mining, and unstructured data analytics.
Hanrong Ye
NVIDIA
Hanrong Ye is a research scientist at NVIDIA Research in Santa Clara working on multi-task, multi-media, and multimodality models for machine understanding and generation. He earned a Ph.D. from HKUST, a master's degree from Peking University, and a B.S. from Sun Yat-sen University.
Matt Jordan
Ai2
Public profiles identify Matt Jordan as an Allen Institute for Artificial Intelligence researcher and a former University of Texas at Austin PhD student advised by Alex Dimakis.
Shashank Gupta
Ai2
Shashank Gupta's public homepage describes him as an applied scientist at Ai2 focused on agentic LLMs, test-time compute, post-training, and reasoning, after earlier work in Microsoft AI.
Jiatao Gu
Google Gemini / Meta AI
Assistant Professor in Penn CIS, a primary GRASP faculty member, and a member of the ASSET Center; he also works part-time as a Staff Research Scientist at Apple MLR on multimodal generative models, reasoning, and embodied AI.
Sydney Z. Li
Cohere
Research scientist at Cohere Labs and PhD candidate at Stanford University focused on language models, machine learning systems, and AI safety.
Jiahui Yu
Google Gemini
Jiahui Yu is a Research Lead at OpenAI leading the Perception team. His homepage notes prior co-leadership on Gemini Multimodal at Google DeepMind and work on deep learning and high-performance computing.
Ching-Yao Chuang
NVIDIA
Ching-Yao Chuang is a researcher at OpenAI working on multimodal and generative AI systems. His homepage lists prior roles at xAI and Meta GenAI and a PhD from MIT.
Zihan Wang
DeepSeek / Moonshot AI
Northwestern PhD student working on agentic RL, previously interned at DeepSeek, Microsoft, and Yutori.
Pradeep Dasigi
NVIDIA
Research scientist on the AllenNLP team at the Allen Institute for AI, where his homepage highlights work on open language models such as OLMo and Tulu and a focus on post-training language models.
Shuai Bai
Alibaba Qwen
Senior algorithm expert at Alibaba Group working on large language models, multimodal large language models, and diffusion models.
Ben Wang
Google Gemini / OpenAI
OpenAI's GPT-4 contributions page credits Ben Wang as attention architecture lead for long context. Public profiles identify him as a University of Pennsylvania undergraduate and an OpenAI researcher from 2021 to 2022.
Wenchao Zhou
MiniMax
Public profiles describe Wenchao Zhou as Director of Data Product and Data Analytics at Alibaba Cloud Intelligence and a former tenured computer science faculty member at Georgetown University. His work centers on databases and distributed systems.
Costa Huang
Ai2
Costa Huang is a reinforcement learning researcher and the creator of CleanRL; his public GitHub profile lists Periodic Labs as his current affiliation and AllenAI and Hugging Face as previous affiliations.
Jan Hendrik Kirchner
OpenAI
Jan Hendrik Kirchner is a researcher at Anthropic. His public OpenReview profile lists prior research work at OpenAI and a Brain Science PhD track through Max Planck Schools.
Maarten Sap
Ai2
Carnegie Mellon says Maarten Sap is an assistant professor in the Language Technologies Institute with a courtesy appointment in HCII, and also a part-time research scientist and AI safety lead at the Allen Institute for AI. His CMU profile says he received his PhD from the University of Washington.
Matt Jordan
Ai2
Matt Jordan is a researcher at Ai2 focused on making language models safe and reliable. His public profile says he previously spent more than four years at Google and is currently a PhD student in computer science at the University of Texas at Austin.
Jiatao Gu
Amazon
Jiatao Gu is an assistant professor in the University of Pennsylvania Department of CIS and a primary faculty member of GRASP Lab and the ASSET Center. His homepage also states that he works part-time as a staff research scientist at Apple ML Research after earlier work at Meta AI.
Shannon Zejiang Shen
Ai2
Shannon Zejiang Shen is a PhD student at MIT CSAIL researching collaborative AI systems that augment human intelligence.
Will Merrill
Ai2
William Merrill is a Young Investigator at the Allen Institute for AI and an incoming assistant professor at TTIC. His public faculty profile describes research on the computational power and limits of transformers, computational linguistics, and deep learning theory.
Yuling Gu
Ai2
Yuling Gu's NYU and personal pages describe her as a PhD student at NYU Center for Data Science who previously worked at Ai2 and has contributed to OLMo and OLMES.
Carl Ble
Google Gemini
Research scientist at Google DeepMind interested in efficient and reliable language models, reasoning, and model security. His personal site also highlights prior work on planning, geometry, and representation learning.
Boris Ginsburg
NVIDIA
Boris Ginsburg is a principal engineer and research scientist at NVIDIA whose work focuses on efficient machine learning and deep learning for speech recognition, language processing, and computer vision.
Tom B. Brown
OpenAI
His GitHub profile says he works on robust and aligned AI at Anthropic and previously worked at OpenAI and Google Brain. He is the first author of the GPT-3 paper 'Language Models are Few-Shot Learners'.
Amelie Haghighat
NVIDIA
Amelie Haghighat is an incoming assistant professor at the University of Illinois Chicago whose research focuses on trustworthy and resource-efficient machine learning systems.
Ana Klimovic
NVIDIA
Ana Klimovic is an assistant professor of computer science at ETH Zurich whose research focuses on machine learning systems, efficient hardware acceleration, and sustainable AI.
Uday Bondhugula
NVIDIA
Compiler researcher and entrepreneur, creator of the PolyMage DSL, and Professor at the Indian Institute of Science whose work spans compilers, AI systems, and programming languages.
Rogerio Feris
NVIDIA
Rogerio Feris is a principal scientist and manager at the MIT-IBM Watson AI Lab within IBM Research. His public bios emphasize multimodal perception, data and model efficiency, and large language models with memory.
Jacob Morrison
Ai2
Jacob Morrison's homepage says he is a PhD student at UW and a student researcher on the AllenNLP team at Ai2, where he mainly works on language modeling.
Jiacheng Liu
Ai2
Jiacheng Liu is a researcher at Ai2 whose work focuses on improving the capabilities and understanding of language models. His public homepage says he is currently a PhD student at New York University and has previously spent time at Princeton and Google Research.
Mayee Chen
Ai2
Mayee Chen's public pages describe her as a Stanford computer science Ph.D. student and an AI2 research intern working on data-centric machine learning and natural language processing.
Yashaswi Karnati
NVIDIA
OpenReview identifies Yashaswi Karnati as a researcher at NVIDIA. His personal homepage describes prior work across intelligent transportation, climate science, data compression, and healthcare, and records completed degrees from the University of Florida and IIT (ISM) Dhanbad.
Ashwin Paranjape
Meta AI
Founding AI Lead at Samaya AI with research interests in reasoning, retrieval, agentic systems, and reinforcement learning; previously completed a Stanford PhD in computer science under Christopher Manning.
Jingren Zhou
MiniMax / Moonshot AI
Jingren Zhou is Chief Technology Officer of Alibaba Cloud. Public speaker biographies describe him as a computer scientist and entrepreneur whose work includes large-scale AI and cloud systems.
Jian Yang
Alibaba Qwen
Jian Yang is an Associate Professor at Beihang University whose research focuses on code intelligence, large language models, and AI agents. He worked with Alibaba Qwen from 2023 to July 2025.
Aston Zhang
Meta AI
Aston Zhang is a member of technical staff at OpenAI, author of Dive into Deep Learning, and a core contributor to Llama 3. His homepage says he earned his Ph.D. from the University of Illinois Urbana-Champaign.
Thomas B. Brown
Google Gemini
Thomas B. Brown is a research scientist at Google DeepMind whose work focuses on language models, multimodality, and reasoning.
Boris Ivanovic
NVIDIA
Boris Ivanovic is a postdoctoral scholar in Stanford's Autonomous Systems Lab whose research focuses on human-centered machine learning and data-driven prediction and planning for autonomous systems.
Geri Skenderi
Cohere
Geri Skenderi is a research scientist at Cohere focusing on multilingual language technology, evaluation of large language models, and natural language processing for low-resource settings.
Hanlin Tang
Cohere / NVIDIA
Hanlin Tang is a researcher at Cohere. His public page says his work focuses on foundation models, large language model post-training, reinforcement learning, and vision-language or language-model agents, and that he previously held research internships at NVIDIA and the Vector Institute.
Saumya Malik
Ai2
Saumya Malik is a predoctoral young investigator at the Allen Institute for Artificial Intelligence working on open language modeling, reward models, and instruction following.
Yifei Qu
Z.ai
Yifei Qu is a technical staff member at JD Explore Academy in Beijing and a Ph.D. student at the University of Texas at Austin. The homepage says Yifei Qu received a master's degree from Carnegie Mellon University in 2023, a bachelor's degree from Sun Yat-sen University in 2021, expects to graduate in 2027, and works on learning and optimization, reinforcement learning, reasoning, and alignment for large language models under Peter Stone.
Yutao Sun
Z.ai
Yutao Sun is a fifth-year Ph.D. student at Zhejiang University supervised by Minlie Huang. The homepage says Yutao Sun has been a visiting student researcher at Stanford University since September 2023 under Tatsunori Hashimoto, earned a bachelor's degree from Tsinghua University in 2019, and studies code generation, code agents, execution feedback, and reasoning in large language models.
Arka Dhar
OpenAI
Public bios identify Arka Dhar as Product Management Director at Google DeepMind and former Head of Data at OpenAI. OpenAI contribution pages list him among contributors to GPT-4o-era model work.
Michael Noukhovitch
Ai2
Michael Noukhovitch is a PhD candidate in artificial intelligence at Mila associated with Universite de Montreal. His public homepage lists interests including reinforcement learning, multi-agent systems, natural language processing, and agents with web and computer use.
Chunting Zhou
Meta AI
Research scientist at Meta AI working on natural language processing, dialogue systems, and large language models.
Dayiheng Liu
Alibaba Qwen
Algorithm expert at Alibaba DAMO Academy focused on natural language generation and large language models.
Huazuo Gao
DeepSeek
Researcher at DeepSeek AI working on decision-making and post-training for large language models.
Fuli Luo
DeepSeek
Research scientist working on large language models and retrieval-augmented generation; creator of the open-source project tiny-universe.
Hannaneh Hajishirzi
Ai2
Professor at the University of Washington and Senior Director of NLP at AI2. Her work spans natural language processing, computer vision, responsible AI, and applications to science and medicine.
Mingchuan Zhang
DeepSeek
Research scientist at DeepSeek interested in large language models, reinforcement learning, robot learning, and machine learning.
Yuling Gu
Ai2
Yuling Gu is a PhD student at the NYU Center for Data Science studying large language models, machine reasoning, and robust evaluation. She was previously a predoctoral researcher at the Allen Institute for AI, where she contributed to OLMo, OLMo 2, OLMo 3, TULU 3, OLMoE, and OLMES.
Aakanksha Chowdhery
Google Gemini
Aakanksha Chowdhery is a machine learning researcher based in New York City. She works on large-scale machine learning across pre-training, post-training, inference, and system efficiency, and is known for contributions such as PaLM, Pathways, and Gemini.
Jinze Bai
Alibaba Qwen
PhD student at The Hong Kong University of Science and Technology (Guangzhou) whose research interests include large language models, vision-language models, AI agents, and multimodal retrieval.
Liyue Zhang
DeepSeek
Liyue Zhang is a research scientist at DeepSeek whose public profiles highlight multimodal models, reasoning, and AI agents.
Shanghao Lu
DeepSeek
Researcher at DeepSeek AI focusing on multimodal large language models and multimodal understanding.
Yuxiang You
DeepSeek
Research scientist at DeepSeek AI. Previously completed a PhD at Carnegie Mellon University.
Steven Adler
OpenAI
Steven Adler is a former OpenAI safety researcher and technical program manager (2020-2024). He writes Clear-Eyed AI, analyzing AI news, policy, and research with a focus on risk reduction.
Luca Soldaini
Cohere
Luca Soldaini is a lead research scientist at Ai2 on the Olmo team. He previously worked at Amazon Alexa and completed a PhD in computer science at Georgetown University in 2018.
Wangding Zeng
DeepSeek
Wangding Zeng is a researcher at DeepSeek. His OpenReview profile also lists graduate and undergraduate study at Beijing University of Posts and Telecommunications.
Jesse Dodge
Google Gemini / Ai2
Jesse Dodge is a research scientist at Meta Superintelligence Labs. His public work spans evaluation, reproducibility, efficiency, and open large language models, and his homepage says he was at the Allen Institute for AI from 2019 to 2025.
Siyuan Li
Google Gemini / NVIDIA
Siyuan Li is a research scientist at NVIDIA working on large language models, multimodal foundation models, and reinforcement learning. His homepage says he received a PhD in computer science from the University of Toronto in 2024 and previously worked at Meta AI, Microsoft Research, and Mila.
Anej Svete
Ai2
Anej Svete is a PhD Fellow at the ETH AI Center. His homepage says he studies language models with formal language theory and completed a 9-month research internship at Ai2 in 2025.
Carlos Riquelme
Google Gemini
Carlos Riquelme's public website describes him as a senior research scientist at Google Brain who previously completed a PhD in statistical machine learning at Stanford. His public materials emphasize machine intelligence, efficient and sparse models, and reinforcement learning.
Giambattista Parascandolo
OpenAI
Research Fellow at OpenAI focused on fundamental research on reasoning in neural networks. Previously completed a PhD in machine learning through the Center for Learning Systems at ETH Zurich and the Max Planck Institute for Intelligent Systems.
Jiacheng Liu
Ai2
Stanford Data Science lists Jiacheng Liu as a PhD candidate in the Paul G. Allen School of Computer Science and Engineering at the University of Washington. Public sources also link this profile to OLMo 3 authorship and a past Allen Institute for Artificial Intelligence internship.
Shizhe Diao
NVIDIA
Shizhe Diao develops methods to scale post-training and reinforcement learning for large language models and AI agents.
Biao Zhang
Amazon
Public sources reviewed support Biao Zhang as a researcher at Google DeepMind whose recent work covers multilingual machine translation, speech translation, and fine-tuning methods for large language models.
Matthijs Van keirsbilck
NVIDIA
Matthijs Van keirsbilck is a Senior Research Scientist at NVIDIA working on neural network architecture design, structural sparsity, quantization, and training dynamics.
Shane Arora
Ai2
OpenReview lists Shane Arora as a researcher at the Allen Institute for Artificial Intelligence from 2023 to 2025, an MS student at UT Austin from 2021 to 2024, and an undergraduate student at the University of Sydney from 2016 to 2019. Ai2's public directory and papers pages also list Shane Arora.
Tim Green
Google Gemini
Tim Green is a staff research scientist on Google's ML Foundations team. His homepage says he leads research at the interface of compression and memory to unlock more efficient language models, with interests spanning compression, memorization, and efficient model architectures. The same page says he previously completed a PhD in neuroscience at Stanford and a bachelor's degree in computer science and mathematics at Vanderbilt.
Biao Zhang
Google Gemini
Researcher at Google DeepMind working on multilingual machine translation, speech-to-text translation, and efficient neural machine translation.
Kaustubh Jha
NVIDIA
Research scientist at NVIDIA and University of Maryland PhD student whose public site highlights speech, audio, and video generation plus alignment for large language and vision-language models.
Siddhartha Brahma
Google Gemini
Research scientist at Google Research in New York City. He earned a PhD in computer science from NYU Tandon and works on language models, multimodal learning, NLP, and deep learning.
Jack Clark
Anthropic / OpenAI
Co-founder and Head of Policy at Anthropic. His public biography also notes earlier work as Policy Director at OpenAI, a technical journalist, and author of the Import AI newsletter.
Akshita Bhagia
Ai2
Research scientist at Ai2 and PhD candidate at the University of Washington. Her work focuses on language models, evaluation, and building practical open-source AI systems.
Ameet Talwalkar
NVIDIA
Ameet Talwalkar is an associate professor in the Machine Learning Department at Carnegie Mellon University and Chief Scientist at Datadog. His public research spans AI for science, human-AI interaction, and specialized models and agents.
Keqin Bao
Alibaba Qwen
Ph.D. candidate at the University of Science and Technology of China whose research focuses on improving the general reasoning abilities of foundational large language models, LLMs for recommendation, and scaling reinforcement reasoning; he lists himself as a core contributor to the Qwen2.5 and Qwen3 technical reports.
Mujeen Sung
LG AI Research
Mujeen Sung is an assistant professor at Kyung Hee University and formerly served as a technical lead at LG AI Research. His research focuses on natural language processing, controllable and efficient large language models, and language generation and interpretation.
Ruyi Chen
DeepSeek
Researcher at DeepSeek focused on large language models, reinforcement learning, multimodal large language models, and generative AI.
Slava Petrov
Google Gemini
Research leader at Google DeepMind and Google Research focused on natural language processing and machine learning, including large language models such as PaLM and Gemini.
Yuntian Deng
Google Gemini
Yuntian Deng is a machine learning researcher whose public work spans language modeling, reasoning, and large multimodal systems.
Bo Zheng
Alibaba Qwen
Senior Research Manager and Senior Director at Tongyi Lab, Alibaba Group, leading agents and AI infrastructure research.
Chunlin Li
MiniMax
Chunlin Li is a research scientist at MiniMax.
Clémentine Fourrier
Mistral AI
AI researcher focused on evaluating language models and agents, open NLP research, and historical linguistics. She led evaluation efforts at Hugging Face between 2023 and 2025 and helped build LightEval and the Open LLM Leaderboard.
Dirk Groeneveld
Ai2
Ai2 engineer whose public GitHub and Hugging Face profiles describe him as building LLMs at Ai2; he is also listed on OpenReview as an OLMo author.
Katarina Slama
OpenAI
Research scientist at the UK AI Security Institute and former OpenAI member of technical staff who worked on model behavior and post-training research. Previously conducted computational neuroscience research at UC Berkeley.
Nouha Dziri
Ai2
Research scientist at Ai2 focused on language models, evaluation, and AI safety. Her work studies reasoning, truthfulness, and how to make large models more reliable.
Scott Gray
OpenAI
OpenAI engineer focused on GPU systems and large-model infrastructure, with public contributions to GPT-4 and earlier OpenAI research on efficient GPU kernels and code models.
Tianyi Tang
Alibaba Qwen
Research Scientist on the Qwen team at Alibaba Cloud.
Stanislas Polu
Cohere
Stanislas Polu is Co-Founder and Engineer at Dust. His public site also documents earlier work at OpenAI, Stripe, and projects in theorem proving, browsers, and Bitcoin-related systems.
Angela Fan
Meta AI / Mistral AI
Recent public bios describe Angela Fan as a researcher at Meta working on large language models, machine translation, multilingual generation, and story generation.
Kai Dong
DeepSeek
DeepSeek engineer and co-author of the DeepSeek-V3, DeepSeek-V2, and DeepSeek LLM technical reports.
Sandhini Agarwal
OpenAI
Sandhini Agarwal is a researcher at OpenAI. Her OpenReview profile lists her as a researcher at OpenAI (2020–present) and an undergraduate student at Stanford University (2015–2019).
Shane Arora
Ai2
Shane Arora is a research scientist at Ai2. His The Org profile identifies that current role, and his OpenReview profile records earlier work as a senior applied scientist at Amazon as well as bachelor's and master's degrees in computer science from the University of Texas at Austin.
Allyson Ettinger
Ai2
Allyson Ettinger is a senior research scientist at Ai2 working on the OLMo pre-training team. Her public materials focus on natural language understanding, language model behavior, and learning from text and experience.
Finbarr Timbers
Ai2
Finbarr Timbers is an AI researcher and investor. His homepage says he works at Ai2 on RL for LLMs and previously worked at Midjourney and DeepMind.
Nathan Lambert
Ai2
Nathan Lambert's homepage says he is the post-training lead at Ai2, where he works on open language models and responsible autonomous systems. His public CV lists a PhD in Computer Science from UC Berkeley.
Dan Busbridge
Meta AI
Dan Busbridge's public OpenReview profile lists him as a researcher at Apple, and his Hugging Face profile highlights interests including deep learning, optimization, and large language modeling.
Joshua Clymer
NVIDIA
Joshua Clymer works on AI safety and theoretical questions around superposition and representation learning. Current public mentor material places him at Redwood Research and previously at METR.
Samy Bengio
Meta AI
Senior Director, AI and Machine Learning Research at Apple and adjunct professor at EPFL. His homepage highlights work on reasoning limits for autoregressive models, deep sequence architectures, and ML systems.
Victoria Graf
Ai2
Victoria Graf studies at the University of Washington and is publicly credited on open-model work including Tulu 3 and OLMo 3.
Daniel Khashabi
Cohere
Assistant Professor of Computer Science at Johns Hopkins University working on helpful, reliable, and efficient language-driven AI systems. Previously a postdoctoral researcher at the Allen Institute for AI.
Samyam Rajbhandari
NVIDIA
Samyam Rajbhandari is a principal AI research scientist at Snowflake AI Research. His public profile describes work on large-scale deep learning systems, distributed training, and efficient optimization, and lists prior Microsoft research experience.
Saurabh Shah
Ai2
Saurabh Shah's public site says he works at humans& and previously worked on post-training the OLMo language model at Ai2.
Wen-tau Yih
Ai2
Scott Wen-tau Yih is a research scientist at Meta FAIR whose recent work focuses on neural retrieval and retrieval-augmented generation; his homepage lists prior roles at Ai2 and Microsoft Research.
Stephen James
NVIDIA
Assistant professor at the University of Oxford and research scientist at NVIDIA. His public work focuses on robot learning, robotics, reinforcement learning, computer vision, and multimodal models.
Furu Wei
Microsoft
Furu Wei is a Distinguished Scientist and Chief Scientist of Microsoft Research Asia, listed on Microsoft Research and connected in LLMpeople to Microsoft technical reports including Kosmos, VALL-E, BitNet, and Multilingual E5.
Carroll Wainwright
OpenAI
Founder and AI Advisor at Metaculus and a named contributor to OpenAI's GPT-4 Technical Report.
Chengyuan Li
Alibaba Qwen
Chengyuan Li is a research intern with Alibaba's Qwen team. His homepage highlights work on reasoning and post-training for large language models.
R. X. Xu
DeepSeek
R. X. Xu is a research scientist at DeepSeek AI. His homepage says he works on trustworthy and efficient large language models, open-ended reasoning, and AI for healthcare.
Shengding Hu
Shanghai AI Laboratory
Shengding Hu is a researcher at Shanghai AI Laboratory and a PhD candidate at Tsinghua University working on large language models, multimodal models, and AI agents.
Carl Ross
OpenAI
Research scientist at OpenAI focused on neural scaling, autonomous learning systems, language models, and reinforcement learning.
David Schnurr
OpenAI
Software engineer focused on developer tools, infrastructure, and AI products. He has worked at OpenAI in San Francisco and publishes engineering projects and writing on his website and GitHub.
Junlong Li
DeepSeek
MPhil student in the Department of Computer Science and Engineering at HKUST interested in LLM agents and multimodal models; previously interned with the DeepSeek LLM Alignment Team.
Justin Jay Wang
OpenAI
Designer focused on brands, websites, and creative tools. He helped grow design at OpenAI from 0 to 1 and later joined Cursor.
Madelaine Boyd
OpenAI
Madelaine Boyd is a staff engineer at OpenAI. Her public GitHub profile links to her personal website, and official OpenAI contributions pages list her work on GPT-4 and GPT-4.5.
Michelle Pokrass
OpenAI
Michelle Pokrass is an OpenAI researcher working on multimodal AI, especially speech and audio. Her public profiles also show work on ChatGPT and model releases including GPT-4.1.
Qiancheng Wang
DeepSeek
Researcher at DeepSeek focused on multimodal large language models, world models, reinforcement learning, and embodied AI.
Quinten Anthony
Cohere
Research scientist at Cohere focused on scaling machine learning systems and improving training efficiency.
Shengguang Wu
Alibaba Qwen
Ph.D. student in Computer Science at Stanford University who previously worked on LLMs and agents with Alibaba's Qwen team.
Alejandro Lopez-Lira
Cohere
Alejandro Lopez-Lira is an assistant professor of finance at the University of Florida whose research interests include investments, machine learning, and empirical asset pricing.
Alethea Power
OpenAI
Alethea Power is a research scientist at OpenAI. Her personal website notes earlier PhD work at Georgia Tech and research interests spanning language, reasoning, cognitive science, and brain-inspired approaches to intelligence.
Andrew Cann
OpenAI
Andrew Cann is a software engineer at OpenAI. His public website and GitHub profile identify him as an OpenAI engineer, and OpenAI's contributions pages list his work on GPT-4 and GPT-4.5.
Benfeng Xu
Alibaba Qwen
Benfeng Xu is an LLM post-training researcher at Alibaba Group and an incoming assistant professor at Carnegie Mellon University. His homepage says he works on language models, efficient and trustworthy machine learning, AI for science, and biomedicine.
Bill Yuchen Lin
Ai2
Researcher working on language models, agents, and retrieval-augmented generation; currently at xAI and incoming assistant professor at the University of Washington, previously a research scientist at the Allen Institute for AI.
Bingchen Feng
Amazon
Applied scientist at Amazon focused on natural language and speech processing, machine learning, and multimodal learning.
Bing Xue
DeepSeek
Research scientist at DeepSeek interested in reinforcement learning, large language models, and reasoning.
Chang Gao
Alibaba Qwen
Researcher at Alibaba's Qwen team, previously a PhD student in the CUHK Text Mining Group, working on RL for LLM reasoning and alignment.
Christina Kim
OpenAI
Researcher at OpenAI on the mid-training team, previously working on post-training and reinforcement learning including WebGPT, ChatGPT with Browsing, and GPT-4.
Chujie Zheng
Alibaba Qwen
Chujie Zheng is an incoming senior researcher at Alibaba Group. His academic homepage says he is finishing a Ph.D. at the University of Wisconsin-Madison and works on large language model reasoning, alignment, and reinforcement learning.
Deepak Narayanan
NVIDIA
Research scientist at NVIDIA working on AI systems and large-scale machine learning, with research spanning distributed systems, deep learning, and networking.
Dibya Ghosh
Cohere
Dibya Ghosh is a machine learning researcher at Cohere and a PhD student in computer science at UC Berkeley advised by Sergey Levine. His work spans reinforcement learning and large language models, with a focus on how foundation models can improve learning agents.
Diyi Yang
Cohere
Diyi Yang is an assistant professor of computer science at Stanford University. Her research focuses on natural language processing and machine learning, especially human-centered AI, social computing, and computational social science. She earned her PhD in language technologies from Carnegie Mellon University.
Emilie Grave
Meta AI
Emilie Grave is a research scientist at FAIR. Her research interests include representation learning, natural language processing, and machine learning.
Gaurav Dhuliawala
Google Gemini
Postdoctoral fellow at Johns Hopkins University and incoming assistant professor at UNC Chapel Hill whose work focuses on multilingual NLP, grounded language understanding, and societal impacts of language technologies.
Han Bao
DeepSeek
Han Bao is a research scientist at DeepSeek AI. His homepage says he works on machine learning theory, including out-of-distribution generalization, robust machine learning, self-supervised learning, and learning-based control.
Hao Fei
Tencent Hunyuan
Researcher currently at Tencent Hunyuan with public work in natural language processing, structured information extraction, and large language models.
Hongyi Yuan
Alibaba Qwen
Engineer on the Qwen team at Alibaba Cloud.
Isabella Fulford
OpenAI
Member of Technical Staff at OpenAI and co-instructor of DeepLearning.AI's ChatGPT Prompt Engineering for Developers and Building Systems with the ChatGPT API courses; listed as a co-author of the GPT-4 Technical Report.
Iz Beltagy
Ai2
Iz Beltagy is an NLP researcher known for work on OLMo, Longformer, SciBERT, and other open language technologies. His public profile highlights research on open language models, scientific NLP, and long-context modeling.
Jessica Shieh
OpenAI
Research scientist at OpenAI interested in general intelligence; previously studied computer science and cognitive science at Stanford and UC Berkeley.
Jingxiang Sun
DeepSeek
PhD student at Tsinghua University researching multimodal large language models, digital avatars, and 2D/3D content generation; previously an AGI student researcher at DeepSeek AI who led DreamCraft3D and contributed to DeepSeek-VL.
Kexin Huang
DeepSeek
PhD student at Shanghai Jiao Tong University advised by Prof. Xueqi Cheng. Her research interests include retrieval and agents, multimodal large language models, and biomedical large language models, and she is currently an intern at DeepSeek.
Leo Gao
OpenAI
Research scientist at OpenAI working on core post-training and previously on pre-training, with interests in personalized and safe AI.
Lewis Tunstall
Cohere
Lewis Tunstall is a principal scientist at Cohere Labs who works on open-source language models, evaluation, and multilingual NLP.
Litong Wang
DeepSeek
Researcher at DeepSeek AI and incoming Ph.D. student at the University of Hong Kong working on efficient and trustworthy LLMs, multimodal large language models, and reasoning.
Machel Reid
Meta AI
Research scientist at Meta FAIR focused on multilinguality, machine translation, speech, and web-scale processing; previously a postdoctoral scholar at Stanford's Center for Research on Foundation Models.
Michael Petrov
OpenAI
Technical staff at OpenAI interested in the theoretical foundations and algorithms for large language models and reinforcement learning.
Mingxiao Li
Moonshot AI
Mingxiao Li is a machine learning researcher focused on vision-language models, generative models, and multimodal learning. His public homepage lists a PhD at KU Leuven, and public research outputs also list him with Moonshot AI Ltd. affiliation.
Moin Nadeem
Ai2
Empirical AI researcher at the Allen Institute for AI whose work focuses on post-training, synthetic data, and evaluation.
Paul Baltescu
OpenAI
Machine learning researcher and engineer; contributed to the GPT-4 Technical Report; previously an MSc by Research student at the University of Oxford Department of Computer Science.
Peter Welinder
OpenAI
Founding member of OpenAI and vice president of product who writes publicly about building things and machine learning.
Saurabh Pan
Meta AI
Saurabh Pan is an AI researcher at FAIR Meta and a PhD student at Carnegie Mellon University. His work focuses on self-supervised learning, generative modeling, multimodal learning, and representation learning.
Sebastian M. R. Arnold
Google Gemini
Research software engineer at Google DeepMind working on machine learning for code and formal reasoning; completed a PhD at TU Darmstadt on uncertainty quantification in deep learning.
Ted Sanders
OpenAI
Technical staff at OpenAI working on machine learning research, large language models, multimodal models, and robotics.
Todor Markov
OpenAI
Research scientist at OpenAI working on pretraining and reasoning; previously studied computer science at Stanford University.
Tyna Eloundou
OpenAI
Member of Technical Staff at OpenAI focused on safety evaluations, economic impact evaluations, and democratic inputs to AI.
Ujjwal Malladi
NVIDIA
Software engineer at NVIDIA with public work spanning machine learning systems and large language models, including Nemotron-4.
Vijay Janapa Reddi
NVIDIA
Vijay Janapa Reddi is a professor of electrical engineering and computer science at Harvard University whose research spans computer architecture, systems, compilers, and machine learning.
Vinnie Monaco
OpenAI
Associate Professor in the Department of Computer Science at the University of Alabama at Birmingham whose research spans computer security, machine learning, pattern recognition, biometrics, and formal methods.
Wanjia Zhao
DeepSeek
Research scientist at DeepSeek interested in post-training, tool use, and in-context learning.
Wen Ouyang
Tencent Hunyuan
Wen Ouyang is a Vice President of AI at Tencent and Director of the Visual Computing Center, where he leads the Hunyuan multimodal foundation model team. His research covers computer vision, multimodal models, and visual understanding.
William Merrill
Ai2
Assistant Professor at the NYU Courant Institute of Mathematical Sciences. He studies formal models of intelligence, including language, learning, and efficient reasoning, and previously worked as a research scientist at Ai2.
Xiang Lisa Li
Cohere
Xiang Lisa Li is a researcher focused on controllable and steerable language models. Her public profile highlights work including Diffusion-LM, Prefix-Tuning, Contrastive Decoding, and evaluation methods such as AutoBencher.
Yiyang Ma
DeepSeek
Research scientist at DeepSeek AI focused on multimodal large language models, reinforcement learning, and efficient post-training.
Yunxing Dai
OpenAI
Yunxing Dai is a researcher at OpenAI and a co-author of the GPT-4 Technical Report. His public homepage and Google Scholar profile provide verified research identity links.
Zhengyan Zhang
DeepSeek
Researcher in natural language processing and pre-trained language models, with public academic work from Tsinghua University and related NLP publications.
Zhen Qin
Cohere
Research scientist at Cohere working on large language models; previously a postdoctoral researcher in machine learning at Carnegie Mellon University.
Zhiniu Wen
DeepSeek
Zhiniu Wen is a research scientist at DeepSeek. His homepage says his research interests include natural language processing, machine learning, large language models, and their applications.
Adriana Hummel
NVIDIA
Adriana Hummel is a PhD student in computer science at Stanford University working on efficient and trustworthy AI systems.
Anjali Gupte
NVIDIA
Anjali Gupte is a PhD student in computer science at the University of Southern California whose research focuses on trustworthy AI, natural language processing, and information retrieval.
Caia Costello
Ai2
Public sources identify Caia Costello as a Stanford computer science MS student and quote her in 2026 as an Applied ML Researcher at Lambda; she is also a coauthor of OLMo Hybrid.
Ming-Wei Chang
Amazon
Research Scientist at Google DeepMind working on machine learning and artificial intelligence.
Zizheng Pan
DeepSeek
Zizheng Pan's homepage says he has worked on multimodal LLMs at DeepSeek since 2024. The same page says he previously completed a PhD in Computer Science at Monash University and a B.E. in Software Engineering at Harbin Institute of Technology, Weihai.
Size Zheng
DeepSeek
ByteDance researcher working on high-performance inference systems, AI compilers, and distributed systems; completed a PhD in computer science at Peking University in 2024.
Supriya Kalluri
Cohere
PhD candidate at the University of Washington and research scientist at Cohere working on natural language processing and machine learning.
Ilge Akkaya
OpenAI
Ilge Akkaya is a research scientist at OpenAI working on reinforcement learning and robotics. His personal website lists a PhD in EECS from UC Berkeley and a BS in Electrical and Electronics Engineering from Bilkent University.
Noah A. Smith
Ai2
Noah A. Smith is a computer scientist and professor at the University of Washington, where he serves as Vice Provost for Artificial Intelligence and co-directs the OLMo open language modeling effort with Ai2. His research focuses on natural language processing, machine learning, and evaluation methodology.
Arthur Mensch
Mistral AI
Co-founder and CEO of Mistral AI and a researcher on efficient large language models and mixture-of-experts systems.
David Atkinson
Ai2
David Atkinson is a senior research scientist at Ai2 working on multimodal learning, document understanding, visual question answering, and natural language processing.
Peter Clark
Ai2
Peter Clark is Senior Director for AI at the Allen Institute for Artificial Intelligence (Ai2), where he works on natural language understanding, reasoning, and question answering.
Guillaume Lample
Meta AI / Mistral AI
Mistral AI's about page lists Guillaume Lample as one of the company's three founders. His OpenReview profile lists expertise in machine translation and natural language processing and a PhD in computer science at Universite Pierre et Marie Curie - Paris 6.
Li Dong
Microsoft
Li Dong is a Microsoft Research principal researcher focused on human language technologies and machine intelligence.
Yichang Zhang
Alibaba Qwen
Yichang Zhang is a research scientist in Alibaba's Tongyi Lab and part of the Qwen team. His public profiles describe work on machine reasoning, agent systems, post-training, synthetic data, evaluation, large language models, multimodal learning, and perception.
Zhihong Shao
DeepSeek
Research scientist at DeepSeek AI working on multimodal large language models and end-to-end autonomous driving. Earned a PhD in computer science from the Chinese University of Hong Kong.
Ashish Sabharwal
Ai2
Ashish Sabharwal is publicly listed as Principal Researcher at the Allen Institute for AI. Public sources also list him as a coauthor of the OLMo 3 technical report.
Alisa Liu
Ai2
Alisa Liu researches language models, including tokenization, data creation, and inference-time adaptation.
Aman Rangapur
Ai2
Aman Rangapur says he works on language models at Ai2 with a focus on modeling, scaling, and infrastructure, and lists prior research roles at Illinois Institute of Technology and VIT-AP.
Xin Dong
NVIDIA
Xin Dong's homepage says he leads a research team on LLM training at Seed at ByteDance. It also states that he earned a Harvard PhD in 2023 and previously worked at NVIDIA, Meta, and Tencent.
Yoram Bachrach
Google Gemini
Yoram Bachrach is a research scientist at Meta in London focused on LLM agents and multi-agent reasoning. He previously worked at Google DeepMind and Microsoft Research in Cambridge.
Peter J. Liu
Cohere
Co-founder of Twenty Labs. His homepage says he previously worked as a research scientist at Google DeepMind, after earlier production machine learning work on Gmail Spam.
Suchir Balaji
OpenAI
Suchir Balaji was a Member of Technical Staff at OpenAI (Nov 2020 to Aug 2024). His personal site states he studied Computer Science at UC Berkeley (2017 to 2021).
Natalie Summers
OpenAI
OpenAI staff member whose public speaker bio says she ran the Artist Access Program; earlier roles included Apple and journalism work at WIRED and USA TODAY.
Bryan Catanzaro
NVIDIA
Vice President of Applied Deep Learning Research at NVIDIA, leading work on conversational AI, generative AI, and accelerated deep learning software.
Caiming Xiong
Salesforce AI Research
Researcher at Salesforce AI Research and coauthor of the xLAM-2 Technical Report.
Dario Amodei
Anthropic / OpenAI
Co-founder and CEO of Anthropic.
Silvio Savarese
Salesforce AI Research
Researcher at Salesforce AI Research and coauthor of the xLAM: A Family of Large Action Models to Empower AI Agent Systems.
Nathan Lambert
Ai2
Machine learning scientist at Ai2 working on reinforcement learning, language models, and online social systems.
Albert Q. Jiang
Mistral AI
Researcher and engineer at Mistral AI focused on distributed machine learning and efficient large language models.
Minghui Tang
DeepSeek
Research scientist at DeepSeek.
Zhewen Hao
DeepSeek
First-year Ph.D. student in Information Engineering at the Chinese University of Hong Kong advised by Dahua Lin and Jifeng Dai. His research interests include understanding and building intelligence in multimodal large language models.
Shaohan Huang
Microsoft
Shaohan Huang is a senior researcher in the General Artificial Intelligence Group at Microsoft Research Asia in Beijing. OpenReview lists him as a Microsoft researcher and a former master's student at Beihang University.
Diogo Almeida
OpenAI
Diogo Almeida is an AI researcher and a co-author of the InstructGPT paper (arXiv:2203.02155).
Alexander Keller
NVIDIA
NVIDIA Research identifies Alexander Keller as a senior director of research, formerly chief scientist at mental images and previously a professor at Ulm University. His research interests are at the intersection of graphics, communications, and machine learning.
Mary Phuong
Cohere
Mary Phuong's 2026 MATS mentor profile describes her as a research scientist at Google DeepMind working on frontier safety loss-of-control problems, including AGI control, security, and monitoring.
Yanhong Li
Ai2
Yanhong Li is a pre-doctoral researcher at the Allen Institute for AI working on data-efficient pretraining, language model efficiency, and hybrid model theory.
Jaron Lochner
Ai2
Publicly listed as a researcher at the Allen Institute for Artificial Intelligence and named on the OLMo 3 author list.
Pavlo Molchanov
NVIDIA
Pavlo Molchanov leads deep learning efficiency work at NVIDIA Research, with public profiles covering LLM and VLM efficiency, model compression, adaptive inference, and earlier computer vision research.
Scott Geng
Ai2
Scott Geng's homepage identifies him as a University of Washington PhD student and visiting researcher at FAIR; the OLMo 3 technical report lists him as a coauthor.
Teng Xiao
Ai2
Teng Xiao is a Young Investigator at the Allen Institute for AI and the University of Washington. His homepage says his research interests are in machine learning, natural language processing, and reinforcement learning.
Myle Ott
Meta AI
Myle Ott is a research scientist at Meta working on the Llama models. His interests include scalable optimization, distributed systems, and language model pretraining.
Panpan Huang
DeepSeek
DeepSeek team member and co-author of the DeepSeek-V3, DeepSeek-V2, and DeepSeek LLM technical reports.
Gretchen Krueger
OpenAI
Software engineer at OpenAI focused on inference. Previously studied computer science at Princeton University.
Lester James V. Miranda
Ai2
Machine learning researcher and PhD student at the University of Cambridge focused on multilingual and equitable NLP, especially for low-resource languages such as Filipino. He previously worked on open language model efforts at Ai2 including Tulu 3 and OLMo 2.
Long Ouyang
OpenAI
Researcher at OpenAI who led InstructGPT and GPT-4 post-training. He previously co-founded Merlyn Mind and was an engineering director at Quora.
Lucas Beyer
Google Gemini
Lucas Beyer is an ML researcher at Google DeepMind in Zurich. His public homepage highlights prior work at Google Brain and a PhD at ETH Zurich.
Maarten Sap
Ai2
Maarten Sap is an assistant professor at the University of Washington and a senior research scientist at the Allen Institute for AI. His work focuses on human-centered language technologies and social NLP.
Nick Ryder
OpenAI
Nick Ryder is VP of Research, Foundations at OpenAI and a contributor to GPT-4. He previously studied mathematics at UC Berkeley and Rice University.
Nikolay Savinov
Google Gemini
Research scientist at Google DeepMind on the Gemini team, working on multimodal AI.
Pranav Shyam
OpenAI
Engineer and founder who previously worked at OpenAI, Palantir, and Lyft and now builds the AI search startup Basalt. He publishes public writing on AI products and software on his personal site.
Xiaoxiang Wang
DeepSeek
Research scientist at DeepSeek working on post-training and reinforcement learning, with interests in systems and AI for science.
Yunfei Chu
Alibaba Qwen
Algorithm expert at Alibaba Group working on computer vision, multimodal learning, and large language models.
Junxiao Song
DeepSeek
DeepSeek report author whose DBLP record includes DeepSeek LLM, DeepSeekMath, DeepSeek-Coder-V2, DeepSeek-V3, DeepSeek-R1, Janus, and JanusFlow work.
Haowei Zhang
DeepSeek
DeepSeek report author whose DBLP-linked publication record includes DeepSeek LLM, DeepSeek-Coder-V2, Janus, DeepSeek-V3, and DeepSeek-R1 work.
Dejian Yang
DeepSeek
DeepSeek team member and co-author of the DeepSeek-V3, DeepSeek-V2, and DeepSeek LLM technical reports.
Peiyi Wang
DeepSeek
DeepSeek report author whose DBLP record includes DeepSeek LLM, DeepSeekMath, DeepSeek-Coder-V2, DeepSeek-V2, DeepSeek-V3, and DeepSeek-R1 work.
Jianwei Zhang
Alibaba Qwen
Alibaba Qwen report author whose DBLP profile identifies an Alibaba Group DAMO Academy affiliation and Qwen technical report authorship.
Peng Wang
Alibaba Qwen
Alibaba Qwen report author whose DBLP profile identifies an Alibaba Group affiliation and Qwen technical report authorship.
Wenbin Ge
Alibaba Qwen
Alibaba Qwen report author whose DBLP record includes Qwen2.5-VL and Qwen technical report work on multimodal and large language models.
Yonghui Wu
Google Gemini
Google researcher whose official profile says he joined Google in September 2008 and has been with Google Brain since January 2015, with research interests spanning information retrieval, machine learning, machine translation, and natural language processing.
Chenggang Zhao
DeepSeek
Research engineer at DeepSeek-AI and co-author of the Nature paper introducing DeepSeek-R1.
Jiashi Li
DeepSeek
DeepSeek team member and co-author of the DeepSeek-V3, DeepSeek-V2, and DeepSeek LLM technical reports.
Damai Dai
DeepSeek
DeepSeek report author whose DBLP publication record includes DeepSeek LLM, DeepSeekMoE, DeepSeek-V2, DeepSeek-V3, and DeepSeek-R1 work.
Deli Chen
DeepSeek
DeepSeek report author whose DBLP record includes DeepSeek LLM, DeepSeekMoE, DeepSeek-Coder-V2, DeepSeek-V3, and DeepSeek-R1 work.
Guanting Chen
DeepSeek
DeepSeek-AI researcher whose DBLP profile lists a DeepSeek-AI affiliation and DeepSeek-Coder, DeepSeek-V2, and DeepSeek-V3 authorship.
Jianhong Tu
Alibaba Qwen
Researcher on Alibaba's Qwen team focused on pre-training and post-training for multimodal large language models and reasoning.
Pei Zhang
Alibaba Qwen
Alibaba Qwen report author whose DBLP profile identifies an Alibaba Group affiliation and Qwen technical report authorship.
Xiao Bi
DeepSeek
DeepSeek report author whose DBLP record includes DeepSeek LLM, DeepSeek-Coder, DeepSeek-Coder-V2, DeepSeek-V2, DeepSeek-V3, and DeepSeek-R1 work.
Xin Liu
DeepSeek
DeepSeek-AI researcher whose DBLP profile lists a DeepSeek-AI affiliation and DeepSeek LLM, DeepSeek-V2, and DeepSeek-V3 authorship.
Shuming Ma
Microsoft
Co-author of the BitNet b1.58 2B4T Technical Report; the paper's author note states that S. Ma is with Microsoft Research.
Andrea Vallone
OpenAI
Andrea Vallone is publicly credited on OpenAI's GPT-4 safety and policy evaluations, and late-2025 to early-2026 reporting said she left OpenAI for Anthropic's alignment team.
David Heineman
Ai2
David Heineman works on the OLMo project at Ai2 and says he previously completed his undergraduate studies at Georgia Tech.
Weijia Shi
Ai2
Weijia Shi is a Ph.D. candidate at the University of Washington. Her research focuses on building augmented and modular architectures to improve the quality, efficiency, and capabilities of language models.
Yejin Choi
Ai2
Dieter Schwarz Foundation Professor and Senior Fellow in Stanford Computer Science and HAI. Her public homepage notes previous roles as professor at the University of Washington and senior director at Ai2.
Benjamin Mann
OpenAI
Public conference bios describe Benjamin Mann as a co-founder and member of the technical staff at Anthropic. They also describe earlier OpenAI work on infrastructure, efficiency, and safety for GPT-3.
Fucong Dai
DeepSeek
Research scientist at DeepSeek with public GitHub projects on large language models and AI systems.
Hamish Ivison
Ai2
Hamish Ivison's homepage says he is a PhD student in Computer Science and Engineering at the University of Washington working on natural language processing and post-training for language models. Public sources also list him as a coauthor of OLMo 3.
Ian Magnusson
Ai2
Ian Magnusson's public site says he does AI research at the University of Washington and the Allen Institute for AI, focused on language-model science and evaluation.
Niklas Muennighoff
Ai2
Niklas Muennighoff researches language models and evaluation at Stanford and has publicly led open-model work including OLMoE.
Akhil Mathur
Meta AI
Akhil Mathur is a postdoctoral research scientist at Meta FAIR whose public homepage highlights work in natural language processing and machine learning.
Alan Soldaini
Ai2
Research scientist at Ai2 working on language models and open research infrastructure. His interests include natural language processing, machine learning, and building reliable open-source systems.
Angela Jiang
OpenAI
Engineer at OpenAI. Formerly worked on generative AI at Disney and on data science projects with organizations including MIT and Tufts, and studied at MIT.
Atty Eleti
OpenAI
Research scientist at Thinking Machines Lab with prior language-model work at OpenAI and earlier quantitative research at Bridgewater. Her public profile highlights interests in scaling language models, new architectures, and adaptive inference.
Baptiste Rolland
Meta AI
Research scientist at Meta AI working on applied mathematics, machine learning, and artificial intelligence.
Billy Roziere
Meta AI
Research scientist at FAIR Meta and teacher affiliated with ENS, INRIA, and PSL.
Braden Hancock
Google Gemini
Braden Hancock is a researcher at Google DeepMind whose public homepage highlights work on large language models, weak supervision, and knowledge representation.
Cameron Raymond
OpenAI
Cameron Raymond is a software engineer at OpenAI. His personal site says he previously worked at Asana, led developer experience at Stripe, and is an author of Staff Engineer.
Chandra Bhagavatula
Ai2
Research scientist at Ai2 focused on natural language processing, commonsense reasoning, long-form generation, narrative intelligence, and text-based games.
Chelsea Voss
OpenAI
Research manager at OpenAI with a public site covering publications and projects.
Christine McLeavey
OpenAI
Audio research leader at OpenAI. She previously worked on MuseNet and Jukebox, and earlier studied physics at Princeton and neuroscience at Stanford before a professional career as a classical pianist.
Daniel Levy
OpenAI
Research Scientist at OpenAI working on deep learning theory, AI alignment, and large language models; he previously completed a PhD at Stanford advised by Percy Liang.
David Hall
NVIDIA
Principal research scientist at NVIDIA working on recursive intelligence, test-time reasoning, AI for biology, multimodal generation, and code generation.
David Karamcheti
Google Gemini
David Karamcheti is a research scientist at Google DeepMind whose public homepage highlights work on multimodal foundation models and embodied intelligence.
David Medina
OpenAI
Researcher at OpenAI working on multimodal and agentic systems. He previously worked on machine learning systems and video understanding, and is completing a PhD at Columbia University.
David Wadden
Ai2
David Wadden is an assistant professor at the University of Washington working on natural language processing and AI systems for science and high-stakes domains. His public profile highlights information extraction, scientific NLP, and trustworthy language model applications.
Derek Chen
OpenAI
Research scientist at OpenAI focused on multimodal reinforcement learning and natural language processing; PhD candidate at Stanford University.
Dmitriy Lepikhin
Google Gemini
Distinguished engineer on Google's Gemini LLM team working on machine learning systems, generative AI, and data center infrastructure.
Edward J Hu
Cohere
Edward J. Hu is a machine learning researcher known for efficient adaptation methods for large language models, including LoRA and QLoRA. His public profile focuses on parameter-efficient fine-tuning, model editing, and practical LLM systems.
Evan Morikawa
OpenAI
Member of Technical Staff at OpenAI working on large language models. He has also worked on software at YouTube and Google and studied computer science at Stanford University.
Gustavo de Rosa
Google Gemini
Research scientist at Google DeepMind working on language models, multimodal learning, evaluation, and alignment.
Haozhun Jin
OpenAI
OpenAI team member in the San Francisco Bay Area and co-author of the GPT-4 Technical Report.
Heewoo Jun
OpenAI
Member of Technical Staff at OpenAI. Previously at New York University, where he completed a PhD in computer science advised by Kyunghyun Cho. His interests include natural language processing, machine learning, and deep learning.
Heng-Tze Cheng
Google Gemini
Senior Staff Software Engineer at Google working on machine intelligence, data management, and large-scale machine learning systems.
Hrant Khachatrian
Cerebras Systems
Hrant Khachatrian is a machine learning researcher and engineer at FAST and a co-founder of YerevaNN, with work spanning natural language processing, computer vision, and efficient language models.
Ikai Lan
OpenAI
Engineer at OpenAI credited in the GPT-4 technical report for reliability engineering. Previously worked on Google App Engine and at LinkedIn, according to his PyCon 2011 speaker profile.
Jake Kenevey
Meta AI
Software engineer at Meta building intelligence for wearables; he previously worked on developer infrastructure for language models and assistant products, as well as on-device NLP systems.
Jan Kautz
NVIDIA
Vice President of Learning and Perception Research at NVIDIA, leading work in computer vision, graphics, and machine learning.
Jianmo Ni
Cohere
Jianmo Ni is a researcher focused on information retrieval, question answering, and large language model systems. His public profile highlights retrieval-augmented generation, ranking, and efficient NLP methods.
Jiayi Weng
OpenAI
PhD student in Computer Science at Stanford University and affiliated with OpenAI. Her research focuses on natural language processing and machine learning, and she is a co-author of the GPT-4 Technical Report.
Karthik Pattabiraman
NVIDIA
Karthik Pattabiraman is a professor of computer science at the University of British Columbia whose research spans software engineering, dependable distributed systems, systems performance, and software security.
Keunhong Park
NVIDIA
Researcher focused on world models, 3D generation, and neural rendering. He is a founding member of World Labs, previously worked at Google on 3D asset generation, and received a PhD from the University of Washington in 2021.
Liam Fedus
OpenAI
Liam Fedus is a research scientist at OpenAI. OpenAI's staff page says his research interests include reinforcement learning and language models, with a focus on improving capabilities, safety, and robustness.
Lilian Weng
OpenAI
Lilian Weng is vice president of research at OpenAI. OpenAI's staff page says she works on AI systems that can mimic and reason about our world to improve intelligence.
Łukasz Kondraciuk
OpenAI
Member of Technical Staff at OpenAI focused on advancing the capabilities of large language models.
Lysa Dash
Mistral AI
Research scientist at Mistral AI and PhD candidate at Mila whose interests include causal representation learning, world models, and reinforcement learning.
Mike Heaton
OpenAI
Engineer and product builder who has worked on ChatGPT, Codex, the OpenAI API, and developer tools. He publishes writing and code on his personal site and GitHub.
Moyin Olorunnisola
Cohere
Machine learning researcher and engineer whose public work includes multimodal models, robotics, and reinforcement learning, including research at Cohere.
Peter West
Cohere
Peter West is a PhD candidate at the University of Washington studying aligned, grounded, and trustworthy language models. His work focuses on evaluation, human feedback, model behavior, and safer NLP systems.
Prajjwal Bhargava
Meta AI
Research scientist at FAIR Meta working on multimodal learning, image-text retrieval, and language technologies.
Pramod Kumbhare
NVIDIA
Member of technical staff at NVIDIA Research focused on language models, deep learning, and efficient training systems.
Ruize Gao
Alibaba Qwen
Research engineer at Alibaba Tongyi Lab focusing on pre-training, post-training, and efficient architecture for large language models.
Runji Wang
DeepSeek
Researcher at DeepSeek and incoming assistant professor at the University of Hong Kong specializing in language, vision, and speech.
Ruobing Xie
Z.ai
Researcher whose public work spans information retrieval, recommendation systems, and large language models.
Sarah Yoo
OpenAI
Researcher at OpenAI focused on large language models and embodied AI. Her public background includes AI and recommendation work at TikTok, computer vision and robotics research at Stanford, and an ML master's at Princeton.
Shivanshu Gupta
Ai2
Research scientist at the Allen Institute for AI working on large language models, multilingual NLP, machine reasoning, and model evaluation.
Shixiang Shane Gu
OpenAI
Research scientist at OpenAI.
Shusheng Yang
Alibaba Qwen
Shusheng Yang is a research scientist in Tongyi Lab at Alibaba Group. His work focuses on natural language processing and machine learning, especially pre-trained language models, text generation, and efficient NLP.
Taya Hartman
Cohere
Taya Hartman is a product and brand designer whose public portfolio spans digital, visual, and interactive work. Public sources tie her to Cohere as a senior designer, and she is credited as a contributor to the Command A technical report.
Thomas Degry
OpenAI
Thomas Degry is a design engineer at OpenAI based in San Francisco. Public profiles also describe prior work as a design engineer at Stripe and as tech lead for openai.com.
Tomi Pott
Meta AI
Tomi Pott is a research engineer at Meta Research whose public homepage highlights work in multi-agent reinforcement learning, deep learning, and AI systems.
Tongfei Chen
Cohere
Principal scientist at Cohere and adjunct professor at McGill University, with research spanning NLP, machine learning systems, speech, and biometrics.
Vitchyr H. Pong
OpenAI
Research scientist at OpenAI working on reinforcement learning and efficient machine learning; previously completed a PhD at UC Berkeley advised by Sergey Levine.
Xinyu Wang
Alibaba Qwen
Researcher in multimodal large language models at the Alibaba Qwen team and student at Johns Hopkins University.
Yaniv Markovski
OpenAI
Senior member of technical staff at OpenAI focused on machine learning systems and AGI. He previously co-founded DeepGPU and publishes public writing and code on his website and GitHub.
Yongjik Kim
OpenAI
Research scientist at OpenAI focused on applying AI to medicine and science; previously worked on AI for vision, language, healthcare, and robotics.
Yuyang Zhou
DeepSeek
Research intern at DeepSeek and a Tsinghua University undergraduate focused on machine learning systems and large language model inference.
Zhenru Zhang
Alibaba Qwen
Ph.D. student in Computer Science at Zhejiang University and researcher on the Qwen series at Alibaba Group.
Kyle Lo
Ai2
Kyle Lo's public homepage says he formerly served as a research scientist at the Allen Institute for AI, where he co-led OLMo. It describes current work on large-scale pretraining, data mixing, benchmark validity, and research-supporting data and software artifacts.
Yanping Huang
DeepSeek / Google Gemini
Yanping Huang's OpenReview profile says he has been an engineer at Google since 2015 and lists research interests in machine learning systems, machine translation, and image classification. Public report records also list him as a co-author on the DeepSeek-V3 technical report.
Jake Poznanski
Ai2
Jakub "Jake" Poznanski identifies himself on his personal site as a Carnegie Mellon computer science graduate and describes leading an OCR toolkit on GitHub.
Filipe de Avila Belbute Peres
OpenAI
Member of Technical Staff at OpenAI. His public homepage lists research and publications spanning GPT-4, differentiable physics, and scientific machine learning.
Michael Schmitz
Ai2
Michael Schmitz is a software engineer and manager in the Pacific Northwest. His personal site says his current position is Director of Engineering at the Allen Institute for AI.
Jared D. Kaplan
OpenAI
Jared D. Kaplan is the co-founder and chief science officer of Anthropic. Public biographies also describe his earlier work on GPT-3 and Codex at OpenAI and his background in theoretical physics.
Kevin Farhat
Ai2
Kevin Farhat's homepage says he is a research engineer at Ai2 working on language models, distributed AI, and privacy-preserving training. Public sources also list him as a coauthor of FlexOlmo.
Lucas Liebenwein
NVIDIA
Works on high-performance LLM inference and AutoDeploy at NVIDIA; previously led efficient-AI work at OmniML and earned graduate degrees at MIT CSAIL.
Jacob Devlin
Google Gemini
Jacob Devlin is a research scientist at Google DeepMind whose work focuses on language models, long-context modeling, and tool use.
Kim Malfacini
OpenAI
Public profiles list Kim Malfacini as head of product policy at OpenAI, with earlier work in public policy and trust and safety for AI systems.
Sam McCandlish
OpenAI
Anthropic co-founder and chief architect working on large-scale model training, research productivity, and reinforcement learning infrastructure.
Sebastian Nowozin
Google Gemini
Partner Research Manager at Microsoft Research Cambridge, with official Microsoft pages describing work in artificial intelligence, computer vision, and human language technologies.
Jason Baldridge
Google Gemini
Jason Baldridge is a Research Scientist at Google DeepMind working on natural language understanding, multimodal learning, and evaluation.
Ali Farhadi
Ai2
CEO of the Allen Institute for AI and professor of computer science at the University of Washington. His work spans computer vision, multimodal learning, reasoning, and embodied AI.
An Yang
Alibaba Qwen
Alibaba researcher working on large language models and multimodal pretraining; public research profiles connect An Yang to Qwen-related work and earlier study at Peking University.
Kai Dang
Alibaba Qwen
Researcher on Alibaba's Qwen team focused on large language models and NLP, with public research profiles listing a Nankai University background.
Zhifeng Chen
Google Gemini / Z.ai
Zhifeng Chen's public homepage describes him as a distinguished software engineer at Google Brain focused on large-scale computer systems and machine learning applications.
Timothée Lacroix
Meta AI / Mistral AI
Timothée Lacroix is a machine learning researcher and one of the founders of Mistral AI.
Jie Tang
OpenAI / Alibaba Qwen
Computer scientist and engineer credited on OpenAI's GPT-4 public contributions page; OpenAI's 2016 team update says he previously led Dropbox's core file sync team after earlier work in Pieter Abbeel's Berkeley robotics lab.
Koray Kavukcuoglu
Google Gemini
Chief Technology Officer at Google DeepMind, with work spanning machine learning and reinforcement learning.
Shuiping Yu
DeepSeek
Researcher at DeepSeek interested in efficient large language models and multimodal large language models.
Wen Liu
DeepSeek
Research scientist at DeepSeek.
Yifeng Lu
Google Gemini
Member of Technical Staff at Google DeepMind working on machine learning, natural language processing, and large language models.
Ziwei Xie
DeepSeek
ML researcher at DeepSeek.
Hongkun Yu
Google Gemini / Cohere
Hongkun Yu is a research scientist at Google Research. His official Google Research profile says his work lies at the intersection of machine learning, signal processing, and sequence modeling.
Pradeep Dasigi
Ai2
Pradeep Dasigi is a research scientist on the AllenNLP team at Ai2. His homepage says he has been actively involved in OLMo and Tulu and currently focuses on post-training language models.
Greg Brockman
OpenAI
OpenAI co-founder and president whose public work spans the OpenAI API, Codex, Gym, OpenAI Five, and broader large-scale model systems.
Raia Hadsell
Google Gemini
VP of Research at Google DeepMind working on robotics and embodied intelligence, with expertise in machine learning, reinforcement learning, neuroscience, and computer vision.
Rui Xin
Ai2
Rui Xin's public homepage identifies him as a PhD student at the University of Washington working on reinforcement learning, privacy-preserving machine learning, and memorization.
Sam Altman
OpenAI
Sam Altman is the CEO of OpenAI.
Shayne Longpre
Ai2
Shayne Longpre's public profiles identify him as a PhD candidate at MIT focused on data-centric AI, language models, and their societal impact.
Anna Makanju
OpenAI
Head of AI for Civil Society and Philanthropy at the OpenAI Foundation. An OpenAI update says she previously served as Vice President of Global Impact at OpenAI.
Hannah Wong
OpenAI
Former Chief Communications Officer at OpenAI. Public reporting says she joined OpenAI in 2021 after seven years at Apple and announced her departure in December 2025.
Mohammad Bavarian
OpenAI
Mohammad Bavarian is a researcher at OpenAI. His personal site states that he earned a PhD from MIT in 2017 and previously worked as a researcher at Rubrik.
Erhang Li
DeepSeek
DeepSeek team member and co-author of the DeepSeek-V3, DeepSeek-V2, and DeepSeek LLM technical reports.
Fangyun Lin
DeepSeek
DeepSeek team member and co-author of the DeepSeek-V3, DeepSeek-V2, and DeepSeek LLM technical reports.
Guangbo Hao
DeepSeek
DeepSeek team member and co-author of the DeepSeek-V3, DeepSeek-V2, and DeepSeek LLM technical reports.
Hanwei Xu
DeepSeek
DeepSeek team member and co-author of the DeepSeek-V3, DeepSeek-V2, and DeepSeek LLM technical reports.
Junjie Qiu
DeepSeek
DeepSeek team member and co-author of the DeepSeek-V3, DeepSeek-V2, and DeepSeek LLM technical reports.
Saumya Malik
Ai2
Saumya Malik is an incoming postdoctoral scholar at Johns Hopkins University and recently completed a PhD in the Machine Learning Department at Carnegie Mellon University. Her interests include interpretability, evaluation, reward learning, and language model agents.
Shanhuang Chen
DeepSeek
DeepSeek team member and co-author of the DeepSeek-V3, DeepSeek-V2, and DeepSeek LLM technical reports.
Zhe Fu
DeepSeek
DeepSeek team member and co-author of the DeepSeek-V3, DeepSeek-V2, and DeepSeek LLM technical reports.
Kexin Yang
Alibaba Qwen
Researcher at Qwen, Alibaba Group since 2021, with publications on large language model evaluation and pre-training data management.
Runji Lin
Alibaba Qwen
Machine learning researcher whose public profile highlights work across Berkeley AI Research, RISELab, Sky Computing, and the Ray team.
Faeze Brahman
Ai2
Research Scientist at Ai2 focused on natural language processing, reasoning, and grounded commonsense understanding.
Sandeep Subramanian
Mistral AI
Sandeep Subramanian is an Assistant Professor at MBZUAI whose research focuses on grounded language generation, multimodal learning, and large language models.
Yang Su
Alibaba Qwen
Research scientist at Alibaba DAMO Academy focusing on aligning and scaling superintelligence.
Chong Zhang
OpenAI
Member of Technical Staff at OpenAI working on machine learning, reinforcement learning, natural language processing, and large language models.
Girish Sastry
OpenAI
Research scientist at OpenAI interested in AI safety, alignment, governance, and technical safety.
Ilya Sutskever
OpenAI
Computer scientist and AI researcher who co-founded OpenAI and served as its chief scientist.
Jianxin Yang
Alibaba Qwen
Member of the Qwen Team at Alibaba Group and a Ph.D. candidate at the Institute of Automation, Chinese Academy of Sciences. His research interests include multimodal reasoning and agents.
Leyi Xia
DeepSeek
Research scientist at DeepSeek AI interested in language models, multimodal large language models, and deep generative models.
Nando de Freitas
Google Gemini
Vice president at Google DeepMind in London working on machine learning and reinforcement learning. His public profile notes work spanning policy-gradient methods, distributed and generative models, and robotics.
Xavier Martinet
Meta AI
Research scientist at Meta working on AI agents, language, and reinforcement learning.
Yuqiong Liu
Alibaba Qwen
Researcher at Alibaba Qwen; formerly a postdoctoral researcher at the University of Cambridge and a PhD in NLP at the University of Edinburgh.
Akari Asai
Ai2
Akari Asai's public homepage identifies her as a research scientist at OLMo at Ai2 for 2025-2026 and an incoming assistant professor at Carnegie Mellon University starting in fall 2026.
Joanne Jang
OpenAI
Leads OpenAI Labs, a research team prototyping new interfaces for people and AI. She studied computer science and applied math at Stanford.
Matt Gardner
Cohere
Senior research scientist at AI2 on the AllenNLP team. His homepage says his work focuses on question answering, semantic parsing, and reasoning over open-domain text.
Nikolaus Binder
NVIDIA
Nikolaus Binder is a senior research scientist at NVIDIA whose public research profile focuses on quasi-Monte Carlo methods, photorealistic image synthesis, ray tracing, and rendering algorithms.
Zihan Qiu
Alibaba Qwen
Research scientist in Tongyi Lab whose official profile highlights multimodal large language models, efficient reasoning, and AI infrastructure.
Christopher Clark
Ai2
Christopher Clark is a researcher working on language models, efficient inference, and trustworthy NLP systems. His public profile highlights work at the intersection of NLP, efficiency, and model evaluation.
Douglas Eck
Google Gemini
Research director at Google working on music AI, multimodal generation, and human-AI interaction. He co-founded the Magenta project and has led widely used work on music generation with neural networks.
Phil Tillet
OpenAI
OpenAI researcher and software engineer known for creating Triton, an open-source GPU programming language, and co-authoring the GPT-4 Technical Report.
Qihao Zhu
DeepSeek
Research scientist focused on foundation models and multimodal large language models; his homepage notes earlier work at DeepSeek AI and current research at the University of Southern California.
Wenfeng Liang
DeepSeek
Wenfeng Liang, also known as Liang Wenfeng, is linked to DeepSeek technical reports in LLMpeople and is identified in public references as the founder and CEO of DeepSeek.
Fei Huang
Alibaba Qwen
Alibaba Qwen report author listed on Qwen, Qwen2.5, Qwen2.5-1M, Qwen3, Qwen3 Embedding, QwQ-32B, and Qwen-VL reports, with report-backed work on large language models, embeddings, reranking, and multimodal models.
Chong Ruan
DeepSeek
Researcher at DeepSeek and former master's student in EECS at Peking University. Public profiles list work on LLMs, multimodal systems, transformers, BERT, and machine translation.
Zeyu Cui
Alibaba Qwen
Zeyu Cui is listed as an author of the Qwen technical report Qwen3 Technical Report.
Jie Zhou
DeepSeek / MiniMax
Public report authorship links Jie Zhou to the MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention at MiniMax.
Rui Men
Alibaba Qwen
Alibaba Qwen report author listed on Qwen, Qwen2.5, Qwen2.5-1M, Qwen3, and Qwen-Image reports, with report-backed work on large language models, long-context models, and image generation.
Yang Fan
Alibaba Qwen
Alibaba Qwen report author listed on Qwen, Qwen2.5, Qwen3, Qwen-VL, and Qwen-Image technical reports, with report-backed work on large language models, vision-language models, and image generation.
Zhenda Xie
DeepSeek
DeepSeek report author listed on DeepSeek LLM, DeepSeekMoE, DeepSeek-Coder, DeepSeek-V2, and DeepSeek-V3 reports, with report-backed work on large language models, mixture-of-experts systems, and code models.
Alexandre Sablayrolles
Google Gemini / Mistral AI
Alexandre Sablayrolles is listed as an author of the Google technical report PaLM 2 Technical Report.
Amin Firooz
NVIDIA
Senior research scientist at NVIDIA focused on large language models, reinforcement learning, and inference-time scaling for AI agents. His public NVIDIA author page also notes prior work on robotic grasping, pose estimation, and language understanding.
Armand Joulin
Meta AI
Armand Joulin is listed as an author of the Meta AI technical report Llama 2: Open Foundation and Fine-Tuned Chat Models.
Brennan Saeta
Google Gemini
Public report authorship links Brennan Saeta to the Gemma 2: Improving Open Language Models at a Practical Size at Google.
Jason Wei
Google Gemini / OpenAI
Public report authorship links Jason Wei to the Gemma 3n Technical Report at Google.
Qinyu Chen
DeepSeek / Alibaba Qwen
Qinyu Chen is listed as an author of the DeepSeek technical report DeepSeek-V3 Technical Report.
Xingkai Yu
DeepSeek
Xingkai Yu is a report-backed author in the LLMpeople atlas, connected through DeepSeek technical reports including DeepSeek-V3, DeepSeek-V2, DeepSeek LLM, and DeepSeekMoE; his public GitHub profile lists DeepSeek affiliation.
Xingzhang Ren
Alibaba Qwen
Researcher at Alibaba working on foundation models and agents for enterprise applications.
Donald W. McFadden
Google Gemini
Google Gemini report author listed on Gemini, Gemini 1.5, RecurrentGemma, and CodeGemma technical reports, with report-backed work on multimodal models, long-context models, efficient architectures, and code models.
Mei Li
Alibaba Qwen
Alibaba Qwen report author listed on Qwen, Qwen2.5, Qwen2.5-1M, and Qwen3 technical reports, with report-backed work on large language models, long-context modeling, and reasoning.
Wentao Zhang
DeepSeek
DeepSeek report author listed on DeepSeek LLM, DeepSeek-Coder, DeepSeek-V2, and DeepSeek-V3 technical reports, with report-backed work on code models and large language models.
Yipeng Wang
Z.ai
Z.ai report author listed on GLM-Z1, GLM-4.5, GLM-4.1V/4.5V, and GLM-5 materials, with report-backed work on reasoning, coding, agentic, and multimodal models.
Zihan Jiang
Z.ai
Z.ai report author listed on GLM-Z1, GLM-4.5, GLM-4.1V/4.5V, and GLM-5 materials, with report-backed work on reasoning, agentic, and multimodal models.
Clemens Winter
OpenAI / Meta AI
Clemens Winter writes publicly about reinforcement learning and machine learning systems, and official OpenAI materials credit him on GPT-4 and GPT-4.5 work. Accessible public sources do not clearly establish a current employer.
Xiaodong Deng
Alibaba Qwen
Research scientist in Tongyi Lab whose official profile highlights post-training and multimodal large language models.
Yang Yue
MiniMax / Moonshot AI
Researcher at Moonshot AI and co-author of the Kimi K2.5 report on visual agentic intelligence.
Yao Zhao
DeepSeek / Google Gemini
Researcher affiliated with DeepSeek-AI and co-author of the Nature paper introducing DeepSeek-R1.
Sharan Narang
Google Gemini / NVIDIA
Researcher at Meta whose public OpenReview profile lists previous research roles at Google and Baidu.
Tyler Romero
Ai2
Groundlight AI lists Tyler Romero as Lead ML Engineer. Public sources also list him as a coauthor of OLMo 3.
Aidan Gomez
Google Gemini
Co-founder and CEO of Cohere; previously a doctoral student at Oxford and a student researcher and intern at Google Brain.
Andrew Drozdov
Cohere
Researcher at Cohere working on machine learning, natural language processing, and computer vision.
Devendra Singh Chaplot
Mistral AI
Devendra Singh Chaplot is a research scientist at Mistral AI working on language models and multimodal AI.
Ingmar Kanitscheider
OpenAI
Research scientist at OpenAI with a background in theoretical physics, computational neuroscience, and theoretical AI.
Mikhail Pavlov
OpenAI
Mikhail Pavlov is publicly credited on OpenAI's GPT-4 contributions page as software correctness lead and on GPT-4V technical work as a scaling engineering lead.
Xuezhi Wang
Google Gemini
Xuezhi Wang is a Research Scientist at Google DeepMind whose public work focuses on reasoning and systematic generalization in language models, along with robustness and fairness in natural language processing. She received her PhD from Carnegie Mellon University in 2016.
Yejin Choi
Cerebras Systems
Professor at Stanford University and senior director at the Allen Institute for AI. Her research spans natural language processing, commonsense reasoning, and multimodal AI.
Yinger Zhang
Alibaba Qwen
Research scientist at Alibaba Qwen working on LLM post-training, RLHF, reasoning, and evaluation; incoming assistant professor at Renmin University of China.
Zijia Zhu
DeepSeek
Member of Technical Staff at DeepSeek focusing on large language models and reinforcement learning.
Abhinav Jauhri
Meta AI
Research scientist at Meta working on post-training and model personalization. He previously worked on retrieval systems, recommendations, and ranking, and earned his Ph.D. from Stanford University.
Aiesha Letman
Meta AI
Staff research scientist at Meta whose work focuses on democratizing natural language processing tools, especially for low-resource languages.
Alan Hickey
OpenAI
Member of the Cybersecurity Policy and Insights team at OpenAI covering the intersection of technology, international security, and geopolitical competition.
Alexander Wettig
Ai2
Alexander Wettig is a Princeton computer science PhD student advised by Danqi Chen. His homepage says he studies language models and their training data, previously interned at Ai2, and is currently working on training coding agents at Cursor.
Amanda Bertsch
Ai2
Amanda Bertsch is a Ph.D. student in the Language Technologies Institute at Carnegie Mellon University. Her research focuses on making language models faster and more memory efficient at test time and improving performance on very long contexts.
Benjamin Minixhofer
Ai2
Benjamin Minixhofer is a third-year PhD student at the University of Cambridge's Language Technology Lab. His homepage focuses on resource-efficient language models, modularity, and adaptive tokenization, and notes prior research experience at Ai2.
Damien Deville
OpenAI
AI researcher at OpenAI working on reinforcement learning and reasoning.
Fangzhou Su
Moonshot AI
Researcher at Moonshot AI focused on reinforcement learning and robotics. He earned a Ph.D. in robotics and machine learning from Carnegie Mellon University.
Janko Altenschmidt
OpenAI
OpenAI contributor with a personal homepage and GitHub profile covering NLP and language model research.
Jianxin Ma
Alibaba Qwen
Researcher at Alibaba DAMO Academy with a background in natural language processing, speech, and recommender systems.
Kevin Yu
OpenAI
Member of technical staff at OpenAI whose public profile notes a PhD in computational neuroscience from Princeton University.
Luke Metz
OpenAI
Researcher at OpenAI focused on improving the efficiency and reliability of AI systems.
Nima Pourdamghani
Amazon
Researcher whose public profiles include Amazon Science and USC ISI; his work spans dialogue systems, temporal reasoning, and document retrieval.
Red Avila
OpenAI
OpenAI contributor with a personal homepage and GitHub profile covering software and AI engineering projects.
Richard Ngo
OpenAI
Researcher and writer focused on safe and beneficial AI; listed as a contributor to GPT-4 by OpenAI.
Skye Wanderman-Milne
Google Gemini
Research scientist at Google working on multilingual generation, machine translation, and large language model systems. His public work spans translation quality, evaluation, and scalable language technology.
Stephanie C. Y. Chan
Cohere
Applied scientist at Cohere focused on natural language processing and machine learning.
Tingyu Xia
Alibaba Qwen
Researcher with public work on large language models. Official Qwen materials credit her on Qwen releases, and public profiles include DBLP and OpenReview.
Tong Mu
OpenAI
Member of technical staff at OpenAI. Previously pursued a PhD at Stanford University, focusing on practical reinforcement learning and bandits.
Vik Goel
OpenAI
Engineer at OpenAI working on the API; previously built products and customer experience systems at Stripe and Instacart.
Vishal Kuo
OpenAI
Vishal Kuo is a member of technical staff at OpenAI.
Xingchao Liu
DeepSeek
Member of technical staff at xAI; previously a research scientist at DeepSeek AI and a PhD student in Computer Science and Engineering at the University of Michigan.
Yuxi Xie
Moonshot AI
Ph.D. candidate in computer science at Princeton University working on LLM agents and machine learning. He is a coauthor of Moonshot AI's Kimi k1.5 report.
Zhipeng Zhou
Alibaba Qwen
Researcher with public work on large language models. Official Qwen materials credit him on Qwen releases, and public profiles include DBLP and OpenReview.
Zijun Liu
DeepSeek
Tsinghua University PhD student whose public OpenReview profile lists a 2024 DeepSeek-AI research internship and interests in scalable foundation models, multi-agent systems, and controllable AIGC.
Daniel Mossing
OpenAI
Daniel Mossing is a research scientist at OpenAI. His OpenReview profile also lists past affiliations with UC Berkeley and New York University.
Kai Xiao
OpenAI
Member of technical staff at OpenAI interested in empirical machine learning and exploration in reinforcement learning; previously a PhD candidate at MIT.
Aleksandar Botev
Google Gemini
Staff research scientist at Google DeepMind interested in deep learning theory and applications, with a focus on scaling and reinforcement learning.
Alex Paino
OpenAI
OpenAI team member and co-author of the GPT-4 Technical Report.
Aman Mathur
NVIDIA
Senior applied research scientist at NVIDIA focused on pretraining and post-training methods for large language models, multimodal learning, and synthetic data generation.
Carolyn Zhang
Meta AI
Carolyn Zhang is a researcher focused on large language models, natural language processing, and machine learning.
Cathy Jiao
Google Gemini
Research scientist at Google DeepMind.
Chenyu Zhang
DeepSeek
DeepSeek team member and co-author of the DeepSeek-V3 Technical Report.
Christopher Hesse
OpenAI
Research scientist and engineer working on large-scale machine learning. His personal site highlights contributions to GPT-3, GitHub Copilot, and Stable Diffusion.
Clément Gehring
Mistral AI
Clément Gehring is a research scientist at Mistral AI working on efficient, multilingual, and general-purpose language models. He is also known for influential early work on neural machine translation, including convolutional sequence-to-sequence models.
Deepak Ramachandran
Cohere
Member of technical staff at Cohere Labs, where he works on models for code, reasoning, and agents and contributes to post-training.
Dragomir Radev
NVIDIA
Dragomir Radev is an Eminent Professor of natural language processing at MBZUAI. His research spans NLP, information retrieval, question answering, and summarization, and he is also a coauthor of the Nemotron-4 15B technical report.
Henri Roussez
OpenAI
Research engineer at OpenAI.
Jake McNeil
OpenAI
OpenAI team member and co-author of the GPT-4 Technical Report.
Kestutis Patiejunas
NVIDIA
Senior research scientist at NVIDIA focused on large language models and reasoning systems. His public NVIDIA author page notes more than eight years of machine learning work spanning NLP, forecasting, recommender systems, and computer vision.
Kushal Kalra
NVIDIA
Senior research scientist at NVIDIA working on multimodal and embodied AI, including multilingual multimodal large language models and human-centered robot learning.
Laura Buesing
Google Gemini
Laura Buesing is a research scientist at Google DeepMind working on machine learning and computational neuroscience, with a focus on agents that remember, reason, and plan.
Manaal Faruqui
Cohere
Manaal Faruqui is an NLP and large language model researcher whose public profile covers factuality, instruction following, conversational systems, and language understanding. His work spans large language model post-training as well as earlier research on word representations, question quality, and dialogue systems.
Michael Lampe
OpenAI
Researcher at OpenAI and a co-author of the GPT-4 Technical Report.
Moein Khazraee
Google Gemini
Moein Khazraee is a postdoctoral researcher at LMU Munich and a visitor at Google DeepMind. His research focuses on reliable and trustworthy machine learning for language, knowledge, and reasoning.
Murali Emani
NVIDIA
Senior research scientist at NVIDIA focused on embodied agent AI, simulation and rendering, computer vision, robot learning, and AI in science.
Nima Rajani
Ai2
Nima Rajani is a research scientist at Ai2 whose work focuses on trustworthy, interpretable, and verifiable AI systems.
Prafulla Dhariwal
OpenAI
Research scientist at OpenAI focused on generative models and unsupervised learning. He studied computers, mathematics, and physics at MIT and has contributed to models including GPT-3, Jukebox, and DALL·E.
Rapha Gontijo-Lopes
OpenAI
Researcher and engineer whose public homepage lists work on multimodal reasoning, robustness, and mechanistic interpretability; he was an OpenAI Resident and is credited on the GPT-4 technical report.
Ray Kurzweil
Google Gemini
Inventor, author, and futurist serving as Principal Researcher and AI Visionary at Google, known for long-running work on AI, language technology, and human-computer systems.
Shengli Hu
OpenAI
OpenAI team member and co-author of the GPT-4 Technical Report.
Siddharth Narayan
Meta AI
Siddharth Narayan is a researcher focused on natural language processing and AI, with applications that include healthcare.
Sumedh Palkar
NVIDIA
Applied research scientist at NVIDIA working on reinforcement learning for large language models, post-training, and data curation.
Tianhao Zheng
OpenAI
OpenAI team member and co-author of the GPT-4 Technical Report.
Wade Hickey
OpenAI
OpenAI team member and co-author of the GPT-4 Technical Report.
Yaron Lipman
Amazon
Research scientist at Amazon AGI and assistant professor at Yale School of Medicine whose work focuses on diffusion models, equivariant deep learning, and generative models for molecules and proteins.
Zhean Xu
DeepSeek
DeepSeek team member and co-author of the DeepSeek-V3 Technical Report.
Ziyi Gao
DeepSeek
DeepSeek team member and co-author of the DeepSeek-V3 Technical Report.
Pete Walsh
Ai2
OpenReview lists Evan Pete Walsh as a researcher at the Allen Institute for Artificial Intelligence, and recent public reports credit Pete Walsh on OLMo Hybrid and related open-model work.
Yu Wan
Alibaba Qwen
Yu Wan is publicly listed as an author on the Qwen2.5 and Qwen3 technical reports.
Huan Lin
Alibaba Qwen
Staff scientist at Tongyi Lab, Alibaba Group, and Qwen team member working on multimodal language models, AI agents, evaluation, reasoning, and information extraction.
Chloe Anastasiades
Ai2
Chloe Anastasiades is a software engineer at the Allen Institute for Artificial Intelligence and a co-author on Ai2 papers including OLMo 3 and Ai2 Scholar QA.
Zhifang Sui
Microsoft
Public Peking University faculty pages list Zhifang Sui as a professor in the Institute of Computational Linguistics with research interests in natural language processing and computational linguistics.
Alan Schelten
Meta AI
Research engineer at Meta with a verified OpenReview profile listing current affiliation at Facebook/Meta.
Ivo Danihelka
Google Gemini
Ivo Danihelka is a research scientist at Google DeepMind whose public profile highlights deep learning research and machine learning publications.
Tara N. Sainath
Google Gemini
Tara N. Sainath is a distinguished research scientist and director at Google DeepMind whose work focuses on speech recognition, spoken language understanding, and multimodal large language models.
Diego de las Casas
Mistral AI
Research scientist at Mistral AI working on machine reasoning and probabilistic deep learning.
Edouard Grave
Meta AI
Edouard Grave is a machine learning researcher whose public work spans representation learning, natural language processing, and large language models.
Jena D. Hwang
Ai2
Research scientist at the Allen Institute for AI (Ai2) whose work focuses on natural language understanding and commonsense reasoning.
Kang Guan
DeepSeek
DeepSeek team member focused on pretraining, post-training, and efficient scaling for large language models.
Yiliang Xiong
DeepSeek
Research scientist at DeepSeek AI. His homepage also describes him as a PhD candidate in computer sciences at the University of Wisconsin-Madison.
Chris Hallacy
OpenAI
Engineer at OpenAI working on inference infrastructure, product engineering, and prototyping for systems including GPT-4 and DALL-E 2.
Armand Joulin
Mistral AI
A 2024 dotAI speaker bio describes Armand Joulin as a Research Director at Google DeepMind working on Gemma, after leading FAIR EMEA at Meta.
Nawshad Farruque
Cohere
The University of Lethbridge Experts Database lists Nawshad Farruque in Maths and Computer Science, with research expertise in large-scale GIS database management, distributed spatial query processing and optimization, software engineering, and artificial intelligence.
Yingyan Celine Lin
NVIDIA
Official Georgia Tech and NVIDIA DLER pages list Yingyan Celine Lin as a Georgia Tech associate professor and a visiting professor collaborating with NVIDIA's deep learning research group.
Keming Lu
Alibaba Qwen
Keming Lu is a research scientist at Tongyi Lab, Alibaba Group in Beijing and a co-author of the Qwen and Qwen2.5 technical reports.
Pierre Stock
Mistral AI
AI Scientist & Observability Lead at Mistral AI.
Pradeep Dasigi
Ai2
Research scientist on the AllenNLP team at the Allen Institute for AI, focused on post-training language models.
Aäron van den Oord
Google Gemini
Aäron van den Oord is a Google DeepMind researcher known for generative and sequence-model research.
Aditya Ramesh
OpenAI
Researcher at OpenAI and co-creator of DALL-E who works on multimodal generative models and image generation.
Alek Andreev
Google Gemini
Research scientist at Google DeepMind working on multimodal and generative AI for healthcare.
Chris Bamford
Mistral AI
Chris Bamford is a researcher at Mistral AI working on foundation models and multimodal systems.
Eric Hambro
Meta AI
Eric Hambro is a machine learning researcher and entrepreneur whose prior work includes contributions at Meta AI and the LLaMA project.
Eric Sigler
OpenAI
Member of technical staff at OpenAI whose public profile notes leadership of the MT+MX team from 2020 to 2025 after earlier engineering work at Google.
John Schulman
OpenAI
Research scientist at OpenAI working on reinforcement learning and robotics, with a PhD from UC Berkeley.
Jonathan Bradley
NVIDIA
Senior research scientist at NVIDIA focused on language and vision models and applied multimodal AI.
Julian Schrittwieser
Google Gemini
Julian Schrittwieser is a Google DeepMind researcher known for reinforcement learning and game-playing systems.
Ling Chen
Z.ai
Z.ai researcher focused on multimodal large language models and computer vision, with interests in large-model training and post-training.
Matthieu Devin
Google Gemini
Research scientist at Google DeepMind based in Paris, focused on deep learning and computer vision.
Mingming Li
DeepSeek
DeepSeek team member focused on scaling large language models, multimodal models, and large-scale reinforcement learning.
Mohammad Ghassemi
Technology Innovation Institute
Mohammad Ghassemi is a distinguished researcher at TII, AI71, and MBZUAI in Abu Dhabi. His work spans machine learning, digital health, and trustworthy AI, and he previously held faculty roles at MIT and the University of Toronto.
Moutasem R. Alfarra
Technology Innovation Institute
Senior AI scientist at the Technology Innovation Institute and machine learning researcher at MBZUAI.
Ning Tian
DeepSeek
Developer and researcher with public GitHub activity on DeepSeek-AI repositories including DeepSeek-VL2, plus coauthor credit on the DeepSeek-V2 and DeepSeek-V3 technical reports.
Pang Wei Koh
Ai2
Assistant Professor at the University of Washington whose research focuses on machine learning, AI safety, and evaluation; he is a co-lead of the OLMo language model project.
Rohan Taori
NVIDIA
Researcher and engineer working on language models and reinforcement learning, with public work spanning Alpaca, emerging agents, and large-model evaluation.
Shengyi Huang
Ai2
Research scientist at Ai2 specializing in open language models, reasoning, and multi-agent systems.
Z. Z. Ren
DeepSeek
DeepSeek researcher. Public OpenReview and Hugging Face profiles identify Z.Z. Ren with DeepSeek and list work on theorem proving and synthetic training data for reasoning models.
Igor Babuschkin
OpenAI
Igor Babuschkin's homepage describes him as an AI researcher, builder, and investor, and highlights Babuschkin Ventures. Public reporting also describes him as a former DeepMind researcher and former OpenAI technical staff member.
Maksim Khadkevich
NVIDIA
NVIDIA's public author page identifies Maksim Khadkevich as a Senior Software Engineering Manager specializing in distributed inference systems and large language models. arXiv public sources also list him as a coauthor of Nemotron-Flash.
Ronan Le Bras
Ai2
Ai2's team directory lists Ronan Le Bras, and public report sources credit him as an author on open-language-model work.
Maxwell Horton
Apple
Co-author of the OpenELM paper with author affiliation shown as Apple on the arXiv HTML version.
Madeleine B. Thompson
OpenAI
Member of technical staff at OpenAI working on post-training. Previously worked on climate and sustainability software and on policy and product efforts related to the electric transition.
Sewon Min
Ai2
Assistant professor at the University of California, Berkeley and research scientist at the Allen Institute for AI. Her public homepage highlights work on language models, natural language processing, pretraining, post-training, and test-time computation.
Wenbiao Yin
Alibaba Qwen
Senior research scientist at Alibaba focused on large language models and agent learning.
Abhilasha Ravichander
Ai2
Assistant professor at Carnegie Mellon University and scientist at Ai2, working on natural language processing and machine learning with a focus on grounding, multilinguality, and robustness.
Abhishek Kadian
Meta AI
Machine learning engineer at FAIR at Meta AI and incoming PhD student at Carnegie Mellon University.
Abhishek Rao
Google Gemini
Abhishek Rao is a Research Scientist at Google DeepMind working on machine learning and natural language processing, with interests including structured prediction, reasoning, compositionality, and controllable generation. He has contributed to large language model research including PaLM.
Alex Maas
Salesforce AI Research
Senior research director at Salesforce AI Research and associate adjunct professor at UC Berkeley.
András Sárközy
Google Gemini
Andras Sarkozy is a senior staff research scientist at Google DeepMind in London. His public profiles describe work on machine learning and natural language processing, with interests in code generation and reasoning in language models.
Ankur Harlalka
NVIDIA
Research scientist at NVIDIA working on large language model post-training and inference.
Anna-Luisa Brakman
OpenAI
OpenAI contributor with a personal website covering research and writing.
Ariel Herbert-Voss
OpenAI
Ariel Herbert-Voss is a computer scientist whose public homepage notes previous work on AI safety and special projects at OpenAI, along with later work on digital humans and universal basic income initiatives.
Aris Konstantinidis
OpenAI
Research engineer at OpenAI focused on reinforcement learning and human-computer interaction; previously co-founded the local-first AI startup Ted AI.
Ashley Pantuliano
OpenAI
Program manager at OpenAI working across finance, legal, operations, and communications, after earlier work in philanthropy, marketing, and project management.
Bianca Martin
OpenAI
Public profiles list Bianca Martin as chief of staff at OpenAI, with an Oxford background in philosophy, politics, and economics.
Carlos E. Jimenez
NVIDIA
Research scientist at NVIDIA with publications in machine learning and embodied AI.
Chengqiang Lu
Alibaba Qwen
Researcher with public work on large language models. Official Qwen materials credit him on Qwen releases, and public profiles include DBLP and OpenReview.
Chengqi Deng
DeepSeek
Researcher and engineer with public work on similarity search and large model systems. Public profiles list Zhejiang University, and DeepSeek open-source releases credit him on projects including DeepSeek-VL and DualPipe.
Chris Hesse
OpenAI
Research Scientist at OpenAI with public work in machine learning and generative models.
Colin Raffel
Google Gemini
Research scientist at Google DeepMind whose work focuses on machine learning, speech, and large language models.
Cullen O'Keefe
OpenAI
AI law and policy researcher and Director of Research at the Institute for Law & AI; previously held policy and legal roles at OpenAI.
Currey Garnier
Google Gemini
Google researcher working on machine learning, computer vision, and natural language, and a co-author of the PaLM 2 technical report.
Cyrus Hessel
Ai2
AI2 researcher working on multimodal language models, evaluations, instruction tuning, and preference alignment, and a co-author of Open Instruct.
Daniel Farrugia-Roberts
Google Gemini
AI researcher focused on epistemic safety for advanced AI systems, including uncertainty and out-of-distribution detection.
Dara Bahri
Cohere
Researcher at Cohere working on large language models. Her public homepage highlights interests in privacy-preserving machine learning, robust machine learning, natural language processing, and reinforcement learning.
Dave Cummings
OpenAI
Engineer at OpenAI who spoke publicly about fine-tuning GPT-4 at DevDay.
David R. So
Google Gemini
Research scientist at Google working on machine learning and language models. His public work includes efficient model architectures, language understanding, and large-scale training.
Dhruti Sreedhar
Databricks
Dhruti Sreedhar is a research scientist at Databricks whose interests include generative models, post-training, and long-context language models. Previously, she studied at MIT and IIT Madras.
Di He
01.AI
Di He is a machine learning researcher whose public homepage and Google Scholar profile highlight work on large language models, efficient training, and foundation models.
Dmitry Lepikhin
Google Gemini
Research scientist and engineer working on machine learning and distributed systems, with large-scale language model contributions including PaLM 2.
Ed Chi
Google Gemini
Vice President of Research at Google DeepMind. His work spans search, recommendation systems, human-computer interaction, and practical applications of large language models.
Elizabeth Tseng
OpenAI
Lead researcher at OpenAI whose official profile focuses on speech and audio.
Eric Hu
Cohere
Eric Hu is a designer and creative director whose public portfolio spans technology, fashion, music, and architecture. At Cohere he served as VP of Design and is credited as a contributor to the Command A technical report; his public work also includes earlier design leadership roles at Nike and SSENSE.
Fraser Green
Google Gemini
Research scientist at Google DeepMind working on language models and efficient sampling and serving.
Gaurav Kapadia
Meta AI
Research scientist at Meta working on large language models, large-scale reinforcement learning from human feedback, retrieval, recommendation systems, and machine learning systems.
Guillaume Ramé
Meta AI
AI researcher at Meta with public work on foundation models, optimization, and machine learning evaluation.
Gururaj Kasinathan
NVIDIA
Research scientist at NVIDIA working on language and vision models.
Hailey Schoelkopf
Ai2
Allen Institute for AI researcher working on NLP, machine learning, and AI, and a co-author of Open Instruct.
Haocheng Wang
DeepSeek
PhD student at KAUST and predoctoral researcher at DeepSeek focused on reasoning, alignment, post-training, and multimodal large language models.
Hyeonwoo Noh
OpenAI
Research scientist at OpenAI working on multimodal understanding, reasoning, and agents; formerly a Ph.D. student at Seoul National University.
Ian Sohl
OpenAI
Research engineer at OpenAI and contributor to GPT-4.
Ilia Kava
Databricks
Ilia Kava is a member of technical staff at Databricks who works on language models and machine learning systems. He contributed to Databricks' DBRX open large language model research.
Jacob Menick
OpenAI
Research scientist at OpenAI working on reinforcement learning and AI alignment. Previously a PhD student in computer science at Harvard University.
Jake Berdine
OpenAI
OpenAI contributor with a public GitHub profile covering engineering projects.
Jean-Marc Andreoli
Cohere
Professor emeritus of computer science at Jean Monnet University whose work includes natural language processing and large language model adaptation and evaluation.
Jesujoba Alabi
Ai2
Researcher in natural language processing, low-resource languages, machine translation, and responsible AI; publicly listed as a PhD candidate at UC Santa Barbara and a co-author of Tulu 2.
Jonathan Gordon
OpenAI
Works at OpenAI on reasoning and general intelligence, and co-authored the GPT-4 Technical Report.
Juan Felipe Cerón Uribe
OpenAI
Member of technical staff at OpenAI whose official profile highlights multimodal models and intelligence.
Julian Martin Eisenschlos
Ai2
Julian Martin Eisenschlos is a Research Scientist at Ai2. His work focuses on natural language processing, language models, and instruction tuning, including contributions to the Tulu 2 project.
Jyrki Alakuijala
Google Gemini
Research scientist at Google DeepMind working on data compression, efficient language models, and multilingual modeling.
Karina Nguyen
Amazon
Research scientist at Amazon AGI and former Stanford University PhD student.
Karthik Prasad
Meta AI
Staff research scientist at Meta whose work focuses on multilingual and multimodal AI, reasoning, and safety.
Kateryna Gashteovski
Google Gemini
Research scientist at Google DeepMind working on natural language understanding and machine learning.
Kensen Shi
Google Gemini
Research scientist at Google DeepMind working on machine learning and natural language processing.
Kewei Tu
Cohere
Associate professor at ShanghaiTech University working on natural language processing and machine learning.
Khalid Salama
Google Gemini
Professor of Electrical and Computer Engineering at KAUST whose research focuses on machine learning and AI, including edge intelligence, tinyML, efficient AI, and uncertainty-aware learning.
Kyla Sheppard
OpenAI
Program manager at OpenAI leading contracting, revenue, pricing, and finance operations, after earlier operations and finance roles at Tesla and Capchase.
Lenny Bogdonoff
OpenAI
OpenAI contributor with a personal website and public engineering projects.
Maarten Bosma
Google Gemini
Research scientist at Google DeepMind whose work focuses on pre-training, post-training, and reasoning for large language models.
Maneesh Bhadauria
NVIDIA
Research scientist at NVIDIA Research working on computer vision, machine learning, and deep learning, and a co-author of the Nemotron-4 340B technical report.
Mao Zheng
DeepSeek
Co-founder of DeepSeek who leads pre-training and post-training research, with a focus on language models, reasoning models, and reinforcement learning.
Marie-Therese C. Mercier
Mistral AI
Research scientist at Mistral AI with a public publication record on Google Scholar and DBLP.
Mark Sandler
Google Gemini
Research scientist at Google DeepMind focused on multimodal and embodied AI. His work covers computer vision, video understanding, and machine learning for interactive systems.
Marvin Zhang
OpenAI
OpenAI contributor to GPT-4 evaluation and analysis, including Evals open-sourcing, instruction-following evaluation, and API model evaluations.
Matej Balog
Google Gemini
Senior staff research scientist at Google DeepMind and assistant professor at the Czech Technical University in Prague.
Mehdi Rezagholizadeh
NVIDIA
Senior applied scientist at NVIDIA focusing on generative and agentic AI, language models, information extraction, multimodality, and reasoning.
Merry Mou
Ai2
Merry Mou is a researcher at Ai2 whose public homepage and Google Scholar profile highlight work on language model post-training, evaluation, and open-source foundation models.
Michael Mitzenmacher
NVIDIA
Harvard computer scientist and professor whose research spans algorithms, probability, information theory, and data streams.
Miles Brundage
OpenAI
Senior research fellow at the Institute for AI Policy and Strategy and former OpenAI research scientist focused on AI governance and policy.
Mira Murati
OpenAI
Founder and CEO of Thinking Machines Lab and former CTO of OpenAI.
Mustafa El-Kishky
Google Gemini
Research scientist at Google DeepMind working on natural language processing, speech, and multimodal machine learning.
Natalie Staudacher
OpenAI
Researcher at OpenAI working across multimodal systems, reasoning, and new product experiences, with earlier roles at Google and YouTube.
Nick Turley
OpenAI
Head of product for ChatGPT at OpenAI. He works on ChatGPT and related product experiences, including agent-style workflows and consumer AI tools.
Noah Deutsch
OpenAI
Research scientist at OpenAI.
Ofer Arazi
Cohere
Machine learning researcher at Cohere whose work focuses on large language models and post-training.
Olivier Delalleau
Cohere
Research scientist at Cohere whose work focuses on deep learning and large language models.
Patricio Lopez Macedo
Google Gemini
Research scientist at Google DeepMind focused on learning with reward feedback and reasoning in large language models.
Paul Michel
Cohere
Research scientist working on large language models, scaling, alignment, and language; public work includes machine translation robustness, data selection, CodeGemma, and reward overoptimization in LLMs.
Pavan Balaji
NVIDIA
Pavan Balaji is a computer scientist and research leader in high-performance computing. His public profiles include work at NVIDIA and leadership of the Argonne Leadership Computing Facility, with research spanning parallel programming models, communication libraries, and runtime systems.
Prasad Sattigeri
Snowflake
Prasad Sattigeri is a machine learning engineer at Snowflake working on agentic AI systems and machine learning.
Pratyusha Kamesetty
NVIDIA
Pratyusha Kamesetty is a deep learning research scientist at NVIDIA and a PhD candidate in computer science at Stanford University. Her research focuses on language models, recommendation systems, and generative AI for music and video.
Preston Tuggle
OpenAI
Preston Tuggle is a distributed systems engineer at OpenAI.
Rajarshi Das
NVIDIA
Research scientist at NVIDIA.
Reiichiro Nakano
OpenAI
Technical staff member at OpenAI. Previously completed a PhD at Stanford University and was a research fellow at the Simons Institute for the Theory of Computing.
Rishiraj Adhikary
NVIDIA
Rishiraj Adhikary is a senior applied research scientist at NVIDIA working on AI systems and generative AI.
Roger Waleffe
NVIDIA
Research scientist at NVIDIA.
Rory Carmichael
OpenAI
DPhil candidate in Engineering Science at the University of Oxford whose work spans cybersecurity, machine learning, and natural language semantics.
Ross Alberger
Meta AI
Research scientist at Meta working on language models and machine translation.
Ruisong Zhang
DeepSeek
Researcher at DeepSeek focused on multimodal models, AI agents, and efficient large language models; previously worked on large-scale recommender systems at Tencent and Kuaishou.
Sadanand Rajaram
Cohere
Research scientist at Cohere working on multimodal and embodied AI, robotics, and machine learning systems.
Sameer Singh
Snowflake
Associate professor at UC Irvine and research director at Snowflake AI Research.
Saurabh Kumar
Cohere
Research scientist at Cohere whose homepage highlights work on efficient and reliable natural language processing, following earlier research engineering work at Google.
Shawn Henry
NVIDIA
Principal applied scientist at NVIDIA whose public work focuses on applied machine learning and large language models.
Shibani Santurkar
OpenAI
Machine learning researcher focused on reliable ML systems. She was a postdoc in Computer Science at Stanford and earned her PhD from MIT.
Shino Jomoto
OpenAI
OpenAI contributor with an official OpenAI profile page; public contribution pages list work on GPT-4, GPT-4o, and DALL-E 2.
Sho Takase
Cohere
Research scientist at Cohere whose work focuses on natural language processing and language models.
Shruti Bhosale
Meta AI
Research engineer at Meta working on Llama models and generative AI. She previously completed an MS in computer science at New York University.
Shuyan Zhou
Ai2
Research scientist at the Allen Institute for AI focused on training, evaluating, and interpreting language models, with work spanning NLP and computational social science.
Simón Posada Fishman
OpenAI
OpenAI member building agentic products; previously founded the AI note-taking app Circleback and writes about AI, product, and startups.
Sina Samangooei
Google Gemini
Sina Samangooei is a machine learning researcher whose public work includes multimodal learning and open language models.
Sudharsan Kodikala
NVIDIA
Machine learning researcher at NVIDIA focused on large language models and reinforcement learning.
Sully Chen
OpenAI
Engineer and builder who co-founded comma.ai and previously led self-driving at OpenAI, with public work spanning autonomous driving, AGI, and music.
Toby Jia-Jun Li
Google Gemini
Assistant professor at Cornell Tech whose research focuses on human-AI interaction, AI copilots, and tools that help people learn, create, and make decisions with AI.
Trieu Trinh
Google Gemini
Mathematician and machine learning researcher whose NYU profile describes him as a PhD student in mathematics, advising at NYU Shanghai and a research intern at Google DeepMind; he also co-authored AlphaGeometry and the Gemma report.
Tyler Scialom
Ai2
Research scientist at Ai2 working on personalized language models, instruction tuning, and reinforcement learning from human feedback.
Vedaant Madan
Cohere
Researcher at Cohere.
Venkatesh Akella
NVIDIA
Venkatesh Akella is a professor of electrical and computer engineering at the University of California, Davis and director of the Center for AI and Experimental Futures. His research includes machine learning, trustworthy AI, causal AI, computer vision, robotics, and intelligent systems.
Virginia Hofmann
Ai2
Research scientist at Ai2 and PhD student at UC Berkeley working on language models, multilingual NLP, and evaluation.
Wendi Gu
Google Gemini
Research scientist at Google DeepMind and incoming assistant professor at Stanford GSB working on AI and economics.
Wenjie Hu
DeepSeek
Developer and researcher with public GitHub activity on DeepSeek-AI repositories including DeepSeek-R1 and DeepSeek-VL2, plus coauthor credit on the DeepSeek LLM Technical Report.
Xipeng Qiu
Meta AI
Xipeng Qiu is a Fudan University professor whose public work focuses on natural language processing and large language models.
Yusuke Iwasawa
Cohere
Yusuke Iwasawa is an Assistant Professor in Computer Science at the University of Tokyo whose work spans human-computer interaction and machine learning.
Zenan Zhou
Baichuan
Researcher at Baichuan focused on large language models and multimodal systems. Public publication profiles include work on Baichuan 2, Baichuan-M1, and Baichuan-M2.
Zhicheng Ma
DeepSeek
Research scientist at DeepSeek and a Ph.D. student in the School of Software at Tsinghua University. His research interests include large language models and multimodal learning.
Tony Gracious
Ai2
Tony Gracious completed his PhD in the Department of Computer Science and Automation at IISc Bangalore. His work includes representation learning, temporal point processes, and higher-order interaction forecasting, and he later joined Dolby's Advanced Technology Group in Bangalore.
Jinliang Guo
NVIDIA
Jinliang Guo is listed as an author of the NVIDIA technical report Nemotron-H: A Family of Accurate and Efficient Hybrid Mamba-Transformer Models.
Pengxiang Cheng
Z.ai
Public report authorship links Pengxiang Cheng to the GLM-Z1-Rumination: An Open Frontier-Class Reasoning Model Through Test-Time Scaling at Z.ai.
Qingyang Ge
MiniMax
Public report authorship links Qingyang Ge to the MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention at MiniMax.
Rudra Murthy
NVIDIA
Rudra Murthy is listed as an author of the NVIDIA technical report Nemotron-H: A Family of Accurate and Efficient Hybrid Mamba-Transformer Models.
Wenhui Wang
Microsoft
Co-author of "The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits"; the paper's author notes list Wenhui Wang with Microsoft Research.
Xi Chen
Z.ai
Xi Chen is listed as an author of the Z.ai technical report GLM-4.1V-Thinking and GLM-4.5V: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning.
Xinhua Cheng
Z.ai
Public report authorship links Xinhua Cheng to the GLM-Z1-Rumination: An Open Frontier-Class Reasoning Model Through Test-Time Scaling at Z.ai.
Xuancheng Ren
Alibaba Qwen
Xuancheng Ren is listed as an author of the Qwen technical report Qwen3 Technical Report.
Yizhe Yang
Z.ai
Public report authorship links Yizhe Yang to the GLM-Z1-Rumination: An Open Frontier-Class Reasoning Model Through Test-Time Scaling at Z.ai.
Zhongzheng Ren
Z.ai
Public report authorship links Zhongzheng Ren to the GLM-Z1-Rumination: An Open Frontier-Class Reasoning Model Through Test-Time Scaling at Z.ai.
Jeff Wu
OpenAI
Researcher at OpenAI working on language model training and evaluation, and co-author of the GPT-4 Technical Report.
Beichen Zhang
Alibaba Qwen
Research scientist in Tongyi Lab whose official profile highlights post-training, efficient reasoning, and coding large language models.
Chang Zhou
Alibaba Qwen
Qwen researcher and co-lead whose work focuses on pretraining and post-training, multimodal models, agent systems, and large-scale model infrastructure.
Ruoyu Zhang
DeepSeek
Researcher affiliated with DeepSeek-AI and co-author of the Nature paper introducing DeepSeek-R1.
Shangyan Zhou
DeepSeek
Researcher affiliated with DeepSeek-AI and co-author of the Nature paper introducing DeepSeek-R1.
Shijie Wang
Alibaba Qwen
Senior research scientist in Tongyi Lab whose official profile highlights post-training, AI for science, evaluation and alignment, multimodal reasoning, and large language model reasoning.
Shirong Ma
DeepSeek
Researcher at DeepSeek-AI in Hangzhou, China and co-author of the Nature paper introducing DeepSeek-R1.
Shiyu Wang
DeepSeek
DeepSeek contributor and coauthor of the DeepSeek-V2 and DeepSeek-V3 technical reports.
Shunfeng Zhou
DeepSeek
Researcher affiliated with DeepSeek-AI and co-author of the Nature paper introducing DeepSeek-R1.
Tian Pei
DeepSeek
Researcher affiliated with DeepSeek-AI and co-author of the Nature paper introducing DeepSeek-R1.
Ting Song
Microsoft
Ting Song is listed as an author of the BitNet b1.58 2B4T Technical Report; the report states that T. Song is with Microsoft Research.
Wenjun Gao
DeepSeek
Researcher affiliated with DeepSeek-AI and co-author of the Nature paper introducing DeepSeek-R1.
Xiaodong Liu
DeepSeek
DeepSeek contributor and coauthor of the DeepSeek-V2 and DeepSeek-V3 technical reports.
Xiaotao Nie
DeepSeek
DeepSeek contributor and coauthor of the DeepSeek-V2 and DeepSeek-V3 technical reports.
Xin Xie
DeepSeek
Researcher affiliated with DeepSeek-AI and co-author of the Nature paper introducing DeepSeek-R1.
Yanhong Xu
DeepSeek
DeepSeek contributor and coauthor of the DeepSeek-V2 and DeepSeek-V3 technical reports.
Yan Xia
Microsoft
Co-author of the BitNet b1.58 2B4T Technical Report; the report states Yan Xia is with Microsoft Research.
Yaofeng Sun
DeepSeek
Researcher at DeepSeek-AI in Hangzhou, China and co-author of the Nature paper introducing DeepSeek-R1.
Yaohui Wang
DeepSeek
DeepSeek contributor and coauthor of the DeepSeek-V2 and DeepSeek-V3 technical reports.
Yao Li
DeepSeek
Researcher affiliated with DeepSeek-AI and co-author of the Nature paper introducing DeepSeek-R1.
Yichao Zhang
DeepSeek
Researcher affiliated with DeepSeek-AI and co-author of the Nature paper introducing DeepSeek-R1.
Yichi Zhang
Moonshot AI / Z.ai
Researcher at Moonshot AI and co-author of the Kimi K2.5 report on visual agentic intelligence.
Ying He
DeepSeek
Researcher affiliated with DeepSeek-AI and co-author of the Nature paper introducing DeepSeek-R1.
Yiyuan Liu
DeepSeek
Researcher affiliated with DeepSeek-AI and co-author of the Nature paper introducing DeepSeek-R1.
Yuheng Zou
DeepSeek
Researcher affiliated with DeepSeek-AI and co-author of the Nature paper introducing DeepSeek-R1.
Yuxuan Hu
Z.ai
Co-author of GLM-4.1V-Thinking and GLM-4.5V, multimodal reasoning models trained with scalable reinforcement learning.
Zhangli Sha
DeepSeek
Researcher affiliated with DeepSeek-AI and co-author of the Nature paper introducing DeepSeek-R1.
Juston Forte
OpenAI
Lead Solutions Architect at OpenAI, based in New York City, according to an official OpenAI Forum speaker page.
Jilong Xue
Microsoft
Co-author of "The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits"; the paper states J. Xue is with Microsoft Research.
Ali Payani
Google Gemini
Cisco Research researcher working in artificial intelligence and machine learning; current public evidence does not support the prior Google DeepMind profile.
Amanpreet Singh
Ai2
Lead of OLMo, AI2's open-source frontier language models, and Associate Director of the OLMo team at the Allen Institute for AI.
Sherwin Wu
OpenAI
Software engineer publicly identified on OpenAI's GPT-4 contributions page and in a 2023 QCon speaker bio as a Member of Technical Staff on OpenAI's Developer Platform team.
Soumya Batra
Meta AI
Soumya Batra is a PhD student in computer science at Stanford University and a former research intern at FAIR and Facebook AI Research.
Ali Payani
NVIDIA
Public report authorship links Ali Payani to the OLMoE: Open Mixture-of-Experts Language Models at Ai2.
Dahua Lin
Shanghai AI Laboratory
Researcher at Shanghai AI Laboratory and coauthor of the InternLM2 Technical Report.
Dirk Groeneveld
Ai2
Public sources identify Dirk Groeneveld as an OLMo 3 coauthor; a 2022 USC ISI speaker bio described him as a principal engineer at AI2 and technical lead for the AllenNLP Platform team.
Jiaqi Wang
Shanghai AI Laboratory
Researcher at Shanghai AI Laboratory and coauthor of the InternLM2 Technical Report.
Jin Chen
DeepSeek
Jin Chen is listed as an author of the DeepSeek technical report DeepSeek-V3 Technical Report.
Yu Qiao
Shanghai AI Laboratory
Researcher at Shanghai AI Laboratory and coauthor of the InternLM2 Technical Report.
Andrew M. Dai
Google Gemini
Research scientist at Google DeepMind in Mountain View working on machine learning, reinforcement learning, and robotics.
Aurelien Rodriguez
Meta AI
Research scientist at Meta GenAI in Menlo Park working on foundation models, language generation, and evaluation.
Bingxuan Wang
DeepSeek
Research scientist at DeepSeek with public scholarly work on large language models and reasoning.
Demis Hassabis
Google Gemini
Founder and CEO of Google DeepMind, leading AI research and product development; his work spans AI, neuroscience, game playing, and structural biology.
Guowei Li
DeepSeek
Research scientist at DeepSeek with a public Google Scholar profile covering large language models and efficient inference.
Honghui Ding
DeepSeek
Research scientist at DeepSeek with public GitHub work on reinforcement learning from human feedback and language models.
Hui Qu
DeepSeek
Research scientist at DeepSeek with public GitHub work on large language models and AI agents.
Jacob Morrison
Ai2
Jacob Morrison is a researcher whose work spans language model post-training, alignment, and evaluation. His public research page highlights projects including Tulu 2, Tulu 3, OLMo 2, and RewardBench.
Jianzhong Guo
DeepSeek
Research scientist at DeepSeek AI working on multimodal large language models and efficient multimodal alignment. Previously completed a PhD at Nanjing University focused on machine learning and computer vision.
Jinyuan Jia
MiniMax
Researcher working on speech and multimodal language models, including MiniMax-Speech and related speech understanding work.
Kaige Gao
DeepSeek
Research scientist at DeepSeek with public GitHub work spanning AI systems.
Lecong Zhang
DeepSeek
Research scientist at DeepSeek with public GitHub projects on AI systems.
Lucile Saulnier
Mistral AI
AI scientist at Mistral AI and PhD student at ENS Paris and CNRS. Her research focuses on language grounding, multimodality, and evaluation for multimodal and language models.
Ming Ding
MiniMax
Lead of foundation models at MiniMax working on large language models, multimodal pretraining, and efficient training systems. He completed a PhD in computer science at Tsinghua University.
Minghua Zhang
DeepSeek
Research scientist at DeepSeek with public GitHub work on language models and AI systems.
Orhan Firat
Google Gemini
Research scientist at Google Research whose public work spans multilingual and large-scale language modeling; arXiv author results include the PaLM paper.
Oriol Vinyals
Google Gemini
Chief Scientist at Google DeepMind and Vice President of Research leading Gemini, with work spanning scalable sequence learning, large language models, games, and robotics.
Oyvind Tafjord
Ai2
Senior research scientist at Ai2 and affiliate associate professor at the University of Washington, with work spanning NLP, machine reading, and reasoning.
Qiushi Du
DeepSeek
Researcher at DeepSeek AI interested in large language models and multimodal language models.
Rohan Anil
Google Gemini
Rohan Anil is a research scientist at Google DeepMind. His public homepage highlights work on large language models, efficient machine learning systems, and multimodal AI.
Ruiqi Ge
DeepSeek
Researcher affiliated with DeepSeek-AI on Google Scholar and coauthor of the DeepSeek technical reports.
Teven Le Scao
Mistral AI
Research scientist at Mistral AI and co-author of the Mistral 7B report.
Xuecheng Su
DeepSeek
Researcher and co-author of the DeepSeek-V3, DeepSeek-V2, and DeepSeek LLM technical reports.
Yishi Piao
DeepSeek
Researcher and co-author of the DeepSeek-V3, DeepSeek-V2, and DeepSeek LLM technical reports.
Yossi Matias
Google Gemini
Vice President of Engineering and Research at Google and site lead for the Google Center in Israel; he also leads Search, Research, and AI for Crisis Response.
Yukun Li
DeepSeek
DeepSeek researcher and coauthor of the DeepSeek LLM, DeepSeek-V2, and DeepSeek-V3 technical reports.
Yusheng Zhao
MiniMax
Research scientist at MiniMax AI Research focused on reinforcement learning, reasoning, multimodal learning, large language models, and large-scale distributed systems. He received a PhD in machine learning from Carnegie Mellon University.
Zehui Ren
DeepSeek
Research scientist at DeepSeek AI and co-author of the DeepSeek-V3, DeepSeek-V2, and DeepSeek LLM technical reports.
Baobao Chang
Z.ai
Public report authorship links Baobao Chang to the GLM-4.5: Agentic, Reasoning, and Coding Foundation Models at Z.ai.
Chaomin Shen
Z.ai
Chaomin Shen is listed as an author of the Z.ai technical report GLM-4.5: Agentic, Reasoning, and Coding Foundation Models.
Dakun Zhang
Z.ai
Dakun Zhang is listed as an author of the Z.ai technical report GLM-4.5: Agentic, Reasoning, and Coding Foundation Models.
Dehao Chen
Google Gemini
Public report authorship links Dehao Chen to the PaLM: Scaling Language Modeling with Pathways at Google.
Dejiang Dou
MiniMax
Public report authorship links Dejiang Dou to the MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention at MiniMax.
Fangrui Zhang
Z.ai
Fangrui Zhang is listed as an author of the Z.ai technical report GLM-4.5: Agentic, Reasoning, and Coding Foundation Models.
Gianna Lengyel
Mistral AI
Gianna Lengyel is listed as an author of the Mistral AI technical report Mistral 7B.
Hang Yan
Z.ai
Public report authorship links Hang Yan to the GLM-4.5: Agentic, Reasoning, and Coding Foundation Models at Z.ai.
Haolin Jia
MiniMax
Public report authorship links Haolin Jia to the MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention at MiniMax.
Haoqi Chen
Z.ai
Public report authorship links Haoqi Chen to the GLM-4.5 report at Z.ai.
Hyeonman Lee
LG AI Research
Researcher at LG AI Research and coauthor of the EXAONE 3.5: Series of Language Models for Real-world Use Cases.
H. Zhang
DeepSeek
H. Zhang is listed as an author of the DeepSeek technical report DeepSeek-V3 Technical Report.
Jamie Kiros
OpenAI / NVIDIA
Public report authorship links Jamie Kiros to the Nemotron-4 340B Technical Report at NVIDIA.
Jiale Zhi
MiniMax
Public report authorship links Jiale Zhi to the MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention at MiniMax.
Jialiang Wang
Z.ai
Jialiang Wang is listed as an author of the Z.ai technical report GLM-4.5: Agentic, Reasoning, and Coding Foundation Models.
Jianfeng Gao
NVIDIA
Public report authorship links Jianfeng Gao to the Nemotron-4 15B Technical Report at NVIDIA.
Jiangnan Li
Z.ai
Public report authorship links Jiangnan Li to the GLM-4.5 report at Z.ai.
Jian Ma
MiniMax
Public report authorship links Jian Ma to the MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention at MiniMax.
Jian Song
Z.ai
Public report authorship links Jian Song to the GLM-4.5: Agentic, Reasoning, and Coding Foundation Models at Z.ai.
Jiaqi Gao
Shanghai AI Laboratory
Researcher at Shanghai AI Laboratory and coauthor of the InternLM2 Technical Report.
Jinhui Yang
Z.ai
Jinhui Yang is listed as an author of the Z.ai technical report GLM-4.5: Agentic, Reasoning, and Coding Foundation Models.
Jong C. Park
LG AI Research
Researcher at LG AI Research and coauthor of the EXAONE 3.5: Series of Language Models for Real-world Use Cases.
Kaixin Yang
MiniMax
Public report authorship links Kaixin Yang to the MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention at MiniMax.
Li Dong
Z.ai
Public report authorship links Li Dong to the GLM-4.5 report at Z.ai.
Linjie Zhao
Z.ai
Linjie Zhao is listed as an author of the Z.ai technical report GLM-4.5: Agentic, Reasoning, and Coding Foundation Models.
Mahyar Babaei
NVIDIA / Microsoft
Mahyar Babaei is listed as an author of the Microsoft technical report Phi-4 Technical Report.
Mateusz Litwin
OpenAI
Mateusz Litwin is listed as an author of the OpenAI technical report Language Models are Few-Shot Learners.
Ming Ding
Z.ai
Ming Ding is listed as an author of the Z.ai technical report GLM-4.5: Agentic, Reasoning, and Coding Foundation Models.
Mingpeng Zhou
Z.ai
Public report authorship links Mingpeng Zhou to the GLM-4.5: Agentic, Reasoning, and Coding Foundation Models at Z.ai.
Mingxing Zhang
Google Gemini
Public report authorship links Mingxing Zhang to the Gemma 3n Technical Report at Google.
Mingze Li
Alibaba Qwen / Meta AI
Mingze Li is listed as an author of the Qwen technical report Qwen3 Technical Report.
Mostafijur Rahman
LG AI Research
Researcher at LG AI Research and coauthor of the EXAONE 3.5: Series of Language Models for Real-world Use Cases.
M. Saiful Bari
LG AI Research
Researcher at LG AI Research and coauthor of the EXAONE 3.5: Series of Language Models for Real-world Use Cases.
M. Waqar Azim
LG AI Research
Researcher at LG AI Research and coauthor of the EXAONE 3.5: Series of Language Models for Real-world Use Cases.
Nhan Tran
Cerebras Systems
Researcher at Cerebras Systems and coauthor of the Cerebras-GPT: Open Compute-Optimal Language Models Trained on the Cerebras Wafer-Scale Cluster.
Niket Tandon
Salesforce AI Research
Researcher at Salesforce AI Research and coauthor of the XGen-7B Technical Report.
Peiyi Wang
MiniMax
Public report authorship links Peiyi Wang to the MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention at MiniMax.
Prasad Reddy Yadati
Salesforce AI Research
Researcher at Salesforce AI Research and coauthor of the XGen-7B Technical Report.
Qianlin Peng
MiniMax
Public report authorship links Qianlin Peng to the MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention at MiniMax.
Qiao Zhang
Google Gemini / Moonshot AI
Qiao Zhang is listed as an author of the Google technical report PaLM 2 Technical Report.
Qinyuan Wang
Z.ai
Qinyuan Wang is listed as an author of the Z.ai technical report GLM-4.5: Agentic, Reasoning, and Coding Foundation Models.
Rong Wu
Z.ai
Public report authorship links Rong Wu to the GLM-4.5 report at Z.ai.
Sanghyun Choi
LG AI Research
Researcher at LG AI Research and coauthor of the EXAONE 3.5: Series of Language Models for Real-world Use Cases.
Sebastian Goodman
Google Gemini
Public report authorship links Sebastian Goodman to the Gemma 3n Technical Report at Google.
Shuai Wang
Z.ai
Public report authorship links Shuai Wang to the GLM-4.5: Agentic, Reasoning, and Coding Foundation Models at Z.ai.
Siyao Huang
Z.ai
Siyao Huang is listed as an author of the Z.ai technical report GLM-4.5: Agentic, Reasoning, and Coding Foundation Models.
Siyuan Fan
MiniMax
Public report authorship links Siyuan Fan to the MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention at MiniMax.
Su Wang
Google Gemini
Public report authorship links Su Wang to the Gemma 3n Technical Report at Google.
Tianchu Gao
Z.ai
Public report authorship links Tianchu Gao to the GLM-4.5 report at Z.ai.
Trevor Cai
Google Gemini / OpenAI
Trevor Cai is listed as an author of the Google technical report RecurrentGemma: Moving Past Transformers for Efficient Open Language Models.
T. Wang
DeepSeek
T. Wang is listed as an author of the DeepSeek technical report DeepSeek-V3 Technical Report.
Wenbo Bi
MiniMax
Public report authorship links Wenbo Bi to the MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention at MiniMax.
Xiang Li
MiniMax
Xiang Li is listed as an author of the MiniMax technical report MiniMax-01: Scaling Foundation Models with Lightning Attention.
Xiao Liu
MiniMax
Public report authorship links Xiao Liu to the MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention at MiniMax.
Xiaowei Jiang
Z.ai
Xiaowei Jiang is listed as an author of the Z.ai technical report GLM-4.5: Agentic, Reasoning, and Coding Foundation Models.
Xiaoxin Li
Z.ai
Public report authorship links Xiaoxin Li to the GLM-4.5 report at Z.ai.
Xiaoyi Wang
MiniMax
Public report authorship links Xiaoyi Wang to the MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention at MiniMax.
Xibin Wu
MiniMax
Public report authorship links Xibin Wu to the MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention at MiniMax.
Xin Fu
Z.ai
Xin Fu is listed as an author of the Z.ai technical report GLM-4.5: Agentic, Reasoning, and Coding Foundation Models.
Xuanhao Ge
Z.ai
Xuanhao Ge is listed as an author of the Z.ai technical report GLM-4.5: Agentic, Reasoning, and Coding Foundation Models.
Xudong Xie
Z.ai
Xudong Xie is listed as an author of the Z.ai technical report GLM-4.5: Agentic, Reasoning, and Coding Foundation Models.
Yafei Wen
MiniMax
Yafei Wen is a MiniMax report-backed author on MiniMax-Text-01, a MiniMax technical report in the LLMpeople catalog.
Yali Wang
MiniMax
Public report authorship links Yali Wang to the MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention at MiniMax.
Yang Jiang
Z.ai
Yang Jiang is listed as an author of the Z.ai technical report GLM-4.5: Agentic, Reasoning, and Coding Foundation Models.
Yibo Tian
Z.ai
Yibo Tian is listed as an author of the Z.ai technical report GLM-4.5: Agentic, Reasoning, and Coding Foundation Models.
Yifei Lu
MiniMax
Public report authorship links Yifei Lu to the MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention at MiniMax.
Yijia Shao
Salesforce AI Research
Researcher at Salesforce AI Research and coauthor of the XGen-7B Technical Report.
Yilong Chen
Z.ai
Yilong Chen is listed as an author of the Z.ai technical report GLM-4.5: Agentic, Reasoning, and Coding Foundation Models.
Yingjie Li
MiniMax
Public report authorship links Yingjie Li to the MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention at MiniMax.
Yinlong Xu
Z.ai
Yinlong Xu is listed as an author of the Z.ai technical report GLM-4.5: Agentic, Reasoning, and Coding Foundation Models.
Yi Wang
Z.ai
Yi Wang is listed as an author of the Z.ai technical report GLM-4.5: Agentic, Reasoning, and Coding Foundation Models.
Yong Li
Z.ai
Yong Li is listed as an author of the Z.ai technical report GLM-4.5: Agentic, Reasoning, and Coding Foundation Models.
Youngjae Yu
LG AI Research
Researcher at LG AI Research and coauthor of the EXAONE 3.5: Series of Language Models for Real-world Use Cases.
Yuhan Li
Z.ai
Yuhan Li is listed as an author of the Z.ai technical report GLM-4.5: Agentic, Reasoning, and Coding Foundation Models.
Yu Liu
MiniMax
Public report authorship links Yu Liu to the MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention at MiniMax.
Yuxiang Zhao
Z.ai
Yuxiang Zhao is listed as an author of the Z.ai technical report GLM-4.5: Agentic, Reasoning, and Coding Foundation Models.
Yuxuan Wang
Shanghai AI Laboratory
Researcher at Shanghai AI Laboratory and coauthor of the InternLM2 Technical Report.
Yuzeng Li
MiniMax
Public report authorship links Yuzeng Li to the MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention at MiniMax.
Yu Zhou
MiniMax
Public report authorship links Yu Zhou to the MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention at MiniMax.
Zehui Wu
Z.ai
Zehui Wu is listed as an author of the Z.ai technical report GLM-4.5: Agentic, Reasoning, and Coding Foundation Models.
Zewen Chi
Microsoft
Zewen Chi is listed as a co-author of the 2026 arXiv paper "Sparse-BitNet: 1.58-bit LLMs are Naturally Friendly to Semi-Structured Sparsity," with affiliation 1 shown as Microsoft Research.
Zhang Zhang
Z.ai
Public report authorship links Zhang Zhang to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.
Zhanhui Kang
Z.ai
Public report authorship links Zhanhui Kang to the GLM-4.5 report at Z.ai.
Zhengduo Zhuo
Z.ai
Zhengduo Zhuo is listed as an author of the Z.ai technical report GLM-4.5: Agentic, Reasoning, and Coding Foundation Models.
Zhe Wang
Shanghai AI Laboratory
Researcher at Shanghai AI Laboratory and coauthor of the InternLM2 Technical Report.
Zhe Wang
Z.ai
Zhe Wang is listed as an author of the Z.ai technical report GLM-4.5: Agentic, Reasoning, and Coding Foundation Models.
Zhili Liang
Z.ai
Zhili Liang is listed as an author of the Z.ai technical report GLM-4.5: Agentic, Reasoning, and Coding Foundation Models.
Zitong Zhang
Z.ai
Zitong Zhang is listed as an author of the Z.ai technical report GLM-4.5: Agentic, Reasoning, and Coding Foundation Models.
Johannes Heidecke
OpenAI
Head of Safety Systems at OpenAI.
Abhinav Dubey
Databricks
Researcher at Databricks and coauthor of the DBRX: A Generalist Open Source LLM.
Abhinav Kadian
Databricks
Researcher at Databricks and coauthor of the DBRX: A Generalist Open Source LLM.
Abhishek Jauhri
Databricks
Researcher at Databricks and coauthor of the DBRX: A Generalist Open Source LLM.
Abhishek Pandey
Databricks
Researcher at Databricks and coauthor of the DBRX: A Generalist Open Source LLM.
Aditya Nori
Salesforce AI Research
Researcher at Salesforce AI Research and coauthor of the XGen-7B Technical Report.
Aimin Zhou
01.AI
Researcher at 01.AI and coauthor of the Yi-Lightning Technical Report.
Aiyuan Yang
Baichuan
Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.
Aleksander Ficek
Salesforce AI Research
Researcher at Salesforce AI Research and coauthor of the XGen-7B Technical Report.
Alexander Lavin
Cerebras Systems
Researcher at Cerebras Systems and coauthor of the BTLM-3B-8K: 7B Performance in a 3B Model.
Alok S. Shah
Cerebras Systems
Researcher at Cerebras Systems and coauthor of the Cerebras-GPT: Open Compute-Optimal Language Models Trained on the Cerebras Wafer-Scale Cluster.
Amaar Arora
Databricks
Researcher at Databricks and coauthor of the DBRX: A Generalist Open Source LLM.
Aming Wu
Shanghai AI Laboratory
Researcher at Shanghai AI Laboratory and coauthor of the InternLM2 Technical Report.
Andrew W. Lochman
Cerebras Systems
Researcher at Cerebras Systems and coauthor of the Cerebras-GPT: Open Compute-Optimal Language Models Trained on the Cerebras Wafer-Scale Cluster.
Anjali Narayan
Salesforce AI Research
Researcher at Salesforce AI Research and coauthor of the XGen-7B Technical Report.
Ankit Z. Parvez
Databricks
Researcher at Databricks and coauthor of the DBRX: A Generalist Open Source LLM.
Ankur Dave
Salesforce AI Research
Researcher at Salesforce AI Research and coauthor of the XGen-7B Technical Report.
Apoorv Prakash Saxena
Databricks
Researcher at Databricks and coauthor of the DBRX: A Generalist Open Source LLM.
Arash Mahdavi
Snowflake
Researcher at Snowflake and coauthor of the Snowflake Arctic: An Enterprise LLM.
Arjun Ghukasyan
Databricks
Researcher at Databricks and coauthor of the DBRX: A Generalist Open Source LLM.
Armen Aghajanyan
Databricks
Researcher at Databricks and coauthor of the DBRX: A Generalist Open Source LLM.
Arun Jindal
Cerebras Systems
Researcher at Cerebras Systems and coauthor of the BTLM-3B-8K: 7B Performance in a 3B Model.
Ashley Li
Databricks
Researcher at Databricks and coauthor of the DBRX: A Generalist Open Source LLM.
Atri Sharma
Snowflake
Researcher at Snowflake and coauthor of the Snowflake Arctic: An Enterprise LLM.
Ayman Bhagia
Databricks
Researcher at Databricks and coauthor of the DBRX: A Generalist Open Source LLM.
Beichen Zhang
01.AI
Researcher at 01.AI and coauthor of the Yi-Lightning Technical Report.
Bingning Wang
Baichuan
Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.
Bin Xiao
Baichuan
Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.
Bokun Wang
01.AI
Researcher at 01.AI and coauthor of the Yi-Lightning Technical Report.
Borong Zhang
Baichuan
Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.
Carlo Strapparava
Cerebras Systems
Researcher at Cerebras Systems and coauthor of the BTLM-3B-8K: 7B Performance in a 3B Model.
Ce Bian
Baichuan
Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.
Chao Yin
Baichuan
Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.
Charlie Wen
Cerebras Systems
Researcher at Cerebras Systems and coauthor of the Cerebras-GPT: Open Compute-Optimal Language Models Trained on the Cerebras Wafer-Scale Cluster.
Cheng Luo
01.AI
Researcher at 01.AI and coauthor of the Yi-Lightning Technical Report.
Chenwei Wang
01.AI
Researcher at 01.AI and coauthor of the Yi-Lightning Technical Report.
Chenxu Lv
Baichuan
Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.
Chenxu Lv
Alibaba Qwen
Research scientist in Tongyi Lab whose official profile highlights post-training, efficient reasoning, and multilingual large language models.
Christian R. Shelton
Snowflake
Researcher at Snowflake and coauthor of the Snowflake Arctic: An Enterprise LLM.
Christopher J. Hoyt
Cerebras Systems
Researcher at Cerebras Systems and coauthor of the Cerebras-GPT: Open Compute-Optimal Language Models Trained on the Cerebras Wafer-Scale Cluster.
Chuanqi Zhao
Z.ai
Researcher at Z.ai and coauthor of the GLM-5: Thinking, Coding, and Agentic Intelligence.
Cliff Young
Cerebras Systems
Researcher at Cerebras Systems and coauthor of the Cerebras-GPT: Open Compute-Optimal Language Models Trained on the Cerebras Wafer-Scale Cluster.
Daniel Whiteson
Snowflake
Researcher at Snowflake and coauthor of the Snowflake Arctic: An Enterprise LLM.
Da Pan
Baichuan
Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.
Dian Wang
Baichuan
Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.
Divya C. Mohan
Cerebras Systems
Researcher at Cerebras Systems and coauthor of the Cerebras-GPT: Open Compute-Optimal Language Models Trained on the Cerebras Wafer-Scale Cluster.
Dong Yan
Baichuan
Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.
Douglas Z. Oehm
Cerebras Systems
Researcher at Cerebras Systems and coauthor of the Cerebras-GPT: Open Compute-Optimal Language Models Trained on the Cerebras Wafer-Scale Cluster.
Duc Quang Nguyen
Cerebras Systems
Researcher at Cerebras Systems and coauthor of the Cerebras-GPT: Open Compute-Optimal Language Models Trained on the Cerebras Wafer-Scale Cluster.
Ernie Chang
Snowflake
Researcher at Snowflake and coauthor of the Snowflake Arctic: An Enterprise LLM.
Fan Yang
Shanghai AI Laboratory
Researcher at Shanghai AI Laboratory and coauthor of the InternLM2 Technical Report.
Fan Yang
Baichuan
Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.
Fan Zhou
Alibaba Qwen
Research scientist in Tongyi Lab whose official profile highlights post-training, efficient reasoning, and large multimodal models.
Fei Deng
Baichuan
Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.
Fei Tian
Databricks
Researcher at Databricks and coauthor of the DBRX: A Generalist Open Source LLM.
Feng Liu
Baichuan
Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.
Feng Wang
Baichuan
Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.
Filippos Kokkinos
Databricks
Researcher at Databricks and coauthor of the DBRX: A Generalist Open Source LLM.
Fuli Luo
01.AI
Researcher at 01.AI and coauthor of the Yi-Lightning Technical Report.
Gargi Sawarkar
Databricks
Researcher at Databricks and coauthor of the DBRX: A Generalist Open Source LLM.
Guangbo Hao
01.AI
Researcher at 01.AI and coauthor of the Yi: Open Foundation Models by 01.AI.
Guangwei Ai
Baichuan
Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.
Guanhua Chen
Z.ai
Researcher at Z.ai and coauthor of the GLM-5: Thinking, Coding, and Agentic Intelligence.
Guoming Wang
Shanghai AI Laboratory
Researcher at Shanghai AI Laboratory and coauthor of the InternLM2 Technical Report.
Guosheng Dong
Baichuan
Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.
Haizhou Zhao
Baichuan
Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.
Hangtian Liu
Z.ai
Researcher at Z.ai and coauthor of the GLM-5: Thinking, Coding, and Agentic Intelligence.
Hang Xu
Baichuan
Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.
Han Hu
01.AI
Researcher at 01.AI and coauthor of the Yi-Lightning Technical Report.
Haobo Wang
Z.ai
Researcher at Z.ai and coauthor of the GLM-5: Thinking, Coding, and Agentic Intelligence.
Hao Ge
Alibaba Qwen
Co-author of the Qwen3 Technical Report.
Haonan Shan
Shanghai AI Laboratory
Researcher at Shanghai AI Laboratory and coauthor of the InternLM2 Technical Report.
Haoran Wei
Alibaba Qwen
Research scientist in Tongyi Lab whose official profile highlights post-training and reasoning.
Haoyuan Li
01.AI
Researcher at 01.AI and coauthor of the Yi: Open Foundation Models by 01.AI.
Haoze Sun
Baichuan
Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.
Hao Zhou
Z.ai
Researcher at Z.ai and coauthor of the GLM-5: Thinking, Coding, and Agentic Intelligence.
Harsha Chilakapati
Cerebras Systems
Researcher at Cerebras Systems and coauthor of the BTLM-3B-8K: 7B Performance in a 3B Model.
Heri Zhao
Google Gemini
Co-author of the PaLM 2 Technical Report, which describes Google's multilingual and reasoning-oriented language model.
Hongda Zhang
Baichuan
Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.
Hossein Hajimirsadeghi
Snowflake
Researcher at Snowflake and coauthor of the Snowflake Arctic: An Enterprise LLM.
Hrayr Harutyunyan
Cerebras Systems
Researcher at Cerebras Systems and coauthor of the Cerebras-GPT: Open Compute-Optimal Language Models Trained on the Cerebras Wafer-Scale Cluster.
Huan Sun
Salesforce AI Research
Researcher at Salesforce AI Research and coauthor of the XGen-7B Technical Report.
Hui Liu
Baichuan
Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.
Iman Mirzadeh
Snowflake
Researcher at Snowflake and coauthor of the Snowflake Arctic: An Enterprise LLM.
Izzeddin Gur
Cerebras Systems
Researcher at Cerebras Systems and coauthor of the Cerebras-GPT: Open Compute-Optimal Language Models Trained on the Cerebras Wafer-Scale Cluster.
Jason Wei
Snowflake
Researcher at Snowflake and coauthor of the Snowflake Arctic: An Enterprise LLM.
Jaxon Wang
Snowflake
Researcher at Snowflake and coauthor of the Snowflake Arctic: An Enterprise LLM.
Jesus A. Gonzalez
Cerebras Systems
Researcher at Cerebras Systems and coauthor of the Cerebras-GPT: Open Compute-Optimal Language Models Trained on the Cerebras Wafer-Scale Cluster.
Jiahao Peng
Z.ai
Researcher at Z.ai and coauthor of the GLM-5: Thinking, Coding, and Agentic Intelligence.
Jiaming Ji
Baichuan
Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.
Jianguo Chen
Shanghai AI Laboratory
Researcher at Shanghai AI Laboratory and coauthor of the InternLM2 Technical Report.
Jianguo Zhang
Snowflake
Researcher at Snowflake and coauthor of the Snowflake Arctic: An Enterprise LLM.
Jiaxi Yang
Alibaba Qwen
Research scientist in Tongyi Lab whose official profile highlights multimodal understanding, image generation, and efficient large language models.
Jiayang Wu
Shanghai AI Laboratory
Researcher at Shanghai AI Laboratory and coauthor of the InternLM2 Technical Report.
Jie Zhang
Z.ai
Researcher at Z.ai and coauthor of the GLM-5: Thinking, Coding, and Agentic Intelligence.
Jingting Ye
Shanghai AI Laboratory
Researcher at Shanghai AI Laboratory and coauthor of the InternLM2 Technical Report.
Jionghao Bai
Cerebras Systems
Researcher at Cerebras Systems and coauthor of the Cerebras-GPT: Open Compute-Optimal Language Models Trained on the Cerebras Wafer-Scale Cluster.
Jon W. Pappas
Cerebras Systems
Researcher at Cerebras Systems and coauthor of the Cerebras-GPT: Open Compute-Optimal Language Models Trained on the Cerebras Wafer-Scale Cluster.
Josue You
Shanghai AI Laboratory
Researcher at Shanghai AI Laboratory and coauthor of the InternLM2 Technical Report.
Junnan Wang
Z.ai
Researcher at Z.ai and coauthor of the GLM-5: Thinking, Coding, and Agentic Intelligence.
JunTao Dai
Baichuan
Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.
Junyu Huang
Shanghai AI Laboratory
Researcher at Shanghai AI Laboratory and coauthor of the InternLM2 Technical Report.
Jun Zhang
Shanghai AI Laboratory
Researcher at Shanghai AI Laboratory and coauthor of the InternLM2 Technical Report.
Kai-Hsin Chen
Cerebras Systems
Researcher at Cerebras Systems and coauthor of the Cerebras-GPT: Open Compute-Optimal Language Models Trained on the Cerebras Wafer-Scale Cluster.
Karan B. Patel
Cerebras Systems
Researcher at Cerebras Systems and coauthor of the Cerebras-GPT: Open Compute-Optimal Language Models Trained on the Cerebras Wafer-Scale Cluster.
Kathy Lee
Databricks
Researcher at Databricks and coauthor of the DBRX: A Generalist Open Source LLM.
Kaustabh Goyal
Databricks
Researcher at Databricks and coauthor of the DBRX: A Generalist Open Source LLM.
Kevin Qian
Cerebras Systems
Researcher at Cerebras Systems and coauthor of the Cerebras-GPT: Open Compute-Optimal Language Models Trained on the Cerebras Wafer-Scale Cluster.
Kewei Yin
01.AI
Researcher at 01.AI and coauthor of the Yi: Open Foundation Models by 01.AI.
Kun Fang
Baichuan
Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.
Kushal Dave
Databricks
Researcher at Databricks and coauthor of the DBRX: A Generalist Open Source LLM.
Lanxia Wang
01.AI
Researcher at 01.AI and coauthor of the Yi: Open Foundation Models by 01.AI.
Lei Dong
Shanghai AI Laboratory
Researcher at Shanghai AI Laboratory and coauthor of the InternLM2 Technical Report.
Lei Su
Baichuan
Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.
Le Xue
Cerebras Systems
Researcher at Cerebras Systems and coauthor of the Cerebras-GPT: Open Compute-Optimal Language Models Trained on the Cerebras Wafer-Scale Cluster.
Liang Song
Baichuan
Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.
Liang Zhao
Z.ai
Researcher at Z.ai and coauthor of the GLM-5: Thinking, Coding, and Agentic Intelligence.
Lifeng Liu
Baichuan
Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.
Limin Gao
Z.ai
Researcher at Z.ai and coauthor of the GLM-5: Thinking, Coding, and Agentic Intelligence.
Li Shen
01.AI
Researcher at 01.AI and coauthor of the Yi-Lightning Technical Report.
Liuzhenghao Lv
Shanghai AI Laboratory
Researcher at Shanghai AI Laboratory and coauthor of the InternLM2 Technical Report.
Liyun Ru
Baichuan
Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.
Luyao Ma
Baichuan
Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.
Mang Wang
Baichuan
Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.
Maosong Cao
Shanghai AI Laboratory
Researcher at Shanghai AI Laboratory and coauthor of the InternLM2 Technical Report.
Maosong Sun
Z.ai
Researcher at Z.ai and coauthor of the GLM-5: Thinking, Coding, and Agentic Intelligence.
Matas Noreika
Databricks
Researcher at Databricks and coauthor of the DBRX: A Generalist Open Source LLM.
Maximilian Welt
Cerebras Systems
Researcher at Cerebras Systems and coauthor of the BTLM-3B-8K: 7B Performance in a 3B Model.
Md. Mofijul Islam
LG AI Research
Researcher at LG AI Research and coauthor of the EXAONE 3.5: Series of Language Models for Real-world Use Cases.
Mehrdad Farajtabar
Snowflake
Researcher at Snowflake and coauthor of the Snowflake Arctic: An Enterprise LLM.
Mickel Liu
Baichuan
Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.
Mike Lewis
Salesforce AI Research
Researcher at Salesforce AI Research and coauthor of the XGen-7B Technical Report.
Milad Faghri
Snowflake
Researcher at Snowflake and coauthor of the Snowflake Arctic: An Enterprise LLM.
MingAn Lin
Baichuan
Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.
Mingkai Zhong
01.AI
Researcher at 01.AI and coauthor of the Yi-Lightning Technical Report.
Monika Reszke
Cerebras Systems
Researcher at Cerebras Systems and coauthor of the Cerebras-GPT: Open Compute-Optimal Language Models Trained on the Cerebras Wafer-Scale Cluster.
Mor Geva
Technology Innovation Institute
Researcher at Technology Innovation Institute and coauthor of the Falcon Mamba 7B: The First Competitive Attention-free 7B Language Model.
Mo Yu
Salesforce AI Research
Researcher at Salesforce AI Research and coauthor of the XGen-7B Technical Report.
M. Salman Asif
Technology Innovation Institute
Researcher at Technology Innovation Institute and coauthor of the Falcon Mamba 7B: The First Competitive Attention-free 7B Language Model.
Murali Krishna Ramachandran
Databricks
Researcher at Databricks and coauthor of the DBRX: A Generalist Open Source LLM.
Nianan Wang
01.AI
Researcher at 01.AI and coauthor of the Yi-Lightning Technical Report.
Nicholas O. Hill
Cerebras Systems
Researcher at Cerebras Systems and coauthor of the Cerebras-GPT: Open Compute-Optimal Language Models Trained on the Cerebras Wafer-Scale Cluster.
Nicolas Rajbhandari
Cerebras Systems
Researcher at Cerebras Systems and coauthor of the BTLM-3B-8K: 7B Performance in a 3B Model.
Nuolan Nie
Baichuan
Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.
Peidong Guo
Baichuan
Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.
Prasad Reddy Y
Salesforce AI Research
Researcher at Salesforce AI Research and coauthor of the XGen-7B Technical Report.
Puzhao Zhang
01.AI
Researcher at 01.AI and coauthor of the Yi-Lightning Technical Report.
Qingru Zhang
01.AI
Researcher at 01.AI and coauthor of the Yi-Lightning Technical Report.
Ranjan Saxena
Databricks
Researcher at Databricks and coauthor of the DBRX: A Generalist Open Source LLM.
R. J. Chen
DeepSeek
Researcher affiliated with DeepSeek-AI and co-author of the Nature paper introducing DeepSeek-R1.
Roger Jiang
OpenAI
Official OpenAI materials list Roger Jiang among contributors to instruction-following work, ChatGPT, and GPT-4. The attached sources support OpenAI contributions but do not robustly support the prior stored homepage-based current-employment summary.
Ruiyang Sun
Baichuan
Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.
Ruizhe Pan
DeepSeek
Researcher affiliated with DeepSeek-AI and co-author of the Nature paper introducing DeepSeek-R1.
Ryan McCann
Salesforce AI Research
Researcher at Salesforce AI Research and coauthor of the XGen-7B Technical Report.
Samyam Rajbhandari
Cerebras Systems
Researcher at Cerebras Systems and coauthor of the Cerebras-GPT: Open Compute-Optimal Language Models Trained on the Cerebras Wafer-Scale Cluster.
Sankalp Sharma
Databricks
Researcher at Databricks and coauthor of the DBRX: A Generalist Open Source LLM.
Satish Narayanan
Cerebras Systems
Researcher at Cerebras Systems and coauthor of the Cerebras-GPT: Open Compute-Optimal Language Models Trained on the Cerebras Wafer-Scale Cluster.
Sean Welleck
Salesforce AI Research
Researcher at Salesforce AI Research and coauthor of the XGen-7B Technical Report.
Sejal Shekar
Databricks
Researcher at Databricks and coauthor of the DBRX: A Generalist Open Source LLM.
Shengyu Yao
01.AI
Researcher at 01.AI and coauthor of the Yi: Open Foundation Models by 01.AI.
Shijie Zhou
01.AI
Researcher at 01.AI and coauthor of the Yi-Lightning Technical Report.
Shimin Li
Shanghai AI Laboratory
Researcher at Shanghai AI Laboratory and coauthor of the InternLM2 Technical Report.
Shriya Chaturvedi
Cerebras Systems
Researcher at Cerebras Systems and coauthor of the Cerebras-GPT: Open Compute-Optimal Language Models Trained on the Cerebras Wafer-Scale Cluster.
Shuang Li (李爽)
Z.ai
Researcher at Z.ai and coauthor of the GLM-5: Thinking, Coding, and Agentic Intelligence.
Shuang Zhou
DeepSeek
Researcher affiliated with DeepSeek-AI and co-author of the Nature paper introducing DeepSeek-R1.
Shunyu Yao
Z.ai
Researcher at Z.ai and coauthor of the GLM-5: Thinking, Coding, and Agentic Intelligence.
Shuo Wang
Snowflake
Researcher at Snowflake and coauthor of the Snowflake Arctic: An Enterprise LLM.
Siddharth Karamcheti
Databricks
Researcher at Databricks and coauthor of the DBRX: A Generalist Open Source LLM.
Siming Chen
01.AI
Researcher at 01.AI and coauthor of the Yi: Open Foundation Models by 01.AI.
Sneha Agarwal
Databricks
Researcher at Databricks and coauthor of the DBRX: A Generalist Open Source LLM.
Soji James
Cerebras Systems
Researcher at Cerebras Systems and coauthor of the Cerebras-GPT: Open Compute-Optimal Language Models Trained on the Cerebras Wafer-Scale Cluster.
Somanshu Gupta
Databricks
Researcher at Databricks and coauthor of the DBRX: A Generalist Open Source LLM.
S. S. Li
DeepSeek
Researcher affiliated with DeepSeek-AI and co-author of the Nature paper introducing DeepSeek-R1.
Sungwon Suh
Cerebras Systems
Researcher at Cerebras Systems and coauthor of the Cerebras-GPT: Open Compute-Optimal Language Models Trained on the Cerebras Wafer-Scale Cluster.
Tanwir Shahriar
Snowflake
Researcher at Snowflake and coauthor of the Snowflake Arctic: An Enterprise LLM.
Tao Zhang
Baichuan
Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.
Tianhao Chen
Shanghai AI Laboratory
Researcher at Shanghai AI Laboratory and coauthor of the InternLM2 Technical Report.
Tianhao Li
Alibaba Qwen
Research scientist in Tongyi Lab whose official profile highlights post-training, multilingual large language models, and efficient reasoning.
Tianpeng Li
Baichuan
Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.
Tianyu Li
Baichuan
Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.
T. Y. Lin
Snowflake
Researcher at Snowflake and coauthor of the Snowflake Arctic: An Enterprise LLM.
Urvashi Khandelwal
Salesforce AI Research
Researcher at Salesforce AI Research and coauthor of the XGen-7B Technical Report.
Vaibhav Patil
Salesforce AI Research
Researcher at Salesforce AI Research and coauthor of the XGen-7B Technical Report.
Varun Arora
Databricks
Researcher at Databricks and coauthor of the DBRX: A Generalist Open Source LLM.
Vishakh Padmakumar
Databricks
Researcher at Databricks and coauthor of the DBRX: A Generalist Open Source LLM.
Viterbi Xiong
Databricks
Researcher at Databricks and coauthor of the DBRX: A Generalist Open Source LLM.
Wei Cheng
Baichuan
Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.
Wei Ge
01.AI
Researcher at 01.AI and coauthor of the Yi-Lightning Technical Report.
Weijie Jheng
Shanghai AI Laboratory
Researcher at Shanghai AI Laboratory and coauthor of the InternLM2 Technical Report.
Weipeng Chen
Baichuan
Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.
Weiyi Chen
Cerebras Systems
Researcher at Cerebras Systems and coauthor of the Cerebras-GPT: Open Compute-Optimal Language Models Trained on the Cerebras Wafer-Scale Cluster.
W. L. Xiao
DeepSeek
Researcher affiliated with DeepSeek-AI and co-author of the Nature paper introducing DeepSeek-R1.
Xiang Li
01.AI
Researcher at 01.AI and coauthor of the Yi-Lightning Technical Report.
Xiang Li
Z.ai
Researcher at Z.ai and coauthor of the GLM-5: Thinking, Coding, and Agentic Intelligence.
Xiangrong Zeng
Baichuan
Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.
Xiangyu Chen
01.AI
Researcher at 01.AI and coauthor of the Yi-Lightning Technical Report.
Xianzu Wang
DeepSeek
DeepSeek contributor and coauthor of the DeepSeek-V2 and DeepSeek-V3 technical reports.
Xiaochuan Wang
Baichuan
Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.
Xiaohan Wang
DeepSeek
Researcher affiliated with DeepSeek-AI and co-author of the Nature paper introducing DeepSeek-R1.
Xiaojin Shen
DeepSeek
Researcher affiliated with DeepSeek-AI and co-author of the Nature paper introducing DeepSeek-R1.
Xiaokang Chen
DeepSeek
Researcher affiliated with DeepSeek-AI and co-author of the Nature paper introducing DeepSeek-R1.
Xiaowei Zhou
01.AI
Researcher at 01.AI and coauthor of the Yi-Lightning Technical Report.
Xiaoxi Chen
Baichuan
Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.
Xiaozhong Zhang
01.AI
Researcher at 01.AI and coauthor of the Yi: Open Foundation Models by 01.AI.
Xing Yao
Z.ai
Researcher at Z.ai and coauthor of the GLM-5: Thinking, Coding, and Agentic Intelligence.
Xin Jiang
01.AI
Researcher at 01.AI and coauthor of the Yi-Lightning Technical Report.
Xin Men
Baichuan
Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.
Xinnan Song
DeepSeek
Researcher affiliated with DeepSeek-AI and co-author of the Nature paper introducing DeepSeek-R1.
Xinting Huang
Shanghai AI Laboratory
Researcher at Shanghai AI Laboratory and coauthor of the InternLM2 Technical Report.
Xinyi Lin
Z.ai
Researcher at Z.ai and coauthor of the GLM-5: Thinking, Coding, and Agentic Intelligence.
Xinyi Zhou
DeepSeek
Researcher affiliated with DeepSeek-AI and co-author of the Nature paper introducing DeepSeek-R1.
Xin Yu
Baichuan
Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.
Xinyu Yang
DeepSeek
Co-author of the Nature paper on DeepSeek-R1; Nature and PubMed list Xinyu Yang with the DeepSeek-AI Team in Hangzhou, China.
X. Q. Li
DeepSeek
Researcher affiliated with DeepSeek-AI and co-author of the Nature paper introducing DeepSeek-R1.
Xuehai Pan
Baichuan
Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.
Xuting Xu
Z.ai
Researcher at Z.ai and coauthor of the GLM-5: Thinking, Coding, and Agentic Intelligence.
Yangbo Fan
01.AI
Researcher at 01.AI and coauthor of the Yi-Lightning Technical Report.
Yanjun Shen
Baichuan
Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.
Yaohui Li
DeepSeek
Researcher affiliated with DeepSeek-AI and co-author of the Nature paper introducing DeepSeek-R1.
Yao Nie
01.AI
Researcher at 01.AI and coauthor of the Yi-Lightning Technical Report.
Yao Wan
01.AI
Researcher at 01.AI and coauthor of the Yi-Lightning Technical Report.
Yash Pathak
Databricks
Researcher at Databricks and coauthor of the DBRX: A Generalist Open Source LLM.
Yeqing Wang
Z.ai
Researcher at Z.ai and coauthor of the GLM-5: Thinking, Coding, and Agentic Intelligence.
Yichao Jiang
01.AI
Researcher at 01.AI and coauthor of the Yi: Open Foundation Models by 01.AI.
Yiding Wang
Baichuan
Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.
Yifan Hu
Z.ai
Researcher at Z.ai and coauthor of the GLM-5: Thinking, Coding, and Agentic Intelligence.
Yikang Shen
Z.ai
Researcher at Z.ai and coauthor of the GLM-5: Thinking, Coding, and Agentic Intelligence.
Yinghao Zhao
Snowflake
Researcher at Snowflake and coauthor of the Snowflake Arctic: An Enterprise LLM.
Ying Tang
DeepSeek
Member of the DeepSeek-AI team in Hangzhou, China and co-author of the DeepSeek-V2 and DeepSeek-V3 technical reports.
Yiyu Li
Baichuan
Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.
Yi Zheng
DeepSeek
Researcher affiliated with DeepSeek-AI and co-author of the Nature paper introducing DeepSeek-R1.
Yizhou Yan
Z.ai
Researcher at Z.ai and coauthor of the GLM-5: Thinking, Coding, and Agentic Intelligence.
Yonghao Zhuang
Snowflake
Researcher at Snowflake and coauthor of the Snowflake Arctic: An Enterprise LLM.
Yongji Wang
DeepSeek
Researcher affiliated with DeepSeek-AI and co-author of the Nature paper introducing DeepSeek-R1.
Yongqiang Guo
DeepSeek
Researcher affiliated with DeepSeek-AI and co-author of the Nature paper introducing DeepSeek-R1.
Yong Zhou
Shanghai AI Laboratory
Researcher at Shanghai AI Laboratory and coauthor of the InternLM2 Technical Report.
Youngjae Yu
Salesforce AI Research
Researcher at Salesforce AI Research and coauthor of the XGen-7B Technical Report.
Young Wang
01.AI
Researcher at 01.AI and coauthor of the Yi: Open Foundation Models by 01.AI.
Youxin Jiang
Baichuan
Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.
Yuchen Gao
Baichuan
Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.
Yuchen Zhang
Databricks
Researcher at Databricks and coauthor of the DBRX: A Generalist Open Source LLM.
Yuchen Zhu
DeepSeek
Researcher affiliated with DeepSeek-AI and co-author of the Nature paper introducing DeepSeek-R1.
Yuchong Hu
Z.ai
Researcher at Z.ai and coauthor of the GLM-5: Thinking, Coding, and Agentic Intelligence.
Yuduan Wang
DeepSeek
Researcher affiliated with DeepSeek-AI and co-author of the Nature paper introducing DeepSeek-R1.
Yujie Lu
Snowflake
Researcher at Snowflake and coauthor of the Snowflake Arctic: An Enterprise LLM.
Yunxian Ma
DeepSeek
Researcher affiliated with DeepSeek-AI and co-author of the Nature paper introducing DeepSeek-R1.
Yupeng Zhang
Baichuan
Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.
Yuting Yan
DeepSeek
Researcher affiliated with DeepSeek-AI and co-author of the Nature paper introducing DeepSeek-R1.
Yuxuan Liu
DeepSeek
Researcher affiliated with DeepSeek-AI and co-author of the Nature paper introducing DeepSeek-R1.
Yuxuan Ma
Shanghai AI Laboratory
Researcher at Shanghai AI Laboratory and coauthor of the InternLM2 Technical Report.
Zenglin Xu
01.AI
Researcher at 01.AI and coauthor of the Yi-Lightning Technical Report.
Zhe Feng
Snowflake
Researcher at Snowflake and coauthor of the Snowflake Arctic: An Enterprise LLM.
Zheng Cai
Shanghai AI Laboratory
Researcher at Shanghai AI Laboratory and coauthor of the InternLM2 Technical Report.
Zhen Huang
DeepSeek
Researcher affiliated with DeepSeek-AI and co-author of the Nature paper introducing DeepSeek-R1.
Zhenzhong Lan
01.AI
Researcher at 01.AI and coauthor of the Yi-Lightning Technical Report.
Zhipeng Xu
DeepSeek
Researcher affiliated with DeepSeek-AI and co-author of the Nature paper introducing DeepSeek-R1.
Zhiqiang Zhang
Shanghai AI Laboratory
Researcher at Shanghai AI Laboratory and coauthor of the InternLM2 Technical Report.
Zhiying Wu
Baichuan
Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.
Zhongyu Zhang
DeepSeek
Researcher at DeepSeek-AI in Hangzhou, China and co-author of the Nature paper introducing DeepSeek-R1.
Zhuoshu Li
DeepSeek
Researcher affiliated with DeepSeek-AI and co-author of the Nature paper introducing DeepSeek-R1.
Zihui Gu
DeepSeek
Researcher affiliated with DeepSeek-AI and co-author of the Nature paper introducing DeepSeek-R1.
Ziwei Wan
01.AI
Researcher at 01.AI and coauthor of the Yi: Open Foundation Models by 01.AI.
Zonghan Yang
01.AI
Researcher at 01.AI and coauthor of the Yi-Lightning Technical Report.
Florencia Leoni Aleman
OpenAI
Florencia Leoni Aleman is an OpenAI contributor listed on the GPT-4V contributions page.
Y. Q. Wang
DeepSeek
Publicly identifiable information is limited; this name appears as an author on the DeepSeek-V3 technical report.
Molly Lin
OpenAI
Official OpenAI pages list Molly Lin among contributors to GPT-4, embedding-model updates, and rule-based-reward model-safety work.
Bo Liu
DeepSeek
Researcher affiliated with DeepSeek on Google Scholar and coauthor of the DeepSeek LLM and DeepSeek-V2 technical reports.
Chengqi Deng
DeepSeek
Research scientist at DeepSeek with public GitHub projects spanning language models and AI systems.
Huajian Xin
DeepSeek
Research scientist at DeepSeek AI working on reinforcement learning from verifiable rewards, reasoning models, and AI agents.
Mingfeng Xue
Alibaba Qwen
Research scientist at Alibaba's Tongyi Lab and a co-author of the Qwen3 and Qwen2.5 technical reports.
Qin Zhu
Alibaba Qwen
Research scientist at Alibaba Group and a co-author of the Qwen3 and Qwen2.5 technical reports.
Hui-Ling Zhen
NVIDIA
Public report authorship links Hui-Ling Zhen to the Nemotron-4 340B Technical Report at NVIDIA.
Kai Wu
NVIDIA
Public report authorship links Kai Wu to the Nemotron-4 340B Technical Report at NVIDIA.
Yu Qiao
MiniMax
Yu Qiao is listed as an author of the MiniMax technical report MiniMax-01: Scaling Foundation Models with Lightning Attention.
Adam Roberts
Google Gemini
Research scientist at Google Research whose public work spans language modeling and multimodal generation; arXiv author results include PaLM and PaLM 2.
Aixin Liu
DeepSeek
Researcher affiliated with DeepSeek on Google Scholar and coauthor of the DeepSeek-V2 and DeepSeek-V3 technical reports.
Arman Cohan
Ai2
Senior research scientist at Ai2 and associate professor at Yale working on long-form language understanding, information retrieval, and scientific NLP.
Avinatan Hassidim
Google Gemini
Professor of Computer Science at the Hebrew University of Jerusalem and Visiting Faculty Researcher at Google, with work spanning algorithms, algorithmic economics, and AI-related decision systems.
Baptiste Rozière
Meta AI
Research scientist at Meta working on large language models and coding assistants. He previously worked on pre-training language models and neural machine translation and completed a PhD in natural language processing at Sorbonne University.
Bei Feng
DeepSeek
Research scientist at DeepSeek with a public Google Scholar profile covering large language models and reasoning research.
Christopher Berner
OpenAI
Christopher Berner is an OpenAI contributor listed on the OpenAI GPT-4 contributions page.
Dawei Feng
MiniMax
Co-founder and research scientist at MiniMax AI Research. He received a PhD from Tsinghua University and works on foundation models, reinforcement learning, and data systems, with publications at major machine learning and NLP venues.
Dongjie Ji
DeepSeek
Researcher affiliated with DeepSeek on Google Scholar and coauthor of the DeepSeek-V2 and DeepSeek-V3 technical reports.
Faisal Azhar
Meta AI
Faisal Azhar is a PhD candidate in computer science at Stanford University. His work focuses on multimodal systems that unify text, image, and speech, together with efficient training and inference for large-scale machine learning.
Hao Xu
Z.ai
Research scientist at Z.ai focused on multimodal understanding and generation, reinforcement learning, AI agents, and end-to-end models. He received a bachelor's degree from Tsinghua University and a master's degree from Peking University.
Hui Li
DeepSeek
Machine learning researcher affiliated with DeepSeek on Google Scholar and coauthor of the DeepSeek-V2 and DeepSeek-V3 technical reports.
Jialiang Cai
DeepSeek
DeepSeek researcher and coauthor of the DeepSeek-V2 and DeepSeek-V3 technical reports.
Jian Liang
DeepSeek
Research scientist at DeepSeek with public scholarly work on large language models and efficient inference.
Jiaqi Ni
DeepSeek
Researcher at DeepSeek AI and PhD in computer science at Peking University, focusing on large language models, multimodal large language models, and reinforcement learning.
Jingyang Yuan
DeepSeek
Research scientist at DeepSeek with public GitHub work on language models and AI systems.
Kyle Lo
Ai2
Senior research scientist at Ai2 and affiliate assistant professor at the University of Washington.
Lean Wang
DeepSeek
Research scientist at DeepSeek with public scholarly work on large language models and AI systems.
Lei Xu
DeepSeek
Research scientist at DeepSeek with public scholarly work on large language models and reasoning.
Lelio Renard Lavaud
Mistral AI
Research scientist at Mistral AI working on multimodal models and reinforcement learning. He previously worked as a machine learning engineer and research scientist at Hugging Face.
Liang Zhao
DeepSeek
Researcher affiliated with DeepSeek on Google Scholar and coauthor of the DeepSeek-V2 and DeepSeek-V3 technical reports.
Marie-Anne Lachaux
Meta AI
Marie-Anne Lachaux is a research scientist at Meta whose public homepage highlights work on large language models, multilingual models, and speech.
Mark Chen
OpenAI
Chief Research Officer at OpenAI.
Melvin Johnson
Google Gemini
Senior Staff Research Scientist at Google DeepMind working on language modeling, speech recognition, machine translation, and multimodal understanding.
Meng Li
DeepSeek
Research scientist at DeepSeek with public scholarly work on language models and AI systems.
Naman Goyal
Meta AI
Research scientist at Meta whose work focuses on efficient large language models and multilingual NLP.
Nicholas Ruas
Ai2
Machine learning engineer at Ai2 whose public work focuses on open language models, post-training, and evaluation.
Pamela Mishkin
OpenAI
Research scientist at OpenAI focused on multimodal models.
Peng Zhang
DeepSeek
Research scientist at DeepSeek with public scholarly work on large language models and AI systems.
Pete Walsh
Ai2
Research scientist at Ai2 working on language model pre-training, adaptation, and alignment.
Rishabh Kabra
Google Gemini
Rishabh Kabra is a research scientist at Google DeepMind. His public homepage highlights work on machine learning systems and large-scale language model research.
Ruilin Jin
DeepSeek
DeepSeek researcher and coauthor of the DeepSeek-V2 and DeepSeek-V3 technical reports.
Shengfeng Ye
DeepSeek
Researcher at DeepSeek interested in scalable and efficient learning, reinforcement learning, reasoning, and large-scale distributed training and inference.
Tianyu Sun
DeepSeek
LLM researcher at DeepSeek focused on alignment, post-training, and reasoning.
Wei An
DeepSeek
Researcher affiliated with DeepSeek AI on Google Scholar and coauthor of the DeepSeek-V2 and DeepSeek-V3 technical reports.
Xiangyue Jin
DeepSeek
Member of Technical Staff at DeepSeek.
Xiaosha Chen
DeepSeek
Member of Technical Staff at DeepSeek.
Xiaowen Sun
DeepSeek
Researcher affiliated with DeepSeek-AI on Google Scholar and coauthor of the DeepSeek-V2 and DeepSeek-V3 technical reports.
Yacine Jernite
Google Gemini
Staff research scientist at Google DeepMind in Cambridge, Massachusetts. His public profile says he works on translating human values and institutional constraints into tests and safeguards for foundation models.
Yi Tay
Google Gemini
Member of Technical Staff at Google DeepMind working on language and multimodal models.
Yixuan Tan
DeepSeek
Master's student at Peking University and research intern at OpenSeek and DeepSeek, with interests in large language models and natural language processing.
Yu Han
Alibaba Qwen
Researcher affiliated with Alibaba Group on Google Scholar and coauthor of the Qwen technical report.
Yuxiang Wei
DeepSeek
DeepSeek researcher and coauthor of the DeepSeek-V2 and DeepSeek-V3 technical reports.
Yuxiang Zhu
DeepSeek
DeepSeek researcher and coauthor of the DeepSeek-V2 and DeepSeek-V3 technical reports.
Zhenyang Wu
Z.ai
Research scientist at Z.ai with research interests in multimodal understanding and generation, large language models, and reinforcement learning. He received a bachelor's degree from the University of Science and Technology of China and a master's degree from Tsinghua University.
Zilin Li
DeepSeek
Researcher affiliated with DeepSeek on Google Scholar and coauthor of the DeepSeek-V2 and DeepSeek-V3 technical reports.
Zoubin Ghahramani
Google Gemini
VP of Research at Google DeepMind and Professor of Information Engineering at the University of Cambridge, known for work in probabilistic machine learning and Bayesian statistics.
Aaron Johnson
Cohere
Aaron Johnson is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.
Abhinav Pandey
Meta AI
Abhinav Pandey is listed as an author of the Meta AI technical report The Llama 3 Herd of Models.
Adam Lerer
NVIDIA
Adam Lerer is listed as an author of the NVIDIA technical report Nemotron-4 340B Technical Report.
Adi Sinha
NVIDIA
Public report authorship links Adi Sinha to the Nemotron-4 340B Technical Report at NVIDIA.
Adrianna Muras
Cohere
Adrianna Muras is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.
Ahu Nguyen
Cohere
Ahu Nguyen is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.
Akshat Shrivastava
Amazon
Akshat Shrivastava is listed as an author of the Amazon technical report Amazon Nova Premier Technical Report.
Akshay Kamath
NVIDIA
Public report authorship links Akshay Kamath to the Nemotron-4 340B Technical Report at NVIDIA.
Akshay Narayan Kini
NVIDIA
Public report authorship links Akshay Narayan Kini to the Nemotron-4 340B Technical Report at NVIDIA.
Alan Chung
Google Gemini
Alan Chung is listed as an author of the Google technical report PaLM 2 Technical Report.
Alexey Pugachev
NVIDIA
Alexey Pugachev is listed as an author of the NVIDIA technical report Nemotron-4 340B Technical Report.
Alexis Potapczuk
Cohere
Alexis Potapczuk is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.
Allison LaPrell
Amazon
Allison LaPrell is listed as an author of the Amazon technical report Amazon Nova Premier Technical Report.
Alya Gafni
NVIDIA
Public report authorship links Alya Gafni to the Nemotron-4 340B Technical Report at NVIDIA.
Amanda Zhou
Google Gemini
Amanda Zhou is listed as an author of the Google technical report PaLM 2 Technical Report.
Aman Gupta
Cohere
Aman Gupta is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.
Amy Yang
Meta AI
Amy Yang is listed as an author of the Meta AI technical report The Llama 3 Herd of Models.
Ana Constant
Google Gemini
Ana Constant is listed as an author of the Google technical report PaLM 2 Technical Report.
Andrew Austin
Google Gemini
Andrew Austin is listed as an author of the Google technical report PaLM 2 Technical Report.
Andrew Drozdov
Ai2
Public report authorship links Andrew Drozdov to the Open Instruct: A Simple Method for Aligning Language Models with Human Preferences at Ai2.
Andrew M. Dai
Cohere
Andrew M. Dai is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.
Andrew Sohn
NVIDIA
Andrew Sohn is listed as an author of the NVIDIA technical report Nemotron-4 340B Technical Report.
Andy Pillai
Google Gemini
Andy Pillai is listed as an author of the Google technical report PaLM 2 Technical Report.
Angelica Chen
Amazon
Angelica Chen is listed as an author of the Amazon technical report Amazon Nova Premier Technical Report.
Anirban Ghosh
Meta AI
Anirban Ghosh is listed as an author of the Meta AI technical report The Llama 3 Herd of Models.
Ankush Goyal
Meta AI
Ankush Goyal is listed as an author of the Meta AI technical report The Llama 3 Herd of Models.
Anna Blusys
Cohere
Anna Blusys is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.
Anna Shlyakhter
NVIDIA
Anna Shlyakhter is listed as an author of the NVIDIA technical report Nemotron-4 340B Technical Report.
Anuj Sharma
Meta AI
Anuj Sharma is listed as an author of the Meta AI technical report The Llama 3 Herd of Models.
Anupama Mahapatra
Amazon
Anupama Mahapatra is listed as an author of the Amazon technical report Amazon Nova Premier Technical Report.
Ashish Mitra
Meta AI
Ashish Mitra is listed as an author of the Meta AI technical report The Llama 3 Herd of Models.
Ashwin Kumar
NVIDIA
Public report authorship links Ashwin Kumar to the Nemotron-4 340B Technical Report at NVIDIA.
Aswath Kumar
NVIDIA
Public report authorship links Aswath Kumar to the Nemotron-4 340B Technical Report at NVIDIA.
Bahareh H. Hajishirzi
Cohere
Bahareh H. Hajishirzi is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.
Baojun Wang
Z.ai
Public report authorship links Baojun Wang to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.
Bartosz Kozdron
NVIDIA
Bartosz Kozdron is listed as an author of the NVIDIA technical report Nemotron-4 340B Technical Report.
Bartosz Ludwiczuk
Cohere
Bartosz Ludwiczuk is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.
Bayan Bruss
Cohere
Bayan Bruss is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.
Benoit Lalloz
Cohere
Benoit Lalloz is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.
Bhagyashree Agrawal
NVIDIA
Public report authorship links Bhagyashree Agrawal to the Nemotron-4 340B Technical Report at NVIDIA.
Bharath Ramavarapu
NVIDIA
Public report authorship links Bharath Ramavarapu to the Nemotron-4 340B Technical Report at NVIDIA.
Bhargav Vaidya
NVIDIA
Bhargav Vaidya is listed as an author of the NVIDIA technical report Nemotron-4 340B Technical Report.
Biao Gong
Z.ai
Public report authorship links Biao Gong to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.
Biao Gong
Amazon
Biao Gong is listed as an author of the Amazon technical report Amazon Nova Premier Technical Report.
Bikash Koley
NVIDIA
Bikash Koley is listed as an author of the NVIDIA technical report Nemotron-4 340B Technical Report.
Bingchen Feng
Cohere
Bingchen Feng is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.
Błażej Miasojedow
Google Gemini
Błażej Miasojedow is listed as an author of the Google technical report Gemma: Open Models Based on Gemini Research and Technology.
Bobby Nanduri
Amazon
Bobby Nanduri is listed as an author of the Amazon technical report Amazon Nova Premier Technical Report.
Bofei Gao
Z.ai
Public report authorship links Bofei Gao to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.
Brandon McKinzie
Google Gemini
Brandon McKinzie is listed as an author of the Google technical report PaLM 2 Technical Report.
Buro Sim
Meta AI
Buro Sim is listed as an author of the Meta AI technical report The Llama 3 Herd of Models.
Can Xu
Z.ai
Public report authorship links Can Xu to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.
Cesi Chen
Cohere
Cesi Chen is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.
Chenguang Wang
Amazon
Chenguang Wang is listed as an author of the Amazon technical report Amazon Nova Premier Technical Report.
Cheng Xing
Cohere
Cheng Xing is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.
Chen Xing
Meta AI
Chen Xing is listed as an author of the Meta AI technical report The Llama 3 Herd of Models.
Chong Wang
Google Gemini
Chong Wang is listed as an author of the Google technical report PaLM 2 Technical Report.
Christian R. A. Azevedo
Google Gemini
Christian R. A. Azevedo is listed as an author of the Google technical report PaLM 2 Technical Report.
Christian Wolf
Mistral AI
Christian Wolf is listed as an author of the Mistral AI technical report Mistral Small 3.1 Technical Report.
Danielle Epstein
Google Gemini
Public report authorship links Danielle Epstein to the PaLM: Scaling Language Modeling with Pathways at Google.
Daniel M. Ziegler
OpenAI
Daniel M. Ziegler is listed as an author of the OpenAI technical report Language Models are Few-Shot Learners.
Danqi Zhang
Cohere
Danqi Zhang is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.
Darren Kong
Cohere
Darren Kong is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.
David Graham
Ai2
Public sources identify David Graham as a member of Ai2's team and a coauthor of OLMo 3.
David Gritz
Cohere
David Gritz is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.
David Li
Google Gemini
David Li is listed as an author of the Google technical report PaLM 2 Technical Report.
David Yu
Cohere
David Yu is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.
De-Chang Chen
Google Gemini
De-Chang Chen is listed as an author of the Google technical report PaLM 2 Technical Report.
Deli Zhao
Z.ai
Public report authorship links Deli Zhao to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.
Denny Zhou
Amazon
Denny Zhou is listed as an author of the Amazon technical report Amazon Nova Premier Technical Report.
Dimitris Papailiopoulos
NVIDIA
Dimitris Papailiopoulos is listed as an author of the NVIDIA technical report Nemotron-4 340B Technical Report.
Dirk van Strien
Ai2
Dirk van Strien is listed as an author of the Ai2 technical report OLMo: Accelerating the Science of Language Models.
Elsie Wang
Google Gemini
Elsie Wang is listed as an author of the Google technical report PaLM 2 Technical Report.
Eric Huo
Cohere
Eric Huo is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.
Esteban Real
Amazon
Esteban Real is listed as an author of the Amazon technical report Amazon Nova Premier Technical Report.
Ezekiel Wong
Cohere
Ezekiel Wong is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.
Fang Liu
Cohere
Fang Liu is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.
Florian Bressand
Mistral AI
Florian Bressand is listed as an author of the Mistral AI technical report Mistral 7B.
Gabriel Georgiou
Meta AI
Gabriel Georgiou is listed as an author of the Meta AI technical report The Llama 3 Herd of Models.
Gabriel Gladstone
Cohere
Gabriel Gladstone is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.
Gang Li
Google Gemini
Gang Li is listed as an author of the Google technical report PaLM 2 Technical Report.
George Thomas
Cohere
George Thomas is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.
Geunwoo Kim
Cohere
Geunwoo Kim is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.
Guanyu Feng
Z.ai
Public report authorship links Guanyu Feng to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.
Guillem Cieslak
Meta AI
Guillem Cieslak is listed as an author of the Meta AI technical report Llama 2: Open Foundation and Fine-Tuned Chat Models.
Hai Hu
Cohere
Hai Hu is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.
Hanjie Chen
Ai2
Hanjie Chen is listed as an author of the Ai2 technical report Tulu 2: Demystifying the Effectiveness of RLHF and Reinforcement Learning with Human Feedback.
Hanlin Zhang
Z.ai
Public report authorship links Hanlin Zhang to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.
Han Wang
Cohere
Han Wang is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.
Han Wei
MiniMax
Han Wei is listed as an author of the MiniMax technical report MiniMax-01: Scaling Foundation Models with Lightning Attention.
Han Xu
Z.ai
Public report authorship links Han Xu to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.
Hao Peng
Cohere
Hao Peng is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.
Haotian Tang
NVIDIA
Public report authorship links Haotian Tang to the Nemotron-4 340B Technical Report at NVIDIA.
Hariharan Subramoni
NVIDIA
Hariharan Subramoni is listed as an author of the NVIDIA technical report Nemotron-4 340B Technical Report.
Haroon Khan
Google Gemini
Haroon Khan is listed as an author of the Google technical report PaLM 2 Technical Report.
Harshita Diddee
Cohere
Harshita Diddee is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.
Heiga Zen
Google Gemini
Heiga Zen is listed as an author of the Google technical report PaLM 2 Technical Report.
Hongbo Zhang
Z.ai
Public report authorship links Hongbo Zhang to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.
Hongwei Cao
Z.ai
Public report authorship links Hongwei Cao to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.
Hongxing He
Z.ai
Public report authorship links Hongxing He to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.
Hongyang Li
MiniMax
Hongyang Li is a MiniMax report-backed author on MiniMax-Text-01, a MiniMax technical report in the LLMpeople catalog.
Howie Liu
Google Gemini
Howie Liu is listed as an author of the Google technical report PaLM 2 Technical Report.
Hugo Javaux
Mistral AI
Hugo Javaux is listed as an author of the Mistral AI technical report Mistral Small 3.1 Technical Report.
Hung-Cheng Chen
Google Gemini
Hung-Cheng Chen is listed as an author of the Google technical report PaLM 2 Technical Report.
Hu Zhang
Amazon
Hu Zhang is listed as an author of the Amazon technical report Amazon Nova Premier Technical Report.
Hyunewoo Cho
Ai2
Hyunewoo Cho is listed as an author of the Ai2 technical report OLMo: Accelerating the Science of Language Models.
Imene Kerboua
Cohere
Imene Kerboua is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.
Irwan Bello
OpenAI
Irwan Bello is listed as an author of the OpenAI technical report GPT-4 Technical Report.
Jack Rae
OpenAI
Jack Rae is listed as an author of the OpenAI technical report GPT-4 Technical Report.
Jackson Wu
Cohere
Jackson Wu is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.
Jacky Peng
NVIDIA
Public report authorship links Jacky Peng to the Nemotron-4 340B Technical Report at NVIDIA.
Jacqueline Hardin
Google Gemini
Jacqueline Hardin is listed as an author of the Google technical report PaLM 2 Technical Report.
Jane Qian
Google Gemini
Jane Qian is listed as an author of the Google technical report PaLM 2 Technical Report.
Jarett Gray
Amazon
Jarett Gray is listed as an author of the Amazon technical report Amazon Nova Premier Technical Report.
Jasdeep Singh Pannu
NVIDIA
Jasdeep Singh Pannu is listed as an author of the NVIDIA technical report Nemotron-4 340B Technical Report.
Jasjeet Sekhon
Cohere
Jasjeet Sekhon is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.
Jason Li
Cohere
Jason Li is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.
Jason Wei
Amazon
Jason Wei is listed as an author of the Amazon technical report Amazon Nova Premier Technical Report.
Jason Wu
Google Gemini
Jason Wu is listed as an author of the Google technical report PaLM 2 Technical Report.
Jason Wu
Ai2
Public report authorship links Jason Wu to the Open Instruct: A Simple Method for Aligning Language Models with Human Preferences at Ai2.
Jeremy Dwivedi-Yu
Ai2
Jeremy Dwivedi-Yu is listed as an author of the Ai2 technical report Tulu 2: Demystifying the Effectiveness of RLHF and Reinforcement Learning with Human Feedback.
Jerry Wang
Cohere
Jerry Wang is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.
Jiahao Zhou
Z.ai
Public report authorship links Jiahao Zhou to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.
Jiahui Li
Cohere
Jiahui Li is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.
Jianguo Zhang
Z.ai
Public report authorship links Jianguo Zhang to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.
Jianli Zhang
Z.ai
Public report authorship links Jianli Zhang to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.
Jianshu Chen
Amazon
Jianshu Chen is listed as an author of the Amazon technical report Amazon Nova Premier Technical Report.
Jianshu Chen
Cohere
Jianshu Chen is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.
Jian Xie
Baichuan
Jian Xie is listed as an author of the Baichuan technical report Baichuan 2: Open Large-scale Language Models.
Jianyu Wang
NVIDIA
Jianyu Wang is listed as an author of the NVIDIA technical report Nemotron-4 340B Technical Report.
Jia Song
Cohere
Jia Song is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.
Jiaxiang Guo
Google Gemini
Public report authorship links Jiaxiang Guo to the RecurrentGemma: Moving Past Transformers for Efficient Open Language Models at Google.
Jie Chen
MiniMax
Jie Chen is a MiniMax report-backed author on MiniMax-Text-01, a MiniMax technical report in the LLMpeople catalog.
Jiefeng Cheng
Z.ai
Public report authorship links Jiefeng Cheng to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.
Jie Fu
Z.ai
Public report authorship links Jie Fu to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.
Ji Ma
Amazon
Ji Ma is listed as an author of the Amazon technical report Amazon Nova Premier Technical Report.
Ji Ma
Google Gemini
Ji Ma is listed as an author of the Google technical report PaLM 2 Technical Report.
Jimmy Lee
Google Gemini
Public report authorship links Jimmy Lee to the Gemma 2: Improving Open Language Models at a Practical Size at Google.
Jincheng Han
MiniMax
Jincheng Han is a MiniMax report-backed author on MiniMax-Text-01, a MiniMax technical report in the LLMpeople catalog.
Jingkuang Wang
Z.ai
Public report authorship links Jingkuang Wang to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.
Jingwen Leng
Z.ai
Public report authorship links Jingwen Leng to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.
Jingyu Hua
NVIDIA
Public report authorship links Jingyu Hua to the Nemotron-4 340B Technical Report at NVIDIA.
Jingyun Jiang
MiniMax
Jingyun Jiang is a MiniMax report-backed author on MiniMax-Text-01, a MiniMax technical report in the LLMpeople catalog.
Jinmian Zhang
MiniMax
Jinmian Zhang is a MiniMax report-backed author on MiniMax-Text-01, a MiniMax technical report in the LLMpeople catalog.
Jiuyang Tang
Z.ai
Public report authorship links Jiuyang Tang to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.
Jiwei Li
Z.ai
Public report authorship links Jiwei Li to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.
Ji Xu
Google Gemini
Ji Xu is listed as an author of the Google technical report PaLM 2 Technical Report.
Joe Hall
Meta AI
Joe Hall is listed as an author of the Meta AI technical report The Llama 3 Herd of Models.
Joel Ruben Antony
Google Gemini
Joel Ruben Antony is listed as an author of the Google technical report PaLM 2 Technical Report.
John Gagneur
Cohere
John Gagneur is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.
John Lee
Meta AI
John Lee is listed as an author of the Meta AI technical report The Llama 3 Herd of Models.
Jonathan Branson
Cohere
Jonathan Branson is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.
Jonathan Lee
Meta AI
Jonathan Lee is listed as an author of the Meta AI technical report The Llama 3 Herd of Models.
Josh Gorlin
Google Gemini
Josh Gorlin is listed as an author of the Google technical report PaLM 2 Technical Report.
Joshua Stoddard
Amazon
Joshua Stoddard is listed as an author of the Amazon technical report Amazon Nova Premier Technical Report.
Julien Mouchere
Mistral AI
Julien Mouchere is listed as an author of the Mistral AI technical report Mistral Small 3.1 Technical Report.
Junlin Han
Z.ai
Public report authorship links Junlin Han to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.
Jun Yin
Cohere
Jun Yin is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.
Jun Zhang
Z.ai
Public report authorship links Jun Zhang to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.
Junzheng Wang
Google Gemini
Junzheng Wang is listed as an author of the Google technical report PaLM 2 Technical Report.
Kailash Karthikeyan
NVIDIA
Public report authorship links Kailash Karthikeyan to the Nemotron-4 340B Technical Report at NVIDIA.
Kaiyan Li
Z.ai
Public report authorship links Kaiyan Li to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.
Kaiyou Yang
Z.ai
Public report authorship links Kaiyou Yang to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.
Kangning Liu
MiniMax
Kangning Liu is a MiniMax report-backed author on MiniMax-Text-01, a MiniMax technical report in the LLMpeople catalog.
Karan Bhatia
Cohere
Karan Bhatia is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.
Karthik Ramavarmaraju
NVIDIA
Public report authorship links Karthik Ramavarmaraju to the Nemotron-4 340B Technical Report at NVIDIA.
Katherine Hou
Google Gemini
Katherine Hou is listed as an author of the Google technical report PaLM 2 Technical Report.
Katie Mayer
OpenAI
Katie Mayer is listed as an author of the OpenAI technical report GPT-4 Technical Report.
Kaushal K. Kishore
Google Gemini
Kaushal K. Kishore is listed as an author of the Google technical report PaLM 2 Technical Report.
Kaushik Roy
Google Gemini
Public report authorship links Kaushik Roy to the Gemma 2: Improving Open Language Models at a Practical Size at Google.
Kaushik Sivakumar
Google Gemini
Kaushik Sivakumar is listed as an author of the Google technical report PaLM 2 Technical Report.
Ke Sun
Z.ai
Ke Sun is listed as an author of the Z.ai technical report GLM-5: Thinking, Coding, and Agentic Intelligence.
Kevin Lin
Cohere
Kevin Lin is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.
Khaled Saeed
Ai2
Public report authorship links Khaled Saeed to the OLMoE: Open Mixture-of-Experts Language Models at Ai2.
Khe Chai Sim
Google Gemini
Khe Chai Sim is listed as an author of the Google technical report PaLM 2 Technical Report.
Kris Cao
Google Gemini
Kris Cao is listed as an author of the Google technical report PaLM 2 Technical Report.
Kun Li
Cohere
Kun Li is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.
Lakshminarasimhan Sundararaman
NVIDIA
Public report authorship links Lakshminarasimhan Sundararaman to the Nemotron-4 340B Technical Report at NVIDIA.
Laura Martin
Cohere
Laura Martin is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.
Le Hou
Meta AI
Le Hou is listed as an author of the Meta AI technical report The Llama 3 Herd of Models.
Leyang Cui
Cohere
Leyang Cui is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.
Lia Lee
Cohere
Lia Lee is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.
Liam Dean
Google Gemini
Liam Dean is listed as an author of the Google technical report PaLM 2 Technical Report.
Lianghao Deng
Alibaba Qwen
Lianghao Deng is listed as an author of the Qwen technical report Qwen3 Technical Report.
Lilian Lee
Google Gemini
Lilian Lee is listed as an author of the Google technical report PaLM 2 Technical Report.
Lokesh Ramamoorthy
NVIDIA
Public report authorship links Lokesh Ramamoorthy to the Nemotron-4 340B Technical Report at NVIDIA.
Longjie Ye
Z.ai
Public report authorship links Longjie Ye to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.
Madhura Dhuri
NVIDIA
Public report authorship links Madhura Dhuri to the Nemotron-4 340B Technical Report at NVIDIA.
Mahyar Babaei
Cohere
Mahyar Babaei is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.
Manav Mathur
Meta AI
Manav Mathur is listed as an author of the Meta AI technical report The Llama 3 Herd of Models.
Manoj Kumar
Google Gemini
Public report authorship links Manoj Kumar to the RecurrentGemma: Moving Past Transformers for Efficient Open Language Models at Google.
Martin Connor
Google Gemini
Martin Connor is listed as an author of the Google technical report PaLM 2 Technical Report.
Marvin Polius
Google Gemini
Marvin Polius is listed as an author of the Google technical report PaLM 2 Technical Report.
Matt Jordan
Amazon
Matt Jordan is listed as an author of the Amazon technical report Amazon Nova Premier Technical Report.
Matt Wood
Amazon
Matt Wood is listed as an author of the Amazon technical report Amazon Nova Premier Technical Report.
Maxim Bluvshtein
NVIDIA
Maxim Bluvshtein is listed as an author of the NVIDIA technical report Nemotron-4 340B Technical Report.
Maxim Krikun
Amazon
Maxim Krikun is listed as an author of the Amazon technical report Amazon Nova Premier Technical Report.
Maxwell Pirtle
Amazon
Maxwell Pirtle is listed as an author of the Amazon technical report Amazon Nova Premier Technical Report.
Maxwell Roberts
Ai2
Maxwell Roberts is listed as an author of the Ai2 technical report Tulu 2: Demystifying the Effectiveness of RLHF and Reinforcement Learning with Human Feedback.
Mayank Bhargava
NVIDIA
Public report authorship links Mayank Bhargava to the Nemotron-4 340B Technical Report at NVIDIA.
Meriem Belkada
Amazon
Meriem Belkada is listed as an author of the Amazon technical report Amazon Nova Premier Technical Report.
Michael Chen
Cohere
Michael Chen is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.
Michael Flaherty
NVIDIA
Public report authorship links Michael Flaherty to the Nemotron-4 15B Technical Report at NVIDIA.
Michael Tan
Meta AI
Michael Tan is listed as an author of the Meta AI technical report The Llama 3 Herd of Models.
Michael Tufano
Ai2
Michael Tufano is listed as an author of the Ai2 technical report OLMo: Accelerating the Science of Language Models.
Mike Shumate
Amazon
Mike Shumate is listed as an author of the Amazon technical report Amazon Nova Premier Technical Report.
Mikhail Miller
Amazon
Mikhail Miller is listed as an author of the Amazon technical report Amazon Nova Premier Technical Report.
Mikhail Ponomarev
Amazon
Mikhail Ponomarev is listed as an author of the Amazon technical report Amazon Nova Premier Technical Report.
Mingfeng Wang
Amazon
Mingfeng Wang is listed as an author of the Amazon technical report Amazon Nova Premier Technical Report.
Mingjie Sun
Amazon
Mingjie Sun is listed as an author of the Amazon technical report Amazon Nova Premier Technical Report.
Mingmei Zhang
Z.ai
Public report authorship links Mingmei Zhang to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.
Mingxing Zhang
Amazon
Mingxing Zhang is listed as an author of the Amazon technical report Amazon Nova Premier Technical Report.
Mingyang Wang
Z.ai
Public report authorship links Mingyang Wang to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.
Mingyang Wu
Z.ai
Public report authorship links Mingyang Wu to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.
Ming Yin
Ai2
Ming Yin is listed as an author of the Ai2 technical report Tulu 2: Demystifying the Effectiveness of RLHF and Reinforcement Learning with Human Feedback.
Mitesh Bharadwaj
NVIDIA
Mitesh Bharadwaj is listed as an author of the NVIDIA technical report Nemotron-4 340B Technical Report.
Mitsuru Yamakawa
Cohere
Mitsuru Yamakawa is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.
Mohan Kumar
NVIDIA
Mohan Kumar is listed as an author of the NVIDIA technical report Nemotron-4 340B Technical Report.
Mohan Kumar Ramaswamy
NVIDIA
Mohan Kumar Ramaswamy is listed as an author of the NVIDIA technical report Nemotron-4 340B Technical Report.
Mohit Bansal
NVIDIA
Public report authorship links Mohit Bansal to the Nemotron-4 15B Technical Report at NVIDIA.
Mona Ranade
Google Gemini
Mona Ranade is listed as an author of the Google technical report PaLM 2 Technical Report.
Murali Emani
Amazon
Murali Emani is listed as an author of the Amazon technical report Amazon Nova Premier Technical Report.
Murray Raum
Google Gemini
Murray Raum is listed as an author of the Google technical report PaLM 2 Technical Report.
Musab Beshir
Cohere
Musab Beshir is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.
Naman Jain
NVIDIA
Public report authorship links Naman Jain to the Nemotron-4 340B Technical Report at NVIDIA.
Nan Du
Google Gemini
Nan Du is listed as an author of the Google technical report PaLM 2 Technical Report.
Nan Duan
Z.ai
Public report authorship links Nan Duan to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.
Natalia Slyusarenko
Cohere
Natalia Slyusarenko is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.
Nathan Knight
NVIDIA
Nathan Knight is listed as an author of the NVIDIA technical report Nemotron-4 340B Technical Report.
N. D. R. Chinnakonduru
Google Gemini
N. D. R. Chinnakonduru is listed as an author of the Google technical report PaLM 2 Technical Report.
Neil Bhatnagar
NVIDIA
Public report authorship links Neil Bhatnagar to the Nemotron-4 340B Technical Report at NVIDIA.
Nesreen K. Ahmed
NVIDIA
Public report authorship links Nesreen K. Ahmed to the Nemotron-4 340B Technical Report at NVIDIA.
Nicholas Tomlin
Cohere
Nicholas Tomlin is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.
Nicola Rossi
Cohere
Nicola Rossi is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.
Nicolas Uzondu
Google Gemini
Nicolas Uzondu is listed as an author of the Google technical report PaLM 2 Technical Report.
Nikita Gokarn
Meta AI
Nikita Gokarn is listed as an author of the Meta AI technical report The Llama 3 Herd of Models.
Nikita Pavlichenko
Amazon
Nikita Pavlichenko is listed as an author of the Amazon technical report Amazon Nova Premier Technical Report.
Ning Ding
Z.ai
Public report authorship links Ning Ding to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.
Nino Vieillard
Ai2
Public report authorship links Nino Vieillard to the Open Instruct: A Simple Method for Aligning Language Models with Human Preferences at Ai2.
Nithin Raveendran
Google Gemini
Nithin Raveendran is listed as an author of the Google technical report PaLM 2 Technical Report.
Nitin Suresh
Cohere
Nitin Suresh is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.
Noah Constant
Google Gemini
Noah Constant is listed as an author of the Google technical report PaLM 2 Technical Report.
Noam Toister
Cohere
Noam Toister is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.
Noel Nabeshima
Ai2
Noel Nabeshima is listed as an author of the Ai2 technical report Tulu 2: Demystifying the Effectiveness of RLHF and Reinforcement Learning with Human Feedback.
Oliver Firat
Google Gemini
Oliver Firat is listed as an author of the Google technical report PaLM 2 Technical Report.
Omer Goldman
Cohere
Omer Goldman is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.
Osama Ramadan
Ai2
Public report authorship links Osama Ramadan to the Open Instruct: A Simple Method for Aligning Language Models with Human Preferences at Ai2.
Pablo Salinas
Cohere
Pablo Salinas is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.
Parham Alaei
Cohere
Parham Alaei is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.
Parth Chadha
NVIDIA
Parth Chadha is listed as an author of the NVIDIA technical report Nemotron-4 340B Technical Report.
Patricia Lue
OpenAI
Patricia Lue is listed as an author of the OpenAI technical report GPT-4 Technical Report.
Pavan K. Murali
NVIDIA
Pavan K. Murali is listed as an author of the NVIDIA technical report Nemotron-4 340B Technical Report.
Peilin Dong
Meta AI
Peilin Dong is listed as an author of the Meta AI technical report The Llama 3 Herd of Models.
Pengyang Wang
MiniMax
Pengyang Wang is listed as an author of the MiniMax technical report MiniMax-01: Scaling Foundation Models with Lightning Attention.
P. Lu
Cohere
P. Lu is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.
Pramod Kandala
NVIDIA
Public report authorship links Pramod Kandala to the Nemotron-4 340B Technical Report at NVIDIA.
Prasann Singhal
NVIDIA
Public report authorship links Prasann Singhal to the Nemotron-4 340B Technical Report at NVIDIA.
Prashanth Rajput
NVIDIA
Public report authorship links Prashanth Rajput to the Nemotron-4 340B Technical Report at NVIDIA.
Prathyusha Kamesetty
NVIDIA
Public report authorship links Prathyusha Kamesetty to the Nemotron-4 15B Technical Report at NVIDIA.
Pratyush Patil
NVIDIA
Pratyush Patil is listed as an author of the NVIDIA technical report Nemotron-4 340B Technical Report.
Purnendu Patil
NVIDIA
Purnendu Patil is listed as an author of the NVIDIA technical report Nemotron-4 340B Technical Report.
Puyang Huang
Z.ai
Puyang Huang is listed as an author of the Z.ai technical report GLM-5: Thinking, Coding, and Agentic Intelligence.
Qian Liu
Cohere
Qian Liu is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.
Qian Liu
Amazon
Qian Liu is listed as an author of the Amazon technical report Amazon Nova Premier Technical Report.
Qian Liu
Z.ai
Public report authorship links Qian Liu to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.
Qi Dou
MiniMax
Qi Dou is a MiniMax report-backed author on MiniMax-Text-01, a MiniMax technical report in the LLMpeople catalog.
Qiming Zhang
Z.ai
Public report authorship links Qiming Zhang to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.
Qinglin Lu
Z.ai
Public report authorship links Qinglin Lu to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.
Qingquan Li
Z.ai
Public report authorship links Qingquan Li to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.
Qingyun Wu
Amazon
Qingyun Wu is listed as an author of the Amazon technical report Amazon Nova Premier Technical Report.
Qiqi Wang
Z.ai
Public report authorship links Qiqi Wang to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.
Quanting Du
Cohere
Quanting Du is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.
Raghavendra Amin
NVIDIA
Raghavendra Amin is listed as an author of the NVIDIA technical report Nemotron-4 340B Technical Report.
Raghavendra Kokku
NVIDIA
Public report authorship links Raghavendra Kokku to the Nemotron-4 340B Technical Report at NVIDIA.
Raghu Ganti
Cohere
Raghu Ganti is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.
Rajan Chari
Amazon
Rajan Chari is listed as an author of the Amazon technical report Amazon Nova Premier Technical Report.
Rajkumar Krishnakumar
Google Gemini
Rajkumar Krishnakumar is listed as an author of the Google technical report PaLM 2 Technical Report.
Ram Rajagopal
NVIDIA
Public report authorship links Ram Rajagopal to the Nemotron-4 340B Technical Report at NVIDIA.
Rebecca Cohen
Google Gemini
Rebecca Cohen is listed as an author of the Google technical report PaLM 2 Technical Report.
Renjie Pi
NVIDIA
Public report authorship links Renjie Pi to the Nemotron-4 340B Technical Report at NVIDIA.
Robert Berry
Ai2
Public sources list Robert Berry on the Ai2 team directory and among the authors of the OLMo 3 technical report.
Rodrigo Gomes
Google Gemini
Rodrigo Gomes is listed as an author of the Google technical report Gemma: Open Models Based on Gemini Research and Technology.
Rohan Ringwala
NVIDIA
Rohan Ringwala is listed as an author of the NVIDIA technical report Nemotron-4 340B Technical Report.
Ronan Rozen
NVIDIA
Public report authorship links Ronan Rozen to the Nemotron-4 340B Technical Report at NVIDIA.
Roy Bar-Haim
Cohere
Roy Bar-Haim is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.
Rui Hu
Z.ai
Public report authorship links Rui Hu to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.
Rui Wang
Z.ai
Public report authorship links Rui Wang to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.
Ruoming Pang
Google Gemini
Ruoming Pang is listed as an author of the Google technical report RecurrentGemma: Moving Past Transformers for Efficient Open Language Models.
Rushil Anirudh
NVIDIA
Rushil Anirudh is listed as an author of the NVIDIA technical report Nemotron-4 340B Technical Report.
Ryan Ann Molander
Meta AI
Ryan Ann Molander is listed as an author of the Meta AI technical report Llama 2: Open Foundation and Fine-Tuned Chat Models.
Ryan Lowe
OpenAI
Ryan Lowe is listed as an author of the OpenAI technical report GPT-4 Technical Report.
Ryan Zhao
Google Gemini
Ryan Zhao is listed as an author of the Google technical report PaLM 2 Technical Report.
Said X. Wang
NVIDIA
Said X. Wang is listed as an author of the NVIDIA technical report Nemotron-4 340B Technical Report.
Sajjad A. Mozaffari
Amazon
Sajjad A. Mozaffari is listed as an author of the Amazon technical report Amazon Nova Premier Technical Report.
Salman Khan
NVIDIA
Salman Khan is listed as an author of the NVIDIA technical report Nemotron-4 340B Technical Report.
Salvatore Romano
Cohere
Salvatore Romano is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.
Sameer Khan
Cohere
Sameer Khan is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.
Sameer Singhal
NVIDIA
Sameer Singhal is listed as an author of the NVIDIA technical report Nemotron-4 340B Technical Report.
Sam Power
Cohere
Sam Power is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.
Samuel Albanie
NVIDIA
Samuel Albanie is listed as an author of the NVIDIA technical report NVIDIA Nemotron 3: Efficient and Open Intelligence.
Sangeetha Abdu Jyothi
Amazon
Sangeetha Abdu Jyothi is listed as an author of the Amazon technical report Amazon Nova Premier Technical Report.
Sarah Laszlo
Cohere
Sarah Laszlo is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.
Saurabh Dolgall
NVIDIA
Saurabh Dolgall is listed as an author of the NVIDIA technical report Nemotron-4 340B Technical Report.
Saurav Muralidharan
NVIDIA
Public report authorship links Saurav Muralidharan to the Nemotron-4 15B Technical Report at NVIDIA.
Scott Mayer McKinney
OpenAI
Scott Mayer McKinney is listed as an author of the OpenAI technical report GPT-4 Technical Report.
Sean Foo
NVIDIA
Public report authorship links Sean Foo to the Nemotron-4 340B Technical Report at NVIDIA.
Sebastian Lee
Google Gemini
Sebastian Lee is listed as an author of the Google technical report PaLM: Scaling Language Modeling with Pathways.
Sebastian Norgaard
NVIDIA
Public report authorship links Sebastian Norgaard to the Nemotron-4 340B Technical Report at NVIDIA.
Sha Cai
Z.ai
Public report authorship links Sha Cai to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.
Shen Gao
Z.ai
Public report authorship links Shen Gao to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.
Shibo Hao
Google Gemini
Public report authorship links Shibo Hao to the RecurrentGemma: Moving Past Transformers for Efficient Open Language Models at Google.
Shi Dong
NVIDIA
Shi Dong is listed as an author of the NVIDIA technical report Nemotron-4 340B Technical Report.
Shijie Geng
Z.ai
Public report authorship links Shijie Geng to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.
Shikib Rahman
Cohere
Shikib Rahman is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.
Shiqi Wang
Z.ai
Public report authorship links Shiqi Wang to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.
Shital Shah
NVIDIA
Shital Shah is listed as an author of the NVIDIA technical report Nemotron-4 340B Technical Report.
Shiyue Zhang
Meta AI
Shiyue Zhang is listed as an author of the Meta AI technical report The Llama 3 Herd of Models.
Shourya Khare
Google Gemini
Shourya Khare is listed as an author of the Google technical report PaLM 2 Technical Report.
Shreyas Paliwal
NVIDIA
Public report authorship links Shreyas Paliwal to the Nemotron-4 340B Technical Report at NVIDIA.
Shuai Wang
Cohere
Shuai Wang is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.
Shuangfei Zhai
Amazon
Shuangfei Zhai is listed as an author of the Amazon technical report Amazon Nova Premier Technical Report.
Shuliang Wang
MiniMax
Shuliang Wang is a MiniMax report-backed author on MiniMax-Text-01, a MiniMax technical report in the LLMpeople catalog.
Shuo Wang
Z.ai
Public report authorship links Shuo Wang to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.
Shuyan Zhou
Cohere
Shuyan Zhou is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.
Shyam Mohan Srinivasan
NVIDIA
Public report authorship links Shyam Mohan Srinivasan to the Nemotron-4 340B Technical Report at NVIDIA.
Siddharth Jain
NVIDIA
Public report authorship links Siddharth Jain to the Nemotron-4 340B Technical Report at NVIDIA.
Sidharth Karamcheti
Mistral AI
Sidharth Karamcheti is listed as an author of the Mistral AI technical report Mistral Small 3.1 Technical Report.
Siliang Tang
MiniMax
Siliang Tang is a MiniMax report-backed author on MiniMax-Text-01, a MiniMax technical report in the LLMpeople catalog.
Siyuan Yin
Z.ai
Siyuan Yin is listed as an author of the Z.ai technical report GLM-5: Thinking, Coding, and Agentic Intelligence.
Songfang Huang
Z.ai
Public report authorship links Songfang Huang to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.
Souradip Chakraborty
Amazon
Souradip Chakraborty is listed as an author of the Amazon technical report Amazon Nova Premier Technical Report.
Spike Gururangan
Meta AI
Spike Gururangan is listed as an author of the Meta AI technical report The Llama 3 Herd of Models.
Steve Dowling
OpenAI
Steve Dowling is listed as an author of the OpenAI technical report GPT-4 Technical Report.
Sujith Ravi
Google Gemini
Sujith Ravi is listed as an author of the Google technical report PaLM 2 Technical Report.
Sukannya Purkayastha
Amazon
Sukannya Purkayastha is listed as an author of the Amazon technical report Amazon Nova Premier Technical Report.
Sundararajan Srinivasan
NVIDIA
Sundararajan Srinivasan is listed as an author of the NVIDIA technical report Nemotron-4 340B Technical Report.
Taira Anderson
Ai2
Taira Anderson is a program manager supporting Ai2's multimodal teams and is listed as an author on the Olmo 3 report.
Tanay Datta
NVIDIA
Public report authorship links Tanay Datta to the Nemotron-4 340B Technical Report at NVIDIA.
Tanmay Shah
Amazon
Tanmay Shah is listed as an author of the Amazon technical report Amazon Nova Premier Technical Report.
Tao Ge
Z.ai
Public report authorship links Tao Ge to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.
Tao Ge
Amazon
Tao Ge is listed as an author of the Amazon technical report Amazon Nova Premier Technical Report.
Tarek Eloundou Nekoul
Ai2
Public report authorship links Tarek Eloundou Nekoul to the OLMoE: Open Mixture-of-Experts Language Models at Ai2.
Teng Xu
Z.ai
Public report authorship links Teng Xu to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.
Theo Garcin
Mistral AI
Theo Garcin is listed as an author of the Mistral AI technical report Mistral Small 3.1 Technical Report.
Thomas Lavril
Mistral AI
Thomas Lavril is listed as an author of the Mistral AI technical report Mistral 7B.
Thomas Scialom
Ai2
Public report authorship links Thomas Scialom to the Open Instruct: A Simple Method for Aligning Language Models with Human Preferences at Ai2.
Tianhao Xu
NVIDIA
Tianhao Xu is listed as an author of the NVIDIA technical report Nemotron-4 340B Technical Report.
Tianmin Shu
Cohere
Tianmin Shu is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.
Tianshu Wang
Z.ai
Public report authorship links Tianshu Wang to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.
Tianyi Zhang
Meta AI
Tianyi Zhang is listed as an author of the Meta AI technical report The Llama 3 Herd of Models.
Tianyi Zhang
Ai2
Public report authorship links Tianyi Zhang to the Open Instruct: A Simple Method for Aligning Language Models with Human Preferences at Ai2.
Tielong Chen
Z.ai
Public report authorship links Tielong Chen to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.
Tim Dodds
Ai2
Public report authorship links Tim Dodds to the Open Instruct: A Simple Method for Aligning Language Models with Human Preferences at Ai2.
Ting Chen
Google Gemini
Ting Chen is listed as an author of the Google technical report PaLM 2 Technical Report.
Tolly Powell
OpenAI
Tolly Powell is listed as an author of the OpenAI technical report GPT-4 Technical Report.
Tong Wang
NVIDIA
Public report authorship links Tong Wang to the Nemotron-4 340B Technical Report at NVIDIA.
Tyler A. Chang
Ai2
Public report authorship links Tyler A. Chang to the OLMoE: Open Mixture-of-Experts Language Models at Ai2.
Upal Ayyalasomayajula
NVIDIA
Upal Ayyalasomayajula is listed as an author of the NVIDIA technical report Nemotron-4 340B Technical Report.
Valentin Nechifor
NVIDIA
Public report authorship links Valentin Nechifor to the Nemotron-4 340B Technical Report at NVIDIA.
Vartika Bhandari
NVIDIA
Vartika Bhandari is listed as an author of the NVIDIA technical report Nemotron-4 340B Technical Report.
Venkata Pingali
NVIDIA
Public report authorship links Venkata Pingali to the Nemotron-4 340B Technical Report at NVIDIA.
Venkatesh Korthikanti
NVIDIA
Public report authorship links Venkatesh Korthikanti to the Nemotron-4 340B Technical Report at NVIDIA.
Vikas Chandra
NVIDIA
Vikas Chandra is listed as an author of the NVIDIA technical report Nemotron-4 340B Technical Report.
Vikas Ours
NVIDIA
Vikas Ours is listed as an author of the NVIDIA technical report Nemotron-4 340B Technical Report.
Vineet Gupta
NVIDIA
Public report authorship links Vineet Gupta to the Nemotron-4 340B Technical Report at NVIDIA.
Viraj Bagal
Ai2
Public report authorship links Viraj Bagal to the OLMoE: Open Mixture-of-Experts Language Models at Ai2.
Vishakha Gupta
NVIDIA
Vishakha Gupta is listed as an author of the NVIDIA technical report Nemotron-4 340B Technical Report.
Wei Dai
Google Gemini
Wei Dai is listed as an author of the Google technical report PaLM 2 Technical Report.
Wei Li
Google Gemini
Wei Li is listed as an author of the Google technical report PaLM 2 Technical Report.
Wenhui Huang
MiniMax
Wenhui Huang is a MiniMax report-backed author on MiniMax-Text-01, a MiniMax technical report in the LLMpeople catalog.
Wenping Wang
Z.ai
Public report authorship links Wenping Wang to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.
Wenxuan Tan
Z.ai
Public report authorship links Wenxuan Tan to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.
Wenyi Hong
Z.ai
Public report authorship links Wenyi Hong to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.
Wenyuan Han
Z.ai
Public report authorship links Wenyuan Han to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.
Wojciech Zaremba
OpenAI
Wojciech Zaremba is listed as an author of the OpenAI technical report GPT-4 Technical Report.
Xiangyong Deng
Z.ai
Public report authorship links Xiangyong Deng to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.
Xiao Feng
Z.ai
Public report authorship links Xiao Feng to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.
Xiaoming Zhang
MiniMax
Xiaoming Zhang is listed as an author of the MiniMax technical report MiniMax-01: Scaling Foundation Models with Lightning Attention.
Xiaosi Yang
NVIDIA
Xiaosi Yang is listed as an author of the NVIDIA technical report Nemotron-H: A Family of Accurate and Efficient Hybrid Mamba-Transformer Models.
Xiaoyang Liu
NVIDIA
Public report authorship links Xiaoyang Liu to the Nemotron-4 340B Technical Report at NVIDIA.
Xiaoze Liu
Z.ai
Public report authorship links Xiaoze Liu to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.
Xiaozhi Wang
Z.ai
Public report authorship links Xiaozhi Wang to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.
Xinggong Zhang
Z.ai
Public report authorship links Xinggong Zhang to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.
Xin Jiang
Google Gemini
Xin Jiang is listed as an author of the Google technical report PaLM 2 Technical Report.
Xinlong Wang
Z.ai
Public report authorship links Xinlong Wang to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.
Xintao Wang
MiniMax
Xintao Wang is a MiniMax report-backed author on MiniMax-Text-01, a MiniMax technical report in the LLMpeople catalog.
Xinyu Xu
Z.ai
Public report authorship links Xinyu Xu to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.
Xuesong Jin
Z.ai
Public report authorship links Xuesong Jin to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.
Xu Ming
Z.ai
Xu Ming is listed as an author of the Z.ai technical report GLM-5: Thinking, Coding, and Agentic Intelligence.
Xun Gao
Cohere
Xun Gao is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.
Xupeng Miao
Z.ai
Public report authorship links Xupeng Miao to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.
Yadong Yin
Z.ai
Public report authorship links Yadong Yin to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.
Yajun Wang
Z.ai
Public report authorship links Yajun Wang to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.
Yang Gao
Z.ai
Public report authorship links Yang Gao to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.
Yang Liu
Z.ai
Public report authorship links Yang Liu to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.
Yangyou Huang
NVIDIA
Yangyou Huang is listed as an author of the NVIDIA technical report Nemotron-4 340B Technical Report.
Yanhui Guo
Z.ai
Public report authorship links Yanhui Guo to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.
Yann Barbaron
Mistral AI
Yann Barbaron is listed as an author of the Mistral AI technical report Mixtral of Experts.
Yann Dubois
Ai2
Public report authorship links Yann Dubois to the Open Instruct: A Simple Method for Aligning Language Models with Human Preferences at Ai2.
Yan Zhou
Z.ai
Public report authorship links Yan Zhou to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.
Yao Wu
Google Gemini
Yao Wu is listed as an author of the Google technical report PaLM 2 Technical Report.
Yaqin Fan
Z.ai
Public report authorship links Yaqin Fan to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.
Yash Sharma
Meta AI
Yash Sharma is listed as an author of the Meta AI technical report The Llama 3 Herd of Models.
Yecong Wei
Z.ai
Public report authorship links Yecong Wei to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.
Yejin Choi
NVIDIA
Public report authorship links Yejin Choi to the Nemotron-4 15B Technical Report at NVIDIA.
Yejin Choi
Cohere
Yejin Choi is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.
Yelin Zhou
Google Gemini
Yelin Zhou is listed as an author of the Google technical report PaLM 2 Technical Report.
Yeqi Bai
Z.ai
Public report authorship links Yeqi Bai to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.
Yevgeny Kuratov
Amazon
Yevgeny Kuratov is listed as an author of the Amazon technical report Amazon Nova Premier Technical Report.
Yibo Xu
Z.ai
Public report authorship links Yibo Xu to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.
Yicheng Jin
Z.ai
Yicheng Jin is listed as an author of the Z.ai technical report GLM-5: Thinking, Coding, and Agentic Intelligence.
Yicheng Wang
Z.ai
Public report authorship links Yicheng Wang to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.
Yichen Wang
Z.ai
Yichen Wang is listed as an author of the Z.ai technical report GLM-5: Thinking, Coding, and Agentic Intelligence.
Yichen Zhu
MiniMax
Yichen Zhu is listed as an author of the MiniMax technical report MiniMax-01: Scaling Foundation Models with Lightning Attention.
Yi Cui
Google Gemini
Yi Cui is listed as an author of the Google technical report PaLM 2 Technical Report.
Yifan Li
MiniMax
Yifan Li is a MiniMax report-backed author on MiniMax-Text-01, a MiniMax technical report in the LLMpeople catalog.
Yijia Shao
Cohere
Yijia Shao is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.
Yile Wang
Cohere
Yile Wang is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.
Yile Wang
Z.ai
Public report authorship links Yile Wang to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.
Yimeng Zhu
Z.ai
Public report authorship links Yimeng Zhu to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.
Yiming Li
Z.ai
Public report authorship links Yiming Li to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.
Yingbo Zhou
Amazon
Yingbo Zhou is listed as an author of the Amazon technical report Amazon Nova Premier Technical Report.
Yingqing He
MiniMax
Yingqing He is a MiniMax report-backed author on MiniMax-Text-01, a MiniMax technical report in the LLMpeople catalog.
Yinqiang Zheng
MiniMax
Yinqiang Zheng is listed as an author of the MiniMax technical report MiniMax-01: Scaling Foundation Models with Lightning Attention.
Yiran Fang
MiniMax
Yiran Fang is a MiniMax report-backed author on MiniMax-Text-01, a MiniMax technical report in the LLMpeople catalog.
Yixing Liao
Amazon
Yixing Liao is listed as an author of the Amazon technical report Amazon Nova Premier Technical Report.
Yizhu Jiao
Ai2
Yizhu Jiao is listed as an author of the Ai2 technical report Tulu 2: Demystifying the Effectiveness of RLHF and Reinforcement Learning with Human Feedback.
Yongbin Li
Z.ai
Yongbin Li is listed as an author of the Z.ai technical report GLM-5: Thinking, Coding, and Agentic Intelligence.
Yongdagger Lee
NVIDIA
Yongdagger Lee is listed as an author of the NVIDIA technical report Nemotron-4 340B Technical Report.
Yonghong Tian
MiniMax
Yonghong Tian is listed as an author of the MiniMax technical report MiniMax-01: Scaling Foundation Models with Lightning Attention.
Yong Wang
Z.ai
Public report authorship links Yong Wang to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.
Yongxin Guo
Z.ai
Public report authorship links Yongxin Guo to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.
Yoni Halpern
Amazon
Yoni Halpern is listed as an author of the Amazon technical report Amazon Nova Premier Technical Report.
Youjia Zhang
MiniMax
Youjia Zhang is a MiniMax report-backed author on MiniMax-Text-01, a MiniMax technical report in the LLMpeople catalog.
Yuan Gu
Z.ai
Public report authorship links Yuan Gu to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.
Yubo Miao
Z.ai
Public report authorship links Yubo Miao to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.
Yu Cheng
Z.ai
Public report authorship links Yu Cheng to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.
Yucheng Zheng
Z.ai
Public report authorship links Yucheng Zheng to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.
Yudong Chen
Z.ai
Public report authorship links Yudong Chen to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.
Yudong Hao
NVIDIA
Yudong Hao is listed as an author of the NVIDIA technical report Nemotron-4 340B Technical Report.
Yue Zhou
Z.ai
Public report authorship links Yue Zhou to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.
Yujiu Yang
Cohere
Yujiu Yang is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.
Yukun Zhu
Z.ai
Public report authorship links Yukun Zhu to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.
Yuliya Simachev
NVIDIA
Yuliya Simachev is listed as an author of the NVIDIA technical report Nemotron-4 340B Technical Report.
Yunan Hu
Z.ai
Public report authorship links Yunan Hu to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.
Yunjia Xi
Z.ai
Public report authorship links Yunjia Xi to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.
Yunyang Zhao
NVIDIA
Public report authorship links Yunyang Zhao to the Nemotron-4 340B Technical Report at NVIDIA.
Yuqi Zhou
Google Gemini
Public report authorship links Yuqi Zhou to the PaLM: Scaling Language Modeling with Pathways at Google.
Yury Malkov
Google Gemini
Yury Malkov is listed as an author of the Google technical report PaLM 2 Technical Report.
Yury Malkov
Amazon
Yury Malkov is listed as an author of the Amazon technical report Amazon Nova Premier Technical Report.
Yu Wang
Z.ai
Public report authorship links Yu Wang to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.
Yuxiang Luo
DeepSeek
Yuxiang Luo is listed as an author of the DeepSeek technical report DeepSeek-V3 Technical Report.
Yuxi Li
Cohere
Yuxi Li is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.
Yuxiong He
NVIDIA
Public report authorship links Yuxiong He to the Nemotron-4 340B Technical Report at NVIDIA.
Yu Zhang
Z.ai
Public report authorship links Yu Zhang to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.
Yuzheng Zhuang
Cohere
Yuzheng Zhuang is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.
Zachary Yung
Google Gemini
Zachary Yung is listed as an author of the Google technical report PaLM 2 Technical Report.
Zefeng Wang
Z.ai
Public report authorship links Zefeng Wang to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.
Zehua Wang
NVIDIA
Public report authorship links Zehua Wang to the Nemotron-4 340B Technical Report at NVIDIA.
Zhangliang Chen
Z.ai
Public report authorship links Zhangliang Chen to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.
Zhaocheng Fang
Z.ai
Public report authorship links Zhaocheng Fang to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.
Zhenhai Zhu
Cohere
Zhenhai Zhu is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.
Zhenhuan Yang
Z.ai
Public report authorship links Zhenhuan Yang to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.
Zhen Zhang
Amazon
Zhen Zhang is listed as an author of the Amazon technical report Amazon Nova Premier Technical Report.
Zheyu Song
Z.ai
Public report authorship links Zheyu Song to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.
Zhibin Chen
Z.ai
Public report authorship links Zhibin Chen to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.
Zhiming Yang
Meta AI
Zhiming Yang is listed as an author of the Meta AI technical report The Llama 3 Herd of Models.
Zhiqiang Zhang
Z.ai
Public report authorship links Zhiqiang Zhang to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.
Zhiwen Fan
Cohere
Zhiwen Fan is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.
Zhiyao Li
Z.ai
Public report authorship links Zhiyao Li to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.
Zhuohan Li
Ai2
Public report authorship links Zhuohan Li to the OLMoE: Open Mixture-of-Experts Language Models at Ai2.
Zihang Li
Meta AI
Zihang Li is listed as an author of the Meta AI technical report The Llama 3 Herd of Models.
Zihan Qiao
MiniMax
Zihan Qiao is a MiniMax report-backed author on MiniMax-Text-01, a MiniMax technical report in the LLMpeople catalog.
Ziyang Song
DeepSeek
Ziyang Song is listed as an author of the DeepSeek technical report DeepSeek-V3 Technical Report.
Ziyi Yang
Ai2
Ziyi Yang is listed as an author of the Ai2 technical report Tulu 2: Demystifying the Effectiveness of RLHF and Reinforcement Learning with Human Feedback.
Z. Olivia Li
Google Gemini
Z. Olivia Li is listed as an author of the Google technical report PaLM 2 Technical Report.
Zongze Kou
Z.ai
Public report authorship links Zongze Kou to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.
Chengen Huang
Alibaba Qwen
Research scientist in Tongyi Lab whose official profile highlights coding large language models and AI agents.
Gao Liu
Alibaba Qwen
Research scientist in Tongyi Lab whose official profile highlights post-training, reinforcement learning, reasoning, and AI agents.
Jialong Tang
Alibaba Qwen
Qwen researcher whose work focuses on alignment, reasoning, and multilingual large language models.
Jing Zhou
Alibaba Qwen
Research scientist in Tongyi Lab whose official profile highlights post-training and multimodal large language models.
Luo Ji
Alibaba Qwen
Leader of Qwen's reasoning and agent team.
Shixuan Liu
Alibaba Qwen
Research scientist in Tongyi Lab whose official profile highlights post-training, large language models, and AI agents.
Tianhang Zhu
Alibaba Qwen
Research scientist in Tongyi Lab whose official profile highlights post-training, reasoning, and large multimodal models.
Xingxuan Zhang
Alibaba Qwen
Research scientist in Tongyi Lab whose official profile highlights reasoning, large-scale reinforcement learning, and multilingual large language models.
Yang Yao
Alibaba Qwen
Research scientist in Tongyi Lab whose official profile highlights post-training, efficient reasoning, and multilingual large language models.
Zheng Yuan
Alibaba Qwen
Research scientist in Tongyi Lab whose official profile highlights multimodal reasoning, agentic reinforcement learning, and post-training for large language models.
Wei Wang
Moonshot AI / Alibaba Qwen
Public report authorship links Wei Wang to Qwen Technical Report, Qwen-Image-2.0 Technical Report, and Kimi K2.5.
Bailey Kuehl
Ai2
Ai2's public team directory lists Bailey Kuehl, and recent public papers credit Kuehl on OLMo Hybrid and scientific-literature work.
Christopher Wilhelm
Ai2
Research scientist at Ai2 working in NLP and machine learning, with interests in efficient training, pruning, and sparsity.
DJ Matusz
Ai2
This pass only confirmed DJ Matusz as a named author on the OLMo Hybrid report and Lambda's MFU optimization white paper.
Xingxing Zhang
Microsoft
Xingxing Zhang is a researcher at Microsoft Research and a co-author of the BitNet b1.58 2B4T Technical Report.
Xun Wu
Microsoft
Xun Wu is listed as a coauthor of the arXiv paper "Sparse-BitNet: 1.58-bit LLMs are Naturally Friendly to Semi-Structured Sparsity," with affiliation 1 shown as Microsoft Research.
Yingbo Hao
Microsoft
Public Microsoft-linked sources identify Yingbo Hao as a coauthor on recent efficient language-model technical reports.
Aakanksha Naik
Google Gemini
Research scientist at Google DeepMind working on natural language understanding, generation, question answering, and evaluation, with current interests in multilinguality, attribution, and agent evaluation.
Aalok Mehta
OpenAI
Research engineer on OpenAI's safety systems team. Previously studied at Carnegie Mellon University and is interested in safety, alignment, and evaluation of language models.
Abhimanyu Dubey
Meta AI
Research scientist focused on parallel and distributed machine learning systems.
Abhinav Sethy
Cohere
Cohere researcher working on speech, multilinguality, multilingual speech, and speech translation, and a co-author of the Command A technical report.
Adam Perelman
OpenAI
Software engineer at OpenAI; previously worked at Palantir.
Adrian Barbu
Google Gemini
Researcher working on large language model training and evaluation, including PaLM 2.
Adrian Mollov
Google Gemini
Adrian Mollov is a machine learning engineer and researcher interested in generative models, large language models, reinforcement learning, and scalable multimodal systems; his profile notes experience at Google DeepMind.
Aimin Yang
Moonshot AI
Aimin Yang is a Moonshot AI contributor and co-author of Kimi k1.5: Scaling Reinforcement Learning with LLMs.
Aishwarya Agrawal
Amazon
Assistant professor at the Mila-Quebec AI Institute and the University of Montreal, and Canada CIFAR AI Chair; her research focuses on vision-and-language and multimodal representation learning.
Akila Welihinda
OpenAI
Product professional at OpenAI focused on models, products, and agents; previously led AI transformation and platform efforts at HubSpot.
Alekh Agarwal
Cohere
Principal scientist at Cohere and adjunct associate professor at Columbia University working on reinforcement learning, active learning, and representation learning.
Alex Berard
Google Gemini
Research scientist at Google Research working on machine learning and natural language processing.
Alex Passos
OpenAI
OpenAI contributor credited on the GPT-4 technical report.
Aliaksei Severyn
Cohere
Senior research scientist at Cohere Labs researching multilingual and multimodal large language models.
Alice Meyers
Databricks
Researcher working on open foundation models and evaluation, including DBRX.
Ali Kamali
OpenAI
Senior Software Engineer at OpenAI. Previously studied computer science at the University of Waterloo.
Ali Shoeybi
NVIDIA
Vice president of applied deep learning research at NVIDIA, leading work on foundation models, large language models, computer vision, robotics, and autonomous driving.
Allison Moreira
Google Gemini
Google researcher whose publications include the PaLM 2 technical report and related work on multilingual language models and dialogue systems.
Alvin Wang
OpenAI
Alvin Wang is a software engineer at OpenAI focused on speech and audio. His official profile lists contributions to GPT-4.1, the audio models in the API, Sora, GPT-4, DALL-E 3, and Whisper.
Aman Rangapur
Ai2
Research Engineer at Ai2 on the OLMo team, focused on reliable and interpretable AI, transformers, retrieval, and autonomous agents.
Amin Tootoonchian
OpenAI
Computer systems researcher and GPT-4 coauthor whose public work centers on networking and large-scale distributed systems.
Andrew Healey
Google Gemini
AI researcher at Google DeepMind with interests in natural language processing, machine learning, and neuroscience.
Andrew Kondrich
OpenAI
Member of Technical Staff at OpenAI. Previously worked at Scale AI and studied computer science at Stanford University.
Andrew Mayne
OpenAI
Author and communicator who led science communication work at OpenAI and helped present GPT-4; later continued working on AI communication and startups.
Andrew Peng
OpenAI
Software engineer at OpenAI whose official profile focuses on multimodal models and AI agents.
Andrey Mishchenko
OpenAI
Researcher at OpenAI working on post-training and reasoning, after earlier research on self-supervised learning, reinforcement learning, and robotics.
Andy Sonneveld
Google Gemini
Researcher working on large language model training and evaluation, including PaLM 2.
Anfeng Li
Alibaba Qwen
Anfeng Li works in AI model research at Alibaba DAMO Academy and is a co-author of the Qwen3 Technical Report.
Anna Petrovic
Google Gemini
Google researcher whose publications include the PaLM 2 technical report.
Anthony Hartshorn
Meta AI
Machine learning engineer at Meta working on Llama post-training, with interests in reinforcement learning, generative AI, digital humans, AI at the edge, and ML systems.
Antoine Bosselut
Google Gemini
Antoine Bosselut is an assistant professor in the Language Technologies Institute at Carnegie Mellon University. His research develops language technologies that align with people's social and cultural values and stay robust in realistic settings.
Aparna Cholakkal Hombali
Cohere
Research scientist at Cohere working on machine learning, computer vision, and explainable AI, including the Command A model family.
Archit Narang
Meta AI
Machine learning researcher at Meta AI and visiting scientist at the University of Illinois Urbana-Champaign focused on language models and multimodal learning.
Arian Hosseini
Cohere
Research scientist at Cohere working on language modeling, reasoning, and vision.
Ari Morcos
NVIDIA
Research scientist at NVIDIA working on foundation models, large-scale training, and model capabilities and safety.
Arun Vijayvergiya
OpenAI
Research scientist at OpenAI whose official profile highlights multimodal speech.
Aryo Pradipta Gema
Ai2
Research engineer at Ai2 focused on post-training and data for open language models.
A. X. Liu
DeepSeek
Researcher affiliated with DeepSeek AI on Google Scholar and co-author of the DeepSeek LLM Technical Report.
Balazs Farkas
NVIDIA
NVIDIA researcher with a Google Scholar profile that includes the Nemotron-4 340B technical report.
Baoyuan Zhang
DeepSeek
DeepSeek researcher and coauthor of the DeepSeek LLM Technical Report.
Behdad Tavakoli
Google Gemini
Research scientist at Google DeepMind working on large-scale language models, multimodal learning, and efficient deep learning systems.
Ben Chess
OpenAI
Product leader at OpenAI working on post-training, voice, and applied reasoning, after earlier product work on GPT-4V and developer products.
Benjamin Chess
OpenAI
Engineer and infrastructure specialist who co-authored OpenAI scaling work and later joined Magic as a former OpenAI supercomputing lead.
Benjamin Sokolowsky
OpenAI
Researcher at OpenAI working on multimodal models, reinforcement learning, and AI safety.
Bhagyashree N. Patil
Cerebras Systems
Bhagyashree N. Patil is a Research Scientist at Cohere whose public profile lists prior work at Cerebras Systems, Brown NLP, and IIT Bombay.
Bhavana Dalvi
Ai2
Researcher whose public work spans NLP, reasoning, and evaluation; arXiv author results include OLMo and related language model work.
Billie Jonn
OpenAI
OpenAI finance and accounting professional who was publicly credited on the GPT-4 technical report.
Binh Tang
Meta AI
Binh Tang is an AI researcher at Meta. His public homepage highlights work in machine learning, natural language processing, and large-scale AI systems.
Bob McGrew
OpenAI
Research leader and GPT-4 coauthor who joined OpenAI in 2017 and later served as chief research officer after earlier engineering leadership work at Palantir.
Bob Rotsted
OpenAI
Member of Technical Staff at OpenAI with a background in machine learning, computer vision, robotics, and cyber-physical systems.
Bochao Wu
DeepSeek
Research scientist at DeepSeek with public work on large language models and reasoning.
Bogdan Mazoure
Cohere
Research scientist at Cohere working on machine learning and reinforcement learning.
Boris Power
OpenAI
Research scientist at OpenAI designing language environments and agents; previously built peer-to-peer applications and holds a PhD from Oxford.
Brandon Houghton
OpenAI
Strategy and operations leader at OpenAI whose official profile highlights interests in learning, biology, and language.
Brendan Bosma
Google Gemini
Research engineer at Google DeepMind and PhD candidate at UC Berkeley working on multimodal understanding, reasoning, and agentic systems.
Brian Hawkins
Meta AI
Research scientist at Meta working on post-training, alignment, and reinforcement learning for language and multimodal models.
Brittany Carey
OpenAI
Research scientist at OpenAI whose official profile highlights multimodal speech and audio.
Brooke Chan
OpenAI
Software engineer at OpenAI. Previously studied electrical engineering and computer science at the University of California, Berkeley.
Casey Chu
OpenAI
OpenAI contributor credited on the GPT-4 technical report.
Chak Ming Li
OpenAI
Researcher focused on speech AI, computer vision, and audio-visual learning. He has worked at OpenAI and publishes publications and projects on his personal website.
Charles Sutton
Google Gemini
Professor at Carnegie Mellon University whose public research spans machine learning and natural language processing; arXiv author results include the PaLM paper.
Che Chang
OpenAI
Research scientist at OpenAI whose official profile highlights multimodal reasoning and agents.
Chelsea Carlson
OpenAI
Research engineer at OpenAI working on AI agents and coding systems. Previously worked on data and product efforts at Stripe and studied computer science and philosophy at Harvard University.
Chengda Lu
DeepSeek
Research scientist at DeepSeek with public work on language models and multimodal learning.
Chester Cho
OpenAI
Researcher at OpenAI working on reasoning, multimodal systems, and products that make AI more useful in everyday life.
Chris Carothers
Cerebras Systems
Principal engineer at Cerebras working on systems for training large language models.
Christian Gibson
OpenAI
Software engineer at OpenAI whose official profile focuses on speech and audio.
CJ Weinmann
OpenAI
Member of Technical Staff at OpenAI working on reliability engineering. Previously worked on software in cloud, AI, and hardware, and earned a BSE in computer systems engineering from Arizona State University.
Colin Kasson
Amazon
Applied scientist at Amazon working on large language models, responsible AI, and evaluation for foundation model systems.
Colton Wei
Google Gemini
Google researcher whose publications include the PaLM 2 technical report.
Connor Rees
Meta AI
Research scientist at Meta working on reinforcement learning and generative models. His homepage highlights robotics, computer vision, and simulation, alongside work related to Llama 3.
Cory Decareaux
OpenAI
Engineering manager at OpenAI with a background in privacy and security engineering across biotech, payments, and advertising.
Crystal Nam
Ai2
Member of Ai2's data team interested in data management systems and workflow tooling for research and model development.
Daisy Han
Google Gemini
Research scientist at Google Research focused on natural language processing and machine learning.
Daniel Kokotajlo
OpenAI
Former OpenAI governance researcher and GPT-4 coauthor known for AI forecasting and alignment writing.
Daniel Selsam
OpenAI
Research scientist at OpenAI focused on machine learning and theorem proving.
Dave Willner
OpenAI
Investor and advisor at OpenAI; previously on the management team at Stripe.
David Farhi
OpenAI
OpenAI contributor credited for business strategy and investor relations support on GPT-4.
David Golovin
Google Gemini
Principal Scientist at Google DeepMind focused on machine learning systems and AI research.
Dawei Lu
Cohere
Research scientist at Cohere whose public OpenReview profile lists interests in natural language processing, efficiency, and large language models.
Denny Jin
OpenAI
Researcher at OpenAI working on multimodal systems, post-training, and evaluations, with earlier work across 3D, computer vision, and embodied AI.
Denny Zhou
Google Gemini
Research scientist at Google Research working on large language models, natural language processing, and multi-agent systems.
Dmitry Yarats
NVIDIA
Research scientist at NVIDIA and assistant professor at NYU, working on machine learning, deep learning, and robotics.
Dustin Schwenk
Ai2
Research scientist at Ai2 whose work focuses on open language models and multimodal systems.
Eli Collins
Google Gemini
Vice President of Product at Google DeepMind and head of the Google DeepMind product team.
Elie Georges
OpenAI
Works at OpenAI.
Elizabeth Proehl
OpenAI
OpenAI staff contributor whose public research profile links her work across public policy and frontier evaluations at OpenAI, following earlier operations leadership at OpenResearch.
Emilia Antonucci
Google Gemini
Google researcher whose public profile lists work across machine learning, natural language processing, and generative AI.
Emilie Morand
Cohere
Senior machine learning researcher at Cohere working on pretraining, post-training, and safety for large language models, according to her public OpenReview profile.
Emily Schwenk
Ai2
Final-year PhD candidate at the University of Washington whose public research focuses on evaluation, uncertainty, and multimodal grounding in language models; her arXiv author results include OLMo and OLMES papers.
Emma Hughes
Google Gemini
Researcher working on open language models and multimodal systems, including Gemma.
Emy Parparita
OpenAI
Research scientist at OpenAI focused on machine learning, multimodal world models, and representation learning.
Eric Rehm
Technology Innovation Institute
Principal researcher in the Technology Innovation Institute's AI Cross-Center Unit focused on language model architectures, long-context LLMs, and vision-language models.
Erik Lindholm
NVIDIA
Researcher at NVIDIA working on computer architecture, machine learning, and large language models, including Nemotron-4.
Fahimeh Saleh
Cohere
Research scientist at Cohere working on reasoning, simulation, and agentic AI.
Feliciano Giannone
Google Gemini
Research scientist at Google DeepMind in Cambridge working on large language models and generative AI.
Felipe Petroski Such
OpenAI
Member of Technical Staff at OpenAI focused on data quality, data curation, and training methods for frontier models.
Feng Hu
Alibaba Qwen
Researcher affiliated with Alibaba Group on Google Scholar and coauthor of the Qwen3 technical report.
Fnu Ruchi
Amazon
Principal applied scientist in Amazon AGI whose publications include the Amazon Nova Premier technical report.
Fotis Chantzis
OpenAI
Security engineer at OpenAI with a background in network and information security, including prior work at Nmap, Mayo Clinic, and CENSUS.
Francis Real
OpenAI
Software engineer at OpenAI whose official profile focuses on data and evaluations for language models.
Gabriel Bernadett-Shapiro
OpenAI
Gabriel Bernadett-Shapiro is an OpenAI contributor listed on the OpenAI GPT-4 contributions page.
Gabriel Goh
OpenAI
Research scientist and writer at OpenAI who works on the societal impact and interpretability of advanced AI systems.
Gautier Izacard
Meta AI
Researcher whose public work focuses on retrieval, question answering, and large language models; arXiv author results include the LLaMA paper.
Gregory Piché
Cohere
Researcher working on enterprise language models and multilingual systems, including Cohere Command A and Aya.
Guillem Cucurull
Google Gemini
Research scientist at Google DeepMind focused on machine learning, optimization, computer vision, and AI for science.
Haiming Bao
OpenAI
Haiming Bao is an OpenAI contributor listed on the OpenAI o1 contributions page.
Hamza Nassar
Cohere
Researcher working on enterprise and multilingual language models, including Cohere Command A.
Harish P V
NVIDIA
Researcher at NVIDIA working on computer vision, machine learning, and 3D vision, including Nemotron-4.
Heather Schmidt
OpenAI
Product manager at OpenAI working on bringing AI models into products, after earlier product and go-to-market work at Spotify and Soundtrap.
Hobin Ryu
Meta AI
Research scientist working on generative AI and multimodal intelligence.
Hongrae Lee
Google Gemini
Google researcher whose public profile lists work across data management, distributed systems, data mining, human-computer interaction, and natural language processing.
Ishita Dasgupta
Meta AI
Research scientist at Meta FAIR working on language modeling, generalization, reasoning, and AI agents.
Jade Leung
OpenAI
Jade Leung leads OpenAI's work on frontier model policy, governance, and partnerships. She was formerly a Governance and Public Policy fellow at the University of Oxford.
Jakub Pachocki
OpenAI
Chief Scientist at OpenAI. Previously served as Director of Research at OpenAI and studied computer science at the University of Warsaw.
James Pennington
Google Gemini
Research scientist at Google DeepMind working on large language models, multimodal models, and language design.
Janny Hong
Cohere
Research scientist at Cohere with research interests in computer vision, multimodal learning, and model editing.
Jared Chow
Google Gemini
Google researcher whose publications include the PaLM 2 technical report.
Jared Quincy Davis
Cohere
Research scientist at Cohere working on language models, post-training, and evaluation.
Jasmine Hall
Google Gemini
Staff research engineer at Google DeepMind working on language models and technical infrastructure.
Jason Chen
OpenAI
Software engineer at OpenAI whose official profile highlights reinforcement learning, robotics, and AI for health.
Jason Lee
Cohere
Researcher working on large language models and enterprise NLP systems, including Cohere Command A.
Jeff Belgum
OpenAI
Jeff Belgum is an OpenAI contributor listed on the DALL·E 2 preview contributions page.
Jeffrey Ritter
Google Gemini
Staff software engineer at Google DeepMind interested in systems and applied machine learning.
Jeremiah Currier
OpenAI
Founding engineer at Galactica and former staff software engineer at OpenAI.
Jesse Han
OpenAI
Research scientist at OpenAI whose official profile focuses on multimodal speech.
Jian Cai
NVIDIA
Researcher working on large language models and multimodal systems, including NVIDIA Nemotron-4.
Jiawei Wang
DeepSeek
Research scientist at DeepSeek with public GitHub projects spanning language models and AI systems.
Ji Lin
Google Gemini
Research scientist at Google working on efficient AI models for mobile devices and ML systems, with interests in systems, algorithms, and co-design.
Jingchang Chen
DeepSeek
Researcher at DeepSeek focused on large language models, multimodal systems, and AI agents.
João Monteiro
Cohere
Research scientist at Cohere Labs and technical lead of the multimodal team.
Joel Parish
OpenAI
OpenAI contributor focused on security applications of large language models; public author and event pages identify Joel Parish as a security staff member at OpenAI.
Joe Palermo
OpenAI
AI research engineer at OpenAI focused on reinforcement learning, and co-author of the GPT-4 Technical Report.
Jonathan Tow
Cohere
Jonathan Tow is a research scientist at Cohere whose public profile highlights work on efficient large language model adaptation and finetuning. He is known as the lead author of QLoRA and works on practical methods for training and aligning language models.
Jonathan Ward
OpenAI
Research scientist at OpenAI working on general intelligence; previously studied at Princeton University and Williams College.
Jong Wook Kim
OpenAI
Member of technical staff at OpenAI who worked on CLIP and Whisper; previously a PhD student in computer science at NYU.
Jordan Sitkin
OpenAI
Founding member of OpenAI's alignment team. Public profile notes prior software engineering work at Google.
Joshua A. Gonzalez
Google Gemini
Researcher working on large language model training and evaluation, including PaLM 2.
Joshua Gross
OpenAI
Product and engineering leader who worked on GPT-4, ChatGPT Enterprise, and advanced data analysis at OpenAI. He later became CTO of Khan Academy.
Joshua Maynez
Google Gemini
Research Scientist at Google working on making language models more factual and controllable.
Justin Gu
Google Gemini
Research scientist at Google DeepMind working on natural language processing and machine learning systems.
Justin Wang
Google Gemini
Google researcher whose publications include the PaLM 2 technical report.
Kai Hu
DeepSeek
Research scientist at DeepSeek with public scholarly work on large language models and reasoning.
Kai Xiao
01.AI
Senior research scientist and research lead at 01.AI; previously worked at Microsoft on language and search systems.
Karan Ahuja
Google Gemini
Research scientist at Google DeepMind in Seattle.
Karen Simonyan
NVIDIA
Research scientist at NVIDIA working on computer vision, deep learning, and reinforcement learning.
Karishma Prasad
Ai2
Research engineer at Ai2 working on open language models, machine learning systems, and evaluation.
Kasra Abbasi
Snowflake
Researcher working on enterprise large language models, including Snowflake Arctic.
Katherine Hui-Ling Chiu
Google Gemini
Research engineer at Google DeepMind whose research interests include computer vision, geometric deep learning, and generative models.
Kautuva Huang
Google Gemini
Research scientist at Google DeepMind working on natural language generation, multimodality, and on-device machine learning.
Kelsey Kuan
Meta AI
Machine learning researcher at Meta focused on Llama model adaptation, post-training, and data-centric learning.
Kendra Rimbach
OpenAI
Staff Research Engineer at OpenAI working on language modeling and reasoning.
Kenny Hsu
OpenAI
Research scientist at OpenAI focused on multimodal language and large language model alignment.
Kevin Gu
Ai2
Research scientist at Ai2 working on large language models, machine learning with human feedback, and related topics; previously at Stanford and MIT.
Kevin Stone
Meta AI
Research scientist at Meta FAIR working on multimodal and speech-language AI; his personal site highlights projects including the Large Concept Model family.
Kuai Yu
DeepSeek
Research scientist at DeepSeek with public work on large language models and reasoning.
Kyle Kosic
OpenAI
Member of technical staff at OpenAI.
Lama Ahmad
OpenAI
Vice President of Global Affairs at OpenAI and a co-author of the GPT-4 Technical Report.
Lauren Workman
OpenAI
Lauren Workman is a research scientist at OpenAI working on deep learning, robotics, and multimodal AI. Her official profile lists contributions across GPT-4.1, o3 and o4-mini, GPT-4o, robotics, and DALL-E 3.
Leandro Almeida
NVIDIA
Research scientist at NVIDIA working on machine learning, natural language processing, and vision-language models, including the NVLM project.
Leonard Berrada
Google Gemini
Research scientist at Google DeepMind in London working on efficient, robust, and reasoning-capable language models. His homepage also notes interests in language model alignment and a mathematics background from Cambridge.
Lifu Tu
Google Gemini
Research scientist at Google Research working on large language models, natural language understanding, and reasoning.
Lilit Yasakova
Amazon
Researcher with public work on fairness, robustness, and language technologies. Public profiles connect her research to both UNC Charlotte and Amazon.
Luca Muttini
Amazon
Applied scientist at Amazon working on large language models, multimodal AI, and audio generation.
Lucas Atkins
Cohere
Research scientist at Cohere working on machine learning and natural language processing, including the Command A model family.
Łukasz Kaiser
OpenAI
Research scientist and engineer whose public work spans deep learning, language models, and efficient inference.
Maddie Simens
OpenAI
Member of Technical Staff at OpenAI with interests spanning AI for education, machine learning, and human-computer interaction.
Marc Greenberg
Amazon
Researcher working on foundation models and agentic systems, including Amazon Nova Premier.
Mario Saltarelli
OpenAI
Product designer at OpenAI whose official profile explores human-computer interaction.
Marta Garnelo
Google Gemini
Research scientist at Google DeepMind whose public profiles describe work in machine learning and generative modeling, with a personal homepage covering neural networks, variational inference, and program synthesis.
Matthew Botvinick
Google Gemini
Matthew Botvinick is a Google DeepMind scientist whose public profile focuses on neuroscience-inspired AI, cognitive science, and machine learning.
Matthias Lampe
Google Gemini
Research scientist at Google working on large-scale language models and natural language processing.
Matt Knight
OpenAI
Technical and creative leader who has worked across games, startups, and AI products. He has held engineering and product roles at OpenAI and previously worked in the Los Angeles games industry.
Matt Wiethoff
OpenAI
Member of technical staff at OpenAI whose official profile focuses on deep learning, robotics, and speech.
Melanie Subbiah
OpenAI
Melanie Subbiah is an independent AI researcher based in New York City. Her work focuses on language and multimodal AI, especially commonsense reasoning and communicative behavior.
Mia Xu Chen
Google Gemini
Research scientist at Google Research working on machine learning and natural language processing.
Michael Kasner
Google Gemini
Research scientist at Google DeepMind working on AI for language.
Michael Schmitz
Ai2
Research scientist at Ai2 whose interests include natural language processing, machine learning, and computer vision.
Miguel Larrauri Iñigo
Google Gemini
Research engineer at Google DeepMind working on large language models and generative AI.
Mike Ranzinger
Cohere
Research scientist at Cohere focused on language modeling, post-training, and multimodality.
Mikhail Chernyshev
Meta AI
Mikhail Chernyshev is a systems engineer focused on efficient distributed systems for large-scale deep learning, spanning compiler stacks, networking, operating systems, and hardware.
Mirko Tuset
Amazon
Applied scientist in Amazon AGI whose publications include the Amazon Nova Premier technical report.
Mohammad Shoeybi
Amazon
Senior principal scientist at Amazon AGI focusing on efficient training and inference of foundation models, large language models, and multimodal models.
Morgan Grafstein
OpenAI
Morgan Grafstein is a software engineer and artist at OpenAI. His personal site says his work spans hardware, software, and rendering.
Mourad Si Bac
NVIDIA
Director of engineering at NVIDIA working on machine learning, computer vision, and agentic AI systems.
Mrinal Kalakrishnan
Google Gemini
Research scientist at Google DeepMind working on robot learning, real-world robotics, and reinforcement learning.
Mukul Agrawal
NVIDIA
Research scientist at NVIDIA working on training large language and multimodal models.
Mustafa Hajij
Ai2
Mustafa Hajij is a research scientist at Ai2 and an adjunct professor in the Department of Computer Science at the University of Southern Maine. His research spans graph machine learning, geometric learning, and applied mathematics.
Namit Katariya
Meta AI
Namit Katariya is a computer science PhD student at Stanford University. His interests include large-scale machine learning, natural language processing, ranking, and recommendation systems.
Neeraj Gaur
Google Gemini
Staff software engineer at Google specializing in machine learning model development and performance optimization for TPUs and other accelerators.
Nenad Tomasev
Google Gemini
Research scientist and research director at Google DeepMind working on machine learning for health, sustainability, and human-centered AI.
Ngoc-Quan Pham
Cohere
Ngoc-Quan Pham is a research scientist at Cohere whose work spans large language models, natural language generation and understanding, multimodal learning, continual learning, and transfer learning.
Niket Tandon
Snowflake
Niket Tandon is a Research Scientist at Snowflake AI Research whose work focuses on natural language processing, reasoning, and language models.
Nikhil Chandra
NVIDIA
Researcher at NVIDIA working on computer vision and machine learning, including large language model work such as Nemotron-4.
Niko Felix
OpenAI
Engineer at OpenAI whose official profile focuses on compute systems and inference.
Nikolas Nado
Google Gemini
Research Scientist at Google DeepMind focused on post-training large language models and evaluation.
Nikolas Tezak
OpenAI
OpenAI contributor credited on the GPT-4 technical report; his personal website describes interests in machine learning, software engineering, and data visualization.
Nikolay Bashlykov
Meta AI
Research engineer whose public work spans language models and multimodal systems; arXiv author results include the Llama 2 paper.
Nishank Kedia
NVIDIA
NVIDIA researcher with a Google Scholar profile that includes the Nemotron-4 340B technical report.
Nitish Shirish Keskar
OpenAI
Member of Technical Staff at OpenAI and co-author of the GPT-4 Technical Report.
Noah Shazeer
Google Gemini
Google researcher and engineer best known as a co-inventor of the Transformer and a contributor to systems including T5, LaMDA, and Gemini.
Oleg Boiko
OpenAI
Oleg Boiko is an OpenAI contributor listed on the OpenAI GPT-4 contributions page.
Oleg Murk
OpenAI
Research scientist at OpenAI and contributor to the o1 reasoning launch.
Pablo Foradada
Technology Innovation Institute
Researcher in the Technology Innovation Institute's AI Cross-Center Unit focused on large language models, multimodality, and visual representation learning.
Parker Schuh
Google Gemini
Researcher working on large language model scaling and evaluation, including Google PaLM.
Paul McMillan
OpenAI
Software engineer at OpenAI whose official profile focuses on multimodal models.
Pavankumar Balachandran
NVIDIA
Research scientist at NVIDIA working on large language model training and machine learning systems.
Petar Veličković
Google Gemini
Petar Veličković is a Senior Staff Research Scientist at Google DeepMind and an affiliated lecturer at Cambridge. His research explores building general-purpose reasoning agents by combining inference, search, and learning.
Peter Albert
Meta AI
Postdoctoral researcher at FAIR with interests in multilingual and multimodal language models, including work associated with Llama 2.
Peter Hoeschele
OpenAI
OpenAI infrastructure leader focused on industrial compute, large-scale AI systems, and strategic buildout of compute and energy capacity.
Pokorny
OpenAI
J. Michael Pokorny is a software engineer at OpenAI focused on inference. His official profile lists contributions to GPT-4.1, the Realtime API, API cost optimizations, GPT-4o, and gpt-oss.
Praneet Dutta
Cohere
Praneet Dutta is a researcher affiliated with Cohere whose publications focus on large language models and related model development work.
Pratik Chaudhari
NVIDIA
Associate Professor at the University of Pennsylvania working on machine learning, optimization, and large-scale AI systems.
Praveen Kumar Challa
Cerebras Systems
Praveen Kumar Challa is a software engineer at Cerebras Systems whose interests include machine learning, large language models, and efficient systems.
P. S. Huang
Cohere
Researcher currently at UC Berkeley whose homepage notes prior work at Cohere on pretraining and post-training for large language models.
Puja Pudipeddi
Google Gemini
Research scientist at Google working on speech, language, and multimodal machine learning systems.
Qiming Yuan
OpenAI
Scientist and engineer at OpenAI working to build AGI that benefits all of humanity.
Quoc Le
Google Gemini
Research scientist at Google Research known for large-scale deep learning and language models; arXiv author results include the PaLM 2 technical report.
Rachel Lim
OpenAI
Rachel Lim is a technical staff manager at OpenAI focused on speech and audio. Her official profile lists contributions across GPT-4.1, ChatGPT search, deep research, the audio models in the API, GPT-4o mini, GPT-4.5, and GPT-4.
Radek Strope
Google Gemini
Research scientist at Google DeepMind interested in large language models, efficient transformers, AI for coding, and AI4Mat.
Rajeev Nayak
OpenAI
Research scientist at OpenAI working across computer vision, robotics, and language.
Ramesh Nallapati
Google Gemini
Research scientist at Google Research focused on machine learning and natural language processing, including text generation and summarization.
Rami Globerson
Google Gemini
Research scientist at Google Research and professor at Tel Aviv University whose work spans machine learning, optimization, and natural language processing.
Rashmi Gangadhar
NVIDIA
Research scientist at NVIDIA working on large language models and natural language processing.
Rewon Child
OpenAI
Rewon Child is a machine learning researcher whose public homepage notes previous work at OpenAI and current work on developer tools. His public research profile highlights sparse transformers and generative modeling.
Rishabh Joshi
Google Gemini
Google researcher whose publications include the PaLM 2 technical report.
Robin Sikand
Google Gemini
Robin Sikand is a machine learning researcher at Google working mostly on large language models.
Rosie Campbell
OpenAI
DPhil student in robotics and AI at the University of Oxford and Rhodes Scholar interested in the safe deployment of autonomous systems.
Ruby Chen
OpenAI
Software engineer at OpenAI whose official profile focuses on speech and audio.
Ruibin Chen
Tencent Hunyuan
Researcher at Tencent Hunyuan with public publications on large language models, multimodal systems, and efficient post-training, including Hunyuan-Large, LLaVA-Video-R1, and Rest-R1.
Ryan Greene
OpenAI
Chief Global Affairs Officer at OpenAI.
Sam Manning
OpenAI
Systems engineer at OpenAI working on production AI systems.
Sam Skjonsberg
Ai2
Research engineer at Ai2 working on open language models, post-training, evaluations, and data.
Samuel Wolrich
OpenAI
PhD student in EECS at UC Berkeley advised by David Wagner and Jacob Steinhardt, with research focused on trustworthy machine learning and AI safety.
Sana Saxena
Meta AI
Research scientist at Meta working on large language models, foundation models, and reasoning.
Sarah Collier
Google Gemini
Research scientist at Google DeepMind in Mountain View focused on AI red teaming, post-deployment risk management, and language model evaluation. She previously worked in cyber policy and software engineering and earned a PhD from the NYU Center for Data Science.
Sarah Shoker
OpenAI
Researcher whose public work focuses on AI safety, alignment, and governance.
Sasha Tsvyashchenko
Google Gemini
Research scientist at Google DeepMind working on large language models and artificial intelligence, based on his public OpenReview profile.
Sean K. R. Tan
Meta AI
Incoming assistant professor in Carnegie Mellon's School of Computer Science and current research scientist at Meta AI, working on long-context language models and reasoning.
Sean Narenthiran
Cohere
Researcher at Cohere working on machine learning and natural language processing, including the Command A model family.
Shanghang Zhang
01.AI
Associate Professor at Peking University and Principal Researcher at 01.AI, working on computer vision and embodied AI including Yi-Lightning.
Shawn Jain
OpenAI
Researcher at OpenAI focused on reasoning, agency, and post-training to make AI systems more useful.
Sheila Dunning
OpenAI
Sheila Dunning is an operations leader at OpenAI whose official profile focuses on helping research, governance, and safety teams work effectively. Her public contributions page lists work across GPT-4.1, the Preparedness Framework, safety and policy efforts, and GPT-4.
Shuang Luo
Alibaba Qwen
Researcher affiliated with Alibaba Group on Google Scholar and coauthor of the Qwen3 technical report.
Sibylle Hess
Google Gemini
Senior research scientist at Google DeepMind in Zurich working on the Gemini team.
Sinan Tan
Alibaba Qwen
Research scientist on the Alibaba Qwen Team and co-author of the Qwen Technical Report.
Siva Reddy
Snowflake
Researcher whose public work focuses on natural language processing, reasoning, and machine learning.
Soroush GholamZadeh
Mistral AI
Research scientist at Mistral AI.
Spencer Kinney
Ai2
Research scientist at Ai2 whose research interests include language grounding and visual understanding.
Sreekanth K. Mandava
NVIDIA
Research scientist at NVIDIA with interests in machine learning systems, deep learning, and distributed systems.
Stéphane Albinet
Google Gemini
Staff research engineer at Google DeepMind in London working on the Gemini team.
Stephanie Lin
OpenAI
Research scientist at OpenAI working on reasoning, code generation, and interpretability.
Sujith Raghuraman
Google Gemini
Research scientist at Google DeepMind working on machine learning and large language models.
Sumanth Doddapaneni
Databricks
Member of technical staff at Databricks working on open source language models.
Szymon Sidor
OpenAI
Engineer at OpenAI focused on software and hardware systems.
Tabarak Khan
OpenAI
Member of technical staff at OpenAI and co-author of the GPT-4 Technical Report.
Tarun Gogineni
OpenAI
Research scientist at OpenAI working on AI safety; previously a graduate student at UC Berkeley studying machine learning and neural signal processing.
Taylor Dai
Cohere
Research scientist at Cohere focused on long-context language models and efficient training.
Teddy Lee
OpenAI
Member of technical staff at OpenAI whose official profile highlights multimodal models, audio, and video.
Theresa Lopez
OpenAI
Software engineer at OpenAI.
Thibault Biervoye
Google Gemini
Thibault Biervoye is a Research Scientist at Google DeepMind focused on large language models, memory-augmented transformers, and knowledge transfer.
Thibaut Lavril
Meta AI
Research scientist at Meta working on multimodal foundation models and open-source AI. His homepage says he leads multimodal efforts at Meta AI and contributes to open-source model releases.
Thomas Cho
NVIDIA
NVIDIA researcher with a Google Scholar profile that includes the Nemotron-4 340B technical report.
Tianyi Zhang
Cohere
Researcher at Cohere in Toronto working on large language models.
Timothy I. Wang
Google Gemini
Machine learning researcher and engineer working on large-scale AI systems, and a co-author of the PaLM 2 technical report.
Tobias Fevry
Amazon
Senior applied scientist at Amazon AGI focused on multimodal deep learning and computer vision, especially video generation, diffusion, and representation learning.
Toki Sherbakov
OpenAI
OpenAI researcher and software engineer based in San Francisco, and co-author of the GPT-4 Technical Report.
Tom Bosc
Mistral AI
Research scientist at Mistral AI working on efficient training and low-latency inference for large models.
Tomer Ronen
Cohere
Tomer Ronen is a research scientist at Cohere Labs whose work focuses on efficient training and fine-tuning of large language models, natural language processing, and applications in genomics and biology.
Tongzheng Ren
DeepSeek
Research scientist at DeepSeek working on deep learning, large language models, multimodal systems, and trustworthy machine learning.
Ty Dunn
Ai2
Research scientist at Ai2 working on efficient and transparent large language models.
Valerie Balcom
OpenAI
Senior researcher at OpenAI whose official profile focuses on AI robustness and the future of compute.
Victoria McGough
Google Gemini
Google researcher whose publications include the Gemma 2 technical report.
William Zhuk
OpenAI
William Zhuk is an OpenAI contributor credited on the GPT-4 Technical Report and listed on the OpenAI GPT-4 contributions page.
Xiaokang Zhang
DeepSeek
Researcher and co-author of the DeepSeek-V3 Technical Report.
Xiao Wang
Moonshot AI
Xiao Wang is a Moonshot AI contributor and co-author of Kimi k1.5: Scaling Reinforcement Learning with LLMs.
Xinyuan Li
DeepSeek
Developer and researcher with public GitHub activity on DeepSeek-AI repositories including DeepSeek-V3 and DeepEP, plus coauthor credit on the DeepSeek-V3 technical report.
Xinyu Zhou
01.AI
Researcher working on language model efficiency and reasoning, including Yi-Lightning.
Xi Victoria Lin
Cohere
Research scientist at Cohere and associate at Harvard Business School whose work focuses on natural language processing and multimodal machine learning.
Xixuan Song
Moonshot AI
Xixuan Song is a Moonshot AI contributor and co-author of Kimi k1.5: Scaling Reinforcement Learning with LLMs.
Yaliang Li
01.AI
Senior research scientist and research lead at 01.AI; previously a senior principal researcher and research manager at Microsoft Research.
Yanfei Huang
Google Gemini
Software engineer at Google DeepMind interested in machine learning, applied mathematics, and scalable systems.
Yifan Shi
DeepSeek
Member of Technical Staff at DeepSeek.
Yilong Zhao
DeepSeek
Researcher affiliated with DeepSeek on Google Scholar and coauthor of the DeepSeek-V2 technical report.
Yingtao Zhang
Moonshot AI
Yingtao Zhang is a Moonshot AI contributor and co-author of Kimi k1.5: Scaling Reinforcement Learning with LLMs.
Ying Zhang
NVIDIA
Ying Zhang is a senior applied research scientist on NVIDIA's foundation model team.
Yining Wang
Cohere
Researcher working on large language models and enterprise NLP systems, including Cohere Command A.
Yiping Cao
Google Gemini
Researcher working on large language model training and evaluation, including PaLM 2.
Yixin Dong
DeepSeek
Final-year PhD student at Peking University and research intern at DeepSeek AI, focusing on machine learning and large language models.
Yi Yu
DeepSeek
Researcher affiliated with DeepSeek-AI on Google Scholar and coauthor of the DeepSeek-V3 technical report.
Youlong Cheng
Google Gemini
Software engineer at Google DeepMind and PhD student at UCLA focusing on efficient large language models.
Yuanzhong Xu
Google Gemini
Research scientist at Google working on large language models, AI agents, systems for machine learning, and optimization.
Yuchen He
OpenAI
Yuchen He is a technical staff manager at OpenAI focused on speech and audio. His official profile lists contributions across GPT-4.1, the audio models in the API, Sora, GPT-4, DALL-E 3, and Whisper.
Yue Wu
Google Gemini
Research scientist at Google Research focused on language and multimodal models, with interests spanning natural language processing, computer vision, and speech.
Yulia Tsvetkov
Ai2
Associate professor at Carnegie Mellon University whose public research spans NLP and machine learning; arXiv author results include OLMo and related open language model work.
Yuqing Cao
Google Gemini
Research scientist at Google DeepMind working on post-training, alignment, and multilinguality.
Yury Zemlyanskiy
Google Gemini
Staff Research Scientist at Google DeepMind working on foundation models for language, speech, and multimodal systems.
Yutao Zhou
Moonshot AI
Applied research scientist at Moonshot AI Labs working on large language model reasoning and multimodal systems.
Yu Wu
DeepSeek
AI researcher at DeepSeek focused on reasoning and large language models, based on his public homepage and DeepSeek authorship context.
Zejiang Shen
Ai2
Zejiang Shen is a research scientist at the Allen Institute for AI whose interests include representation learning, generative AI, and multimodal learning.
Zekun Wang
Alibaba Qwen
Research Scientist at Alibaba Group, Qwen Team.
Zhaofeng Wu
Ai2
PhD student in computer science at Stanford University working on natural language processing and machine learning.
Zhibin Gou
DeepSeek
Researcher and co-author of the DeepSeek-V3 Technical Report.
Zhifu Wu
DeepSeek
DeepSeek researcher and coauthor of the DeepSeek-V3 technical report.
Zhigang Yan
DeepSeek
Researcher affiliated with DeepSeek AI on Google Scholar and coauthor of the DeepSeek-V3 technical report.
Zhiyue Fan
Cohere
Member of technical staff at Cohere Labs working on reinforcement learning and post-training for language models and agents.
Zora Tung
Google Gemini
Zora Tung is a Research Scientist at Google DeepMind.