Luca Soldaini portrait
Researcher 7 reports

Luca Soldaini

Ai2

Lead research scientist at Ai2 and a founding member of the OLMo team. He works on open language models, language agents, evaluation, and reasoning, and helps build open-model research infrastructure.

Daya Guo portrait
Researcher 13 reports

Daya Guo

DeepSeek / Moonshot AI

AI researcher at DeepSeek working on natural language processing, code intelligence, and large language model reasoning.

Amanda Askell portrait
Researcher 7 reports

Amanda Askell

Anthropic / OpenAI

Amanda Askell is a philosopher and AI alignment researcher at Anthropic. Her personal site says she previously worked as a research scientist on the policy team at OpenAI.

Baosong Yang portrait
Researcher 5 reports

Baosong Yang

Alibaba Qwen

Senior Algorithm Expert in Alibaba Tongyi Lab's Language Technology Lab and Qwen Team member whose work focuses on multilingual large language models and machine translation.

Mona Jalal portrait
Researcher 5 reports

Mona Jalal

NVIDIA

Mona Jalal is a computer vision research engineer at Toyota Material Handling. Her homepage says she leads 3D computer vision work such as 6D object pose estimation and synthetic data generation, previously worked as an R&D engineer at UC Berkeley's FHL VIVE Center, completed doctoral study in computer vision at Boston University, and earlier earned master's degrees in computer sciences and electrical engineering from the University of Wisconsin-Madison.

Yuxiang Zheng portrait
Researcher 3 reports

Yuxiang Zheng

Z.ai

Yuxiang Zheng is a Ph.D. student in electrical and computer engineering at Memorial University of Newfoundland. The homepage says Yuxiang Zheng received double B.Eng. degrees in electronics and electrical engineering from the University of Glasgow and the University of Electronic Science and Technology of China in 2021, an M.Sc. with distinction in communications and signal processing from Imperial College London in 2022, and studies reinforcement learning and quantum machine learning for 6G vehicular networks.

Ming-Yu Liu portrait
Researcher 4 reports

Ming-Yu Liu

NVIDIA

Ming-Yu Liu is a vice president of research at NVIDIA and an IEEE Fellow. He leads the Deep Imagination Research group, which focuses on deep generative models for content creation and foundation models for physical AI. He previously held research leadership roles at Mercedes-Benz Research and Development North America and National Tsing Hua University, and he earned degrees from National Taiwan University, National Chiao Tung University, and the University of Illinois Urbana-Champaign.

Xinyun Chen portrait
Researcher 4 reports

Xinyun Chen

Google Gemini / Meta AI

Xinyun Chen's homepage identifies her as an AI research scientist at Meta Superintelligence Labs, previously a staff research scientist at Google DeepMind. It also lists a PhD in Computer Science from UC Berkeley and a BS in Computer Science from Shanghai Jiao Tong University.

Xueyan Zou portrait
Researcher 3 reports

Xueyan Zou

Z.ai

Xueyan Zou is an assistant professor in the College of AI at Tsinghua University. Her official Tsinghua page says she works on interactive embodied intelligence powered by world models, dexterous control and sensing, and embodied foundation models, after receiving a Ph.D. in computer science from the University of Wisconsin-Madison and then becoming a postdoctoral fellow at UC San Diego's Contextual Robotics Institute.

Arash Vahdat portrait
Researcher 3 reports

Arash Vahdat

NVIDIA

Arash Vahdat is a research director at NVIDIA Research, where he leads the GenAIR team on fundamental generative AI research. Before joining NVIDIA, he was a research scientist at D-Wave Systems and a research faculty member at Simon Fraser University, where he led video analysis research and taught machine learning for big data. He earned a PhD in computing science from Simon Fraser University.

Haoyu Lu portrait
Researcher 3 reports

Haoyu Lu

DeepSeek / Moonshot AI

Haoyu Lu is a Ph.D. student at Renmin University of China working on multimodal foundation models and video understanding. His homepage highlights papers and code including DeepSeek-VL, UniAdapter, and VDT.

Hyung Won Chung portrait
Researcher 3 reports

Hyung Won Chung

Google Gemini / OpenAI

Research Scientist at OpenAI focused on reasoning and agents. Previously at Google Brain, he worked on T5X, PaLM, Flan-PaLM, and Flan-T5, and he earned a PhD at MIT.

Dilek Hakkani-Tur portrait
Researcher 3 reports

Dilek Hakkani-Tur

NVIDIA

Dilek Hakkani-Tur is a Professor of Computer Science at the University of Illinois Urbana-Champaign and an Amazon Scholar at Amazon Health Science. Her UIUC faculty profile says her research interests include conversational AI, natural language and speech processing, spoken dialogue systems, and machine learning for language processing.

Jialong Tang portrait
Researcher 2 reports

Jialong Tang

Z.ai

Jialong Tang works at Alibaba Tongyi in Hangzhou. The homepage says Jialong Tang received a Ph.D. from the Chinese Information Processing Laboratory at the Institute of Software, Chinese Academy of Sciences in 2023, a bachelor's degree from Xiamen University in 2018, and studies natural language processing, event understanding, multilingual systems, and Qwen large language models.

Xinyu Zhang portrait
Researcher 3 reports

Xinyu Zhang

Alibaba Qwen

Research scientist at Tongyi Lab, Alibaba Group, working on multimodal large language models, machine reasoning, and efficient learning.

Alexandre Ramé portrait
Researcher 2 reports

Alexandre Ramé

Ai2

Alexandre Ramé is a research scientist at Google DeepMind and an adjunct professor at Ecole Polytechnique. His homepage says he previously held research roles at NYU and SCAI / Sorbonne Université, completed a PhD in machine learning at Ecole Polytechnique and ENS Paris-Saclay, and works on post-training and alignment for Gemma LLMs.

Benyou Wang portrait
Researcher 2 reports

Benyou Wang

MiniMax

Benyou Wang is an assistant professor at the Chinese University of Hong Kong, Shenzhen. His official school profile lists academic areas in artificial intelligence, computer science, and speech and natural language processing, and his personal homepage says he leads the CUHKSZ LLM group after earning a Ph.D. in information engineering from the University of Padua plus earlier degrees from Tianjin University and Hubei University of Automotive Technology.

Brian Lester portrait
Researcher 2 reports

Brian Lester

Google Gemini / Cohere

Brian Lester is a Senior Research Engineer at Google Brain and a PhD student at the University of Toronto advised by Colin Raffel. His public work focuses on efficient adaptation of large language models, including prompt tuning and related methods.

Kartikeya Mangalam portrait
Researcher 2 reports

Kartikeya Mangalam

NVIDIA

Kartikeya Mangalam is a PhD student in computer vision at UC Berkeley advised by Jitendra Malik. His public homepage says he previously held a visiting researcher role at Meta AI, studied in Stanford CS, graduated summa cum laude in electrical engineering with a machine learning minor from IIT Kanpur, and works on computer vision, video understanding, and human motion analysis.

Wanli Ouyang portrait
Researcher 2 reports

Wanli Ouyang

MiniMax

Wanli Ouyang is a professor at Shanghai AI Laboratory. His homepage says he is also with MMlab and the SIGMA lab, obtained a PhD from the Chinese University of Hong Kong, and works on AI4Science, computer vision, and pattern recognition.

Ashvin Nair portrait
Researcher 1 reports

Ashvin Nair

OpenAI

Researcher at OpenAI. He recently completed a PhD at UC Berkeley's BAIR lab, where he worked on machine learning for robotics, offline reinforcement learning, and self-supervised reinforcement learning.

Kyle Richardson portrait
Researcher 2 reports

Kyle Richardson

Ai2 / Cohere

Senior research scientist at the Allen Institute for Artificial Intelligence working on natural language processing, machine learning, and reasoning on the Aristo project; previously a researcher at the University of Stuttgart.

Matei Zaharia portrait
Researcher 1 reports

Matei Zaharia

NVIDIA

Computer scientist, UC Berkeley professor, and Databricks co-founder and CTO. He created Apache Spark and works on systems for large-scale data, machine learning, and compound AI applications.

Tulsee Doshi portrait
Researcher 2 reports

Tulsee Doshi

Google Gemini

Tulsee Doshi is a Senior Director of Product Management at Google DeepMind and currently leads product for Gemini Model. She previously served as Head of Product for Responsible AI at Google and holds both an M.S. and a Ph.D. in Symbolic Systems from Stanford.

Jason Phang portrait
Researcher 1 reports

Jason Phang

Cohere

Jason Phang is a researcher at OpenAI working on the Foundations team. His homepage highlights transfer learning and natural language understanding with large-scale language models.

Juntang Zhuang portrait
Researcher 1 reports

Juntang Zhuang

OpenAI

Juntang Zhuang leads pretraining at xAI and previously worked at OpenAI. His homepage highlights Grok pretraining work and contributions to GPT-4o, GPT-4 Turbo long-context, DALL-E 3, and OpenAI embeddings.

Minhao Cheng portrait
Researcher 1 reports

Minhao Cheng

Google Gemini

Minhao Cheng is an assistant professor in the College of Information Sciences and Technology at Penn State. His homepage says he earned a PhD in computer science from UCLA under Cho-Jui Hsieh, served as an assistant professor at HKUST before joining Penn State in Spring 2024, and broadly studies machine learning with a focus on trustworthy machine learning and AutoML.

Sang Michael Xie portrait
Researcher 1 reports

Sang Michael Xie

NVIDIA

Researcher at OpenAI focused on data-centric methods for foundation models, including synthetic data and reinforcement learning. Previously a research scientist at Meta GenAI; earned BS, MS, and PhD degrees in computer science at Stanford.

Sarath Chandar portrait
Researcher 1 reports

Sarath Chandar

Google Gemini

Associate professor at Polytechnique Montreal, core academic member at Mila, and CIFAR AI Chair whose research spans continual learning, reasoning, meta-learning, and reinforcement learning.

Tim Brooks portrait
Researcher 1 reports

Tim Brooks

OpenAI

AI researcher who co-created Sora at OpenAI and invented InstructPix2Pix during his PhD at Berkeley AI Research.

Tong Wu portrait
Researcher 1 reports

Tong Wu

DeepSeek

Tong Wu is a researcher at BIGAI in Beijing. His homepage says he works on diffusion language models, long context, and long sequence generation, and that he holds a master's from Tsinghua University and a bachelor's from Beijing Institute of Technology.

Utku Evci portrait
Researcher 1 reports

Utku Evci

NVIDIA

Utku Evci is a researcher at Google. His public Google Research profile describes work on making neural network training faster and more efficient, with public materials centered on sparsity, transfer learning, and machine intelligence.

David Mély portrait
Researcher 1 reports

David Mély

OpenAI

Researcher and engineer working on artificial general intelligence; his public experience page lists him as a Member of Technical Staff at OpenAI, after roles at Google/X and Vicarious.

Joost Huizinga portrait
Researcher 1 reports

Joost Huizinga

OpenAI

OpenAI researcher whose work spans alignment, machine learning, complexity science, open-ended evolution, and artificial life.

David Brandfonbrener portrait
Researcher 1 reports

David Brandfonbrener

Google Gemini

Member of Technical Staff at Anthropic working on reinforcement learning for LLMs; previously at Meta FAIR and the Kempner Institute at Harvard, with a Ph.D. from NYU and undergraduate study at Yale.

Junyang Lin portrait
Researcher 13 reports

Junyang Lin

Alibaba Qwen

Junyang Lin (Justin Lin) is a researcher and open-source maintainer known for the Qwen family of models. His public profiles list interests in LLMs, AI agents, multimodal learning, long-horizon reasoning, world models, and reinforcement learning; multiple March 2026 news reports said he stepped down from the Qwen tech lead role.

Chinmay Hegde portrait
Researcher 1 reports

Chinmay Hegde

NVIDIA

Chinmay Hegde is an associate professor at the NYU Tandon School of Engineering. His public homepage and NYU faculty profile say he works on machine learning, optimization, and inverse problems, joined NYU in 2021 after three years at Iowa State University, previously completed a postdoc at Stanford University, and earned a PhD in electrical and computer engineering from Georgia Tech in 2016.

Jason Yosinski portrait
Researcher 1 reports

Jason Yosinski

Google Gemini

Scientist at OpenAI and president of the ML Collective research group, focused on understanding and controlling large neural networks. He completed a PhD at Cornell and previously helped start Uber AI and later founded Windscape AI.

Kelvin Guu portrait
Researcher 1 reports

Kelvin Guu

Cohere

Research leader at Google DeepMind working on language models, retrieval-augmented generation, post-training, and factuality. His homepage highlights foundational contributions including REALM, FLAN, and Gemini factuality work.

Jean-Baptiste Alayrac portrait
Researcher 5 reports

Jean-Baptiste Alayrac

Google Gemini / Meta AI

DeepMind researcher working on machine learning, computer vision, and structured learning from video and language.

Le Yu portrait
Researcher 3 reports

Le Yu

Alibaba Qwen

Algorithm Engineer at the Qwen Team, Alibaba Group. He received his Ph.D. in computer science from Beihang University in 2024 and works on post-training alignment for Qwen models, with research interests in large language models, graph neural networks, and recommender systems.

Hao Zhang portrait
Researcher 3 reports

Hao Zhang

Moonshot AI / NVIDIA

Researcher at NVIDIA Research. Previously a PhD student in Computer Science and Engineering at HKUST, with earlier internships at International Digital Economy Academy and Microsoft Research.

Luke Zettlemoyer portrait
Researcher 6 reports

Luke Zettlemoyer

Ai2

Luke Zettlemoyer works on empirical methods for natural language semantics, machine learning, new tasks and datasets, and self-supervision for pre-training.

Shyamal Anadkat portrait
Researcher 2 reports

Shyamal Anadkat

OpenAI

Engineer and product leader who worked on OpenAI's Applied AI team and now advises startups on AI products. He writes publicly about agents, retrieval, and evaluation on his personal site.

Yang Song portrait
Researcher 2 reports

Yang Song

OpenAI / Alibaba Qwen

Research Principal at Meta Superintelligence Labs. He previously led the strategic explorations team at OpenAI and is known for foundational work on score-based diffusion models.

Rowan Zellers portrait
Researcher 2 reports

Rowan Zellers

Google Gemini / OpenAI

Researcher at Thinking Machines, formerly at OpenAI, working on real-time multimodal systems across language, vision, and voice.

Adrià Garriga-Alonso portrait
Researcher 1 reports

Adrià Garriga-Alonso

Google Gemini

Adria Garriga-Alonso is a research scientist at Google DeepMind focused on machine learning and AI safety. His public profile notes a PhD in computer science from the University of Cambridge in 2020.

Adrien Ecoffet portrait
Researcher 1 reports

Adrien Ecoffet

OpenAI

Research scientist at OpenAI working on AI agents, large models, and reinforcement learning. Previously worked as an applied scientist at Uber and completed a PhD in computer science at ENS Paris-Saclay under Marc Schoenauer and Olivier Teytaud.

Claire Cui portrait
Researcher 2 reports

Claire Cui

Google Gemini

Claire Cui is a Google Fellow in the Google Brain team, where she leads research on deep generalist learning, large language models, multimodal learning, and interpretable AI. During her tenure at Google she helped found AdSense for Content and co-founded Google Health Research and Medical Brain; she holds a Ph.D. in computer science from Stanford and a B.S. in computer science from Tsinghua University.

Hamish Ivison portrait
Researcher 2 reports

Hamish Ivison

Ai2

PhD student at the University of Washington advised by Luke Zettlemoyer and researcher at Ai2. His work focuses on open language models, instruction tuning, data curation, and evaluation.

Valentina Pyatkin portrait
Researcher 2 reports

Valentina Pyatkin

Ai2

Postdoctoral researcher at the Allen Institute for AI and the University of Washington whose work focuses on open language model post-training, steerability, and evaluation.

William Merrill portrait
Researcher 1 reports

William Merrill

Ai2

TTIC and William Merrill's homepage describe him as currently at the Allen Institute for AI and incoming to TTIC in Fall 2026 after a PhD from New York University. Public profile pages describe research spanning transformer expressivity, formal language theory, computational linguistics, and language models.

Lester James V. Miranda portrait
Researcher 1 reports

Lester James V. Miranda

Ai2

Lester James V. Miranda previously worked as a Predoctoral Young Investigator at Ai2 and as a machine learning engineer on the spaCy team at ExplosionAI.

Yonggan Fu portrait
Researcher 1 reports

Yonggan Fu

NVIDIA

Public profiles say he completed a Georgia Tech PhD in 2025 after earlier study at Rice and USTC, and his current work focuses on bringing frontier AI to everyday devices.

Xiaohuan Zhou portrait
Researcher 1 reports

Xiaohuan Zhou

Alibaba Qwen

Alibaba Qwen report author with public bibliography links for work on large language models, multimodal foundation models, and AI agents.

Xin Hu portrait
Researcher 1 reports

Xin Hu

OpenAI

Xin Hu is a Member of Technical Staff at OpenAI focused on privacy and security. Xin previously held cybersecurity and machine learning roles at Robinhood, Pinterest, IBM Research, and Intel, and earned a PhD in Computer Science and Engineering from the University of Michigan.

Binyuan Hui portrait
Researcher 6 reports

Binyuan Hui

DeepSeek / MiniMax

Staff research scientist at Alibaba's Qwen Team and initiator of OpenDevin, focused on foundation models, reasoning models, coding agents, and computer-use agents.

Bowen Yu portrait
Researcher 5 reports

Bowen Yu

Alibaba Qwen

Bowen Yu is an Algorithm Expert at Qwen (Alibaba Group) working on LLM post-training and automated alignment.

Sanjiv Kumar portrait
Researcher 4 reports

Sanjiv Kumar

Google Gemini / NVIDIA

Sanjiv Kumar is a Google Fellow and vice president at Google Research. His public homepage says he leads teams working on large machine learning foundation models and generative AI, has spent more than 25 years building machine learning systems and products, and received a PhD in computer science from Carnegie Mellon University in 2005.

Shantanu Jain portrait
Researcher 1 reports

Shantanu Jain

OpenAI

Shantanu Jain has worked on language models at OpenAI since 2020, focusing on training data and infrastructure. His homepage also highlights work in Python's open-source and static type-checking ecosystems.

Josh Achiam portrait
Researcher 1 reports

Josh Achiam

OpenAI

Joshua (Josh) Achiam is Chief Futurist at OpenAI. He joined OpenAI in 2017 as a research scientist and has worked on AI safety research and operations, AI impacts research, and educational resources (including Spinning Up in Deep RL).

Isaac MacMillan portrait
Researcher 1 reports

Isaac MacMillan

Ai2

Research scientist at AI2; previously a machine learning engineer at Koko, where he worked on safety systems for language models.

Diederik P. Kingma portrait
Researcher 1 reports

Diederik P. Kingma

NVIDIA

Machine learning researcher focused on scalable learning and generative modeling, known for work on variational autoencoders, Adam, Glow, and variational diffusion models; his homepage says he is currently at Anthropic.

Tao Xu portrait
Researcher 1 reports

Tao Xu

OpenAI

Research scientist at OpenAI working on intelligent agents, computer vision, natural language processing, and robotics; he previously completed a PhD in computer science at Penn State.

Tomer Kaftan portrait
Researcher 1 reports

Tomer Kaftan

OpenAI

Researcher working on machine learning systems and infrastructure at OpenAI. He earned his PhD from UCLA.

Ming Lin portrait
Researcher 4 reports

Ming Lin

NVIDIA

Ming Lin is a professor of computer science at the University of Maryland, College Park and the chief executive officer and co-founder of Aitrios.ai. Her public homepage says she was the founding director of the GAMMA lab, previously served as assistant dean of the University of Maryland's Institute for Advanced Computer Studies, and earned her BS, MS, and PhD in EECS from the University of California, Berkeley.

Louis Martin portrait
Researcher 3 reports

Louis Martin

Meta AI / Mistral AI

Research scientist at Meta AI working on natural language processing and AI safety. His homepage says he completed a PhD at Facebook AI Research and Inria focused on text simplification and accessibility.

Yuning Mao portrait
Researcher 1 reports

Yuning Mao

Meta AI

Research scientist at Meta GenAI working on efficient and safe language and multimodal systems. She completed a Ph.D. in computer science at Carnegie Mellon University.

Hannaneh Hajishirzi portrait
Researcher 3 reports

Hannaneh Hajishirzi

Ai2

Hannaneh Hajishirzi is Senior Director at Ai2 and a professor of computer science at the University of Washington. Her public materials focus on generative AI, reasoning, open language models, pre-training, and evaluation.

Junxian He portrait
Researcher 2 reports

Junxian He

NVIDIA

Junxian He is an assistant professor in computer science and engineering at the Hong Kong University of Science and Technology. His public homepage and Princeton profile say he works at the intersection of natural language processing and machine learning, focusing on scalable methods for language model training and reasoning, completed a PhD in computer science at Princeton University in 2024, and previously earned bachelor's degrees in economics and computer science from Peking University.

Tyler Murray portrait
Researcher 1 reports

Tyler Murray

Ai2

Tyler Murray is a research scientist on Ai2's PRIOR team working on vision and multimodal models. His OpenReview profile shows previous research engineering roles at Google and Facebook and degrees from Stanford University and the University of California, Berkeley.

Yukun Zha portrait
Researcher 2 reports

Yukun Zha

DeepSeek

Yukun Zha is a research scientist at DeepSeek. His public profile lists research interests in machine learning, reinforcement learning, and efficient inference, alongside a PhD in computer science from Dartmouth College.

Arvind Neelakantan portrait
Researcher 2 reports

Arvind Neelakantan

OpenAI

Research scientist at Google DeepMind working on Gemini. He previously worked at Meta, OpenAI, and Google Brain, and earned a PhD from UMass Amherst.

Hugo Touvron portrait
Researcher 2 reports

Hugo Touvron

Meta AI

Research scientist at Meta FAIR focused on computer vision and multimodal understanding, including image and video models, visual reasoning, large-scale training, and evaluation.

Pang Wei Koh portrait
Researcher 2 reports

Pang Wei Koh

Ai2

The Allen School lists Pang Wei Koh as an assistant professor at the University of Washington and a visiting research scientist at the Allen Institute for AI. The same page lists expertise in explainable AI, generative AI, machine learning, and natural language processing.

Daogao Liu portrait
Researcher 1 reports

Daogao Liu

Ai2

Daogao Liu's homepage says he is currently a visiting postdoctoral researcher at Google. It says his research focuses on machine learning and theoretical computer science, especially differential privacy, stochastic optimization, and algorithm design.

Rulin Shao portrait
Researcher 1 reports

Rulin Shao

Ai2

Rulin Shao's homepage says he is a third-year PhD at the University of Washington advised by Pang Wei Koh and Luke Zettlemoyer, and also a visiting researcher at Meta.

Wonmin Byeon portrait
Researcher 1 reports

Wonmin Byeon

NVIDIA

NVIDIA Research and Wonmin Byeon's personal site identify him as a researcher at NVIDIA Research in California. Public site materials describe interests in computer vision, robotics, recurrent and state-space models, sequence learning, and spatio-temporal learning.

Mohammad Norouzi portrait
Researcher 1 reports

Mohammad Norouzi

Cohere

Co-founder and CEO of Ideogram whose public homepage highlights work on self-supervised representation learning, generative modeling, and multimodal learning; previously a senior staff research scientist at Google Brain Toronto.

Trevor Gale portrait
Researcher 1 reports

Trevor Gale

NVIDIA

Research scientist at NVIDIA working on compression, efficient inference, post-training, multimodality, and evaluation for large language models. He completed a PhD in machine learning at Stanford and a master's in artificial intelligence at the University of Edinburgh.

Hao Yang portrait
Researcher 5 reports

Hao Yang

DeepSeek / Moonshot AI

Hao Yang works on multimodal data infrastructure at Moonshot.ai. He previously worked at ByteDance ICVG and Microsoft Research Asia, and received BS and PhD degrees from Tsinghua University.

Jerry Tworek portrait
Researcher 1 reports

Jerry Tworek

OpenAI

Research scientist at OpenAI whose public profiles highlight multi-agent alignment, scalable training, and reinforcement learning.

Leon Derczynski portrait
Researcher 1 reports

Leon Derczynski

NVIDIA

Leon Derczynski is a professor in natural language processing at DTU Compute whose research focuses on information extraction, online harms, and the dynamics of abusive language and misinformation.

Jin Xu portrait
Researcher 3 reports

Jin Xu

Alibaba Qwen

Jin Xu's homepage says he leads the audio group at Qwen Team, Alibaba, working on audio understanding, real-time multimodal interaction, speech synthesis, general audio synthesis, and audio-centered chat models. He previously completed a Ph.D. at IIIS, Tsinghua University and received a BSc in 2018 from Beijing University of Posts and Telecommunications.

Noah A. Smith portrait
Researcher 3 reports

Noah A. Smith

Ai2

Noah A. Smith is the University of Washington's inaugural Vice Provost for Artificial Intelligence and a professor in the Paul G. Allen School of Computer Science & Engineering. He is also Senior Director of NLP Research at the Allen Institute for AI.

Shengjia Zhao portrait
Researcher 2 reports

Shengjia Zhao

OpenAI / Meta AI

AI researcher and former OpenAI scientist who became chief scientist of Meta Superintelligence Labs in July 2025; public academic sources also show earlier Stanford and Tsinghua training.

Jan Kautz portrait
Researcher 1 reports

Jan Kautz

NVIDIA

NVIDIA's research page describes Jan Kautz as vice president of Learning and Perception Research, working across computer vision, machine learning, computational photography, and geometric vision.

Valentina Pyatkin portrait
Researcher 1 reports

Valentina Pyatkin

Ai2

Valentina Pyatkin works on open language models, post-training, steerability, instruction following, and evaluation as a postdoctoral researcher and Young Investigator at Ai2 and the University of Washington.

Yufei Guo portrait
Researcher 1 reports

Yufei Guo

OpenAI

Yufei Guo is an assistant professor in the UCLA Computer Science Department. His public profiles note prior work as a research scientist at OpenAI and research interests in AI agents, reinforcement learning, and large language models.

Tom Henighan portrait
Researcher 1 reports

Tom Henighan

OpenAI

Works on large language model interpretability at Anthropic; previously worked on scaling laws at OpenAI and ML engineering at Beehive AI, and completed a Ph.D. in physics at Stanford.

Dan Roth portrait
Researcher 1 reports

Dan Roth

Cohere

Head of AI at Cohere and professor at the University of Pennsylvania whose public profiles focus on natural language understanding, reasoning, and grounding.

David Dohan portrait
Researcher 3 reports

David Dohan

Google Gemini / OpenAI

David Dohan is a computer scientist at OpenAI studying scalable alignment of language models and generally intelligent reasoning systems. His personal site also notes prior work at Google Brain on foundation model programs, code generation, protein engineering, and scientific reasoning.

Evgenia Rusak portrait
Researcher 1 reports

Evgenia Rusak

Google Gemini

Research scientist at Google DeepMind working on trustworthy deep learning, multimodal modeling, and generalization in AI systems.

Ahmad Al-Dahle portrait
Researcher 1 reports

Ahmad Al-Dahle

Meta AI

Ahmad Al-Dahle joined Airbnb as Chief Technology Officer on January 14, 2026. Airbnb's announcement says he previously led Generative AI at Meta, founded Meta's Generative AI group in early 2023, and earlier studied engineering at the University of Waterloo and held senior technical leadership roles at Apple.

Hyung Won Chung portrait
Researcher 1 reports

Hyung Won Chung

Cohere

Hyung Won Chung is a Research Scientist at OpenAI working on reasoning and agents. His public site says he previously worked at Google Brain and completed his PhD at MIT.

Mitesh M. Khapra portrait
Researcher 1 reports

Mitesh M. Khapra

NVIDIA

Mitesh M. Khapra is a professor at IIT Madras and co-founder of AI4Bharat whose research spans natural language processing, machine learning, transfer learning, and deep learning.

Y. Wu portrait
Researcher 8 reports

Y. Wu

DeepSeek

Yu Wu's public homepage says he is a technical staff member at DeepSeek AI who leads the LLM Alignment Team and works on writing, QA, AI search, reasoning, and safety.

Trung H. Bui portrait
Researcher 1 reports

Trung H. Bui

Cohere

Trung H. Bui is a staff researcher at Cohere working on generative AI, large language models, natural language processing, machine learning, and computer vision.

Alec Radford portrait
Researcher 3 reports

Alec Radford

OpenAI

Alec Radford is a researcher and a co-author of the GPT-4 Technical Report. His GitHub profile links to his personal website at newmu.github.io.

Seungyoun Hong portrait
Researcher 1 reports

Seungyoun Hong

Cohere

Seungyoun Hong is a research scientist and postdoctoral scholar at Stanford University working on machine unlearning, data attribution, hallucination, and generalization.

Chuanqi Tan portrait
Researcher 3 reports

Chuanqi Tan

Alibaba Qwen / Z.ai

Chuanqi Tan's homepage says he received a PhD from Tsinghua University in July 2019, is currently focused on LLM research and applications, and is also a postdoctoral fellow at the University of Hong Kong.

Vahid Noroozi portrait
Researcher 3 reports

Vahid Noroozi

Google Gemini / NVIDIA

Vahid Noroozi is an applied research scientist at NVIDIA. His NVIDIA author profile says his work focuses on deep learning for speech and natural language processing and that he received a PhD in computer science from the University of Illinois Chicago. His homepage says he previously worked on post-training large language models at Google DeepMind after earlier multimedia and neuroscience research at TU Delft and the Max Planck Institute for Biological Cybernetics.

Jan Leike portrait
Researcher 1 reports

Jan Leike

OpenAI

Jan Leike is a machine learning and alignment researcher who leads the Alignment Science team at Anthropic. He previously co-led the Superalignment team at OpenAI and earlier worked on alignment research at DeepMind.

Shuyue Stella Li portrait
Researcher 1 reports

Shuyue Stella Li

Ai2

Shuyue Stella Li is a Ph.D. student in the Allen School at the University of Washington advised by Yulia Tsvetkov, with research interests including proactive learning, social reasoning, and AI for health.

Logan Kilpatrick portrait
Researcher 1 reports

Logan Kilpatrick

OpenAI

Member of the Technical Staff at Google DeepMind; his homepage says he previously led developer relations at OpenAI from 2022 through early 2024 and earlier worked at NASA and Apple.

Pengcheng Yin portrait
Researcher 1 reports

Pengcheng Yin

Meta AI

Pengcheng Yin is a research scientist at Google DeepMind working on Gemini Code and natural-language-to-code generation. His homepage also lists earlier experience as a part-time research collaborator at Facebook AI Research and research internships at Microsoft research groups.

Runxin Xu portrait
Researcher 6 reports

Runxin Xu

DeepSeek

Researcher at DeepSeek whose public homepage describes work on DeepSeek R1, V1, V2, V3, Math, Coder, and mixture-of-experts systems.

Yizhong Wang portrait
Researcher 4 reports

Yizhong Wang

Ai2

Yizhong Wang is a research scientist at the Allen Institute for AI and incoming assistant professor at the University of Washington whose work focuses on language models, agents, reasoning, and open-source AI.

Vasu Singhal portrait
Researcher 1 reports

Vasu Singhal

Meta AI

Vasu Singhal is an AI researcher focused on natural language processing, generative AI, and machine learning. He studied computer science at Stanford University and has worked on NLP research at FAIR.

Chris Alberti portrait
Researcher 3 reports

Chris Alberti

Google Gemini / Ai2

Chris Alberti's Google Research profile lists natural language processing and speech processing as his research areas, and his OpenReview profile lists him as a researcher at Google.

Barret Zoph portrait
Researcher 2 reports

Barret Zoph

OpenAI / Meta AI

Co-founder and CTO at Thinking Machines. Previously VP of Research (Post-Training) at OpenAI, Staff Research Scientist at Google Brain, and a researcher at the Information Sciences Institute.

Amjad Almahairi portrait
Researcher 2 reports

Amjad Almahairi

Meta AI / Mistral AI

Amjad Almahairi is a researcher at Anyscale. His OpenReview profile lists work spanning LLMs, VLLMs, generative models, and deep learning, with earlier roles at Facebook and Element AI.

Prasanna Parthasarathi portrait
Researcher 2 reports

Prasanna Parthasarathi

NVIDIA

Prasanna Parthasarathi is a research scientist at Huawei Noah's Ark Lab in Montreal. His public speaker and lab-profile pages say he collaborates with Mila and McGill University, works on natural language processing, dialogue systems, and social simulation, and completed a PhD at McGill University in 2022 under Joelle Pineau.

Sebastian Gehrmann portrait
Researcher 2 reports

Sebastian Gehrmann

Google Gemini / Mistral AI

Sebastian Gehrmann leads Responsible AI in the office of the CTO at Bloomberg and works on natural language generation, model evaluation, and interpretability.

Can Cao portrait
Researcher 2 reports

Can Cao

Z.ai

Can Cao is a Ph.D. student at Tsinghua University advised by Ming Zhou and Yuxuan Wang. The homepage says Can Cao received a bachelor's degree from Tsinghua University in 2022, previously interned at Microsoft Research Asia in summer 2022 and Meta AI in summer 2025, and studies large language models with a focus on pre-training, reasoning, agents, post-training, and multimodal learning.

Azade Nova portrait
Researcher 2 reports

Azade Nova

Google Gemini

Staff Research Scientist at Google DeepMind. Public Google profiles describe earlier work at Google Brain and Microsoft Research and research spanning machine learning, graph mining, and unstructured data analytics.

Hanrong Ye portrait
Researcher 1 reports

Hanrong Ye

NVIDIA

Hanrong Ye is a research scientist at NVIDIA Research in Santa Clara working on multi-task, multi-media, and multimodality models for machine understanding and generation. He earned a Ph.D. from HKUST, a master's degree from Peking University, and a B.S. from Sun Yat-sen University.

Matt Jordan portrait
Researcher 1 reports

Matt Jordan

Ai2

Public profiles identify Matt Jordan as an Allen Institute for Artificial Intelligence researcher and a former University of Texas at Austin PhD student advised by Alex Dimakis.

Shashank Gupta portrait
Researcher 1 reports

Shashank Gupta

Ai2

Shashank Gupta's public homepage describes him as an applied scientist at Ai2 focused on agentic LLMs, test-time compute, post-training, and reasoning, after earlier work in Microsoft AI.

Jiatao Gu portrait
Researcher 2 reports

Jiatao Gu

Google Gemini / Meta AI

Assistant Professor in Penn CIS, a primary GRASP faculty member, and a member of the ASSET Center; he also works part-time as a Staff Research Scientist at Apple MLR on multimodal generative models, reasoning, and embodied AI.

Sydney Z. Li portrait
Researcher 1 reports

Sydney Z. Li

Cohere

Research scientist at Cohere Labs and PhD candidate at Stanford University focused on language models, machine learning systems, and AI safety.

Jiahui Yu portrait
Researcher 8 reports

Jiahui Yu

Google Gemini

Jiahui Yu is a Research Lead at OpenAI leading the Perception team. His homepage notes prior co-leadership on Gemini Multimodal at Google DeepMind and work on deep learning and high-performance computing.

Ching-Yao Chuang portrait
Researcher 4 reports

Ching-Yao Chuang

NVIDIA

Ching-Yao Chuang is a researcher at OpenAI working on multimodal and generative AI systems. His homepage lists prior roles at xAI and Meta GenAI and a PhD from MIT.

Zihan Wang portrait
Researcher 3 reports

Zihan Wang

DeepSeek / Moonshot AI

Northwestern PhD student working on agentic RL, previously interned at DeepSeek, Microsoft, and Yutori.

Pradeep Dasigi portrait
Researcher 1 reports

Pradeep Dasigi

NVIDIA

Research scientist on the AllenNLP team at the Allen Institute for AI, where his homepage highlights work on open language models such as OLMo and Tulu and a focus on post-training language models.

Shuai Bai portrait
Researcher 6 reports

Shuai Bai

Alibaba Qwen

Senior algorithm expert at Alibaba Group working on large language models, multimodal large language models, and diffusion models.

Ben Wang portrait
Researcher 2 reports

Ben Wang

Google Gemini / OpenAI

OpenAI's GPT-4 contributions page credits Ben Wang as attention architecture lead for long context. Public profiles identify him as a University of Pennsylvania undergraduate and an OpenAI researcher from 2021 to 2022.

Wenchao Zhou portrait
Researcher 3 reports

Wenchao Zhou

MiniMax

Public profiles describe Wenchao Zhou as Director of Data Product and Data Analytics at Alibaba Cloud Intelligence and a former tenured computer science faculty member at Georgetown University. His work centers on databases and distributed systems.

Costa Huang portrait
Researcher 1 reports

Costa Huang

Ai2

Costa Huang is a reinforcement learning researcher and the creator of CleanRL; his public GitHub profile lists Periodic Labs as his current affiliation and AllenAI and Hugging Face as previous affiliations.

Jan Hendrik Kirchner portrait
Researcher 1 reports

Jan Hendrik Kirchner

OpenAI

Jan Hendrik Kirchner is a researcher at Anthropic. His public OpenReview profile lists prior research work at OpenAI and a Brain Science PhD track through Max Planck Schools.

Maarten Sap portrait
Researcher 1 reports

Maarten Sap

Ai2

Carnegie Mellon says Maarten Sap is an assistant professor in the Language Technologies Institute with a courtesy appointment in HCII, and also a part-time research scientist and AI safety lead at the Allen Institute for AI. His CMU profile says he received his PhD from the University of Washington.

Matt Jordan portrait
Researcher 1 reports

Matt Jordan

Ai2

Matt Jordan is a researcher at Ai2 focused on making language models safe and reliable. His public profile says he previously spent more than four years at Google and is currently a PhD student in computer science at the University of Texas at Austin.

Jiatao Gu portrait
Researcher 1 reports

Jiatao Gu

Amazon

Jiatao Gu is an assistant professor in the University of Pennsylvania Department of CIS and a primary faculty member of GRASP Lab and the ASSET Center. His homepage also states that he works part-time as a staff research scientist at Apple ML Research after earlier work at Meta AI.

Shannon Zejiang Shen portrait
Researcher 1 reports

Shannon Zejiang Shen

Ai2

Shannon Zejiang Shen is a PhD student at MIT CSAIL researching collaborative AI systems that augment human intelligence.

Will Merrill portrait
Researcher 1 reports

Will Merrill

Ai2

William Merrill is a Young Investigator at the Allen Institute for AI and an incoming assistant professor at TTIC. His public faculty profile describes research on the computational power and limits of transformers, computational linguistics, and deep learning theory.

Yuling Gu portrait
Researcher 1 reports

Yuling Gu

Ai2

Yuling Gu's NYU and personal pages describe her as a PhD student at NYU Center for Data Science who previously worked at Ai2 and has contributed to OLMo and OLMES.

Carl Ble portrait
Researcher 1 reports

Carl Ble

Google Gemini

Research scientist at Google DeepMind interested in efficient and reliable language models, reasoning, and model security. His personal site also highlights prior work on planning, geometry, and representation learning.

Boris Ginsburg portrait
Researcher 2 reports

Boris Ginsburg

NVIDIA

Boris Ginsburg is a principal engineer and research scientist at NVIDIA whose work focuses on efficient machine learning and deep learning for speech recognition, language processing, and computer vision.

Tom B. Brown portrait
Researcher 1 reports

Tom B. Brown

OpenAI

His GitHub profile says he works on robust and aligned AI at Anthropic and previously worked at OpenAI and Google Brain. He is the first author of the GPT-3 paper 'Language Models are Few-Shot Learners'.

Amelie Haghighat portrait
Researcher 1 reports

Amelie Haghighat

NVIDIA

Amelie Haghighat is an incoming assistant professor at the University of Illinois Chicago whose research focuses on trustworthy and resource-efficient machine learning systems.

Ana Klimovic portrait
Researcher 1 reports

Ana Klimovic

NVIDIA

Ana Klimovic is an assistant professor of computer science at ETH Zurich whose research focuses on machine learning systems, efficient hardware acceleration, and sustainable AI.

Uday Bondhugula portrait
Researcher 1 reports

Uday Bondhugula

NVIDIA

Compiler researcher and entrepreneur, creator of the PolyMage DSL, and Professor at the Indian Institute of Science whose work spans compilers, AI systems, and programming languages.

Rogerio Feris portrait
Researcher 3 reports

Rogerio Feris

NVIDIA

Rogerio Feris is a principal scientist and manager at the MIT-IBM Watson AI Lab within IBM Research. His public bios emphasize multimodal perception, data and model efficiency, and large language models with memory.

Jacob Morrison portrait
Researcher 3 reports

Jacob Morrison

Ai2

Jacob Morrison's homepage says he is a PhD student at UW and a student researcher on the AllenNLP team at Ai2, where he mainly works on language modeling.

Jiacheng Liu portrait
Researcher 2 reports

Jiacheng Liu

Ai2

Jiacheng Liu is a researcher at Ai2 whose work focuses on improving the capabilities and understanding of language models. His public homepage says he is currently a PhD student at New York University and has previously spent time at Princeton and Google Research.

Mayee Chen portrait
Researcher 1 reports

Mayee Chen

Ai2

Mayee Chen's public pages describe her as a Stanford computer science Ph.D. student and an AI2 research intern working on data-centric machine learning and natural language processing.

Yashaswi Karnati portrait
Researcher 1 reports

Yashaswi Karnati

NVIDIA

OpenReview identifies Yashaswi Karnati as a researcher at NVIDIA. His personal homepage describes prior work across intelligent transportation, climate science, data compression, and healthcare, and records completed degrees from the University of Florida and IIT (ISM) Dhanbad.

Ashwin Paranjape portrait
Researcher 1 reports

Ashwin Paranjape

Meta AI

Founding AI Lead at Samaya AI with research interests in reasoning, retrieval, agentic systems, and reinforcement learning; previously completed a Stanford PhD in computer science under Christopher Manning.

Jingren Zhou portrait
Researcher 23 reports

Jingren Zhou

MiniMax / Moonshot AI

Jingren Zhou is Chief Technology Officer of Alibaba Cloud. Public speaker biographies describe him as a computer scientist and entrepreneur whose work includes large-scale AI and cloud systems.

Jian Yang portrait
Researcher 4 reports

Jian Yang

Alibaba Qwen

Jian Yang is an Associate Professor at Beihang University whose research focuses on code intelligence, large language models, and AI agents. He worked with Alibaba Qwen from 2023 to July 2025.

Aston Zhang portrait
Researcher 1 reports

Aston Zhang

Meta AI

Aston Zhang is a member of technical staff at OpenAI, author of Dive into Deep Learning, and a core contributor to Llama 3. His homepage says he earned his Ph.D. from the University of Illinois Urbana-Champaign.

Thomas B. Brown portrait
Researcher 1 reports

Thomas B. Brown

Google Gemini

Thomas B. Brown is a research scientist at Google DeepMind whose work focuses on language models, multimodality, and reasoning.

Boris Ivanovic portrait
Researcher 1 reports

Boris Ivanovic

NVIDIA

Boris Ivanovic is a postdoctoral scholar in Stanford's Autonomous Systems Lab whose research focuses on human-centered machine learning and data-driven prediction and planning for autonomous systems.

Geri Skenderi portrait
Researcher 1 reports

Geri Skenderi

Cohere

Geri Skenderi is a research scientist at Cohere focusing on multilingual language technology, evaluation of large language models, and natural language processing for low-resource settings.

Hanlin Tang portrait
Researcher 3 reports

Hanlin Tang

Cohere / NVIDIA

Hanlin Tang is a researcher at Cohere. His public page says his work focuses on foundation models, large language model post-training, reinforcement learning, and vision-language or language-model agents, and that he previously held research internships at NVIDIA and the Vector Institute.

Saumya Malik portrait
Researcher 2 reports

Saumya Malik

Ai2

Saumya Malik is a predoctoral young investigator at the Allen Institute for Artificial Intelligence working on open language modeling, reward models, and instruction following.

Yifei Qu portrait
Researcher 2 reports

Yifei Qu

Z.ai

Yifei Qu is a technical staff member at JD Explore Academy in Beijing and a Ph.D. student at the University of Texas at Austin. The homepage says Yifei Qu received a master's degree from Carnegie Mellon University in 2023, a bachelor's degree from Sun Yat-sen University in 2021, expects to graduate in 2027, and works on learning and optimization, reinforcement learning, reasoning, and alignment for large language models under Peter Stone.

Yutao Sun portrait
Researcher 2 reports

Yutao Sun

Z.ai

Yutao Sun is a fifth-year Ph.D. student at Zhejiang University supervised by Minlie Huang. The homepage says Yutao Sun has been a visiting student researcher at Stanford University since September 2023 under Tatsunori Hashimoto, earned a bachelor's degree from Tsinghua University in 2019, and studies code generation, code agents, execution feedback, and reasoning in large language models.

Arka Dhar portrait
Researcher 1 reports

Arka Dhar

OpenAI

Public bios identify Arka Dhar as Product Management Director at Google DeepMind and former Head of Data at OpenAI. OpenAI contribution pages list him among contributors to GPT-4o-era model work.

Michael Noukhovitch portrait
Researcher 1 reports

Michael Noukhovitch

Ai2

Michael Noukhovitch is a PhD candidate in artificial intelligence at Mila associated with Universite de Montreal. His public homepage lists interests including reinforcement learning, multi-agent systems, natural language processing, and agents with web and computer use.

Chunting Zhou portrait
Researcher 1 reports

Chunting Zhou

Meta AI

Research scientist at Meta AI working on natural language processing, dialogue systems, and large language models.

Dayiheng Liu portrait
Researcher 7 reports

Dayiheng Liu

Alibaba Qwen

Algorithm expert at Alibaba DAMO Academy focused on natural language generation and large language models.

Huazuo Gao portrait
Researcher 7 reports

Huazuo Gao

DeepSeek

Researcher at DeepSeek AI working on decision-making and post-training for large language models.

Fuli Luo portrait
Researcher 5 reports

Fuli Luo

DeepSeek

Research scientist working on large language models and retrieval-augmented generation; creator of the open-source project tiny-universe.

Hannaneh Hajishirzi portrait
Researcher 4 reports

Hannaneh Hajishirzi

Ai2

Professor at the University of Washington and Senior Director of NLP at AI2. Her work spans natural language processing, computer vision, responsible AI, and applications to science and medicine.

Mingchuan Zhang portrait
Researcher 4 reports

Mingchuan Zhang

DeepSeek

Research scientist at DeepSeek interested in large language models, reinforcement learning, robot learning, and machine learning.

Yuling Gu portrait
Researcher 3 reports

Yuling Gu

Ai2

Yuling Gu is a PhD student at the NYU Center for Data Science studying large language models, machine reasoning, and robust evaluation. She was previously a predoctoral researcher at the Allen Institute for AI, where she contributed to OLMo, OLMo 2, OLMo 3, TULU 3, OLMoE, and OLMES.

Aakanksha Chowdhery portrait
Researcher 3 reports

Aakanksha Chowdhery

Google Gemini

Aakanksha Chowdhery is a machine learning researcher based in New York City. She works on large-scale machine learning across pre-training, post-training, inference, and system efficiency, and is known for contributions such as PaLM, Pathways, and Gemini.

Jinze Bai portrait
Researcher 3 reports

Jinze Bai

Alibaba Qwen

PhD student at The Hong Kong University of Science and Technology (Guangzhou) whose research interests include large language models, vision-language models, AI agents, and multimodal retrieval.

Liyue Zhang portrait
Researcher 3 reports

Liyue Zhang

DeepSeek

Liyue Zhang is a research scientist at DeepSeek whose public profiles highlight multimodal models, reasoning, and AI agents.

Shanghao Lu portrait
Researcher 3 reports

Shanghao Lu

DeepSeek

Researcher at DeepSeek AI focusing on multimodal large language models and multimodal understanding.

Yuxiang You portrait
Researcher 3 reports

Yuxiang You

DeepSeek

Research scientist at DeepSeek AI. Previously completed a PhD at Carnegie Mellon University.

Steven Adler portrait
Researcher 1 reports

Steven Adler

OpenAI

Steven Adler is a former OpenAI safety researcher and technical program manager (2020-2024). He writes Clear-Eyed AI, analyzing AI news, policy, and research with a focus on risk reduction.

Luca Soldaini portrait
Researcher 1 reports

Luca Soldaini

Cohere

Luca Soldaini is a lead research scientist at Ai2 on the Olmo team. He previously worked at Amazon Alexa and completed a PhD in computer science at Georgetown University in 2018.

Wangding Zeng portrait
Researcher 3 reports

Wangding Zeng

DeepSeek

Wangding Zeng is a researcher at DeepSeek. His OpenReview profile also lists graduate and undergraduate study at Beijing University of Posts and Telecommunications.

Jesse Dodge portrait
Researcher 2 reports

Jesse Dodge

Google Gemini / Ai2

Jesse Dodge is a research scientist at Meta Superintelligence Labs. His public work spans evaluation, reproducibility, efficiency, and open large language models, and his homepage says he was at the Allen Institute for AI from 2019 to 2025.

Siyuan Li portrait
Researcher 2 reports

Siyuan Li

Google Gemini / NVIDIA

Siyuan Li is a research scientist at NVIDIA working on large language models, multimodal foundation models, and reinforcement learning. His homepage says he received a PhD in computer science from the University of Toronto in 2024 and previously worked at Meta AI, Microsoft Research, and Mila.

Anej Svete portrait
Researcher 1 reports

Anej Svete

Ai2

Anej Svete is a PhD Fellow at the ETH AI Center. His homepage says he studies language models with formal language theory and completed a 9-month research internship at Ai2 in 2025.

Carlos Riquelme portrait
Researcher 1 reports

Carlos Riquelme

Google Gemini

Carlos Riquelme's public website describes him as a senior research scientist at Google Brain who previously completed a PhD in statistical machine learning at Stanford. His public materials emphasize machine intelligence, efficient and sparse models, and reinforcement learning.

Giambattista Parascandolo portrait
Researcher 1 reports

Giambattista Parascandolo

OpenAI

Research Fellow at OpenAI focused on fundamental research on reasoning in neural networks. Previously completed a PhD in machine learning through the Center for Learning Systems at ETH Zurich and the Max Planck Institute for Intelligent Systems.

Jiacheng Liu portrait
Researcher 1 reports

Jiacheng Liu

Ai2

Stanford Data Science lists Jiacheng Liu as a PhD candidate in the Paul G. Allen School of Computer Science and Engineering at the University of Washington. Public sources also link this profile to OLMo 3 authorship and a past Allen Institute for Artificial Intelligence internship.

Shizhe Diao portrait
Researcher 1 reports

Shizhe Diao

NVIDIA

Shizhe Diao develops methods to scale post-training and reinforcement learning for large language models and AI agents.

Biao Zhang portrait
Researcher 1 reports

Biao Zhang

Amazon

Public sources reviewed support Biao Zhang as a researcher at Google DeepMind whose recent work covers multilingual machine translation, speech translation, and fine-tuning methods for large language models.

Matthijs Van keirsbilck portrait
Researcher 1 reports

Matthijs Van keirsbilck

NVIDIA

Matthijs Van keirsbilck is a Senior Research Scientist at NVIDIA working on neural network architecture design, structural sparsity, quantization, and training dynamics.

Shane Arora portrait
Researcher 1 reports

Shane Arora

Ai2

OpenReview lists Shane Arora as a researcher at the Allen Institute for Artificial Intelligence from 2023 to 2025, an MS student at UT Austin from 2021 to 2024, and an undergraduate student at the University of Sydney from 2016 to 2019. Ai2's public directory and papers pages also list Shane Arora.

Tim Green portrait
Researcher 1 reports

Tim Green

Google Gemini

Tim Green is a staff research scientist on Google's ML Foundations team. His homepage says he leads research at the interface of compression and memory to unlock more efficient language models, with interests spanning compression, memorization, and efficient model architectures. The same page says he previously completed a PhD in neuroscience at Stanford and a bachelor's degree in computer science and mathematics at Vanderbilt.

Biao Zhang portrait
Researcher 1 reports

Biao Zhang

Google Gemini

Researcher at Google DeepMind working on multilingual machine translation, speech-to-text translation, and efficient neural machine translation.

Kaustubh Jha portrait
Researcher 1 reports

Kaustubh Jha

NVIDIA

Research scientist at NVIDIA and University of Maryland PhD student whose public site highlights speech, audio, and video generation plus alignment for large language and vision-language models.

Siddhartha Brahma portrait
Researcher 1 reports

Siddhartha Brahma

Google Gemini

Research scientist at Google Research in New York City. He earned a PhD in computer science from NYU Tandon and works on language models, multimodal learning, NLP, and deep learning.

Jack Clark portrait
Researcher 7 reports

Jack Clark

Anthropic / OpenAI

Co-founder and Head of Policy at Anthropic. His public biography also notes earlier work as Policy Director at OpenAI, a technical journalist, and author of the Import AI newsletter.

Akshita Bhagia portrait
Researcher 5 reports

Akshita Bhagia

Ai2

Research scientist at Ai2 and PhD candidate at the University of Washington. Her work focuses on language models, evaluation, and building practical open-source AI systems.

Ameet Talwalkar portrait
Researcher 2 reports

Ameet Talwalkar

NVIDIA

Ameet Talwalkar is an associate professor in the Machine Learning Department at Carnegie Mellon University and Chief Scientist at Datadog. His public research spans AI for science, human-AI interaction, and specialized models and agents.

Keqin Bao portrait
Researcher 2 reports

Keqin Bao

Alibaba Qwen

Ph.D. candidate at the University of Science and Technology of China whose research focuses on improving the general reasoning abilities of foundational large language models, LLMs for recommendation, and scaling reinforcement reasoning; he lists himself as a core contributor to the Qwen2.5 and Qwen3 technical reports.

Mujeen Sung portrait
Researcher 2 reports

Mujeen Sung

LG AI Research

Mujeen Sung is an assistant professor at Kyung Hee University and formerly served as a technical lead at LG AI Research. His research focuses on natural language processing, controllable and efficient large language models, and language generation and interpretation.

Ruyi Chen portrait
Researcher 2 reports

Ruyi Chen

DeepSeek

Researcher at DeepSeek focused on large language models, reinforcement learning, multimodal large language models, and generative AI.

Slava Petrov portrait
Researcher 2 reports

Slava Petrov

Google Gemini

Research leader at Google DeepMind and Google Research focused on natural language processing and machine learning, including large language models such as PaLM and Gemini.

Yuntian Deng portrait
Researcher 2 reports

Yuntian Deng

Google Gemini

Yuntian Deng is a machine learning researcher whose public work spans language modeling, reasoning, and large multimodal systems.

Bo Zheng portrait
Researcher 2 reports

Bo Zheng

Alibaba Qwen

Senior Research Manager and Senior Director at Tongyi Lab, Alibaba Group, leading agents and AI infrastructure research.

Chunlin Li portrait
Researcher 2 reports

Chunlin Li

MiniMax

Chunlin Li is a research scientist at MiniMax.

Clémentine Fourrier portrait
Researcher 2 reports

Clémentine Fourrier

Mistral AI

AI researcher focused on evaluating language models and agents, open NLP research, and historical linguistics. She led evaluation efforts at Hugging Face between 2023 and 2025 and helped build LightEval and the Open LLM Leaderboard.

Dirk Groeneveld portrait
Researcher 2 reports

Dirk Groeneveld

Ai2

Ai2 engineer whose public GitHub and Hugging Face profiles describe him as building LLMs at Ai2; he is also listed on OpenReview as an OLMo author.

Katarina Slama portrait
Researcher 2 reports

Katarina Slama

OpenAI

Research scientist at the UK AI Security Institute and former OpenAI member of technical staff who worked on model behavior and post-training research. Previously conducted computational neuroscience research at UC Berkeley.

Nouha Dziri portrait
Researcher 2 reports

Nouha Dziri

Ai2

Research scientist at Ai2 focused on language models, evaluation, and AI safety. Her work studies reasoning, truthfulness, and how to make large models more reliable.

Scott Gray portrait
Researcher 2 reports

Scott Gray

OpenAI

OpenAI engineer focused on GPU systems and large-model infrastructure, with public contributions to GPT-4 and earlier OpenAI research on efficient GPU kernels and code models.

Tianyi Tang portrait
Researcher 2 reports

Tianyi Tang

Alibaba Qwen

Research Scientist on the Qwen team at Alibaba Cloud.

Stanislas Polu portrait
Researcher 1 reports

Stanislas Polu

Cohere

Stanislas Polu is Co-Founder and Engineer at Dust. His public site also documents earlier work at OpenAI, Stripe, and projects in theorem proving, browsers, and Bitcoin-related systems.

Angela Fan portrait
Researcher 5 reports

Angela Fan

Meta AI / Mistral AI

Recent public bios describe Angela Fan as a researcher at Meta working on large language models, machine translation, multilingual generation, and story generation.

Kai Dong portrait
Researcher 4 reports

Kai Dong

DeepSeek

DeepSeek engineer and co-author of the DeepSeek-V3, DeepSeek-V2, and DeepSeek LLM technical reports.

Sandhini Agarwal portrait
Researcher 4 reports

Sandhini Agarwal

OpenAI

Sandhini Agarwal is a researcher at OpenAI. Her OpenReview profile lists her as a researcher at OpenAI (2020–present) and an undergraduate student at Stanford University (2015–2019).

Shane Arora portrait
Researcher 2 reports

Shane Arora

Ai2

Shane Arora is a research scientist at Ai2. His The Org profile identifies that current role, and his OpenReview profile records earlier work as a senior applied scientist at Amazon as well as bachelor's and master's degrees in computer science from the University of Texas at Austin.

Allyson Ettinger portrait
Researcher 2 reports

Allyson Ettinger

Ai2

Allyson Ettinger is a senior research scientist at Ai2 working on the OLMo pre-training team. Her public materials focus on natural language understanding, language model behavior, and learning from text and experience.

Finbarr Timbers portrait
Researcher 2 reports

Finbarr Timbers

Ai2

Finbarr Timbers is an AI researcher and investor. His homepage says he works at Ai2 on RL for LLMs and previously worked at Midjourney and DeepMind.

Nathan Lambert portrait
Researcher 2 reports

Nathan Lambert

Ai2

Nathan Lambert's homepage says he is the post-training lead at Ai2, where he works on open language models and responsible autonomous systems. His public CV lists a PhD in Computer Science from UC Berkeley.

Dan Busbridge portrait
Researcher 1 reports

Dan Busbridge

Meta AI

Dan Busbridge's public OpenReview profile lists him as a researcher at Apple, and his Hugging Face profile highlights interests including deep learning, optimization, and large language modeling.

Joshua Clymer portrait
Researcher 1 reports

Joshua Clymer

NVIDIA

Joshua Clymer works on AI safety and theoretical questions around superposition and representation learning. Current public mentor material places him at Redwood Research and previously at METR.

Samy Bengio portrait
Researcher 1 reports

Samy Bengio

Meta AI

Senior Director, AI and Machine Learning Research at Apple and adjunct professor at EPFL. His homepage highlights work on reasoning limits for autoregressive models, deep sequence architectures, and ML systems.

Victoria Graf portrait
Researcher 1 reports

Victoria Graf

Ai2

Victoria Graf studies at the University of Washington and is publicly credited on open-model work including Tulu 3 and OLMo 3.

Daniel Khashabi portrait
Researcher 1 reports

Daniel Khashabi

Cohere

Assistant Professor of Computer Science at Johns Hopkins University working on helpful, reliable, and efficient language-driven AI systems. Previously a postdoctoral researcher at the Allen Institute for AI.

Samyam Rajbhandari portrait
Researcher 1 reports

Samyam Rajbhandari

NVIDIA

Samyam Rajbhandari is a principal AI research scientist at Snowflake AI Research. His public profile describes work on large-scale deep learning systems, distributed training, and efficient optimization, and lists prior Microsoft research experience.

Saurabh Shah portrait
Researcher 1 reports

Saurabh Shah

Ai2

Saurabh Shah's public site says he works at humans& and previously worked on post-training the OLMo language model at Ai2.

Wen-tau Yih portrait
Researcher 1 reports

Wen-tau Yih

Ai2

Scott Wen-tau Yih is a research scientist at Meta FAIR whose recent work focuses on neural retrieval and retrieval-augmented generation; his homepage lists prior roles at Ai2 and Microsoft Research.

Stephen James portrait
Researcher 1 reports

Stephen James

NVIDIA

Assistant professor at the University of Oxford and research scientist at NVIDIA. His public work focuses on robot learning, robotics, reinforcement learning, computer vision, and multimodal models.

Furu Wei portrait
Researcher 10 reports

Furu Wei

Microsoft

Furu Wei is a Distinguished Scientist and Chief Scientist of Microsoft Research Asia, listed on Microsoft Research and connected in LLMpeople to Microsoft technical reports including Kosmos, VALL-E, BitNet, and Multilingual E5.

Carroll Wainwright portrait
Researcher 2 reports

Carroll Wainwright

OpenAI

Founder and AI Advisor at Metaculus and a named contributor to OpenAI's GPT-4 Technical Report.

Chengyuan Li portrait
Researcher 2 reports

Chengyuan Li

Alibaba Qwen

Chengyuan Li is a research intern with Alibaba's Qwen team. His homepage highlights work on reasoning and post-training for large language models.

R. X. Xu portrait
Researcher 2 reports

R. X. Xu

DeepSeek

R. X. Xu is a research scientist at DeepSeek AI. His homepage says he works on trustworthy and efficient large language models, open-ended reasoning, and AI for healthcare.

Shengding Hu portrait
Researcher 2 reports

Shengding Hu

Shanghai AI Laboratory

Shengding Hu is a researcher at Shanghai AI Laboratory and a PhD candidate at Tsinghua University working on large language models, multimodal models, and AI agents.

Carl Ross portrait
Researcher 1 reports

Carl Ross

OpenAI

Research scientist at OpenAI focused on neural scaling, autonomous learning systems, language models, and reinforcement learning.

David Schnurr portrait
Researcher 1 reports

David Schnurr

OpenAI

Software engineer focused on developer tools, infrastructure, and AI products. He has worked at OpenAI in San Francisco and publishes engineering projects and writing on his website and GitHub.

Junlong Li portrait
Researcher 1 reports

Junlong Li

DeepSeek

MPhil student in the Department of Computer Science and Engineering at HKUST interested in LLM agents and multimodal models; previously interned with the DeepSeek LLM Alignment Team.

Justin Jay Wang portrait
Researcher 1 reports

Justin Jay Wang

OpenAI

Designer focused on brands, websites, and creative tools. He helped grow design at OpenAI from 0 to 1 and later joined Cursor.

Madelaine Boyd portrait
Researcher 1 reports

Madelaine Boyd

OpenAI

Madelaine Boyd is a staff engineer at OpenAI. Her public GitHub profile links to her personal website, and official OpenAI contributions pages list her work on GPT-4 and GPT-4.5.

Michelle Pokrass portrait
Researcher 1 reports

Michelle Pokrass

OpenAI

Michelle Pokrass is an OpenAI researcher working on multimodal AI, especially speech and audio. Her public profiles also show work on ChatGPT and model releases including GPT-4.1.

Qiancheng Wang portrait
Researcher 1 reports

Qiancheng Wang

DeepSeek

Researcher at DeepSeek focused on multimodal large language models, world models, reinforcement learning, and embodied AI.

Quinten Anthony portrait
Researcher 1 reports

Quinten Anthony

Cohere

Research scientist at Cohere focused on scaling machine learning systems and improving training efficiency.

Shengguang Wu portrait
Researcher 1 reports

Shengguang Wu

Alibaba Qwen

Ph.D. student in Computer Science at Stanford University who previously worked on LLMs and agents with Alibaba's Qwen team.

Alejandro Lopez-Lira portrait
Researcher 1 reports

Alejandro Lopez-Lira

Cohere

Alejandro Lopez-Lira is an assistant professor of finance at the University of Florida whose research interests include investments, machine learning, and empirical asset pricing.

Alethea Power portrait
Researcher 1 reports

Alethea Power

OpenAI

Alethea Power is a research scientist at OpenAI. Her personal website notes earlier PhD work at Georgia Tech and research interests spanning language, reasoning, cognitive science, and brain-inspired approaches to intelligence.

Andrew Cann portrait
Researcher 1 reports

Andrew Cann

OpenAI

Andrew Cann is a software engineer at OpenAI. His public website and GitHub profile identify him as an OpenAI engineer, and OpenAI's contributions pages list his work on GPT-4 and GPT-4.5.

Benfeng Xu portrait
Researcher 1 reports

Benfeng Xu

Alibaba Qwen

Benfeng Xu is an LLM post-training researcher at Alibaba Group and an incoming assistant professor at Carnegie Mellon University. His homepage says he works on language models, efficient and trustworthy machine learning, AI for science, and biomedicine.

Bill Yuchen Lin portrait
Researcher 1 reports

Bill Yuchen Lin

Ai2

Researcher working on language models, agents, and retrieval-augmented generation; currently at xAI and incoming assistant professor at the University of Washington, previously a research scientist at the Allen Institute for AI.

Bingchen Feng portrait
Researcher 1 reports

Bingchen Feng

Amazon

Applied scientist at Amazon focused on natural language and speech processing, machine learning, and multimodal learning.

Bing Xue portrait
Researcher 1 reports

Bing Xue

DeepSeek

Research scientist at DeepSeek interested in reinforcement learning, large language models, and reasoning.

Chang Gao portrait
Researcher 1 reports

Chang Gao

Alibaba Qwen

Researcher at Alibaba's Qwen team, previously a PhD student in the CUHK Text Mining Group, working on RL for LLM reasoning and alignment.

Christina Kim portrait
Researcher 1 reports

Christina Kim

OpenAI

Researcher at OpenAI on the mid-training team, previously working on post-training and reinforcement learning including WebGPT, ChatGPT with Browsing, and GPT-4.

Chujie Zheng portrait
Researcher 1 reports

Chujie Zheng

Alibaba Qwen

Chujie Zheng is an incoming senior researcher at Alibaba Group. His academic homepage says he is finishing a Ph.D. at the University of Wisconsin-Madison and works on large language model reasoning, alignment, and reinforcement learning.

Deepak Narayanan portrait
Researcher 1 reports

Deepak Narayanan

NVIDIA

Research scientist at NVIDIA working on AI systems and large-scale machine learning, with research spanning distributed systems, deep learning, and networking.

Dibya Ghosh portrait
Researcher 1 reports

Dibya Ghosh

Cohere

Dibya Ghosh is a machine learning researcher at Cohere and a PhD student in computer science at UC Berkeley advised by Sergey Levine. His work spans reinforcement learning and large language models, with a focus on how foundation models can improve learning agents.

Diyi Yang portrait
Researcher 1 reports

Diyi Yang

Cohere

Diyi Yang is an assistant professor of computer science at Stanford University. Her research focuses on natural language processing and machine learning, especially human-centered AI, social computing, and computational social science. She earned her PhD in language technologies from Carnegie Mellon University.

Emilie Grave portrait
Researcher 1 reports

Emilie Grave

Meta AI

Emilie Grave is a research scientist at FAIR. Her research interests include representation learning, natural language processing, and machine learning.

Gaurav Dhuliawala portrait
Researcher 1 reports

Gaurav Dhuliawala

Google Gemini

Postdoctoral fellow at Johns Hopkins University and incoming assistant professor at UNC Chapel Hill whose work focuses on multilingual NLP, grounded language understanding, and societal impacts of language technologies.

Han Bao portrait
Researcher 1 reports

Han Bao

DeepSeek

Han Bao is a research scientist at DeepSeek AI. His homepage says he works on machine learning theory, including out-of-distribution generalization, robust machine learning, self-supervised learning, and learning-based control.

Hao Fei portrait
Researcher 1 reports

Hao Fei

Tencent Hunyuan

Researcher currently at Tencent Hunyuan with public work in natural language processing, structured information extraction, and large language models.

Hongyi Yuan portrait
Researcher 1 reports

Hongyi Yuan

Alibaba Qwen

Engineer on the Qwen team at Alibaba Cloud.

Isabella Fulford portrait
Researcher 1 reports

Isabella Fulford

OpenAI

Member of Technical Staff at OpenAI and co-instructor of DeepLearning.AI's ChatGPT Prompt Engineering for Developers and Building Systems with the ChatGPT API courses; listed as a co-author of the GPT-4 Technical Report.

Iz Beltagy portrait
Researcher 1 reports

Iz Beltagy

Ai2

Iz Beltagy is an NLP researcher known for work on OLMo, Longformer, SciBERT, and other open language technologies. His public profile highlights research on open language models, scientific NLP, and long-context modeling.

Jessica Shieh portrait
Researcher 1 reports

Jessica Shieh

OpenAI

Research scientist at OpenAI interested in general intelligence; previously studied computer science and cognitive science at Stanford and UC Berkeley.

Jingxiang Sun portrait
Researcher 1 reports

Jingxiang Sun

DeepSeek

PhD student at Tsinghua University researching multimodal large language models, digital avatars, and 2D/3D content generation; previously an AGI student researcher at DeepSeek AI who led DreamCraft3D and contributed to DeepSeek-VL.

Kexin Huang portrait
Researcher 1 reports

Kexin Huang

DeepSeek

PhD student at Shanghai Jiao Tong University advised by Prof. Xueqi Cheng. Her research interests include retrieval and agents, multimodal large language models, and biomedical large language models, and she is currently an intern at DeepSeek.

Leo Gao portrait
Researcher 1 reports

Leo Gao

OpenAI

Research scientist at OpenAI working on core post-training and previously on pre-training, with interests in personalized and safe AI.

Lewis Tunstall portrait
Researcher 1 reports

Lewis Tunstall

Cohere

Lewis Tunstall is a principal scientist at Cohere Labs who works on open-source language models, evaluation, and multilingual NLP.

Litong Wang portrait
Researcher 1 reports

Litong Wang

DeepSeek

Researcher at DeepSeek AI and incoming Ph.D. student at the University of Hong Kong working on efficient and trustworthy LLMs, multimodal large language models, and reasoning.

Machel Reid portrait
Researcher 1 reports

Machel Reid

Meta AI

Research scientist at Meta FAIR focused on multilinguality, machine translation, speech, and web-scale processing; previously a postdoctoral scholar at Stanford's Center for Research on Foundation Models.

Michael Petrov portrait
Researcher 1 reports

Michael Petrov

OpenAI

Technical staff at OpenAI interested in the theoretical foundations and algorithms for large language models and reinforcement learning.

Mingxiao Li portrait
Researcher 1 reports

Mingxiao Li

Moonshot AI

Mingxiao Li is a machine learning researcher focused on vision-language models, generative models, and multimodal learning. His public homepage lists a PhD at KU Leuven, and public research outputs also list him with Moonshot AI Ltd. affiliation.

Moin Nadeem portrait
Researcher 1 reports

Moin Nadeem

Ai2

Empirical AI researcher at the Allen Institute for AI whose work focuses on post-training, synthetic data, and evaluation.

Paul Baltescu portrait
Researcher 1 reports

Paul Baltescu

OpenAI

Machine learning researcher and engineer; contributed to the GPT-4 Technical Report; previously an MSc by Research student at the University of Oxford Department of Computer Science.

Peter Welinder portrait
Researcher 1 reports

Peter Welinder

OpenAI

Founding member of OpenAI and vice president of product who writes publicly about building things and machine learning.

Saurabh Pan portrait
Researcher 1 reports

Saurabh Pan

Meta AI

Saurabh Pan is an AI researcher at FAIR Meta and a PhD student at Carnegie Mellon University. His work focuses on self-supervised learning, generative modeling, multimodal learning, and representation learning.

Sebastian M. R. Arnold portrait
Researcher 1 reports

Sebastian M. R. Arnold

Google Gemini

Research software engineer at Google DeepMind working on machine learning for code and formal reasoning; completed a PhD at TU Darmstadt on uncertainty quantification in deep learning.

Ted Sanders portrait
Researcher 1 reports

Ted Sanders

OpenAI

Technical staff at OpenAI working on machine learning research, large language models, multimodal models, and robotics.

Todor Markov portrait
Researcher 1 reports

Todor Markov

OpenAI

Research scientist at OpenAI working on pretraining and reasoning; previously studied computer science at Stanford University.

Tyna Eloundou portrait
Researcher 1 reports

Tyna Eloundou

OpenAI

Member of Technical Staff at OpenAI focused on safety evaluations, economic impact evaluations, and democratic inputs to AI.

Ujjwal Malladi portrait
Researcher 1 reports

Ujjwal Malladi

NVIDIA

Software engineer at NVIDIA with public work spanning machine learning systems and large language models, including Nemotron-4.

Vijay Janapa Reddi portrait
Researcher 1 reports

Vijay Janapa Reddi

NVIDIA

Vijay Janapa Reddi is a professor of electrical engineering and computer science at Harvard University whose research spans computer architecture, systems, compilers, and machine learning.

Vinnie Monaco portrait
Researcher 1 reports

Vinnie Monaco

OpenAI

Associate Professor in the Department of Computer Science at the University of Alabama at Birmingham whose research spans computer security, machine learning, pattern recognition, biometrics, and formal methods.

Wanjia Zhao portrait
Researcher 1 reports

Wanjia Zhao

DeepSeek

Research scientist at DeepSeek interested in post-training, tool use, and in-context learning.

Wen Ouyang portrait
Researcher 1 reports

Wen Ouyang

Tencent Hunyuan

Wen Ouyang is a Vice President of AI at Tencent and Director of the Visual Computing Center, where he leads the Hunyuan multimodal foundation model team. His research covers computer vision, multimodal models, and visual understanding.

William Merrill portrait
Researcher 1 reports

William Merrill

Ai2

Assistant Professor at the NYU Courant Institute of Mathematical Sciences. He studies formal models of intelligence, including language, learning, and efficient reasoning, and previously worked as a research scientist at Ai2.

Xiang Lisa Li portrait
Researcher 1 reports

Xiang Lisa Li

Cohere

Xiang Lisa Li is a researcher focused on controllable and steerable language models. Her public profile highlights work including Diffusion-LM, Prefix-Tuning, Contrastive Decoding, and evaluation methods such as AutoBencher.

Yiyang Ma portrait
Researcher 1 reports

Yiyang Ma

DeepSeek

Research scientist at DeepSeek AI focused on multimodal large language models, reinforcement learning, and efficient post-training.

Yunxing Dai portrait
Researcher 1 reports

Yunxing Dai

OpenAI

Yunxing Dai is a researcher at OpenAI and a co-author of the GPT-4 Technical Report. His public homepage and Google Scholar profile provide verified research identity links.

Zhengyan Zhang portrait
Researcher 1 reports

Zhengyan Zhang

DeepSeek

Researcher in natural language processing and pre-trained language models, with public academic work from Tsinghua University and related NLP publications.

Zhen Qin portrait
Researcher 1 reports

Zhen Qin

Cohere

Research scientist at Cohere working on large language models; previously a postdoctoral researcher in machine learning at Carnegie Mellon University.

Zhiniu Wen portrait
Researcher 1 reports

Zhiniu Wen

DeepSeek

Zhiniu Wen is a research scientist at DeepSeek. His homepage says his research interests include natural language processing, machine learning, large language models, and their applications.

Adriana Hummel portrait
Researcher 1 reports

Adriana Hummel

NVIDIA

Adriana Hummel is a PhD student in computer science at Stanford University working on efficient and trustworthy AI systems.

Anjali Gupte portrait
Researcher 1 reports

Anjali Gupte

NVIDIA

Anjali Gupte is a PhD student in computer science at the University of Southern California whose research focuses on trustworthy AI, natural language processing, and information retrieval.

Caia Costello portrait
Researcher 1 reports

Caia Costello

Ai2

Public sources identify Caia Costello as a Stanford computer science MS student and quote her in 2026 as an Applied ML Researcher at Lambda; she is also a coauthor of OLMo Hybrid.

Ming-Wei Chang portrait
Researcher 1 reports

Ming-Wei Chang

Amazon

Research Scientist at Google DeepMind working on machine learning and artificial intelligence.

Zizheng Pan portrait
Researcher 1 reports

Zizheng Pan

DeepSeek

Zizheng Pan's homepage says he has worked on multimodal LLMs at DeepSeek since 2024. The same page says he previously completed a PhD in Computer Science at Monash University and a B.E. in Software Engineering at Harbin Institute of Technology, Weihai.

Size Zheng portrait
Researcher 1 reports

Size Zheng

DeepSeek

ByteDance researcher working on high-performance inference systems, AI compilers, and distributed systems; completed a PhD in computer science at Peking University in 2024.

Supriya Kalluri portrait
Researcher 1 reports

Supriya Kalluri

Cohere

PhD candidate at the University of Washington and research scientist at Cohere working on natural language processing and machine learning.

Ilge Akkaya portrait
Researcher 1 reports

Ilge Akkaya

OpenAI

Ilge Akkaya is a research scientist at OpenAI working on reinforcement learning and robotics. His personal website lists a PhD in EECS from UC Berkeley and a BS in Electrical and Electronics Engineering from Bilkent University.

Noah A. Smith portrait
Researcher 7 reports

Noah A. Smith

Ai2

Noah A. Smith is a computer scientist and professor at the University of Washington, where he serves as Vice Provost for Artificial Intelligence and co-directs the OLMo open language modeling effort with Ai2. His research focuses on natural language processing, machine learning, and evaluation methodology.

Arthur Mensch portrait
Researcher 4 reports

Arthur Mensch

Mistral AI

Co-founder and CEO of Mistral AI and a researcher on efficient large language models and mixture-of-experts systems.

David Atkinson portrait
Researcher 1 reports

David Atkinson

Ai2

David Atkinson is a senior research scientist at Ai2 working on multimodal learning, document understanding, visual question answering, and natural language processing.

Peter Clark portrait
Researcher 1 reports

Peter Clark

Ai2

Peter Clark is Senior Director for AI at the Allen Institute for Artificial Intelligence (Ai2), where he works on natural language understanding, reasoning, and question answering.

Guillaume Lample portrait
Researcher 8 reports

Guillaume Lample

Meta AI / Mistral AI

Mistral AI's about page lists Guillaume Lample as one of the company's three founders. His OpenReview profile lists expertise in machine translation and natural language processing and a PhD in computer science at Universite Pierre et Marie Curie - Paris 6.

Li Dong portrait
Researcher 5 reports

Li Dong

Microsoft

Li Dong is a Microsoft Research principal researcher focused on human language technologies and machine intelligence.

Yichang Zhang portrait
Researcher 4 reports

Yichang Zhang

Alibaba Qwen

Yichang Zhang is a research scientist in Alibaba's Tongyi Lab and part of the Qwen team. His public profiles describe work on machine reasoning, agent systems, post-training, synthetic data, evaluation, large language models, multimodal learning, and perception.

Zhihong Shao portrait
Researcher 5 reports

Zhihong Shao

DeepSeek

Research scientist at DeepSeek AI working on multimodal large language models and end-to-end autonomous driving. Earned a PhD in computer science from the Chinese University of Hong Kong.

Ashish Sabharwal portrait
Researcher 2 reports

Ashish Sabharwal

Ai2

Ashish Sabharwal is publicly listed as Principal Researcher at the Allen Institute for AI. Public sources also list him as a coauthor of the OLMo 3 technical report.

Alisa Liu portrait
Researcher 1 reports

Alisa Liu

Ai2

Alisa Liu researches language models, including tokenization, data creation, and inference-time adaptation.

Aman Rangapur portrait
Researcher 1 reports

Aman Rangapur

Ai2

Aman Rangapur says he works on language models at Ai2 with a focus on modeling, scaling, and infrastructure, and lists prior research roles at Illinois Institute of Technology and VIT-AP.

Xin Dong portrait
Researcher 1 reports

Xin Dong

NVIDIA

Xin Dong's homepage says he leads a research team on LLM training at Seed at ByteDance. It also states that he earned a Harvard PhD in 2023 and previously worked at NVIDIA, Meta, and Tencent.

Yoram Bachrach portrait
Researcher 1 reports

Yoram Bachrach

Google Gemini

Yoram Bachrach is a research scientist at Meta in London focused on LLM agents and multi-agent reasoning. He previously worked at Google DeepMind and Microsoft Research in Cambridge.

Peter J. Liu portrait
Researcher 1 reports

Peter J. Liu

Cohere

Co-founder of Twenty Labs. His homepage says he previously worked as a research scientist at Google DeepMind, after earlier production machine learning work on Gmail Spam.

Suchir Balaji portrait
Researcher 1 reports

Suchir Balaji

OpenAI

Suchir Balaji was a Member of Technical Staff at OpenAI (Nov 2020 to Aug 2024). His personal site states he studied Computer Science at UC Berkeley (2017 to 2021).

Natalie Summers portrait
Researcher 1 reports

Natalie Summers

OpenAI

OpenAI staff member whose public speaker bio says she ran the Artist Access Program; earlier roles included Apple and journalism work at WIRED and USA TODAY.

Bryan Catanzaro portrait
Researcher 7 reports

Bryan Catanzaro

NVIDIA

Vice President of Applied Deep Learning Research at NVIDIA, leading work on conversational AI, generative AI, and accelerated deep learning software.

Caiming Xiong portrait
Researcher 5 reports

Caiming Xiong

Salesforce AI Research

Researcher at Salesforce AI Research and coauthor of the xLAM-2 Technical Report.

Dario Amodei portrait
Researcher 5 reports

Dario Amodei

Anthropic / OpenAI

Co-founder and CEO of Anthropic.

Silvio Savarese portrait
Researcher 4 reports

Silvio Savarese

Salesforce AI Research

Researcher at Salesforce AI Research and coauthor of the xLAM: A Family of Large Action Models to Empower AI Agent Systems.

Nathan Lambert portrait
Researcher 4 reports

Nathan Lambert

Ai2

Machine learning scientist at Ai2 working on reinforcement learning, language models, and online social systems.

Albert Q. Jiang portrait
Researcher 3 reports

Albert Q. Jiang

Mistral AI

Researcher and engineer at Mistral AI focused on distributed machine learning and efficient large language models.

Minghui Tang portrait
Researcher 3 reports

Minghui Tang

DeepSeek

Research scientist at DeepSeek.

Zhewen Hao portrait
Researcher 3 reports

Zhewen Hao

DeepSeek

First-year Ph.D. student in Information Engineering at the Chinese University of Hong Kong advised by Dahua Lin and Jifeng Dai. His research interests include understanding and building intelligence in multimodal large language models.

Shaohan Huang portrait
Researcher 6 reports

Shaohan Huang

Microsoft

Shaohan Huang is a senior researcher in the General Artificial Intelligence Group at Microsoft Research Asia in Beijing. OpenReview lists him as a Microsoft researcher and a former master's student at Beihang University.

Diogo Almeida portrait
Researcher 2 reports

Diogo Almeida

OpenAI

Diogo Almeida is an AI researcher and a co-author of the InstructGPT paper (arXiv:2203.02155).

Alexander Keller portrait
Researcher 1 reports

Alexander Keller

NVIDIA

NVIDIA Research identifies Alexander Keller as a senior director of research, formerly chief scientist at mental images and previously a professor at Ulm University. His research interests are at the intersection of graphics, communications, and machine learning.

Mary Phuong portrait
Researcher 1 reports

Mary Phuong

Cohere

Mary Phuong's 2026 MATS mentor profile describes her as a research scientist at Google DeepMind working on frontier safety loss-of-control problems, including AGI control, security, and monitoring.

Yanhong Li portrait
Researcher 1 reports

Yanhong Li

Ai2

Yanhong Li is a pre-doctoral researcher at the Allen Institute for AI working on data-efficient pretraining, language model efficiency, and hybrid model theory.

Jaron Lochner portrait
Researcher 1 reports

Jaron Lochner

Ai2

Publicly listed as a researcher at the Allen Institute for Artificial Intelligence and named on the OLMo 3 author list.

Pavlo Molchanov portrait
Researcher 1 reports

Pavlo Molchanov

NVIDIA

Pavlo Molchanov leads deep learning efficiency work at NVIDIA Research, with public profiles covering LLM and VLM efficiency, model compression, adaptive inference, and earlier computer vision research.

Scott Geng portrait
Researcher 1 reports

Scott Geng

Ai2

Scott Geng's homepage identifies him as a University of Washington PhD student and visiting researcher at FAIR; the OLMo 3 technical report lists him as a coauthor.

Teng Xiao portrait
Researcher 1 reports

Teng Xiao

Ai2

Teng Xiao is a Young Investigator at the Allen Institute for AI and the University of Washington. His homepage says his research interests are in machine learning, natural language processing, and reinforcement learning.

Myle Ott portrait
Researcher 1 reports

Myle Ott

Meta AI

Myle Ott is a research scientist at Meta working on the Llama models. His interests include scalable optimization, distributed systems, and language model pretraining.

Panpan Huang portrait
Researcher 4 reports

Panpan Huang

DeepSeek

DeepSeek team member and co-author of the DeepSeek-V3, DeepSeek-V2, and DeepSeek LLM technical reports.

Gretchen Krueger portrait
Researcher 2 reports

Gretchen Krueger

OpenAI

Software engineer at OpenAI focused on inference. Previously studied computer science at Princeton University.

Lester James V. Miranda portrait
Researcher 2 reports

Lester James V. Miranda

Ai2

Machine learning researcher and PhD student at the University of Cambridge focused on multilingual and equitable NLP, especially for low-resource languages such as Filipino. He previously worked on open language model efforts at Ai2 including Tulu 3 and OLMo 2.

Long Ouyang portrait
Researcher 2 reports

Long Ouyang

OpenAI

Researcher at OpenAI who led InstructGPT and GPT-4 post-training. He previously co-founded Merlyn Mind and was an engineering director at Quora.

Lucas Beyer portrait
Researcher 2 reports

Lucas Beyer

Google Gemini

Lucas Beyer is an ML researcher at Google DeepMind in Zurich. His public homepage highlights prior work at Google Brain and a PhD at ETH Zurich.

Maarten Sap portrait
Researcher 2 reports

Maarten Sap

Ai2

Maarten Sap is an assistant professor at the University of Washington and a senior research scientist at the Allen Institute for AI. His work focuses on human-centered language technologies and social NLP.

Nick Ryder portrait
Researcher 2 reports

Nick Ryder

OpenAI

Nick Ryder is VP of Research, Foundations at OpenAI and a contributor to GPT-4. He previously studied mathematics at UC Berkeley and Rice University.

Nikolay Savinov portrait
Researcher 2 reports

Nikolay Savinov

Google Gemini

Research scientist at Google DeepMind on the Gemini team, working on multimodal AI.

Pranav Shyam portrait
Researcher 2 reports

Pranav Shyam

OpenAI

Engineer and founder who previously worked at OpenAI, Palantir, and Lyft and now builds the AI search startup Basalt. He publishes public writing on AI products and software on his personal site.

Xiaoxiang Wang portrait
Researcher 2 reports

Xiaoxiang Wang

DeepSeek

Research scientist at DeepSeek working on post-training and reinforcement learning, with interests in systems and AI for science.

Yunfei Chu portrait
Researcher 2 reports

Yunfei Chu

Alibaba Qwen

Algorithm expert at Alibaba Group working on computer vision, multimodal learning, and large language models.

Junxiao Song portrait
Researcher 8 reports

Junxiao Song

DeepSeek

DeepSeek report author whose DBLP record includes DeepSeek LLM, DeepSeekMath, DeepSeek-Coder-V2, DeepSeek-V3, DeepSeek-R1, Janus, and JanusFlow work.

Haowei Zhang portrait
Researcher 7 reports

Haowei Zhang

DeepSeek

DeepSeek report author whose DBLP-linked publication record includes DeepSeek LLM, DeepSeek-Coder-V2, Janus, DeepSeek-V3, and DeepSeek-R1 work.

Dejian Yang portrait
Researcher 7 reports

Dejian Yang

DeepSeek

DeepSeek team member and co-author of the DeepSeek-V3, DeepSeek-V2, and DeepSeek LLM technical reports.

Peiyi Wang portrait
Researcher 6 reports

Peiyi Wang

DeepSeek

DeepSeek report author whose DBLP record includes DeepSeek LLM, DeepSeekMath, DeepSeek-Coder-V2, DeepSeek-V2, DeepSeek-V3, and DeepSeek-R1 work.

Jianwei Zhang portrait
Researcher 5 reports

Jianwei Zhang

Alibaba Qwen

Alibaba Qwen report author whose DBLP profile identifies an Alibaba Group DAMO Academy affiliation and Qwen technical report authorship.

Peng Wang portrait
Researcher 5 reports

Peng Wang

Alibaba Qwen

Alibaba Qwen report author whose DBLP profile identifies an Alibaba Group affiliation and Qwen technical report authorship.

Wenbin Ge portrait
Researcher 5 reports

Wenbin Ge

Alibaba Qwen

Alibaba Qwen report author whose DBLP record includes Qwen2.5-VL and Qwen technical report work on multimodal and large language models.

Yonghui Wu portrait
Researcher 5 reports

Yonghui Wu

Google Gemini

Google researcher whose official profile says he joined Google in September 2008 and has been with Google Brain since January 2015, with research interests spanning information retrieval, machine learning, machine translation, and natural language processing.

Chenggang Zhao portrait
Researcher 4 reports

Chenggang Zhao

DeepSeek

Research engineer at DeepSeek-AI and co-author of the Nature paper introducing DeepSeek-R1.

Jiashi Li portrait
Researcher 4 reports

Jiashi Li

DeepSeek

DeepSeek team member and co-author of the DeepSeek-V3, DeepSeek-V2, and DeepSeek LLM technical reports.

Damai Dai portrait
Researcher 4 reports

Damai Dai

DeepSeek

DeepSeek report author whose DBLP publication record includes DeepSeek LLM, DeepSeekMoE, DeepSeek-V2, DeepSeek-V3, and DeepSeek-R1 work.

Deli Chen portrait
Researcher 4 reports

Deli Chen

DeepSeek

DeepSeek report author whose DBLP record includes DeepSeek LLM, DeepSeekMoE, DeepSeek-Coder-V2, DeepSeek-V3, and DeepSeek-R1 work.

Guanting Chen portrait
Researcher 4 reports

Guanting Chen

DeepSeek

DeepSeek-AI researcher whose DBLP profile lists a DeepSeek-AI affiliation and DeepSeek-Coder, DeepSeek-V2, and DeepSeek-V3 authorship.

Jianhong Tu portrait
Researcher 5 reports

Jianhong Tu

Alibaba Qwen

Researcher on Alibaba's Qwen team focused on pre-training and post-training for multimodal large language models and reasoning.

Pei Zhang portrait
Researcher 4 reports

Pei Zhang

Alibaba Qwen

Alibaba Qwen report author whose DBLP profile identifies an Alibaba Group affiliation and Qwen technical report authorship.

Xiao Bi portrait
Researcher 4 reports

Xiao Bi

DeepSeek

DeepSeek report author whose DBLP record includes DeepSeek LLM, DeepSeek-Coder, DeepSeek-Coder-V2, DeepSeek-V2, DeepSeek-V3, and DeepSeek-R1 work.

Xin Liu portrait
Researcher 4 reports

Xin Liu

DeepSeek

DeepSeek-AI researcher whose DBLP profile lists a DeepSeek-AI affiliation and DeepSeek LLM, DeepSeek-V2, and DeepSeek-V3 authorship.

Shuming Ma portrait
Researcher 4 reports

Shuming Ma

Microsoft

Co-author of the BitNet b1.58 2B4T Technical Report; the paper's author note states that S. Ma is with Microsoft Research.

Andrea Vallone portrait
Researcher 1 reports

Andrea Vallone

OpenAI

Andrea Vallone is publicly credited on OpenAI's GPT-4 safety and policy evaluations, and late-2025 to early-2026 reporting said she left OpenAI for Anthropic's alignment team.

David Heineman portrait
Researcher 2 reports

David Heineman

Ai2

David Heineman works on the OLMo project at Ai2 and says he previously completed his undergraduate studies at Georgia Tech.

Weijia Shi portrait
Researcher 1 reports

Weijia Shi

Ai2

Weijia Shi is a Ph.D. candidate at the University of Washington. Her research focuses on building augmented and modular architectures to improve the quality, efficiency, and capabilities of language models.

Yejin Choi portrait
Researcher 2 reports

Yejin Choi

Ai2

Dieter Schwarz Foundation Professor and Senior Fellow in Stanford Computer Science and HAI. Her public homepage notes previous roles as professor at the University of Washington and senior director at Ai2.

Benjamin Mann portrait
Researcher 1 reports

Benjamin Mann

OpenAI

Public conference bios describe Benjamin Mann as a co-founder and member of the technical staff at Anthropic. They also describe earlier OpenAI work on infrastructure, efficiency, and safety for GPT-3.

Fucong Dai portrait
Researcher 1 reports

Fucong Dai

DeepSeek

Research scientist at DeepSeek with public GitHub projects on large language models and AI systems.

Hamish Ivison portrait
Researcher 1 reports

Hamish Ivison

Ai2

Hamish Ivison's homepage says he is a PhD student in Computer Science and Engineering at the University of Washington working on natural language processing and post-training for language models. Public sources also list him as a coauthor of OLMo 3.

Ian Magnusson portrait
Researcher 1 reports

Ian Magnusson

Ai2

Ian Magnusson's public site says he does AI research at the University of Washington and the Allen Institute for AI, focused on language-model science and evaluation.

Niklas Muennighoff portrait
Researcher 1 reports

Niklas Muennighoff

Ai2

Niklas Muennighoff researches language models and evaluation at Stanford and has publicly led open-model work including OLMoE.

Akhil Mathur portrait
Researcher 1 reports

Akhil Mathur

Meta AI

Akhil Mathur is a postdoctoral research scientist at Meta FAIR whose public homepage highlights work in natural language processing and machine learning.

Alan Soldaini portrait
Researcher 1 reports

Alan Soldaini

Ai2

Research scientist at Ai2 working on language models and open research infrastructure. His interests include natural language processing, machine learning, and building reliable open-source systems.

Angela Jiang portrait
Researcher 1 reports

Angela Jiang

OpenAI

Engineer at OpenAI. Formerly worked on generative AI at Disney and on data science projects with organizations including MIT and Tufts, and studied at MIT.

Atty Eleti portrait
Researcher 1 reports

Atty Eleti

OpenAI

Research scientist at Thinking Machines Lab with prior language-model work at OpenAI and earlier quantitative research at Bridgewater. Her public profile highlights interests in scaling language models, new architectures, and adaptive inference.

Baptiste Rolland portrait
Researcher 1 reports

Baptiste Rolland

Meta AI

Research scientist at Meta AI working on applied mathematics, machine learning, and artificial intelligence.

Billy Roziere portrait
Researcher 1 reports

Billy Roziere

Meta AI

Research scientist at FAIR Meta and teacher affiliated with ENS, INRIA, and PSL.

Braden Hancock portrait
Researcher 1 reports

Braden Hancock

Google Gemini

Braden Hancock is a researcher at Google DeepMind whose public homepage highlights work on large language models, weak supervision, and knowledge representation.

Cameron Raymond portrait
Researcher 1 reports

Cameron Raymond

OpenAI

Cameron Raymond is a software engineer at OpenAI. His personal site says he previously worked at Asana, led developer experience at Stripe, and is an author of Staff Engineer.

Chandra Bhagavatula portrait
Researcher 1 reports

Chandra Bhagavatula

Ai2

Research scientist at Ai2 focused on natural language processing, commonsense reasoning, long-form generation, narrative intelligence, and text-based games.

Chelsea Voss portrait
Researcher 1 reports

Chelsea Voss

OpenAI

Research manager at OpenAI with a public site covering publications and projects.

Christine McLeavey portrait
Researcher 1 reports

Christine McLeavey

OpenAI

Audio research leader at OpenAI. She previously worked on MuseNet and Jukebox, and earlier studied physics at Princeton and neuroscience at Stanford before a professional career as a classical pianist.

Daniel Levy portrait
Researcher 1 reports

Daniel Levy

OpenAI

Research Scientist at OpenAI working on deep learning theory, AI alignment, and large language models; he previously completed a PhD at Stanford advised by Percy Liang.

David Hall portrait
Researcher 1 reports

David Hall

NVIDIA

Principal research scientist at NVIDIA working on recursive intelligence, test-time reasoning, AI for biology, multimodal generation, and code generation.

David Karamcheti portrait
Researcher 1 reports

David Karamcheti

Google Gemini

David Karamcheti is a research scientist at Google DeepMind whose public homepage highlights work on multimodal foundation models and embodied intelligence.

David Medina portrait
Researcher 1 reports

David Medina

OpenAI

Researcher at OpenAI working on multimodal and agentic systems. He previously worked on machine learning systems and video understanding, and is completing a PhD at Columbia University.

David Wadden portrait
Researcher 1 reports

David Wadden

Ai2

David Wadden is an assistant professor at the University of Washington working on natural language processing and AI systems for science and high-stakes domains. His public profile highlights information extraction, scientific NLP, and trustworthy language model applications.

Derek Chen portrait
Researcher 1 reports

Derek Chen

OpenAI

Research scientist at OpenAI focused on multimodal reinforcement learning and natural language processing; PhD candidate at Stanford University.

Dmitriy Lepikhin portrait
Researcher 1 reports

Dmitriy Lepikhin

Google Gemini

Distinguished engineer on Google's Gemini LLM team working on machine learning systems, generative AI, and data center infrastructure.

Edward J Hu portrait
Researcher 1 reports

Edward J Hu

Cohere

Edward J. Hu is a machine learning researcher known for efficient adaptation methods for large language models, including LoRA and QLoRA. His public profile focuses on parameter-efficient fine-tuning, model editing, and practical LLM systems.

Evan Morikawa portrait
Researcher 1 reports

Evan Morikawa

OpenAI

Member of Technical Staff at OpenAI working on large language models. He has also worked on software at YouTube and Google and studied computer science at Stanford University.

Gustavo de Rosa portrait
Researcher 1 reports

Gustavo de Rosa

Google Gemini

Research scientist at Google DeepMind working on language models, multimodal learning, evaluation, and alignment.

Haozhun Jin portrait
Researcher 1 reports

Haozhun Jin

OpenAI

OpenAI team member in the San Francisco Bay Area and co-author of the GPT-4 Technical Report.

Heewoo Jun portrait
Researcher 1 reports

Heewoo Jun

OpenAI

Member of Technical Staff at OpenAI. Previously at New York University, where he completed a PhD in computer science advised by Kyunghyun Cho. His interests include natural language processing, machine learning, and deep learning.

Heng-Tze Cheng portrait
Researcher 1 reports

Heng-Tze Cheng

Google Gemini

Senior Staff Software Engineer at Google working on machine intelligence, data management, and large-scale machine learning systems.

Hrant Khachatrian portrait
Researcher 1 reports

Hrant Khachatrian

Cerebras Systems

Hrant Khachatrian is a machine learning researcher and engineer at FAST and a co-founder of YerevaNN, with work spanning natural language processing, computer vision, and efficient language models.

Ikai Lan portrait
Researcher 1 reports

Ikai Lan

OpenAI

Engineer at OpenAI credited in the GPT-4 technical report for reliability engineering. Previously worked on Google App Engine and at LinkedIn, according to his PyCon 2011 speaker profile.

Jake Kenevey portrait
Researcher 1 reports

Jake Kenevey

Meta AI

Software engineer at Meta building intelligence for wearables; he previously worked on developer infrastructure for language models and assistant products, as well as on-device NLP systems.

Jan Kautz portrait
Researcher 1 reports

Jan Kautz

NVIDIA

Vice President of Learning and Perception Research at NVIDIA, leading work in computer vision, graphics, and machine learning.

Jianmo Ni portrait
Researcher 1 reports

Jianmo Ni

Cohere

Jianmo Ni is a researcher focused on information retrieval, question answering, and large language model systems. His public profile highlights retrieval-augmented generation, ranking, and efficient NLP methods.

Jiayi Weng portrait
Researcher 1 reports

Jiayi Weng

OpenAI

PhD student in Computer Science at Stanford University and affiliated with OpenAI. Her research focuses on natural language processing and machine learning, and she is a co-author of the GPT-4 Technical Report.

Karthik Pattabiraman portrait
Researcher 1 reports

Karthik Pattabiraman

NVIDIA

Karthik Pattabiraman is a professor of computer science at the University of British Columbia whose research spans software engineering, dependable distributed systems, systems performance, and software security.

Keunhong Park portrait
Researcher 1 reports

Keunhong Park

NVIDIA

Researcher focused on world models, 3D generation, and neural rendering. He is a founding member of World Labs, previously worked at Google on 3D asset generation, and received a PhD from the University of Washington in 2021.

Liam Fedus portrait
Researcher 1 reports

Liam Fedus

OpenAI

Liam Fedus is a research scientist at OpenAI. OpenAI's staff page says his research interests include reinforcement learning and language models, with a focus on improving capabilities, safety, and robustness.

Lilian Weng portrait
Researcher 1 reports

Lilian Weng

OpenAI

Lilian Weng is vice president of research at OpenAI. OpenAI's staff page says she works on AI systems that can mimic and reason about our world to improve intelligence.

Łukasz Kondraciuk portrait
Researcher 1 reports

Łukasz Kondraciuk

OpenAI

Member of Technical Staff at OpenAI focused on advancing the capabilities of large language models.

Lysa Dash portrait
Researcher 1 reports

Lysa Dash

Mistral AI

Research scientist at Mistral AI and PhD candidate at Mila whose interests include causal representation learning, world models, and reinforcement learning.

Mike Heaton portrait
Researcher 1 reports

Mike Heaton

OpenAI

Engineer and product builder who has worked on ChatGPT, Codex, the OpenAI API, and developer tools. He publishes writing and code on his personal site and GitHub.

Moyin Olorunnisola portrait
Researcher 1 reports

Moyin Olorunnisola

Cohere

Machine learning researcher and engineer whose public work includes multimodal models, robotics, and reinforcement learning, including research at Cohere.

Peter West portrait
Researcher 1 reports

Peter West

Cohere

Peter West is a PhD candidate at the University of Washington studying aligned, grounded, and trustworthy language models. His work focuses on evaluation, human feedback, model behavior, and safer NLP systems.

Prajjwal Bhargava portrait
Researcher 1 reports

Prajjwal Bhargava

Meta AI

Research scientist at FAIR Meta working on multimodal learning, image-text retrieval, and language technologies.

Pramod Kumbhare portrait
Researcher 1 reports

Pramod Kumbhare

NVIDIA

Member of technical staff at NVIDIA Research focused on language models, deep learning, and efficient training systems.

Ruize Gao portrait
Researcher 1 reports

Ruize Gao

Alibaba Qwen

Research engineer at Alibaba Tongyi Lab focusing on pre-training, post-training, and efficient architecture for large language models.

Runji Wang portrait
Researcher 1 reports

Runji Wang

DeepSeek

Researcher at DeepSeek and incoming assistant professor at the University of Hong Kong specializing in language, vision, and speech.

Ruobing Xie portrait
Researcher 1 reports

Ruobing Xie

Z.ai

Researcher whose public work spans information retrieval, recommendation systems, and large language models.

Sarah Yoo portrait
Researcher 1 reports

Sarah Yoo

OpenAI

Researcher at OpenAI focused on large language models and embodied AI. Her public background includes AI and recommendation work at TikTok, computer vision and robotics research at Stanford, and an ML master's at Princeton.

Shivanshu Gupta portrait
Researcher 1 reports

Shivanshu Gupta

Ai2

Research scientist at the Allen Institute for AI working on large language models, multilingual NLP, machine reasoning, and model evaluation.

Shixiang Shane Gu portrait
Researcher 1 reports

Shixiang Shane Gu

OpenAI

Research scientist at OpenAI.

Shusheng Yang portrait
Researcher 1 reports

Shusheng Yang

Alibaba Qwen

Shusheng Yang is a research scientist in Tongyi Lab at Alibaba Group. His work focuses on natural language processing and machine learning, especially pre-trained language models, text generation, and efficient NLP.

Taya Hartman portrait
Researcher 1 reports

Taya Hartman

Cohere

Taya Hartman is a product and brand designer whose public portfolio spans digital, visual, and interactive work. Public sources tie her to Cohere as a senior designer, and she is credited as a contributor to the Command A technical report.

Thomas Degry portrait
Researcher 1 reports

Thomas Degry

OpenAI

Thomas Degry is a design engineer at OpenAI based in San Francisco. Public profiles also describe prior work as a design engineer at Stripe and as tech lead for openai.com.

Tomi Pott portrait
Researcher 1 reports

Tomi Pott

Meta AI

Tomi Pott is a research engineer at Meta Research whose public homepage highlights work in multi-agent reinforcement learning, deep learning, and AI systems.

Tongfei Chen portrait
Researcher 1 reports

Tongfei Chen

Cohere

Principal scientist at Cohere and adjunct professor at McGill University, with research spanning NLP, machine learning systems, speech, and biometrics.

Vitchyr H. Pong portrait
Researcher 1 reports

Vitchyr H. Pong

OpenAI

Research scientist at OpenAI working on reinforcement learning and efficient machine learning; previously completed a PhD at UC Berkeley advised by Sergey Levine.

Xinyu Wang portrait
Researcher 1 reports

Xinyu Wang

Alibaba Qwen

Researcher in multimodal large language models at the Alibaba Qwen team and student at Johns Hopkins University.

Yaniv Markovski portrait
Researcher 1 reports

Yaniv Markovski

OpenAI

Senior member of technical staff at OpenAI focused on machine learning systems and AGI. He previously co-founded DeepGPU and publishes public writing and code on his website and GitHub.

Yongjik Kim portrait
Researcher 1 reports

Yongjik Kim

OpenAI

Research scientist at OpenAI focused on applying AI to medicine and science; previously worked on AI for vision, language, healthcare, and robotics.

Yuyang Zhou portrait
Researcher 1 reports

Yuyang Zhou

DeepSeek

Research intern at DeepSeek and a Tsinghua University undergraduate focused on machine learning systems and large language model inference.

Zhenru Zhang portrait
Researcher 4 reports

Zhenru Zhang

Alibaba Qwen

Ph.D. student in Computer Science at Zhejiang University and researcher on the Qwen series at Alibaba Group.

Kyle Lo portrait
Researcher 3 reports

Kyle Lo

Ai2

Kyle Lo's public homepage says he formerly served as a research scientist at the Allen Institute for AI, where he co-led OLMo. It describes current work on large-scale pretraining, data mixing, benchmark validity, and research-supporting data and software artifacts.

Yanping Huang portrait
Researcher 3 reports

Yanping Huang

DeepSeek / Google Gemini

Yanping Huang's OpenReview profile says he has been an engineer at Google since 2015 and lists research interests in machine learning systems, machine translation, and image classification. Public report records also list him as a co-author on the DeepSeek-V3 technical report.

Jake Poznanski portrait
Researcher 2 reports

Jake Poznanski

Ai2

Jakub "Jake" Poznanski identifies himself on his personal site as a Carnegie Mellon computer science graduate and describes leading an OCR toolkit on GitHub.

Filipe de Avila Belbute Peres portrait
Researcher 1 reports

Filipe de Avila Belbute Peres

OpenAI

Member of Technical Staff at OpenAI. His public homepage lists research and publications spanning GPT-4, differentiable physics, and scientific machine learning.

Michael Schmitz portrait
Researcher 1 reports

Michael Schmitz

Ai2

Michael Schmitz is a software engineer and manager in the Pacific Northwest. His personal site says his current position is Director of Engineering at the Allen Institute for AI.

Jared D. Kaplan portrait
Researcher 1 reports

Jared D. Kaplan

OpenAI

Jared D. Kaplan is the co-founder and chief science officer of Anthropic. Public biographies also describe his earlier work on GPT-3 and Codex at OpenAI and his background in theoretical physics.

Kevin Farhat portrait
Researcher 1 reports

Kevin Farhat

Ai2

Kevin Farhat's homepage says he is a research engineer at Ai2 working on language models, distributed AI, and privacy-preserving training. Public sources also list him as a coauthor of FlexOlmo.

Lucas Liebenwein portrait
Researcher 1 reports

Lucas Liebenwein

NVIDIA

Works on high-performance LLM inference and AutoDeploy at NVIDIA; previously led efficient-AI work at OmniML and earned graduate degrees at MIT CSAIL.

Jacob Devlin portrait
Researcher 1 reports

Jacob Devlin

Google Gemini

Jacob Devlin is a research scientist at Google DeepMind whose work focuses on language models, long-context modeling, and tool use.

Kim Malfacini portrait
Researcher 1 reports

Kim Malfacini

OpenAI

Public profiles list Kim Malfacini as head of product policy at OpenAI, with earlier work in public policy and trust and safety for AI systems.

Sam McCandlish portrait
Researcher 1 reports

Sam McCandlish

OpenAI

Anthropic co-founder and chief architect working on large-scale model training, research productivity, and reinforcement learning infrastructure.

Sebastian Nowozin portrait
Researcher 1 reports

Sebastian Nowozin

Google Gemini

Partner Research Manager at Microsoft Research Cambridge, with official Microsoft pages describing work in artificial intelligence, computer vision, and human language technologies.

Jason Baldridge portrait
Researcher 1 reports

Jason Baldridge

Google Gemini

Jason Baldridge is a Research Scientist at Google DeepMind working on natural language understanding, multimodal learning, and evaluation.

Ali Farhadi portrait
Researcher 7 reports

Ali Farhadi

Ai2

CEO of the Allen Institute for AI and professor of computer science at the University of Washington. His work spans computer vision, multimodal learning, reasoning, and embodied AI.

An Yang portrait
Researcher 9 reports

An Yang

Alibaba Qwen

Alibaba researcher working on large language models and multimodal pretraining; public research profiles connect An Yang to Qwen-related work and earlier study at Peking University.

Kai Dang portrait
Researcher 8 reports

Kai Dang

Alibaba Qwen

Researcher on Alibaba's Qwen team focused on large language models and NLP, with public research profiles listing a Nankai University background.

Zhifeng Chen portrait
Researcher 8 reports

Zhifeng Chen

Google Gemini / Z.ai

Zhifeng Chen's public homepage describes him as a distinguished software engineer at Google Brain focused on large-scale computer systems and machine learning applications.

Timothée Lacroix portrait
Researcher 6 reports

Timothée Lacroix

Meta AI / Mistral AI

Timothée Lacroix is a machine learning researcher and one of the founders of Mistral AI.

Jie Tang portrait
Researcher 3 reports

Jie Tang

OpenAI / Alibaba Qwen

Computer scientist and engineer credited on OpenAI's GPT-4 public contributions page; OpenAI's 2016 team update says he previously led Dropbox's core file sync team after earlier work in Pieter Abbeel's Berkeley robotics lab.

Koray Kavukcuoglu portrait
Researcher 3 reports

Koray Kavukcuoglu

Google Gemini

Chief Technology Officer at Google DeepMind, with work spanning machine learning and reinforcement learning.

Shuiping Yu portrait
Researcher 3 reports

Shuiping Yu

DeepSeek

Researcher at DeepSeek interested in efficient large language models and multimodal large language models.

Wen Liu portrait
Researcher 3 reports

Wen Liu

DeepSeek

Research scientist at DeepSeek.

Yifeng Lu portrait
Researcher 3 reports

Yifeng Lu

Google Gemini

Member of Technical Staff at Google DeepMind working on machine learning, natural language processing, and large language models.

Ziwei Xie portrait
Researcher 3 reports

Ziwei Xie

DeepSeek

ML researcher at DeepSeek.

Hongkun Yu portrait
Researcher 2 reports

Hongkun Yu

Google Gemini / Cohere

Hongkun Yu is a research scientist at Google Research. His official Google Research profile says his work lies at the intersection of machine learning, signal processing, and sequence modeling.

Pradeep Dasigi portrait
Researcher 2 reports

Pradeep Dasigi

Ai2

Pradeep Dasigi is a research scientist on the AllenNLP team at Ai2. His homepage says he has been actively involved in OLMo and Tulu and currently focuses on post-training language models.

Greg Brockman portrait
Researcher 1 reports

Greg Brockman

OpenAI

OpenAI co-founder and president whose public work spans the OpenAI API, Codex, Gym, OpenAI Five, and broader large-scale model systems.

Raia Hadsell portrait
Researcher 2 reports

Raia Hadsell

Google Gemini

VP of Research at Google DeepMind working on robotics and embodied intelligence, with expertise in machine learning, reinforcement learning, neuroscience, and computer vision.

Rui Xin portrait
Researcher 1 reports

Rui Xin

Ai2

Rui Xin's public homepage identifies him as a PhD student at the University of Washington working on reinforcement learning, privacy-preserving machine learning, and memorization.

Sam Altman portrait
Researcher 1 reports

Sam Altman

OpenAI

Sam Altman is the CEO of OpenAI.

Shayne Longpre portrait
Researcher 1 reports

Shayne Longpre

Ai2

Shayne Longpre's public profiles identify him as a PhD candidate at MIT focused on data-centric AI, language models, and their societal impact.

Anna Makanju portrait
Researcher 1 reports

Anna Makanju

OpenAI

Head of AI for Civil Society and Philanthropy at the OpenAI Foundation. An OpenAI update says she previously served as Vice President of Global Impact at OpenAI.

Hannah Wong portrait
Researcher 1 reports

Hannah Wong

OpenAI

Former Chief Communications Officer at OpenAI. Public reporting says she joined OpenAI in 2021 after seven years at Apple and announced her departure in December 2025.

Mohammad Bavarian portrait
Researcher 1 reports

Mohammad Bavarian

OpenAI

Mohammad Bavarian is a researcher at OpenAI. His personal site states that he earned a PhD from MIT in 2017 and previously worked as a researcher at Rubrik.

Erhang Li portrait
Researcher 3 reports

Erhang Li

DeepSeek

DeepSeek team member and co-author of the DeepSeek-V3, DeepSeek-V2, and DeepSeek LLM technical reports.

Fangyun Lin portrait
Researcher 3 reports

Fangyun Lin

DeepSeek

DeepSeek team member and co-author of the DeepSeek-V3, DeepSeek-V2, and DeepSeek LLM technical reports.

Guangbo Hao portrait
Researcher 3 reports

Guangbo Hao

DeepSeek

DeepSeek team member and co-author of the DeepSeek-V3, DeepSeek-V2, and DeepSeek LLM technical reports.

Hanwei Xu portrait
Researcher 3 reports

Hanwei Xu

DeepSeek

DeepSeek team member and co-author of the DeepSeek-V3, DeepSeek-V2, and DeepSeek LLM technical reports.

Junjie Qiu portrait
Researcher 3 reports

Junjie Qiu

DeepSeek

DeepSeek team member and co-author of the DeepSeek-V3, DeepSeek-V2, and DeepSeek LLM technical reports.

Saumya Malik portrait
Researcher 3 reports

Saumya Malik

Ai2

Saumya Malik is an incoming postdoctoral scholar at Johns Hopkins University and recently completed a PhD in the Machine Learning Department at Carnegie Mellon University. Her interests include interpretability, evaluation, reward learning, and language model agents.

Shanhuang Chen portrait
Researcher 3 reports

Shanhuang Chen

DeepSeek

DeepSeek team member and co-author of the DeepSeek-V3, DeepSeek-V2, and DeepSeek LLM technical reports.

Zhe Fu portrait
Researcher 3 reports

Zhe Fu

DeepSeek

DeepSeek team member and co-author of the DeepSeek-V3, DeepSeek-V2, and DeepSeek LLM technical reports.

Kexin Yang portrait
Researcher 3 reports

Kexin Yang

Alibaba Qwen

Researcher at Qwen, Alibaba Group since 2021, with publications on large language model evaluation and pre-training data management.

Runji Lin portrait
Researcher 3 reports

Runji Lin

Alibaba Qwen

Machine learning researcher whose public profile highlights work across Berkeley AI Research, RISELab, Sky Computing, and the Ray team.

Faeze Brahman portrait
Researcher 2 reports

Faeze Brahman

Ai2

Research Scientist at Ai2 focused on natural language processing, reasoning, and grounded commonsense understanding.

Sandeep Subramanian portrait
Researcher 2 reports

Sandeep Subramanian

Mistral AI

Sandeep Subramanian is an Assistant Professor at MBZUAI whose research focuses on grounded language generation, multimodal learning, and large language models.

Yang Su portrait
Researcher 2 reports

Yang Su

Alibaba Qwen

Research scientist at Alibaba DAMO Academy focusing on aligning and scaling superintelligence.

Chong Zhang portrait
Researcher 2 reports

Chong Zhang

OpenAI

Member of Technical Staff at OpenAI working on machine learning, reinforcement learning, natural language processing, and large language models.

Girish Sastry portrait
Researcher 2 reports

Girish Sastry

OpenAI

Research scientist at OpenAI interested in AI safety, alignment, governance, and technical safety.

Ilya Sutskever portrait
Researcher 2 reports

Ilya Sutskever

OpenAI

Computer scientist and AI researcher who co-founded OpenAI and served as its chief scientist.

Jianxin Yang portrait
Researcher 2 reports

Jianxin Yang

Alibaba Qwen

Member of the Qwen Team at Alibaba Group and a Ph.D. candidate at the Institute of Automation, Chinese Academy of Sciences. His research interests include multimodal reasoning and agents.

Leyi Xia portrait
Researcher 2 reports

Leyi Xia

DeepSeek

Research scientist at DeepSeek AI interested in language models, multimodal large language models, and deep generative models.

Nando de Freitas portrait
Researcher 2 reports

Nando de Freitas

Google Gemini

Vice president at Google DeepMind in London working on machine learning and reinforcement learning. His public profile notes work spanning policy-gradient methods, distributed and generative models, and robotics.

Xavier Martinet portrait
Researcher 2 reports

Xavier Martinet

Meta AI

Research scientist at Meta working on AI agents, language, and reinforcement learning.

Yuqiong Liu portrait
Researcher 2 reports

Yuqiong Liu

Alibaba Qwen

Researcher at Alibaba Qwen; formerly a postdoctoral researcher at the University of Cambridge and a PhD in NLP at the University of Edinburgh.

Akari Asai portrait
Researcher 1 reports

Akari Asai

Ai2

Akari Asai's public homepage identifies her as a research scientist at OLMo at Ai2 for 2025-2026 and an incoming assistant professor at Carnegie Mellon University starting in fall 2026.

Joanne Jang portrait
Researcher 1 reports

Joanne Jang

OpenAI

Leads OpenAI Labs, a research team prototyping new interfaces for people and AI. She studied computer science and applied math at Stanford.

Matt Gardner portrait
Researcher 1 reports

Matt Gardner

Cohere

Senior research scientist at AI2 on the AllenNLP team. His homepage says his work focuses on question answering, semantic parsing, and reasoning over open-domain text.

Nikolaus Binder portrait
Researcher 1 reports

Nikolaus Binder

NVIDIA

Nikolaus Binder is a senior research scientist at NVIDIA whose public research profile focuses on quasi-Monte Carlo methods, photorealistic image synthesis, ray tracing, and rendering algorithms.

Zihan Qiu portrait
Researcher 2 reports

Zihan Qiu

Alibaba Qwen

Research scientist in Tongyi Lab whose official profile highlights multimodal large language models, efficient reasoning, and AI infrastructure.

Christopher Clark portrait
Researcher 2 reports

Christopher Clark

Ai2

Christopher Clark is a researcher working on language models, efficient inference, and trustworthy NLP systems. His public profile highlights work at the intersection of NLP, efficiency, and model evaluation.

Douglas Eck portrait
Researcher 2 reports

Douglas Eck

Google Gemini

Research director at Google working on music AI, multimodal generation, and human-AI interaction. He co-founded the Magenta project and has led widely used work on music generation with neural networks.

Phil Tillet portrait
Researcher 1 reports

Phil Tillet

OpenAI

OpenAI researcher and software engineer known for creating Triton, an open-source GPU programming language, and co-authoring the GPT-4 Technical Report.

Qihao Zhu portrait
Researcher 10 reports

Qihao Zhu

DeepSeek

Research scientist focused on foundation models and multimodal large language models; his homepage notes earlier work at DeepSeek AI and current research at the University of Southern California.

Wenfeng Liang portrait
Researcher 8 reports

Wenfeng Liang

DeepSeek

Wenfeng Liang, also known as Liang Wenfeng, is linked to DeepSeek technical reports in LLMpeople and is identified in public references as the founder and CEO of DeepSeek.

Fei Huang portrait
Researcher 7 reports

Fei Huang

Alibaba Qwen

Alibaba Qwen report author listed on Qwen, Qwen2.5, Qwen2.5-1M, Qwen3, Qwen3 Embedding, QwQ-32B, and Qwen-VL reports, with report-backed work on large language models, embeddings, reranking, and multimodal models.

Chong Ruan portrait
Researcher 6 reports

Chong Ruan

DeepSeek

Researcher at DeepSeek and former master's student in EECS at Peking University. Public profiles list work on LLMs, multimodal systems, transformers, BERT, and machine translation.

Zeyu Cui portrait
Researcher 5 reports

Zeyu Cui

Alibaba Qwen

Zeyu Cui is listed as an author of the Qwen technical report Qwen3 Technical Report.

Jie Zhou portrait
Researcher 5 reports

Jie Zhou

DeepSeek / MiniMax

Public report authorship links Jie Zhou to the MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention at MiniMax.

Rui Men portrait
Researcher 5 reports

Rui Men

Alibaba Qwen

Alibaba Qwen report author listed on Qwen, Qwen2.5, Qwen2.5-1M, Qwen3, and Qwen-Image reports, with report-backed work on large language models, long-context models, and image generation.

Yang Fan portrait
Researcher 5 reports

Yang Fan

Alibaba Qwen

Alibaba Qwen report author listed on Qwen, Qwen2.5, Qwen3, Qwen-VL, and Qwen-Image technical reports, with report-backed work on large language models, vision-language models, and image generation.

Zhenda Xie portrait
Researcher 5 reports

Zhenda Xie

DeepSeek

DeepSeek report author listed on DeepSeek LLM, DeepSeekMoE, DeepSeek-Coder, DeepSeek-V2, and DeepSeek-V3 reports, with report-backed work on large language models, mixture-of-experts systems, and code models.

Alexandre Sablayrolles portrait
Researcher 4 reports

Alexandre Sablayrolles

Google Gemini / Mistral AI

Alexandre Sablayrolles is listed as an author of the Google technical report PaLM 2 Technical Report.

Amin Firooz portrait
Researcher 5 reports

Amin Firooz

NVIDIA

Senior research scientist at NVIDIA focused on large language models, reinforcement learning, and inference-time scaling for AI agents. His public NVIDIA author page also notes prior work on robotic grasping, pose estimation, and language understanding.

Armand Joulin portrait
Researcher 4 reports

Armand Joulin

Meta AI

Armand Joulin is listed as an author of the Meta AI technical report Llama 2: Open Foundation and Fine-Tuned Chat Models.

Brennan Saeta portrait
Researcher 4 reports

Brennan Saeta

Google Gemini

Public report authorship links Brennan Saeta to the Gemma 2: Improving Open Language Models at a Practical Size at Google.

Jason Wei portrait
Researcher 4 reports

Jason Wei

Google Gemini / OpenAI

Public report authorship links Jason Wei to the Gemma 3n Technical Report at Google.

Qinyu Chen portrait
Researcher 4 reports

Qinyu Chen

DeepSeek / Alibaba Qwen

Qinyu Chen is listed as an author of the DeepSeek technical report DeepSeek-V3 Technical Report.

Xingkai Yu portrait
Researcher 4 reports

Xingkai Yu

DeepSeek

Xingkai Yu is a report-backed author in the LLMpeople atlas, connected through DeepSeek technical reports including DeepSeek-V3, DeepSeek-V2, DeepSeek LLM, and DeepSeekMoE; his public GitHub profile lists DeepSeek affiliation.

Xingzhang Ren portrait
Researcher 5 reports

Xingzhang Ren

Alibaba Qwen

Researcher at Alibaba working on foundation models and agents for enterprise applications.

Donald W. McFadden portrait
Researcher 4 reports

Donald W. McFadden

Google Gemini

Google Gemini report author listed on Gemini, Gemini 1.5, RecurrentGemma, and CodeGemma technical reports, with report-backed work on multimodal models, long-context models, efficient architectures, and code models.

Mei Li portrait
Researcher 4 reports

Mei Li

Alibaba Qwen

Alibaba Qwen report author listed on Qwen, Qwen2.5, Qwen2.5-1M, and Qwen3 technical reports, with report-backed work on large language models, long-context modeling, and reasoning.

Wentao Zhang portrait
Researcher 4 reports

Wentao Zhang

DeepSeek

DeepSeek report author listed on DeepSeek LLM, DeepSeek-Coder, DeepSeek-V2, and DeepSeek-V3 technical reports, with report-backed work on code models and large language models.

Yipeng Wang portrait
Researcher 4 reports

Yipeng Wang

Z.ai

Z.ai report author listed on GLM-Z1, GLM-4.5, GLM-4.1V/4.5V, and GLM-5 materials, with report-backed work on reasoning, coding, agentic, and multimodal models.

Zihan Jiang portrait
Researcher 4 reports

Zihan Jiang

Z.ai

Z.ai report author listed on GLM-Z1, GLM-4.5, GLM-4.1V/4.5V, and GLM-5 materials, with report-backed work on reasoning, agentic, and multimodal models.

Clemens Winter portrait
Researcher 3 reports

Clemens Winter

OpenAI / Meta AI

Clemens Winter writes publicly about reinforcement learning and machine learning systems, and official OpenAI materials credit him on GPT-4 and GPT-4.5 work. Accessible public sources do not clearly establish a current employer.

Xiaodong Deng portrait
Researcher 4 reports

Xiaodong Deng

Alibaba Qwen

Research scientist in Tongyi Lab whose official profile highlights post-training and multimodal large language models.

Yang Yue portrait
Researcher 4 reports

Yang Yue

MiniMax / Moonshot AI

Researcher at Moonshot AI and co-author of the Kimi K2.5 report on visual agentic intelligence.

Yao Zhao portrait
Researcher 4 reports

Yao Zhao

DeepSeek / Google Gemini

Researcher affiliated with DeepSeek-AI and co-author of the Nature paper introducing DeepSeek-R1.

Sharan Narang portrait
Researcher 3 reports

Sharan Narang

Google Gemini / NVIDIA

Researcher at Meta whose public OpenReview profile lists previous research roles at Google and Baidu.

Tyler Romero portrait
Researcher 2 reports

Tyler Romero

Ai2

Groundlight AI lists Tyler Romero as Lead ML Engineer. Public sources also list him as a coauthor of OLMo 3.

Aidan Gomez portrait
Researcher 2 reports

Aidan Gomez

Google Gemini

Co-founder and CEO of Cohere; previously a doctoral student at Oxford and a student researcher and intern at Google Brain.

Andrew Drozdov portrait
Researcher 1 reports

Andrew Drozdov

Cohere

Researcher at Cohere working on machine learning, natural language processing, and computer vision.

Devendra Singh Chaplot portrait
Researcher 2 reports

Devendra Singh Chaplot

Mistral AI

Devendra Singh Chaplot is a research scientist at Mistral AI working on language models and multimodal AI.

Ingmar Kanitscheider portrait
Researcher 1 reports

Ingmar Kanitscheider

OpenAI

Research scientist at OpenAI with a background in theoretical physics, computational neuroscience, and theoretical AI.

Mikhail Pavlov portrait
Researcher 1 reports

Mikhail Pavlov

OpenAI

Mikhail Pavlov is publicly credited on OpenAI's GPT-4 contributions page as software correctness lead and on GPT-4V technical work as a scaling engineering lead.

Xuezhi Wang portrait
Researcher 1 reports

Xuezhi Wang

Google Gemini

Xuezhi Wang is a Research Scientist at Google DeepMind whose public work focuses on reasoning and systematic generalization in language models, along with robustness and fairness in natural language processing. She received her PhD from Carnegie Mellon University in 2016.

Yejin Choi portrait
Researcher 1 reports

Yejin Choi

Cerebras Systems

Professor at Stanford University and senior director at the Allen Institute for AI. Her research spans natural language processing, commonsense reasoning, and multimodal AI.

Yinger Zhang portrait
Researcher 1 reports

Yinger Zhang

Alibaba Qwen

Research scientist at Alibaba Qwen working on LLM post-training, RLHF, reasoning, and evaluation; incoming assistant professor at Renmin University of China.

Zijia Zhu portrait
Researcher 1 reports

Zijia Zhu

DeepSeek

Member of Technical Staff at DeepSeek focusing on large language models and reinforcement learning.

Abhinav Jauhri portrait
Researcher 1 reports

Abhinav Jauhri

Meta AI

Research scientist at Meta working on post-training and model personalization. He previously worked on retrieval systems, recommendations, and ranking, and earned his Ph.D. from Stanford University.

Aiesha Letman portrait
Researcher 1 reports

Aiesha Letman

Meta AI

Staff research scientist at Meta whose work focuses on democratizing natural language processing tools, especially for low-resource languages.

Alan Hickey portrait
Researcher 1 reports

Alan Hickey

OpenAI

Member of the Cybersecurity Policy and Insights team at OpenAI covering the intersection of technology, international security, and geopolitical competition.

Alexander Wettig portrait
Researcher 1 reports

Alexander Wettig

Ai2

Alexander Wettig is a Princeton computer science PhD student advised by Danqi Chen. His homepage says he studies language models and their training data, previously interned at Ai2, and is currently working on training coding agents at Cursor.

Amanda Bertsch portrait
Researcher 1 reports

Amanda Bertsch

Ai2

Amanda Bertsch is a Ph.D. student in the Language Technologies Institute at Carnegie Mellon University. Her research focuses on making language models faster and more memory efficient at test time and improving performance on very long contexts.

Benjamin Minixhofer portrait
Researcher 1 reports

Benjamin Minixhofer

Ai2

Benjamin Minixhofer is a third-year PhD student at the University of Cambridge's Language Technology Lab. His homepage focuses on resource-efficient language models, modularity, and adaptive tokenization, and notes prior research experience at Ai2.

Damien Deville portrait
Researcher 1 reports

Damien Deville

OpenAI

AI researcher at OpenAI working on reinforcement learning and reasoning.

Fangzhou Su portrait
Researcher 1 reports

Fangzhou Su

Moonshot AI

Researcher at Moonshot AI focused on reinforcement learning and robotics. He earned a Ph.D. in robotics and machine learning from Carnegie Mellon University.

Janko Altenschmidt portrait
Researcher 1 reports

Janko Altenschmidt

OpenAI

OpenAI contributor with a personal homepage and GitHub profile covering NLP and language model research.

Jianxin Ma portrait
Researcher 1 reports

Jianxin Ma

Alibaba Qwen

Researcher at Alibaba DAMO Academy with a background in natural language processing, speech, and recommender systems.

Kevin Yu portrait
Researcher 1 reports

Kevin Yu

OpenAI

Member of technical staff at OpenAI whose public profile notes a PhD in computational neuroscience from Princeton University.

Luke Metz portrait
Researcher 1 reports

Luke Metz

OpenAI

Researcher at OpenAI focused on improving the efficiency and reliability of AI systems.

Nima Pourdamghani portrait
Researcher 1 reports

Nima Pourdamghani

Amazon

Researcher whose public profiles include Amazon Science and USC ISI; his work spans dialogue systems, temporal reasoning, and document retrieval.

Red Avila portrait
Researcher 1 reports

Red Avila

OpenAI

OpenAI contributor with a personal homepage and GitHub profile covering software and AI engineering projects.

Richard Ngo portrait
Researcher 1 reports

Richard Ngo

OpenAI

Researcher and writer focused on safe and beneficial AI; listed as a contributor to GPT-4 by OpenAI.

Skye Wanderman-Milne portrait
Researcher 1 reports

Skye Wanderman-Milne

Google Gemini

Research scientist at Google working on multilingual generation, machine translation, and large language model systems. His public work spans translation quality, evaluation, and scalable language technology.

Stephanie C. Y. Chan portrait
Researcher 1 reports

Stephanie C. Y. Chan

Cohere

Applied scientist at Cohere focused on natural language processing and machine learning.

Tingyu Xia portrait
Researcher 1 reports

Tingyu Xia

Alibaba Qwen

Researcher with public work on large language models. Official Qwen materials credit her on Qwen releases, and public profiles include DBLP and OpenReview.

Tong Mu portrait
Researcher 1 reports

Tong Mu

OpenAI

Member of technical staff at OpenAI. Previously pursued a PhD at Stanford University, focusing on practical reinforcement learning and bandits.

Vik Goel portrait
Researcher 1 reports

Vik Goel

OpenAI

Engineer at OpenAI working on the API; previously built products and customer experience systems at Stripe and Instacart.

Vishal Kuo portrait
Researcher 1 reports

Vishal Kuo

OpenAI

Vishal Kuo is a member of technical staff at OpenAI.

Xingchao Liu portrait
Researcher 1 reports

Xingchao Liu

DeepSeek

Member of technical staff at xAI; previously a research scientist at DeepSeek AI and a PhD student in Computer Science and Engineering at the University of Michigan.

Yuxi Xie portrait
Researcher 1 reports

Yuxi Xie

Moonshot AI

Ph.D. candidate in computer science at Princeton University working on LLM agents and machine learning. He is a coauthor of Moonshot AI's Kimi k1.5 report.

Zhipeng Zhou portrait
Researcher 1 reports

Zhipeng Zhou

Alibaba Qwen

Researcher with public work on large language models. Official Qwen materials credit him on Qwen releases, and public profiles include DBLP and OpenReview.

Zijun Liu portrait
Researcher 1 reports

Zijun Liu

DeepSeek

Tsinghua University PhD student whose public OpenReview profile lists a 2024 DeepSeek-AI research internship and interests in scalable foundation models, multi-agent systems, and controllable AIGC.

Daniel Mossing portrait
Researcher 1 reports

Daniel Mossing

OpenAI

Daniel Mossing is a research scientist at OpenAI. His OpenReview profile also lists past affiliations with UC Berkeley and New York University.

Kai Xiao portrait
Researcher 1 reports

Kai Xiao

OpenAI

Member of technical staff at OpenAI interested in empirical machine learning and exploration in reinforcement learning; previously a PhD candidate at MIT.

Aleksandar Botev portrait
Researcher 1 reports

Aleksandar Botev

Google Gemini

Staff research scientist at Google DeepMind interested in deep learning theory and applications, with a focus on scaling and reinforcement learning.

Alex Paino portrait
Researcher 1 reports

Alex Paino

OpenAI

OpenAI team member and co-author of the GPT-4 Technical Report.

Aman Mathur portrait
Researcher 1 reports

Aman Mathur

NVIDIA

Senior applied research scientist at NVIDIA focused on pretraining and post-training methods for large language models, multimodal learning, and synthetic data generation.

Carolyn Zhang portrait
Researcher 1 reports

Carolyn Zhang

Meta AI

Carolyn Zhang is a researcher focused on large language models, natural language processing, and machine learning.

Cathy Jiao portrait
Researcher 1 reports

Cathy Jiao

Google Gemini

Research scientist at Google DeepMind.

Chenyu Zhang portrait
Researcher 1 reports

Chenyu Zhang

DeepSeek

DeepSeek team member and co-author of the DeepSeek-V3 Technical Report.

Christopher Hesse portrait
Researcher 1 reports

Christopher Hesse

OpenAI

Research scientist and engineer working on large-scale machine learning. His personal site highlights contributions to GPT-3, GitHub Copilot, and Stable Diffusion.

Clément Gehring portrait
Researcher 1 reports

Clément Gehring

Mistral AI

Clément Gehring is a research scientist at Mistral AI working on efficient, multilingual, and general-purpose language models. He is also known for influential early work on neural machine translation, including convolutional sequence-to-sequence models.

Deepak Ramachandran portrait
Researcher 1 reports

Deepak Ramachandran

Cohere

Member of technical staff at Cohere Labs, where he works on models for code, reasoning, and agents and contributes to post-training.

Dragomir Radev portrait
Researcher 1 reports

Dragomir Radev

NVIDIA

Dragomir Radev is an Eminent Professor of natural language processing at MBZUAI. His research spans NLP, information retrieval, question answering, and summarization, and he is also a coauthor of the Nemotron-4 15B technical report.

Henri Roussez portrait
Researcher 1 reports

Henri Roussez

OpenAI

Research engineer at OpenAI.

Jake McNeil portrait
Researcher 1 reports

Jake McNeil

OpenAI

OpenAI team member and co-author of the GPT-4 Technical Report.

Kestutis Patiejunas portrait
Researcher 1 reports

Kestutis Patiejunas

NVIDIA

Senior research scientist at NVIDIA focused on large language models and reasoning systems. His public NVIDIA author page notes more than eight years of machine learning work spanning NLP, forecasting, recommender systems, and computer vision.

Kushal Kalra portrait
Researcher 1 reports

Kushal Kalra

NVIDIA

Senior research scientist at NVIDIA working on multimodal and embodied AI, including multilingual multimodal large language models and human-centered robot learning.

Laura Buesing portrait
Researcher 1 reports

Laura Buesing

Google Gemini

Laura Buesing is a research scientist at Google DeepMind working on machine learning and computational neuroscience, with a focus on agents that remember, reason, and plan.

Manaal Faruqui portrait
Researcher 1 reports

Manaal Faruqui

Cohere

Manaal Faruqui is an NLP and large language model researcher whose public profile covers factuality, instruction following, conversational systems, and language understanding. His work spans large language model post-training as well as earlier research on word representations, question quality, and dialogue systems.

Michael Lampe portrait
Researcher 1 reports

Michael Lampe

OpenAI

Researcher at OpenAI and a co-author of the GPT-4 Technical Report.

Moein Khazraee portrait
Researcher 1 reports

Moein Khazraee

Google Gemini

Moein Khazraee is a postdoctoral researcher at LMU Munich and a visitor at Google DeepMind. His research focuses on reliable and trustworthy machine learning for language, knowledge, and reasoning.

Murali Emani portrait
Researcher 1 reports

Murali Emani

NVIDIA

Senior research scientist at NVIDIA focused on embodied agent AI, simulation and rendering, computer vision, robot learning, and AI in science.

Nima Rajani portrait
Researcher 1 reports

Nima Rajani

Ai2

Nima Rajani is a research scientist at Ai2 whose work focuses on trustworthy, interpretable, and verifiable AI systems.

Prafulla Dhariwal portrait
Researcher 1 reports

Prafulla Dhariwal

OpenAI

Research scientist at OpenAI focused on generative models and unsupervised learning. He studied computers, mathematics, and physics at MIT and has contributed to models including GPT-3, Jukebox, and DALL·E.

Rapha Gontijo-Lopes portrait
Researcher 1 reports

Rapha Gontijo-Lopes

OpenAI

Researcher and engineer whose public homepage lists work on multimodal reasoning, robustness, and mechanistic interpretability; he was an OpenAI Resident and is credited on the GPT-4 technical report.

Ray Kurzweil portrait
Researcher 1 reports

Ray Kurzweil

Google Gemini

Inventor, author, and futurist serving as Principal Researcher and AI Visionary at Google, known for long-running work on AI, language technology, and human-computer systems.

Shengli Hu portrait
Researcher 1 reports

Shengli Hu

OpenAI

OpenAI team member and co-author of the GPT-4 Technical Report.

Siddharth Narayan portrait
Researcher 1 reports

Siddharth Narayan

Meta AI

Siddharth Narayan is a researcher focused on natural language processing and AI, with applications that include healthcare.

Sumedh Palkar portrait
Researcher 1 reports

Sumedh Palkar

NVIDIA

Applied research scientist at NVIDIA working on reinforcement learning for large language models, post-training, and data curation.

Tianhao Zheng portrait
Researcher 1 reports

Tianhao Zheng

OpenAI

OpenAI team member and co-author of the GPT-4 Technical Report.

Wade Hickey portrait
Researcher 1 reports

Wade Hickey

OpenAI

OpenAI team member and co-author of the GPT-4 Technical Report.

Yaron Lipman portrait
Researcher 1 reports

Yaron Lipman

Amazon

Research scientist at Amazon AGI and assistant professor at Yale School of Medicine whose work focuses on diffusion models, equivariant deep learning, and generative models for molecules and proteins.

Zhean Xu portrait
Researcher 1 reports

Zhean Xu

DeepSeek

DeepSeek team member and co-author of the DeepSeek-V3 Technical Report.

Ziyi Gao portrait
Researcher 1 reports

Ziyi Gao

DeepSeek

DeepSeek team member and co-author of the DeepSeek-V3 Technical Report.

Pete Walsh portrait
Researcher 3 reports

Pete Walsh

Ai2

OpenReview lists Evan Pete Walsh as a researcher at the Allen Institute for Artificial Intelligence, and recent public reports credit Pete Walsh on OLMo Hybrid and related open-model work.

Yu Wan portrait
Researcher 2 reports

Yu Wan

Alibaba Qwen

Yu Wan is publicly listed as an author on the Qwen2.5 and Qwen3 technical reports.

Huan Lin portrait
Researcher 3 reports

Huan Lin

Alibaba Qwen

Staff scientist at Tongyi Lab, Alibaba Group, and Qwen team member working on multimodal language models, AI agents, evaluation, reasoning, and information extraction.

Chloe Anastasiades portrait
Researcher 1 reports

Chloe Anastasiades

Ai2

Chloe Anastasiades is a software engineer at the Allen Institute for Artificial Intelligence and a co-author on Ai2 papers including OLMo 3 and Ai2 Scholar QA.

Zhifang Sui portrait
Researcher 1 reports

Zhifang Sui

Microsoft

Public Peking University faculty pages list Zhifang Sui as a professor in the Institute of Computational Linguistics with research interests in natural language processing and computational linguistics.

Alan Schelten portrait
Researcher 1 reports

Alan Schelten

Meta AI

Research engineer at Meta with a verified OpenReview profile listing current affiliation at Facebook/Meta.

Ivo Danihelka portrait
Researcher 1 reports

Ivo Danihelka

Google Gemini

Ivo Danihelka is a research scientist at Google DeepMind whose public profile highlights deep learning research and machine learning publications.

Tara N. Sainath portrait
Researcher 1 reports

Tara N. Sainath

Google Gemini

Tara N. Sainath is a distinguished research scientist and director at Google DeepMind whose work focuses on speech recognition, spoken language understanding, and multimodal large language models.

Diego de las Casas portrait
Researcher 3 reports

Diego de las Casas

Mistral AI

Research scientist at Mistral AI working on machine reasoning and probabilistic deep learning.

Edouard Grave portrait
Researcher 3 reports

Edouard Grave

Meta AI

Edouard Grave is a machine learning researcher whose public work spans representation learning, natural language processing, and large language models.

Jena D. Hwang portrait
Researcher 3 reports

Jena D. Hwang

Ai2

Research scientist at the Allen Institute for AI (Ai2) whose work focuses on natural language understanding and commonsense reasoning.

Kang Guan portrait
Researcher 3 reports

Kang Guan

DeepSeek

DeepSeek team member focused on pretraining, post-training, and efficient scaling for large language models.

Yiliang Xiong portrait
Researcher 3 reports

Yiliang Xiong

DeepSeek

Research scientist at DeepSeek AI. His homepage also describes him as a PhD candidate in computer sciences at the University of Wisconsin-Madison.

Chris Hallacy portrait
Researcher 2 reports

Chris Hallacy

OpenAI

Engineer at OpenAI working on inference infrastructure, product engineering, and prototyping for systems including GPT-4 and DALL-E 2.

Armand Joulin portrait
Researcher 1 reports

Armand Joulin

Mistral AI

A 2024 dotAI speaker bio describes Armand Joulin as a Research Director at Google DeepMind working on Gemma, after leading FAIR EMEA at Meta.

Nawshad Farruque portrait
Researcher 1 reports

Nawshad Farruque

Cohere

The University of Lethbridge Experts Database lists Nawshad Farruque in Maths and Computer Science, with research expertise in large-scale GIS database management, distributed spatial query processing and optimization, software engineering, and artificial intelligence.

Yingyan Celine Lin portrait
Researcher 1 reports

Yingyan Celine Lin

NVIDIA

Official Georgia Tech and NVIDIA DLER pages list Yingyan Celine Lin as a Georgia Tech associate professor and a visiting professor collaborating with NVIDIA's deep learning research group.

Keming Lu portrait
Researcher 3 reports

Keming Lu

Alibaba Qwen

Keming Lu is a research scientist at Tongyi Lab, Alibaba Group in Beijing and a co-author of the Qwen and Qwen2.5 technical reports.

Pierre Stock portrait
Researcher 3 reports

Pierre Stock

Mistral AI

AI Scientist & Observability Lead at Mistral AI.

Pradeep Dasigi portrait
Researcher 3 reports

Pradeep Dasigi

Ai2

Research scientist on the AllenNLP team at the Allen Institute for AI, focused on post-training language models.

Aäron van den Oord portrait
Researcher 2 reports

Aäron van den Oord

Google Gemini

Aäron van den Oord is a Google DeepMind researcher known for generative and sequence-model research.

Aditya Ramesh portrait
Researcher 2 reports

Aditya Ramesh

OpenAI

Researcher at OpenAI and co-creator of DALL-E who works on multimodal generative models and image generation.

Alek Andreev portrait
Researcher 2 reports

Alek Andreev

Google Gemini

Research scientist at Google DeepMind working on multimodal and generative AI for healthcare.

Chris Bamford portrait
Researcher 2 reports

Chris Bamford

Mistral AI

Chris Bamford is a researcher at Mistral AI working on foundation models and multimodal systems.

Eric Hambro portrait
Researcher 2 reports

Eric Hambro

Meta AI

Eric Hambro is a machine learning researcher and entrepreneur whose prior work includes contributions at Meta AI and the LLaMA project.

Eric Sigler portrait
Researcher 2 reports

Eric Sigler

OpenAI

Member of technical staff at OpenAI whose public profile notes leadership of the MT+MX team from 2020 to 2025 after earlier engineering work at Google.

John Schulman portrait
Researcher 2 reports

John Schulman

OpenAI

Research scientist at OpenAI working on reinforcement learning and robotics, with a PhD from UC Berkeley.

Jonathan Bradley portrait
Researcher 2 reports

Jonathan Bradley

NVIDIA

Senior research scientist at NVIDIA focused on language and vision models and applied multimodal AI.

Julian Schrittwieser portrait
Researcher 2 reports

Julian Schrittwieser

Google Gemini

Julian Schrittwieser is a Google DeepMind researcher known for reinforcement learning and game-playing systems.

Ling Chen portrait
Researcher 2 reports

Ling Chen

Z.ai

Z.ai researcher focused on multimodal large language models and computer vision, with interests in large-model training and post-training.

Matthieu Devin portrait
Researcher 2 reports

Matthieu Devin

Google Gemini

Research scientist at Google DeepMind based in Paris, focused on deep learning and computer vision.

Mingming Li portrait
Researcher 2 reports

Mingming Li

DeepSeek

DeepSeek team member focused on scaling large language models, multimodal models, and large-scale reinforcement learning.

Mohammad Ghassemi portrait
Researcher 2 reports

Mohammad Ghassemi

Technology Innovation Institute

Mohammad Ghassemi is a distinguished researcher at TII, AI71, and MBZUAI in Abu Dhabi. His work spans machine learning, digital health, and trustworthy AI, and he previously held faculty roles at MIT and the University of Toronto.

Moutasem R. Alfarra portrait
Researcher 2 reports

Moutasem R. Alfarra

Technology Innovation Institute

Senior AI scientist at the Technology Innovation Institute and machine learning researcher at MBZUAI.

Ning Tian portrait
Researcher 2 reports

Ning Tian

DeepSeek

Developer and researcher with public GitHub activity on DeepSeek-AI repositories including DeepSeek-VL2, plus coauthor credit on the DeepSeek-V2 and DeepSeek-V3 technical reports.

Pang Wei Koh portrait
Researcher 2 reports

Pang Wei Koh

Ai2

Assistant Professor at the University of Washington whose research focuses on machine learning, AI safety, and evaluation; he is a co-lead of the OLMo language model project.

Rohan Taori portrait
Researcher 2 reports

Rohan Taori

NVIDIA

Researcher and engineer working on language models and reinforcement learning, with public work spanning Alpaca, emerging agents, and large-model evaluation.

Shengyi Huang portrait
Researcher 2 reports

Shengyi Huang

Ai2

Research scientist at Ai2 specializing in open language models, reasoning, and multi-agent systems.

Z. Z. Ren portrait
Researcher 2 reports

Z. Z. Ren

DeepSeek

DeepSeek researcher. Public OpenReview and Hugging Face profiles identify Z.Z. Ren with DeepSeek and list work on theorem proving and synthetic training data for reasoning models.

Igor Babuschkin portrait
Researcher 1 reports

Igor Babuschkin

OpenAI

Igor Babuschkin's homepage describes him as an AI researcher, builder, and investor, and highlights Babuschkin Ventures. Public reporting also describes him as a former DeepMind researcher and former OpenAI technical staff member.

Maksim Khadkevich portrait
Researcher 1 reports

Maksim Khadkevich

NVIDIA

NVIDIA's public author page identifies Maksim Khadkevich as a Senior Software Engineering Manager specializing in distributed inference systems and large language models. arXiv public sources also list him as a coauthor of Nemotron-Flash.

Ronan Le Bras portrait
Researcher 1 reports

Ronan Le Bras

Ai2

Ai2's team directory lists Ronan Le Bras, and public report sources credit him as an author on open-language-model work.

Maxwell Horton portrait
Researcher 1 reports

Maxwell Horton

Apple

Co-author of the OpenELM paper with author affiliation shown as Apple on the arXiv HTML version.

Madeleine B. Thompson portrait
Researcher 1 reports

Madeleine B. Thompson

OpenAI

Member of technical staff at OpenAI working on post-training. Previously worked on climate and sustainability software and on policy and product efforts related to the electric transition.

Sewon Min portrait
Researcher 3 reports

Sewon Min

Ai2

Assistant professor at the University of California, Berkeley and research scientist at the Allen Institute for AI. Her public homepage highlights work on language models, natural language processing, pretraining, post-training, and test-time computation.

Wenbiao Yin portrait
Researcher 2 reports

Wenbiao Yin

Alibaba Qwen

Senior research scientist at Alibaba focused on large language models and agent learning.

Abhilasha Ravichander portrait
Researcher 1 reports

Abhilasha Ravichander

Ai2

Assistant professor at Carnegie Mellon University and scientist at Ai2, working on natural language processing and machine learning with a focus on grounding, multilinguality, and robustness.

Abhishek Kadian portrait
Researcher 1 reports

Abhishek Kadian

Meta AI

Machine learning engineer at FAIR at Meta AI and incoming PhD student at Carnegie Mellon University.

Abhishek Rao portrait
Researcher 1 reports

Abhishek Rao

Google Gemini

Abhishek Rao is a Research Scientist at Google DeepMind working on machine learning and natural language processing, with interests including structured prediction, reasoning, compositionality, and controllable generation. He has contributed to large language model research including PaLM.

Alex Maas portrait
Researcher 1 reports

Alex Maas

Salesforce AI Research

Senior research director at Salesforce AI Research and associate adjunct professor at UC Berkeley.

András Sárközy portrait
Researcher 1 reports

András Sárközy

Google Gemini

Andras Sarkozy is a senior staff research scientist at Google DeepMind in London. His public profiles describe work on machine learning and natural language processing, with interests in code generation and reasoning in language models.

Ankur Harlalka portrait
Researcher 1 reports

Ankur Harlalka

NVIDIA

Research scientist at NVIDIA working on large language model post-training and inference.

Anna-Luisa Brakman portrait
Researcher 1 reports

Anna-Luisa Brakman

OpenAI

OpenAI contributor with a personal website covering research and writing.

Ariel Herbert-Voss portrait
Researcher 1 reports

Ariel Herbert-Voss

OpenAI

Ariel Herbert-Voss is a computer scientist whose public homepage notes previous work on AI safety and special projects at OpenAI, along with later work on digital humans and universal basic income initiatives.

Aris Konstantinidis portrait
Researcher 1 reports

Aris Konstantinidis

OpenAI

Research engineer at OpenAI focused on reinforcement learning and human-computer interaction; previously co-founded the local-first AI startup Ted AI.

Ashley Pantuliano portrait
Researcher 1 reports

Ashley Pantuliano

OpenAI

Program manager at OpenAI working across finance, legal, operations, and communications, after earlier work in philanthropy, marketing, and project management.

Bianca Martin portrait
Researcher 1 reports

Bianca Martin

OpenAI

Public profiles list Bianca Martin as chief of staff at OpenAI, with an Oxford background in philosophy, politics, and economics.

Carlos E. Jimenez portrait
Researcher 1 reports

Carlos E. Jimenez

NVIDIA

Research scientist at NVIDIA with publications in machine learning and embodied AI.

Chengqiang Lu portrait
Researcher 1 reports

Chengqiang Lu

Alibaba Qwen

Researcher with public work on large language models. Official Qwen materials credit him on Qwen releases, and public profiles include DBLP and OpenReview.

Chengqi Deng portrait
Researcher 1 reports

Chengqi Deng

DeepSeek

Researcher and engineer with public work on similarity search and large model systems. Public profiles list Zhejiang University, and DeepSeek open-source releases credit him on projects including DeepSeek-VL and DualPipe.

Chris Hesse portrait
Researcher 1 reports

Chris Hesse

OpenAI

Research Scientist at OpenAI with public work in machine learning and generative models.

Colin Raffel portrait
Researcher 1 reports

Colin Raffel

Google Gemini

Research scientist at Google DeepMind whose work focuses on machine learning, speech, and large language models.

Cullen O'Keefe portrait
Researcher 1 reports

Cullen O'Keefe

OpenAI

AI law and policy researcher and Director of Research at the Institute for Law & AI; previously held policy and legal roles at OpenAI.

Currey Garnier portrait
Researcher 1 reports

Currey Garnier

Google Gemini

Google researcher working on machine learning, computer vision, and natural language, and a co-author of the PaLM 2 technical report.

Cyrus Hessel portrait
Researcher 1 reports

Cyrus Hessel

Ai2

AI2 researcher working on multimodal language models, evaluations, instruction tuning, and preference alignment, and a co-author of Open Instruct.

Daniel Farrugia-Roberts portrait
Researcher 1 reports

Daniel Farrugia-Roberts

Google Gemini

AI researcher focused on epistemic safety for advanced AI systems, including uncertainty and out-of-distribution detection.

Dara Bahri portrait
Researcher 1 reports

Dara Bahri

Cohere

Researcher at Cohere working on large language models. Her public homepage highlights interests in privacy-preserving machine learning, robust machine learning, natural language processing, and reinforcement learning.

Dave Cummings portrait
Researcher 1 reports

Dave Cummings

OpenAI

Engineer at OpenAI who spoke publicly about fine-tuning GPT-4 at DevDay.

David R. So portrait
Researcher 1 reports

David R. So

Google Gemini

Research scientist at Google working on machine learning and language models. His public work includes efficient model architectures, language understanding, and large-scale training.

Dhruti Sreedhar portrait
Researcher 1 reports

Dhruti Sreedhar

Databricks

Dhruti Sreedhar is a research scientist at Databricks whose interests include generative models, post-training, and long-context language models. Previously, she studied at MIT and IIT Madras.

Di He portrait
Researcher 1 reports

Di He

01.AI

Di He is a machine learning researcher whose public homepage and Google Scholar profile highlight work on large language models, efficient training, and foundation models.

Dmitry Lepikhin portrait
Researcher 1 reports

Dmitry Lepikhin

Google Gemini

Research scientist and engineer working on machine learning and distributed systems, with large-scale language model contributions including PaLM 2.

Ed Chi portrait
Researcher 1 reports

Ed Chi

Google Gemini

Vice President of Research at Google DeepMind. His work spans search, recommendation systems, human-computer interaction, and practical applications of large language models.

Elizabeth Tseng portrait
Researcher 1 reports

Elizabeth Tseng

OpenAI

Lead researcher at OpenAI whose official profile focuses on speech and audio.

Eric Hu portrait
Researcher 1 reports

Eric Hu

Cohere

Eric Hu is a designer and creative director whose public portfolio spans technology, fashion, music, and architecture. At Cohere he served as VP of Design and is credited as a contributor to the Command A technical report; his public work also includes earlier design leadership roles at Nike and SSENSE.

Fraser Green portrait
Researcher 1 reports

Fraser Green

Google Gemini

Research scientist at Google DeepMind working on language models and efficient sampling and serving.

Gaurav Kapadia portrait
Researcher 1 reports

Gaurav Kapadia

Meta AI

Research scientist at Meta working on large language models, large-scale reinforcement learning from human feedback, retrieval, recommendation systems, and machine learning systems.

Guillaume Ramé portrait
Researcher 1 reports

Guillaume Ramé

Meta AI

AI researcher at Meta with public work on foundation models, optimization, and machine learning evaluation.

Gururaj Kasinathan portrait
Researcher 1 reports

Gururaj Kasinathan

NVIDIA

Research scientist at NVIDIA working on language and vision models.

Hailey Schoelkopf portrait
Researcher 1 reports

Hailey Schoelkopf

Ai2

Allen Institute for AI researcher working on NLP, machine learning, and AI, and a co-author of Open Instruct.

Haocheng Wang portrait
Researcher 1 reports

Haocheng Wang

DeepSeek

PhD student at KAUST and predoctoral researcher at DeepSeek focused on reasoning, alignment, post-training, and multimodal large language models.

Hyeonwoo Noh portrait
Researcher 1 reports

Hyeonwoo Noh

OpenAI

Research scientist at OpenAI working on multimodal understanding, reasoning, and agents; formerly a Ph.D. student at Seoul National University.

Ian Sohl portrait
Researcher 1 reports

Ian Sohl

OpenAI

Research engineer at OpenAI and contributor to GPT-4.

Ilia Kava portrait
Researcher 1 reports

Ilia Kava

Databricks

Ilia Kava is a member of technical staff at Databricks who works on language models and machine learning systems. He contributed to Databricks' DBRX open large language model research.

Jacob Menick portrait
Researcher 1 reports

Jacob Menick

OpenAI

Research scientist at OpenAI working on reinforcement learning and AI alignment. Previously a PhD student in computer science at Harvard University.

Jake Berdine portrait
Researcher 1 reports

Jake Berdine

OpenAI

OpenAI contributor with a public GitHub profile covering engineering projects.

Jean-Marc Andreoli portrait
Researcher 1 reports

Jean-Marc Andreoli

Cohere

Professor emeritus of computer science at Jean Monnet University whose work includes natural language processing and large language model adaptation and evaluation.

Jesujoba Alabi portrait
Researcher 1 reports

Jesujoba Alabi

Ai2

Researcher in natural language processing, low-resource languages, machine translation, and responsible AI; publicly listed as a PhD candidate at UC Santa Barbara and a co-author of Tulu 2.

Jonathan Gordon portrait
Researcher 1 reports

Jonathan Gordon

OpenAI

Works at OpenAI on reasoning and general intelligence, and co-authored the GPT-4 Technical Report.

Juan Felipe Cerón Uribe portrait
Researcher 1 reports

Juan Felipe Cerón Uribe

OpenAI

Member of technical staff at OpenAI whose official profile highlights multimodal models and intelligence.

Julian Martin Eisenschlos portrait
Researcher 1 reports

Julian Martin Eisenschlos

Ai2

Julian Martin Eisenschlos is a Research Scientist at Ai2. His work focuses on natural language processing, language models, and instruction tuning, including contributions to the Tulu 2 project.

Jyrki Alakuijala portrait
Researcher 1 reports

Jyrki Alakuijala

Google Gemini

Research scientist at Google DeepMind working on data compression, efficient language models, and multilingual modeling.

Karina Nguyen portrait
Researcher 1 reports

Karina Nguyen

Amazon

Research scientist at Amazon AGI and former Stanford University PhD student.

Karthik Prasad portrait
Researcher 1 reports

Karthik Prasad

Meta AI

Staff research scientist at Meta whose work focuses on multilingual and multimodal AI, reasoning, and safety.

Kateryna Gashteovski portrait
Researcher 1 reports

Kateryna Gashteovski

Google Gemini

Research scientist at Google DeepMind working on natural language understanding and machine learning.

Kensen Shi portrait
Researcher 1 reports

Kensen Shi

Google Gemini

Research scientist at Google DeepMind working on machine learning and natural language processing.

Kewei Tu portrait
Researcher 1 reports

Kewei Tu

Cohere

Associate professor at ShanghaiTech University working on natural language processing and machine learning.

Khalid Salama portrait
Researcher 1 reports

Khalid Salama

Google Gemini

Professor of Electrical and Computer Engineering at KAUST whose research focuses on machine learning and AI, including edge intelligence, tinyML, efficient AI, and uncertainty-aware learning.

Kyla Sheppard portrait
Researcher 1 reports

Kyla Sheppard

OpenAI

Program manager at OpenAI leading contracting, revenue, pricing, and finance operations, after earlier operations and finance roles at Tesla and Capchase.

Lenny Bogdonoff portrait
Researcher 1 reports

Lenny Bogdonoff

OpenAI

OpenAI contributor with a personal website and public engineering projects.

Maarten Bosma portrait
Researcher 1 reports

Maarten Bosma

Google Gemini

Research scientist at Google DeepMind whose work focuses on pre-training, post-training, and reasoning for large language models.

Maneesh Bhadauria portrait
Researcher 1 reports

Maneesh Bhadauria

NVIDIA

Research scientist at NVIDIA Research working on computer vision, machine learning, and deep learning, and a co-author of the Nemotron-4 340B technical report.

Mao Zheng portrait
Researcher 1 reports

Mao Zheng

DeepSeek

Co-founder of DeepSeek who leads pre-training and post-training research, with a focus on language models, reasoning models, and reinforcement learning.

Marie-Therese C. Mercier portrait
Researcher 1 reports

Marie-Therese C. Mercier

Mistral AI

Research scientist at Mistral AI with a public publication record on Google Scholar and DBLP.

Mark Sandler portrait
Researcher 1 reports

Mark Sandler

Google Gemini

Research scientist at Google DeepMind focused on multimodal and embodied AI. His work covers computer vision, video understanding, and machine learning for interactive systems.

Marvin Zhang portrait
Researcher 1 reports

Marvin Zhang

OpenAI

OpenAI contributor to GPT-4 evaluation and analysis, including Evals open-sourcing, instruction-following evaluation, and API model evaluations.

Matej Balog portrait
Researcher 1 reports

Matej Balog

Google Gemini

Senior staff research scientist at Google DeepMind and assistant professor at the Czech Technical University in Prague.

Mehdi Rezagholizadeh portrait
Researcher 1 reports

Mehdi Rezagholizadeh

NVIDIA

Senior applied scientist at NVIDIA focusing on generative and agentic AI, language models, information extraction, multimodality, and reasoning.

Merry Mou portrait
Researcher 1 reports

Merry Mou

Ai2

Merry Mou is a researcher at Ai2 whose public homepage and Google Scholar profile highlight work on language model post-training, evaluation, and open-source foundation models.

Michael Mitzenmacher portrait
Researcher 1 reports

Michael Mitzenmacher

NVIDIA

Harvard computer scientist and professor whose research spans algorithms, probability, information theory, and data streams.

Miles Brundage portrait
Researcher 1 reports

Miles Brundage

OpenAI

Senior research fellow at the Institute for AI Policy and Strategy and former OpenAI research scientist focused on AI governance and policy.

Mira Murati portrait
Researcher 1 reports

Mira Murati

OpenAI

Founder and CEO of Thinking Machines Lab and former CTO of OpenAI.

Mustafa El-Kishky portrait
Researcher 1 reports

Mustafa El-Kishky

Google Gemini

Research scientist at Google DeepMind working on natural language processing, speech, and multimodal machine learning.

Natalie Staudacher portrait
Researcher 1 reports

Natalie Staudacher

OpenAI

Researcher at OpenAI working across multimodal systems, reasoning, and new product experiences, with earlier roles at Google and YouTube.

Nick Turley portrait
Researcher 1 reports

Nick Turley

OpenAI

Head of product for ChatGPT at OpenAI. He works on ChatGPT and related product experiences, including agent-style workflows and consumer AI tools.

Noah Deutsch portrait
Researcher 1 reports

Noah Deutsch

OpenAI

Research scientist at OpenAI.

Ofer Arazi portrait
Researcher 1 reports

Ofer Arazi

Cohere

Machine learning researcher at Cohere whose work focuses on large language models and post-training.

Olivier Delalleau portrait
Researcher 1 reports

Olivier Delalleau

Cohere

Research scientist at Cohere whose work focuses on deep learning and large language models.

Patricio Lopez Macedo portrait
Researcher 1 reports

Patricio Lopez Macedo

Google Gemini

Research scientist at Google DeepMind focused on learning with reward feedback and reasoning in large language models.

Paul Michel portrait
Researcher 1 reports

Paul Michel

Cohere

Research scientist working on large language models, scaling, alignment, and language; public work includes machine translation robustness, data selection, CodeGemma, and reward overoptimization in LLMs.

Pavan Balaji portrait
Researcher 1 reports

Pavan Balaji

NVIDIA

Pavan Balaji is a computer scientist and research leader in high-performance computing. His public profiles include work at NVIDIA and leadership of the Argonne Leadership Computing Facility, with research spanning parallel programming models, communication libraries, and runtime systems.

Prasad Sattigeri portrait
Researcher 1 reports

Prasad Sattigeri

Snowflake

Prasad Sattigeri is a machine learning engineer at Snowflake working on agentic AI systems and machine learning.

Pratyusha Kamesetty portrait
Researcher 1 reports

Pratyusha Kamesetty

NVIDIA

Pratyusha Kamesetty is a deep learning research scientist at NVIDIA and a PhD candidate in computer science at Stanford University. Her research focuses on language models, recommendation systems, and generative AI for music and video.

Preston Tuggle portrait
Researcher 1 reports

Preston Tuggle

OpenAI

Preston Tuggle is a distributed systems engineer at OpenAI.

Rajarshi Das portrait
Researcher 1 reports

Rajarshi Das

NVIDIA

Research scientist at NVIDIA.

Reiichiro Nakano portrait
Researcher 1 reports

Reiichiro Nakano

OpenAI

Technical staff member at OpenAI. Previously completed a PhD at Stanford University and was a research fellow at the Simons Institute for the Theory of Computing.

Rishiraj Adhikary portrait
Researcher 1 reports

Rishiraj Adhikary

NVIDIA

Rishiraj Adhikary is a senior applied research scientist at NVIDIA working on AI systems and generative AI.

Roger Waleffe portrait
Researcher 1 reports

Roger Waleffe

NVIDIA

Research scientist at NVIDIA.

Rory Carmichael portrait
Researcher 1 reports

Rory Carmichael

OpenAI

DPhil candidate in Engineering Science at the University of Oxford whose work spans cybersecurity, machine learning, and natural language semantics.

Ross Alberger portrait
Researcher 1 reports

Ross Alberger

Meta AI

Research scientist at Meta working on language models and machine translation.

Ruisong Zhang portrait
Researcher 1 reports

Ruisong Zhang

DeepSeek

Researcher at DeepSeek focused on multimodal models, AI agents, and efficient large language models; previously worked on large-scale recommender systems at Tencent and Kuaishou.

Sadanand Rajaram portrait
Researcher 1 reports

Sadanand Rajaram

Cohere

Research scientist at Cohere working on multimodal and embodied AI, robotics, and machine learning systems.

Sameer Singh portrait
Researcher 1 reports

Sameer Singh

Snowflake

Associate professor at UC Irvine and research director at Snowflake AI Research.

Saurabh Kumar portrait
Researcher 1 reports

Saurabh Kumar

Cohere

Research scientist at Cohere whose homepage highlights work on efficient and reliable natural language processing, following earlier research engineering work at Google.

Shawn Henry portrait
Researcher 1 reports

Shawn Henry

NVIDIA

Principal applied scientist at NVIDIA whose public work focuses on applied machine learning and large language models.

Shibani Santurkar portrait
Researcher 1 reports

Shibani Santurkar

OpenAI

Machine learning researcher focused on reliable ML systems. She was a postdoc in Computer Science at Stanford and earned her PhD from MIT.

Shino Jomoto portrait
Researcher 1 reports

Shino Jomoto

OpenAI

OpenAI contributor with an official OpenAI profile page; public contribution pages list work on GPT-4, GPT-4o, and DALL-E 2.

Sho Takase portrait
Researcher 1 reports

Sho Takase

Cohere

Research scientist at Cohere whose work focuses on natural language processing and language models.

Shruti Bhosale portrait
Researcher 1 reports

Shruti Bhosale

Meta AI

Research engineer at Meta working on Llama models and generative AI. She previously completed an MS in computer science at New York University.

Shuyan Zhou portrait
Researcher 1 reports

Shuyan Zhou

Ai2

Research scientist at the Allen Institute for AI focused on training, evaluating, and interpreting language models, with work spanning NLP and computational social science.

Simón Posada Fishman portrait
Researcher 1 reports

Simón Posada Fishman

OpenAI

OpenAI member building agentic products; previously founded the AI note-taking app Circleback and writes about AI, product, and startups.

Sina Samangooei portrait
Researcher 1 reports

Sina Samangooei

Google Gemini

Sina Samangooei is a machine learning researcher whose public work includes multimodal learning and open language models.

Sudharsan Kodikala portrait
Researcher 1 reports

Sudharsan Kodikala

NVIDIA

Machine learning researcher at NVIDIA focused on large language models and reinforcement learning.

Sully Chen portrait
Researcher 1 reports

Sully Chen

OpenAI

Engineer and builder who co-founded comma.ai and previously led self-driving at OpenAI, with public work spanning autonomous driving, AGI, and music.

Toby Jia-Jun Li portrait
Researcher 1 reports

Toby Jia-Jun Li

Google Gemini

Assistant professor at Cornell Tech whose research focuses on human-AI interaction, AI copilots, and tools that help people learn, create, and make decisions with AI.

Trieu Trinh portrait
Researcher 1 reports

Trieu Trinh

Google Gemini

Mathematician and machine learning researcher whose NYU profile describes him as a PhD student in mathematics, advising at NYU Shanghai and a research intern at Google DeepMind; he also co-authored AlphaGeometry and the Gemma report.

Tyler Scialom portrait
Researcher 1 reports

Tyler Scialom

Ai2

Research scientist at Ai2 working on personalized language models, instruction tuning, and reinforcement learning from human feedback.

Vedaant Madan portrait
Researcher 1 reports

Vedaant Madan

Cohere

Researcher at Cohere.

Venkatesh Akella portrait
Researcher 1 reports

Venkatesh Akella

NVIDIA

Venkatesh Akella is a professor of electrical and computer engineering at the University of California, Davis and director of the Center for AI and Experimental Futures. His research includes machine learning, trustworthy AI, causal AI, computer vision, robotics, and intelligent systems.

Virginia Hofmann portrait
Researcher 1 reports

Virginia Hofmann

Ai2

Research scientist at Ai2 and PhD student at UC Berkeley working on language models, multilingual NLP, and evaluation.

Wendi Gu portrait
Researcher 1 reports

Wendi Gu

Google Gemini

Research scientist at Google DeepMind and incoming assistant professor at Stanford GSB working on AI and economics.

Wenjie Hu portrait
Researcher 1 reports

Wenjie Hu

DeepSeek

Developer and researcher with public GitHub activity on DeepSeek-AI repositories including DeepSeek-R1 and DeepSeek-VL2, plus coauthor credit on the DeepSeek LLM Technical Report.

Xipeng Qiu portrait
Researcher 1 reports

Xipeng Qiu

Meta AI

Xipeng Qiu is a Fudan University professor whose public work focuses on natural language processing and large language models.

Yusuke Iwasawa portrait
Researcher 1 reports

Yusuke Iwasawa

Cohere

Yusuke Iwasawa is an Assistant Professor in Computer Science at the University of Tokyo whose work spans human-computer interaction and machine learning.

Zenan Zhou portrait
Researcher 1 reports

Zenan Zhou

Baichuan

Researcher at Baichuan focused on large language models and multimodal systems. Public publication profiles include work on Baichuan 2, Baichuan-M1, and Baichuan-M2.

Zhicheng Ma portrait
Researcher 1 reports

Zhicheng Ma

DeepSeek

Research scientist at DeepSeek and a Ph.D. student in the School of Software at Tsinghua University. His research interests include large language models and multimodal learning.

Tony Gracious portrait
Researcher 1 reports

Tony Gracious

Ai2

Tony Gracious completed his PhD in the Department of Computer Science and Automation at IISc Bangalore. His work includes representation learning, temporal point processes, and higher-order interaction forecasting, and he later joined Dolby's Advanced Technology Group in Bangalore.

Jinliang Guo portrait
Researcher 3 reports

Jinliang Guo

NVIDIA

Jinliang Guo is listed as an author of the NVIDIA technical report Nemotron-H: A Family of Accurate and Efficient Hybrid Mamba-Transformer Models.

Pengxiang Cheng portrait
Researcher 3 reports

Pengxiang Cheng

Z.ai

Public report authorship links Pengxiang Cheng to the GLM-Z1-Rumination: An Open Frontier-Class Reasoning Model Through Test-Time Scaling at Z.ai.

Qingyang Ge portrait
Researcher 3 reports

Qingyang Ge

MiniMax

Public report authorship links Qingyang Ge to the MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention at MiniMax.

Rudra Murthy portrait
Researcher 3 reports

Rudra Murthy

NVIDIA

Rudra Murthy is listed as an author of the NVIDIA technical report Nemotron-H: A Family of Accurate and Efficient Hybrid Mamba-Transformer Models.

Wenhui Wang portrait
Researcher 3 reports

Wenhui Wang

Microsoft

Co-author of "The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits"; the paper's author notes list Wenhui Wang with Microsoft Research.

Xi Chen portrait
Researcher 3 reports

Xi Chen

Z.ai

Xi Chen is listed as an author of the Z.ai technical report GLM-4.1V-Thinking and GLM-4.5V: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning.

Xinhua Cheng portrait
Researcher 3 reports

Xinhua Cheng

Z.ai

Public report authorship links Xinhua Cheng to the GLM-Z1-Rumination: An Open Frontier-Class Reasoning Model Through Test-Time Scaling at Z.ai.

Xuancheng Ren portrait
Researcher 3 reports

Xuancheng Ren

Alibaba Qwen

Xuancheng Ren is listed as an author of the Qwen technical report Qwen3 Technical Report.

Yizhe Yang portrait
Researcher 3 reports

Yizhe Yang

Z.ai

Public report authorship links Yizhe Yang to the GLM-Z1-Rumination: An Open Frontier-Class Reasoning Model Through Test-Time Scaling at Z.ai.

Zhongzheng Ren portrait
Researcher 3 reports

Zhongzheng Ren

Z.ai

Public report authorship links Zhongzheng Ren to the GLM-Z1-Rumination: An Open Frontier-Class Reasoning Model Through Test-Time Scaling at Z.ai.

Jeff Wu portrait
Researcher 3 reports

Jeff Wu

OpenAI

Researcher at OpenAI working on language model training and evaluation, and co-author of the GPT-4 Technical Report.

Beichen Zhang portrait
Researcher 3 reports

Beichen Zhang

Alibaba Qwen

Research scientist in Tongyi Lab whose official profile highlights post-training, efficient reasoning, and coding large language models.

Chang Zhou portrait
Researcher 3 reports

Chang Zhou

Alibaba Qwen

Qwen researcher and co-lead whose work focuses on pretraining and post-training, multimodal models, agent systems, and large-scale model infrastructure.

Ruoyu Zhang portrait
Researcher 3 reports

Ruoyu Zhang

DeepSeek

Researcher affiliated with DeepSeek-AI and co-author of the Nature paper introducing DeepSeek-R1.

Shangyan Zhou portrait
Researcher 3 reports

Shangyan Zhou

DeepSeek

Researcher affiliated with DeepSeek-AI and co-author of the Nature paper introducing DeepSeek-R1.

Shijie Wang portrait
Researcher 3 reports

Shijie Wang

Alibaba Qwen

Senior research scientist in Tongyi Lab whose official profile highlights post-training, AI for science, evaluation and alignment, multimodal reasoning, and large language model reasoning.

Shirong Ma portrait
Researcher 3 reports

Shirong Ma

DeepSeek

Researcher at DeepSeek-AI in Hangzhou, China and co-author of the Nature paper introducing DeepSeek-R1.

Shiyu Wang portrait
Researcher 3 reports

Shiyu Wang

DeepSeek

DeepSeek contributor and coauthor of the DeepSeek-V2 and DeepSeek-V3 technical reports.

Shunfeng Zhou portrait
Researcher 3 reports

Shunfeng Zhou

DeepSeek

Researcher affiliated with DeepSeek-AI and co-author of the Nature paper introducing DeepSeek-R1.

Tian Pei portrait
Researcher 3 reports

Tian Pei

DeepSeek

Researcher affiliated with DeepSeek-AI and co-author of the Nature paper introducing DeepSeek-R1.

Ting Song portrait
Researcher 2 reports

Ting Song

Microsoft

Ting Song is listed as an author of the BitNet b1.58 2B4T Technical Report; the report states that T. Song is with Microsoft Research.

Wenjun Gao portrait
Researcher 3 reports

Wenjun Gao

DeepSeek

Researcher affiliated with DeepSeek-AI and co-author of the Nature paper introducing DeepSeek-R1.

Xiaodong Liu portrait
Researcher 3 reports

Xiaodong Liu

DeepSeek

DeepSeek contributor and coauthor of the DeepSeek-V2 and DeepSeek-V3 technical reports.

Xiaotao Nie portrait
Researcher 3 reports

Xiaotao Nie

DeepSeek

DeepSeek contributor and coauthor of the DeepSeek-V2 and DeepSeek-V3 technical reports.

Xin Xie portrait
Researcher 3 reports

Xin Xie

DeepSeek

Researcher affiliated with DeepSeek-AI and co-author of the Nature paper introducing DeepSeek-R1.

Yanhong Xu portrait
Researcher 3 reports

Yanhong Xu

DeepSeek

DeepSeek contributor and coauthor of the DeepSeek-V2 and DeepSeek-V3 technical reports.

Yan Xia portrait
Researcher 2 reports

Yan Xia

Microsoft

Co-author of the BitNet b1.58 2B4T Technical Report; the report states Yan Xia is with Microsoft Research.

Yaofeng Sun portrait
Researcher 3 reports

Yaofeng Sun

DeepSeek

Researcher at DeepSeek-AI in Hangzhou, China and co-author of the Nature paper introducing DeepSeek-R1.

Yaohui Wang portrait
Researcher 3 reports

Yaohui Wang

DeepSeek

DeepSeek contributor and coauthor of the DeepSeek-V2 and DeepSeek-V3 technical reports.

Yao Li portrait
Researcher 3 reports

Yao Li

DeepSeek

Researcher affiliated with DeepSeek-AI and co-author of the Nature paper introducing DeepSeek-R1.

Yichao Zhang portrait
Researcher 3 reports

Yichao Zhang

DeepSeek

Researcher affiliated with DeepSeek-AI and co-author of the Nature paper introducing DeepSeek-R1.

Yichi Zhang portrait
Researcher 3 reports

Yichi Zhang

Moonshot AI / Z.ai

Researcher at Moonshot AI and co-author of the Kimi K2.5 report on visual agentic intelligence.

Ying He portrait
Researcher 3 reports

Ying He

DeepSeek

Researcher affiliated with DeepSeek-AI and co-author of the Nature paper introducing DeepSeek-R1.

Yiyuan Liu portrait
Researcher 3 reports

Yiyuan Liu

DeepSeek

Researcher affiliated with DeepSeek-AI and co-author of the Nature paper introducing DeepSeek-R1.

Yuheng Zou portrait
Researcher 3 reports

Yuheng Zou

DeepSeek

Researcher affiliated with DeepSeek-AI and co-author of the Nature paper introducing DeepSeek-R1.

Yuxuan Hu portrait
Researcher 3 reports

Yuxuan Hu

Z.ai

Co-author of GLM-4.1V-Thinking and GLM-4.5V, multimodal reasoning models trained with scalable reinforcement learning.

Zhangli Sha portrait
Researcher 3 reports

Zhangli Sha

DeepSeek

Researcher affiliated with DeepSeek-AI and co-author of the Nature paper introducing DeepSeek-R1.

Juston Forte portrait
Researcher 1 reports

Juston Forte

OpenAI

Lead Solutions Architect at OpenAI, based in New York City, according to an official OpenAI Forum speaker page.

Jilong Xue portrait
Researcher 1 reports

Jilong Xue

Microsoft

Co-author of "The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits"; the paper states J. Xue is with Microsoft Research.

Ali Payani portrait
Researcher 1 reports

Ali Payani

Google Gemini

Cisco Research researcher working in artificial intelligence and machine learning; current public evidence does not support the prior Google DeepMind profile.

Amanpreet Singh portrait
Researcher 1 reports

Amanpreet Singh

Ai2

Lead of OLMo, AI2's open-source frontier language models, and Associate Director of the OLMo team at the Allen Institute for AI.

Sherwin Wu portrait
Researcher 1 reports

Sherwin Wu

OpenAI

Software engineer publicly identified on OpenAI's GPT-4 contributions page and in a 2023 QCon speaker bio as a Member of Technical Staff on OpenAI's Developer Platform team.

Soumya Batra portrait
Researcher 1 reports

Soumya Batra

Meta AI

Soumya Batra is a PhD student in computer science at Stanford University and a former research intern at FAIR and Facebook AI Research.

Ali Payani portrait
Researcher 3 reports

Ali Payani

NVIDIA

Public report authorship links Ali Payani to the OLMoE: Open Mixture-of-Experts Language Models at Ai2.

Dahua Lin portrait
Researcher 3 reports

Dahua Lin

Shanghai AI Laboratory

Researcher at Shanghai AI Laboratory and coauthor of the InternLM2 Technical Report.

Dirk Groeneveld portrait
Researcher 3 reports

Dirk Groeneveld

Ai2

Public sources identify Dirk Groeneveld as an OLMo 3 coauthor; a 2022 USC ISI speaker bio described him as a principal engineer at AI2 and technical lead for the AllenNLP Platform team.

Jiaqi Wang portrait
Researcher 3 reports

Jiaqi Wang

Shanghai AI Laboratory

Researcher at Shanghai AI Laboratory and coauthor of the InternLM2 Technical Report.

Jin Chen portrait
Researcher 3 reports

Jin Chen

DeepSeek

Jin Chen is listed as an author of the DeepSeek technical report DeepSeek-V3 Technical Report.

Yu Qiao portrait
Researcher 3 reports

Yu Qiao

Shanghai AI Laboratory

Researcher at Shanghai AI Laboratory and coauthor of the InternLM2 Technical Report.

Andrew M. Dai portrait
Researcher 3 reports

Andrew M. Dai

Google Gemini

Research scientist at Google DeepMind in Mountain View working on machine learning, reinforcement learning, and robotics.

Aurelien Rodriguez portrait
Researcher 3 reports

Aurelien Rodriguez

Meta AI

Research scientist at Meta GenAI in Menlo Park working on foundation models, language generation, and evaluation.

Bingxuan Wang portrait
Researcher 3 reports

Bingxuan Wang

DeepSeek

Research scientist at DeepSeek with public scholarly work on large language models and reasoning.

Demis Hassabis portrait
Researcher 3 reports

Demis Hassabis

Google Gemini

Founder and CEO of Google DeepMind, leading AI research and product development; his work spans AI, neuroscience, game playing, and structural biology.

Guowei Li portrait
Researcher 3 reports

Guowei Li

DeepSeek

Research scientist at DeepSeek with a public Google Scholar profile covering large language models and efficient inference.

Honghui Ding portrait
Researcher 3 reports

Honghui Ding

DeepSeek

Research scientist at DeepSeek with public GitHub work on reinforcement learning from human feedback and language models.

Hui Qu portrait
Researcher 3 reports

Hui Qu

DeepSeek

Research scientist at DeepSeek with public GitHub work on large language models and AI agents.

Jacob Morrison portrait
Researcher 3 reports

Jacob Morrison

Ai2

Jacob Morrison is a researcher whose work spans language model post-training, alignment, and evaluation. His public research page highlights projects including Tulu 2, Tulu 3, OLMo 2, and RewardBench.

Jianzhong Guo portrait
Researcher 3 reports

Jianzhong Guo

DeepSeek

Research scientist at DeepSeek AI working on multimodal large language models and efficient multimodal alignment. Previously completed a PhD at Nanjing University focused on machine learning and computer vision.

Jinyuan Jia portrait
Researcher 3 reports

Jinyuan Jia

MiniMax

Researcher working on speech and multimodal language models, including MiniMax-Speech and related speech understanding work.

Kaige Gao portrait
Researcher 3 reports

Kaige Gao

DeepSeek

Research scientist at DeepSeek with public GitHub work spanning AI systems.

Lecong Zhang portrait
Researcher 3 reports

Lecong Zhang

DeepSeek

Research scientist at DeepSeek with public GitHub projects on AI systems.

Lucile Saulnier portrait
Researcher 3 reports

Lucile Saulnier

Mistral AI

AI scientist at Mistral AI and PhD student at ENS Paris and CNRS. Her research focuses on language grounding, multimodality, and evaluation for multimodal and language models.

Ming Ding portrait
Researcher 3 reports

Ming Ding

MiniMax

Lead of foundation models at MiniMax working on large language models, multimodal pretraining, and efficient training systems. He completed a PhD in computer science at Tsinghua University.

Minghua Zhang portrait
Researcher 3 reports

Minghua Zhang

DeepSeek

Research scientist at DeepSeek with public GitHub work on language models and AI systems.

Orhan Firat portrait
Researcher 3 reports

Orhan Firat

Google Gemini

Research scientist at Google Research whose public work spans multilingual and large-scale language modeling; arXiv author results include the PaLM paper.

Oriol Vinyals portrait
Researcher 3 reports

Oriol Vinyals

Google Gemini

Chief Scientist at Google DeepMind and Vice President of Research leading Gemini, with work spanning scalable sequence learning, large language models, games, and robotics.

Oyvind Tafjord portrait
Researcher 3 reports

Oyvind Tafjord

Ai2

Senior research scientist at Ai2 and affiliate associate professor at the University of Washington, with work spanning NLP, machine reading, and reasoning.

Qiushi Du portrait
Researcher 3 reports

Qiushi Du

DeepSeek

Researcher at DeepSeek AI interested in large language models and multimodal language models.

Rohan Anil portrait
Researcher 3 reports

Rohan Anil

Google Gemini

Rohan Anil is a research scientist at Google DeepMind. His public homepage highlights work on large language models, efficient machine learning systems, and multimodal AI.

Ruiqi Ge portrait
Researcher 3 reports

Ruiqi Ge

DeepSeek

Researcher affiliated with DeepSeek-AI on Google Scholar and coauthor of the DeepSeek technical reports.

Teven Le Scao portrait
Researcher 3 reports

Teven Le Scao

Mistral AI

Research scientist at Mistral AI and co-author of the Mistral 7B report.

Xuecheng Su portrait
Researcher 3 reports

Xuecheng Su

DeepSeek

Researcher and co-author of the DeepSeek-V3, DeepSeek-V2, and DeepSeek LLM technical reports.

Yishi Piao portrait
Researcher 3 reports

Yishi Piao

DeepSeek

Researcher and co-author of the DeepSeek-V3, DeepSeek-V2, and DeepSeek LLM technical reports.

Yossi Matias portrait
Researcher 3 reports

Yossi Matias

Google Gemini

Vice President of Engineering and Research at Google and site lead for the Google Center in Israel; he also leads Search, Research, and AI for Crisis Response.

Yukun Li portrait
Researcher 3 reports

Yukun Li

DeepSeek

DeepSeek researcher and coauthor of the DeepSeek LLM, DeepSeek-V2, and DeepSeek-V3 technical reports.

Yusheng Zhao portrait
Researcher 3 reports

Yusheng Zhao

MiniMax

Research scientist at MiniMax AI Research focused on reinforcement learning, reasoning, multimodal learning, large language models, and large-scale distributed systems. He received a PhD in machine learning from Carnegie Mellon University.

Zehui Ren portrait
Researcher 3 reports

Zehui Ren

DeepSeek

Research scientist at DeepSeek AI and co-author of the DeepSeek-V3, DeepSeek-V2, and DeepSeek LLM technical reports.

Baobao Chang portrait
Researcher 2 reports

Baobao Chang

Z.ai

Public report authorship links Baobao Chang to the GLM-4.5: Agentic, Reasoning, and Coding Foundation Models at Z.ai.

Chaomin Shen portrait
Researcher 2 reports

Chaomin Shen

Z.ai

Chaomin Shen is listed as an author of the Z.ai technical report GLM-4.5: Agentic, Reasoning, and Coding Foundation Models.

Dakun Zhang portrait
Researcher 2 reports

Dakun Zhang

Z.ai

Dakun Zhang is listed as an author of the Z.ai technical report GLM-4.5: Agentic, Reasoning, and Coding Foundation Models.

Dehao Chen portrait
Researcher 2 reports

Dehao Chen

Google Gemini

Public report authorship links Dehao Chen to the PaLM: Scaling Language Modeling with Pathways at Google.

Dejiang Dou portrait
Researcher 2 reports

Dejiang Dou

MiniMax

Public report authorship links Dejiang Dou to the MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention at MiniMax.

Fangrui Zhang portrait
Researcher 2 reports

Fangrui Zhang

Z.ai

Fangrui Zhang is listed as an author of the Z.ai technical report GLM-4.5: Agentic, Reasoning, and Coding Foundation Models.

Gianna Lengyel portrait
Researcher 2 reports

Gianna Lengyel

Mistral AI

Gianna Lengyel is listed as an author of the Mistral AI technical report Mistral 7B.

Hang Yan portrait
Researcher 2 reports

Hang Yan

Z.ai

Public report authorship links Hang Yan to the GLM-4.5: Agentic, Reasoning, and Coding Foundation Models at Z.ai.

Haolin Jia portrait
Researcher 2 reports

Haolin Jia

MiniMax

Public report authorship links Haolin Jia to the MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention at MiniMax.

Haoqi Chen portrait
Researcher 2 reports

Haoqi Chen

Z.ai

Public report authorship links Haoqi Chen to the GLM-4.5 report at Z.ai.

Hyeonman Lee portrait
Researcher 2 reports

Hyeonman Lee

LG AI Research

Researcher at LG AI Research and coauthor of the EXAONE 3.5: Series of Language Models for Real-world Use Cases.

H. Zhang portrait
Researcher 2 reports

H. Zhang

DeepSeek

H. Zhang is listed as an author of the DeepSeek technical report DeepSeek-V3 Technical Report.

Jamie Kiros portrait
Researcher 2 reports

Jamie Kiros

OpenAI / NVIDIA

Public report authorship links Jamie Kiros to the Nemotron-4 340B Technical Report at NVIDIA.

Jiale Zhi portrait
Researcher 2 reports

Jiale Zhi

MiniMax

Public report authorship links Jiale Zhi to the MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention at MiniMax.

Jialiang Wang portrait
Researcher 2 reports

Jialiang Wang

Z.ai

Jialiang Wang is listed as an author of the Z.ai technical report GLM-4.5: Agentic, Reasoning, and Coding Foundation Models.

Jianfeng Gao portrait
Researcher 2 reports

Jianfeng Gao

NVIDIA

Public report authorship links Jianfeng Gao to the Nemotron-4 15B Technical Report at NVIDIA.

Jiangnan Li portrait
Researcher 2 reports

Jiangnan Li

Z.ai

Public report authorship links Jiangnan Li to the GLM-4.5 report at Z.ai.

Jian Ma portrait
Researcher 2 reports

Jian Ma

MiniMax

Public report authorship links Jian Ma to the MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention at MiniMax.

Jian Song portrait
Researcher 2 reports

Jian Song

Z.ai

Public report authorship links Jian Song to the GLM-4.5: Agentic, Reasoning, and Coding Foundation Models at Z.ai.

Jiaqi Gao portrait
Researcher 2 reports

Jiaqi Gao

Shanghai AI Laboratory

Researcher at Shanghai AI Laboratory and coauthor of the InternLM2 Technical Report.

Jinhui Yang portrait
Researcher 2 reports

Jinhui Yang

Z.ai

Jinhui Yang is listed as an author of the Z.ai technical report GLM-4.5: Agentic, Reasoning, and Coding Foundation Models.

Jong C. Park portrait
Researcher 2 reports

Jong C. Park

LG AI Research

Researcher at LG AI Research and coauthor of the EXAONE 3.5: Series of Language Models for Real-world Use Cases.

Kaixin Yang portrait
Researcher 2 reports

Kaixin Yang

MiniMax

Public report authorship links Kaixin Yang to the MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention at MiniMax.

Li Dong portrait
Researcher 2 reports

Li Dong

Z.ai

Public report authorship links Li Dong to the GLM-4.5 report at Z.ai.

Linjie Zhao portrait
Researcher 2 reports

Linjie Zhao

Z.ai

Linjie Zhao is listed as an author of the Z.ai technical report GLM-4.5: Agentic, Reasoning, and Coding Foundation Models.

Mahyar Babaei portrait
Researcher 2 reports

Mahyar Babaei

NVIDIA / Microsoft

Mahyar Babaei is listed as an author of the Microsoft technical report Phi-4 Technical Report.

Mateusz Litwin portrait
Researcher 2 reports

Mateusz Litwin

OpenAI

Mateusz Litwin is listed as an author of the OpenAI technical report Language Models are Few-Shot Learners.

Ming Ding portrait
Researcher 2 reports

Ming Ding

Z.ai

Ming Ding is listed as an author of the Z.ai technical report GLM-4.5: Agentic, Reasoning, and Coding Foundation Models.

Mingpeng Zhou portrait
Researcher 2 reports

Mingpeng Zhou

Z.ai

Public report authorship links Mingpeng Zhou to the GLM-4.5: Agentic, Reasoning, and Coding Foundation Models at Z.ai.

Mingxing Zhang portrait
Researcher 2 reports

Mingxing Zhang

Google Gemini

Public report authorship links Mingxing Zhang to the Gemma 3n Technical Report at Google.

Mingze Li portrait
Researcher 2 reports

Mingze Li

Alibaba Qwen / Meta AI

Mingze Li is listed as an author of the Qwen technical report Qwen3 Technical Report.

Mostafijur Rahman portrait
Researcher 2 reports

Mostafijur Rahman

LG AI Research

Researcher at LG AI Research and coauthor of the EXAONE 3.5: Series of Language Models for Real-world Use Cases.

M. Saiful Bari portrait
Researcher 2 reports

M. Saiful Bari

LG AI Research

Researcher at LG AI Research and coauthor of the EXAONE 3.5: Series of Language Models for Real-world Use Cases.

M. Waqar Azim portrait
Researcher 2 reports

M. Waqar Azim

LG AI Research

Researcher at LG AI Research and coauthor of the EXAONE 3.5: Series of Language Models for Real-world Use Cases.

Nhan Tran portrait
Researcher 2 reports

Nhan Tran

Cerebras Systems

Researcher at Cerebras Systems and coauthor of the Cerebras-GPT: Open Compute-Optimal Language Models Trained on the Cerebras Wafer-Scale Cluster.

Niket Tandon portrait
Researcher 2 reports

Niket Tandon

Salesforce AI Research

Researcher at Salesforce AI Research and coauthor of the XGen-7B Technical Report.

Peiyi Wang portrait
Researcher 2 reports

Peiyi Wang

MiniMax

Public report authorship links Peiyi Wang to the MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention at MiniMax.

Prasad Reddy Yadati portrait
Researcher 2 reports

Prasad Reddy Yadati

Salesforce AI Research

Researcher at Salesforce AI Research and coauthor of the XGen-7B Technical Report.

Qianlin Peng portrait
Researcher 2 reports

Qianlin Peng

MiniMax

Public report authorship links Qianlin Peng to the MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention at MiniMax.

Qiao Zhang portrait
Researcher 2 reports

Qiao Zhang

Google Gemini / Moonshot AI

Qiao Zhang is listed as an author of the Google technical report PaLM 2 Technical Report.

Qinyuan Wang portrait
Researcher 2 reports

Qinyuan Wang

Z.ai

Qinyuan Wang is listed as an author of the Z.ai technical report GLM-4.5: Agentic, Reasoning, and Coding Foundation Models.

Rong Wu portrait
Researcher 2 reports

Rong Wu

Z.ai

Public report authorship links Rong Wu to the GLM-4.5 report at Z.ai.

Sanghyun Choi portrait
Researcher 2 reports

Sanghyun Choi

LG AI Research

Researcher at LG AI Research and coauthor of the EXAONE 3.5: Series of Language Models for Real-world Use Cases.

Sebastian Goodman portrait
Researcher 2 reports

Sebastian Goodman

Google Gemini

Public report authorship links Sebastian Goodman to the Gemma 3n Technical Report at Google.

Shuai Wang portrait
Researcher 2 reports

Shuai Wang

Z.ai

Public report authorship links Shuai Wang to the GLM-4.5: Agentic, Reasoning, and Coding Foundation Models at Z.ai.

Siyao Huang portrait
Researcher 2 reports

Siyao Huang

Z.ai

Siyao Huang is listed as an author of the Z.ai technical report GLM-4.5: Agentic, Reasoning, and Coding Foundation Models.

Siyuan Fan portrait
Researcher 2 reports

Siyuan Fan

MiniMax

Public report authorship links Siyuan Fan to the MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention at MiniMax.

Su Wang portrait
Researcher 2 reports

Su Wang

Google Gemini

Public report authorship links Su Wang to the Gemma 3n Technical Report at Google.

Tianchu Gao portrait
Researcher 2 reports

Tianchu Gao

Z.ai

Public report authorship links Tianchu Gao to the GLM-4.5 report at Z.ai.

Trevor Cai portrait
Researcher 2 reports

Trevor Cai

Google Gemini / OpenAI

Trevor Cai is listed as an author of the Google technical report RecurrentGemma: Moving Past Transformers for Efficient Open Language Models.

T. Wang portrait
Researcher 2 reports

T. Wang

DeepSeek

T. Wang is listed as an author of the DeepSeek technical report DeepSeek-V3 Technical Report.

Wenbo Bi portrait
Researcher 2 reports

Wenbo Bi

MiniMax

Public report authorship links Wenbo Bi to the MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention at MiniMax.

Xiang Li portrait
Researcher 2 reports

Xiang Li

MiniMax

Xiang Li is listed as an author of the MiniMax technical report MiniMax-01: Scaling Foundation Models with Lightning Attention.

Xiao Liu portrait
Researcher 2 reports

Xiao Liu

MiniMax

Public report authorship links Xiao Liu to the MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention at MiniMax.

Xiaowei Jiang portrait
Researcher 2 reports

Xiaowei Jiang

Z.ai

Xiaowei Jiang is listed as an author of the Z.ai technical report GLM-4.5: Agentic, Reasoning, and Coding Foundation Models.

Xiaoxin Li portrait
Researcher 2 reports

Xiaoxin Li

Z.ai

Public report authorship links Xiaoxin Li to the GLM-4.5 report at Z.ai.

Xiaoyi Wang portrait
Researcher 2 reports

Xiaoyi Wang

MiniMax

Public report authorship links Xiaoyi Wang to the MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention at MiniMax.

Xibin Wu portrait
Researcher 2 reports

Xibin Wu

MiniMax

Public report authorship links Xibin Wu to the MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention at MiniMax.

Xin Fu portrait
Researcher 2 reports

Xin Fu

Z.ai

Xin Fu is listed as an author of the Z.ai technical report GLM-4.5: Agentic, Reasoning, and Coding Foundation Models.

Xuanhao Ge portrait
Researcher 2 reports

Xuanhao Ge

Z.ai

Xuanhao Ge is listed as an author of the Z.ai technical report GLM-4.5: Agentic, Reasoning, and Coding Foundation Models.

Xudong Xie portrait
Researcher 2 reports

Xudong Xie

Z.ai

Xudong Xie is listed as an author of the Z.ai technical report GLM-4.5: Agentic, Reasoning, and Coding Foundation Models.

Yafei Wen portrait
Researcher 2 reports

Yafei Wen

MiniMax

Yafei Wen is a MiniMax report-backed author on MiniMax-Text-01, a MiniMax technical report in the LLMpeople catalog.

Yali Wang portrait
Researcher 2 reports

Yali Wang

MiniMax

Public report authorship links Yali Wang to the MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention at MiniMax.

Yang Jiang portrait
Researcher 2 reports

Yang Jiang

Z.ai

Yang Jiang is listed as an author of the Z.ai technical report GLM-4.5: Agentic, Reasoning, and Coding Foundation Models.

Yibo Tian portrait
Researcher 2 reports

Yibo Tian

Z.ai

Yibo Tian is listed as an author of the Z.ai technical report GLM-4.5: Agentic, Reasoning, and Coding Foundation Models.

Yifei Lu portrait
Researcher 2 reports

Yifei Lu

MiniMax

Public report authorship links Yifei Lu to the MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention at MiniMax.

Yijia Shao portrait
Researcher 2 reports

Yijia Shao

Salesforce AI Research

Researcher at Salesforce AI Research and coauthor of the XGen-7B Technical Report.

Yilong Chen portrait
Researcher 2 reports

Yilong Chen

Z.ai

Yilong Chen is listed as an author of the Z.ai technical report GLM-4.5: Agentic, Reasoning, and Coding Foundation Models.

Yingjie Li portrait
Researcher 2 reports

Yingjie Li

MiniMax

Public report authorship links Yingjie Li to the MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention at MiniMax.

Yinlong Xu portrait
Researcher 2 reports

Yinlong Xu

Z.ai

Yinlong Xu is listed as an author of the Z.ai technical report GLM-4.5: Agentic, Reasoning, and Coding Foundation Models.

Yi Wang portrait
Researcher 2 reports

Yi Wang

Z.ai

Yi Wang is listed as an author of the Z.ai technical report GLM-4.5: Agentic, Reasoning, and Coding Foundation Models.

Yong Li portrait
Researcher 2 reports

Yong Li

Z.ai

Yong Li is listed as an author of the Z.ai technical report GLM-4.5: Agentic, Reasoning, and Coding Foundation Models.

Youngjae Yu portrait
Researcher 2 reports

Youngjae Yu

LG AI Research

Researcher at LG AI Research and coauthor of the EXAONE 3.5: Series of Language Models for Real-world Use Cases.

Yuhan Li portrait
Researcher 2 reports

Yuhan Li

Z.ai

Yuhan Li is listed as an author of the Z.ai technical report GLM-4.5: Agentic, Reasoning, and Coding Foundation Models.

Yu Liu portrait
Researcher 2 reports

Yu Liu

MiniMax

Public report authorship links Yu Liu to the MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention at MiniMax.

Yuxiang Zhao portrait
Researcher 2 reports

Yuxiang Zhao

Z.ai

Yuxiang Zhao is listed as an author of the Z.ai technical report GLM-4.5: Agentic, Reasoning, and Coding Foundation Models.

Yuxuan Wang portrait
Researcher 2 reports

Yuxuan Wang

Shanghai AI Laboratory

Researcher at Shanghai AI Laboratory and coauthor of the InternLM2 Technical Report.

Yuzeng Li portrait
Researcher 2 reports

Yuzeng Li

MiniMax

Public report authorship links Yuzeng Li to the MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention at MiniMax.

Yu Zhou portrait
Researcher 2 reports

Yu Zhou

MiniMax

Public report authorship links Yu Zhou to the MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention at MiniMax.

Zehui Wu portrait
Researcher 2 reports

Zehui Wu

Z.ai

Zehui Wu is listed as an author of the Z.ai technical report GLM-4.5: Agentic, Reasoning, and Coding Foundation Models.

Zewen Chi portrait
Researcher 2 reports

Zewen Chi

Microsoft

Zewen Chi is listed as a co-author of the 2026 arXiv paper "Sparse-BitNet: 1.58-bit LLMs are Naturally Friendly to Semi-Structured Sparsity," with affiliation 1 shown as Microsoft Research.

Zhang Zhang portrait
Researcher 2 reports

Zhang Zhang

Z.ai

Public report authorship links Zhang Zhang to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.

Zhanhui Kang portrait
Researcher 2 reports

Zhanhui Kang

Z.ai

Public report authorship links Zhanhui Kang to the GLM-4.5 report at Z.ai.

Zhengduo Zhuo portrait
Researcher 2 reports

Zhengduo Zhuo

Z.ai

Zhengduo Zhuo is listed as an author of the Z.ai technical report GLM-4.5: Agentic, Reasoning, and Coding Foundation Models.

Zhe Wang portrait
Researcher 2 reports

Zhe Wang

Shanghai AI Laboratory

Researcher at Shanghai AI Laboratory and coauthor of the InternLM2 Technical Report.

Zhe Wang portrait
Researcher 2 reports

Zhe Wang

Z.ai

Zhe Wang is listed as an author of the Z.ai technical report GLM-4.5: Agentic, Reasoning, and Coding Foundation Models.

Zhili Liang portrait
Researcher 2 reports

Zhili Liang

Z.ai

Zhili Liang is listed as an author of the Z.ai technical report GLM-4.5: Agentic, Reasoning, and Coding Foundation Models.

Zitong Zhang portrait
Researcher 2 reports

Zitong Zhang

Z.ai

Zitong Zhang is listed as an author of the Z.ai technical report GLM-4.5: Agentic, Reasoning, and Coding Foundation Models.

Johannes Heidecke portrait
Researcher 2 reports

Johannes Heidecke

OpenAI

Head of Safety Systems at OpenAI.

Abhinav Dubey portrait
Researcher 1 reports

Abhinav Dubey

Databricks

Researcher at Databricks and coauthor of the DBRX: A Generalist Open Source LLM.

Abhinav Kadian portrait
Researcher 1 reports

Abhinav Kadian

Databricks

Researcher at Databricks and coauthor of the DBRX: A Generalist Open Source LLM.

Abhishek Jauhri portrait
Researcher 1 reports

Abhishek Jauhri

Databricks

Researcher at Databricks and coauthor of the DBRX: A Generalist Open Source LLM.

Abhishek Pandey portrait
Researcher 1 reports

Abhishek Pandey

Databricks

Researcher at Databricks and coauthor of the DBRX: A Generalist Open Source LLM.

Aditya Nori portrait
Researcher 1 reports

Aditya Nori

Salesforce AI Research

Researcher at Salesforce AI Research and coauthor of the XGen-7B Technical Report.

Aimin Zhou portrait
Researcher 1 reports

Aimin Zhou

01.AI

Researcher at 01.AI and coauthor of the Yi-Lightning Technical Report.

Aiyuan Yang portrait
Researcher 1 reports

Aiyuan Yang

Baichuan

Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.

Aleksander Ficek portrait
Researcher 1 reports

Aleksander Ficek

Salesforce AI Research

Researcher at Salesforce AI Research and coauthor of the XGen-7B Technical Report.

Alexander Lavin portrait
Researcher 1 reports

Alexander Lavin

Cerebras Systems

Researcher at Cerebras Systems and coauthor of the BTLM-3B-8K: 7B Performance in a 3B Model.

Alok S. Shah portrait
Researcher 1 reports

Alok S. Shah

Cerebras Systems

Researcher at Cerebras Systems and coauthor of the Cerebras-GPT: Open Compute-Optimal Language Models Trained on the Cerebras Wafer-Scale Cluster.

Amaar Arora portrait
Researcher 1 reports

Amaar Arora

Databricks

Researcher at Databricks and coauthor of the DBRX: A Generalist Open Source LLM.

Aming Wu portrait
Researcher 1 reports

Aming Wu

Shanghai AI Laboratory

Researcher at Shanghai AI Laboratory and coauthor of the InternLM2 Technical Report.

Andrew W. Lochman portrait
Researcher 1 reports

Andrew W. Lochman

Cerebras Systems

Researcher at Cerebras Systems and coauthor of the Cerebras-GPT: Open Compute-Optimal Language Models Trained on the Cerebras Wafer-Scale Cluster.

Anjali Narayan portrait
Researcher 1 reports

Anjali Narayan

Salesforce AI Research

Researcher at Salesforce AI Research and coauthor of the XGen-7B Technical Report.

Ankit Z. Parvez portrait
Researcher 1 reports

Ankit Z. Parvez

Databricks

Researcher at Databricks and coauthor of the DBRX: A Generalist Open Source LLM.

Ankur Dave portrait
Researcher 1 reports

Ankur Dave

Salesforce AI Research

Researcher at Salesforce AI Research and coauthor of the XGen-7B Technical Report.

Apoorv Prakash Saxena portrait
Researcher 1 reports

Apoorv Prakash Saxena

Databricks

Researcher at Databricks and coauthor of the DBRX: A Generalist Open Source LLM.

Arash Mahdavi portrait
Researcher 1 reports

Arash Mahdavi

Snowflake

Researcher at Snowflake and coauthor of the Snowflake Arctic: An Enterprise LLM.

Arjun Ghukasyan portrait
Researcher 1 reports

Arjun Ghukasyan

Databricks

Researcher at Databricks and coauthor of the DBRX: A Generalist Open Source LLM.

Armen Aghajanyan portrait
Researcher 1 reports

Armen Aghajanyan

Databricks

Researcher at Databricks and coauthor of the DBRX: A Generalist Open Source LLM.

Arun Jindal portrait
Researcher 1 reports

Arun Jindal

Cerebras Systems

Researcher at Cerebras Systems and coauthor of the BTLM-3B-8K: 7B Performance in a 3B Model.

Ashley Li portrait
Researcher 1 reports

Ashley Li

Databricks

Researcher at Databricks and coauthor of the DBRX: A Generalist Open Source LLM.

Atri Sharma portrait
Researcher 1 reports

Atri Sharma

Snowflake

Researcher at Snowflake and coauthor of the Snowflake Arctic: An Enterprise LLM.

Ayman Bhagia portrait
Researcher 1 reports

Ayman Bhagia

Databricks

Researcher at Databricks and coauthor of the DBRX: A Generalist Open Source LLM.

Beichen Zhang portrait
Researcher 1 reports

Beichen Zhang

01.AI

Researcher at 01.AI and coauthor of the Yi-Lightning Technical Report.

Bingning Wang portrait
Researcher 1 reports

Bingning Wang

Baichuan

Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.

Bin Xiao portrait
Researcher 1 reports

Bin Xiao

Baichuan

Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.

Bokun Wang portrait
Researcher 1 reports

Bokun Wang

01.AI

Researcher at 01.AI and coauthor of the Yi-Lightning Technical Report.

Borong Zhang portrait
Researcher 1 reports

Borong Zhang

Baichuan

Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.

Carlo Strapparava portrait
Researcher 1 reports

Carlo Strapparava

Cerebras Systems

Researcher at Cerebras Systems and coauthor of the BTLM-3B-8K: 7B Performance in a 3B Model.

Ce Bian portrait
Researcher 1 reports

Ce Bian

Baichuan

Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.

Chao Yin portrait
Researcher 1 reports

Chao Yin

Baichuan

Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.

Charlie Wen portrait
Researcher 1 reports

Charlie Wen

Cerebras Systems

Researcher at Cerebras Systems and coauthor of the Cerebras-GPT: Open Compute-Optimal Language Models Trained on the Cerebras Wafer-Scale Cluster.

Cheng Luo portrait
Researcher 1 reports

Cheng Luo

01.AI

Researcher at 01.AI and coauthor of the Yi-Lightning Technical Report.

Chenwei Wang portrait
Researcher 1 reports

Chenwei Wang

01.AI

Researcher at 01.AI and coauthor of the Yi-Lightning Technical Report.

Chenxu Lv portrait
Researcher 1 reports

Chenxu Lv

Baichuan

Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.

Chenxu Lv portrait
Researcher 2 reports

Chenxu Lv

Alibaba Qwen

Research scientist in Tongyi Lab whose official profile highlights post-training, efficient reasoning, and multilingual large language models.

Christian R. Shelton portrait
Researcher 1 reports

Christian R. Shelton

Snowflake

Researcher at Snowflake and coauthor of the Snowflake Arctic: An Enterprise LLM.

Christopher J. Hoyt portrait
Researcher 1 reports

Christopher J. Hoyt

Cerebras Systems

Researcher at Cerebras Systems and coauthor of the Cerebras-GPT: Open Compute-Optimal Language Models Trained on the Cerebras Wafer-Scale Cluster.

Chuanqi Zhao portrait
Researcher 1 reports

Chuanqi Zhao

Z.ai

Researcher at Z.ai and coauthor of the GLM-5: Thinking, Coding, and Agentic Intelligence.

Cliff Young portrait
Researcher 1 reports

Cliff Young

Cerebras Systems

Researcher at Cerebras Systems and coauthor of the Cerebras-GPT: Open Compute-Optimal Language Models Trained on the Cerebras Wafer-Scale Cluster.

Daniel Whiteson portrait
Researcher 1 reports

Daniel Whiteson

Snowflake

Researcher at Snowflake and coauthor of the Snowflake Arctic: An Enterprise LLM.

Da Pan portrait
Researcher 1 reports

Da Pan

Baichuan

Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.

Dian Wang portrait
Researcher 1 reports

Dian Wang

Baichuan

Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.

Divya C. Mohan portrait
Researcher 1 reports

Divya C. Mohan

Cerebras Systems

Researcher at Cerebras Systems and coauthor of the Cerebras-GPT: Open Compute-Optimal Language Models Trained on the Cerebras Wafer-Scale Cluster.

Dong Yan portrait
Researcher 1 reports

Dong Yan

Baichuan

Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.

Douglas Z. Oehm portrait
Researcher 1 reports

Douglas Z. Oehm

Cerebras Systems

Researcher at Cerebras Systems and coauthor of the Cerebras-GPT: Open Compute-Optimal Language Models Trained on the Cerebras Wafer-Scale Cluster.

Duc Quang Nguyen portrait
Researcher 1 reports

Duc Quang Nguyen

Cerebras Systems

Researcher at Cerebras Systems and coauthor of the Cerebras-GPT: Open Compute-Optimal Language Models Trained on the Cerebras Wafer-Scale Cluster.

Ernie Chang portrait
Researcher 1 reports

Ernie Chang

Snowflake

Researcher at Snowflake and coauthor of the Snowflake Arctic: An Enterprise LLM.

Fan Yang portrait
Researcher 1 reports

Fan Yang

Shanghai AI Laboratory

Researcher at Shanghai AI Laboratory and coauthor of the InternLM2 Technical Report.

Fan Yang portrait
Researcher 1 reports

Fan Yang

Baichuan

Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.

Fan Zhou portrait
Researcher 2 reports

Fan Zhou

Alibaba Qwen

Research scientist in Tongyi Lab whose official profile highlights post-training, efficient reasoning, and large multimodal models.

Fei Deng portrait
Researcher 1 reports

Fei Deng

Baichuan

Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.

Fei Tian portrait
Researcher 1 reports

Fei Tian

Databricks

Researcher at Databricks and coauthor of the DBRX: A Generalist Open Source LLM.

Feng Liu portrait
Researcher 1 reports

Feng Liu

Baichuan

Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.

Feng Wang portrait
Researcher 1 reports

Feng Wang

Baichuan

Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.

Filippos Kokkinos portrait
Researcher 1 reports

Filippos Kokkinos

Databricks

Researcher at Databricks and coauthor of the DBRX: A Generalist Open Source LLM.

Fuli Luo portrait
Researcher 1 reports

Fuli Luo

01.AI

Researcher at 01.AI and coauthor of the Yi-Lightning Technical Report.

Gargi Sawarkar portrait
Researcher 1 reports

Gargi Sawarkar

Databricks

Researcher at Databricks and coauthor of the DBRX: A Generalist Open Source LLM.

Guangbo Hao portrait
Researcher 1 reports

Guangbo Hao

01.AI

Researcher at 01.AI and coauthor of the Yi: Open Foundation Models by 01.AI.

Guangwei Ai portrait
Researcher 1 reports

Guangwei Ai

Baichuan

Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.

Guanhua Chen portrait
Researcher 1 reports

Guanhua Chen

Z.ai

Researcher at Z.ai and coauthor of the GLM-5: Thinking, Coding, and Agentic Intelligence.

Guoming Wang portrait
Researcher 1 reports

Guoming Wang

Shanghai AI Laboratory

Researcher at Shanghai AI Laboratory and coauthor of the InternLM2 Technical Report.

Guosheng Dong portrait
Researcher 1 reports

Guosheng Dong

Baichuan

Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.

Haizhou Zhao portrait
Researcher 1 reports

Haizhou Zhao

Baichuan

Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.

Hangtian Liu portrait
Researcher 1 reports

Hangtian Liu

Z.ai

Researcher at Z.ai and coauthor of the GLM-5: Thinking, Coding, and Agentic Intelligence.

Hang Xu portrait
Researcher 1 reports

Hang Xu

Baichuan

Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.

Han Hu portrait
Researcher 1 reports

Han Hu

01.AI

Researcher at 01.AI and coauthor of the Yi-Lightning Technical Report.

Haobo Wang portrait
Researcher 1 reports

Haobo Wang

Z.ai

Researcher at Z.ai and coauthor of the GLM-5: Thinking, Coding, and Agentic Intelligence.

Hao Ge portrait
Researcher 2 reports

Hao Ge

Alibaba Qwen

Co-author of the Qwen3 Technical Report.

Haonan Shan portrait
Researcher 1 reports

Haonan Shan

Shanghai AI Laboratory

Researcher at Shanghai AI Laboratory and coauthor of the InternLM2 Technical Report.

Haoran Wei portrait
Researcher 2 reports

Haoran Wei

Alibaba Qwen

Research scientist in Tongyi Lab whose official profile highlights post-training and reasoning.

Haoyuan Li portrait
Researcher 1 reports

Haoyuan Li

01.AI

Researcher at 01.AI and coauthor of the Yi: Open Foundation Models by 01.AI.

Haoze Sun portrait
Researcher 1 reports

Haoze Sun

Baichuan

Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.

Hao Zhou portrait
Researcher 1 reports

Hao Zhou

Z.ai

Researcher at Z.ai and coauthor of the GLM-5: Thinking, Coding, and Agentic Intelligence.

Harsha Chilakapati portrait
Researcher 1 reports

Harsha Chilakapati

Cerebras Systems

Researcher at Cerebras Systems and coauthor of the BTLM-3B-8K: 7B Performance in a 3B Model.

Heri Zhao portrait
Researcher 2 reports

Heri Zhao

Google Gemini

Co-author of the PaLM 2 Technical Report, which describes Google's multilingual and reasoning-oriented language model.

Hongda Zhang portrait
Researcher 1 reports

Hongda Zhang

Baichuan

Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.

Hossein Hajimirsadeghi portrait
Researcher 1 reports

Hossein Hajimirsadeghi

Snowflake

Researcher at Snowflake and coauthor of the Snowflake Arctic: An Enterprise LLM.

Hrayr Harutyunyan portrait
Researcher 1 reports

Hrayr Harutyunyan

Cerebras Systems

Researcher at Cerebras Systems and coauthor of the Cerebras-GPT: Open Compute-Optimal Language Models Trained on the Cerebras Wafer-Scale Cluster.

Huan Sun portrait
Researcher 1 reports

Huan Sun

Salesforce AI Research

Researcher at Salesforce AI Research and coauthor of the XGen-7B Technical Report.

Hui Liu portrait
Researcher 1 reports

Hui Liu

Baichuan

Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.

Iman Mirzadeh portrait
Researcher 1 reports

Iman Mirzadeh

Snowflake

Researcher at Snowflake and coauthor of the Snowflake Arctic: An Enterprise LLM.

Izzeddin Gur portrait
Researcher 1 reports

Izzeddin Gur

Cerebras Systems

Researcher at Cerebras Systems and coauthor of the Cerebras-GPT: Open Compute-Optimal Language Models Trained on the Cerebras Wafer-Scale Cluster.

Jason Wei portrait
Researcher 1 reports

Jason Wei

Snowflake

Researcher at Snowflake and coauthor of the Snowflake Arctic: An Enterprise LLM.

Jaxon Wang portrait
Researcher 1 reports

Jaxon Wang

Snowflake

Researcher at Snowflake and coauthor of the Snowflake Arctic: An Enterprise LLM.

Jesus A. Gonzalez portrait
Researcher 1 reports

Jesus A. Gonzalez

Cerebras Systems

Researcher at Cerebras Systems and coauthor of the Cerebras-GPT: Open Compute-Optimal Language Models Trained on the Cerebras Wafer-Scale Cluster.

Jiahao Peng portrait
Researcher 1 reports

Jiahao Peng

Z.ai

Researcher at Z.ai and coauthor of the GLM-5: Thinking, Coding, and Agentic Intelligence.

Jiaming Ji portrait
Researcher 1 reports

Jiaming Ji

Baichuan

Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.

Jianguo Chen portrait
Researcher 1 reports

Jianguo Chen

Shanghai AI Laboratory

Researcher at Shanghai AI Laboratory and coauthor of the InternLM2 Technical Report.

Jianguo Zhang portrait
Researcher 1 reports

Jianguo Zhang

Snowflake

Researcher at Snowflake and coauthor of the Snowflake Arctic: An Enterprise LLM.

Jiaxi Yang portrait
Researcher 2 reports

Jiaxi Yang

Alibaba Qwen

Research scientist in Tongyi Lab whose official profile highlights multimodal understanding, image generation, and efficient large language models.

Jiayang Wu portrait
Researcher 1 reports

Jiayang Wu

Shanghai AI Laboratory

Researcher at Shanghai AI Laboratory and coauthor of the InternLM2 Technical Report.

Jie Zhang portrait
Researcher 1 reports

Jie Zhang

Z.ai

Researcher at Z.ai and coauthor of the GLM-5: Thinking, Coding, and Agentic Intelligence.

Jingting Ye portrait
Researcher 1 reports

Jingting Ye

Shanghai AI Laboratory

Researcher at Shanghai AI Laboratory and coauthor of the InternLM2 Technical Report.

Jionghao Bai portrait
Researcher 1 reports

Jionghao Bai

Cerebras Systems

Researcher at Cerebras Systems and coauthor of the Cerebras-GPT: Open Compute-Optimal Language Models Trained on the Cerebras Wafer-Scale Cluster.

Jon W. Pappas portrait
Researcher 1 reports

Jon W. Pappas

Cerebras Systems

Researcher at Cerebras Systems and coauthor of the Cerebras-GPT: Open Compute-Optimal Language Models Trained on the Cerebras Wafer-Scale Cluster.

Josue You portrait
Researcher 1 reports

Josue You

Shanghai AI Laboratory

Researcher at Shanghai AI Laboratory and coauthor of the InternLM2 Technical Report.

Junnan Wang portrait
Researcher 1 reports

Junnan Wang

Z.ai

Researcher at Z.ai and coauthor of the GLM-5: Thinking, Coding, and Agentic Intelligence.

JunTao Dai portrait
Researcher 1 reports

JunTao Dai

Baichuan

Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.

Junyu Huang portrait
Researcher 1 reports

Junyu Huang

Shanghai AI Laboratory

Researcher at Shanghai AI Laboratory and coauthor of the InternLM2 Technical Report.

Jun Zhang portrait
Researcher 1 reports

Jun Zhang

Shanghai AI Laboratory

Researcher at Shanghai AI Laboratory and coauthor of the InternLM2 Technical Report.

Kai-Hsin Chen portrait
Researcher 1 reports

Kai-Hsin Chen

Cerebras Systems

Researcher at Cerebras Systems and coauthor of the Cerebras-GPT: Open Compute-Optimal Language Models Trained on the Cerebras Wafer-Scale Cluster.

Karan B. Patel portrait
Researcher 1 reports

Karan B. Patel

Cerebras Systems

Researcher at Cerebras Systems and coauthor of the Cerebras-GPT: Open Compute-Optimal Language Models Trained on the Cerebras Wafer-Scale Cluster.

Kathy Lee portrait
Researcher 1 reports

Kathy Lee

Databricks

Researcher at Databricks and coauthor of the DBRX: A Generalist Open Source LLM.

Kaustabh Goyal portrait
Researcher 1 reports

Kaustabh Goyal

Databricks

Researcher at Databricks and coauthor of the DBRX: A Generalist Open Source LLM.

Kevin Qian portrait
Researcher 1 reports

Kevin Qian

Cerebras Systems

Researcher at Cerebras Systems and coauthor of the Cerebras-GPT: Open Compute-Optimal Language Models Trained on the Cerebras Wafer-Scale Cluster.

Kewei Yin portrait
Researcher 1 reports

Kewei Yin

01.AI

Researcher at 01.AI and coauthor of the Yi: Open Foundation Models by 01.AI.

Kun Fang portrait
Researcher 1 reports

Kun Fang

Baichuan

Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.

Kushal Dave portrait
Researcher 1 reports

Kushal Dave

Databricks

Researcher at Databricks and coauthor of the DBRX: A Generalist Open Source LLM.

Lanxia Wang portrait
Researcher 1 reports

Lanxia Wang

01.AI

Researcher at 01.AI and coauthor of the Yi: Open Foundation Models by 01.AI.

Lei Dong portrait
Researcher 1 reports

Lei Dong

Shanghai AI Laboratory

Researcher at Shanghai AI Laboratory and coauthor of the InternLM2 Technical Report.

Lei Su portrait
Researcher 1 reports

Lei Su

Baichuan

Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.

Le Xue portrait
Researcher 1 reports

Le Xue

Cerebras Systems

Researcher at Cerebras Systems and coauthor of the Cerebras-GPT: Open Compute-Optimal Language Models Trained on the Cerebras Wafer-Scale Cluster.

Liang Song portrait
Researcher 1 reports

Liang Song

Baichuan

Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.

Liang Zhao portrait
Researcher 1 reports

Liang Zhao

Z.ai

Researcher at Z.ai and coauthor of the GLM-5: Thinking, Coding, and Agentic Intelligence.

Lifeng Liu portrait
Researcher 1 reports

Lifeng Liu

Baichuan

Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.

Limin Gao portrait
Researcher 1 reports

Limin Gao

Z.ai

Researcher at Z.ai and coauthor of the GLM-5: Thinking, Coding, and Agentic Intelligence.

Li Shen portrait
Researcher 1 reports

Li Shen

01.AI

Researcher at 01.AI and coauthor of the Yi-Lightning Technical Report.

Liuzhenghao Lv portrait
Researcher 1 reports

Liuzhenghao Lv

Shanghai AI Laboratory

Researcher at Shanghai AI Laboratory and coauthor of the InternLM2 Technical Report.

Liyun Ru portrait
Researcher 1 reports

Liyun Ru

Baichuan

Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.

Luyao Ma portrait
Researcher 1 reports

Luyao Ma

Baichuan

Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.

Mang Wang portrait
Researcher 1 reports

Mang Wang

Baichuan

Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.

Maosong Cao portrait
Researcher 1 reports

Maosong Cao

Shanghai AI Laboratory

Researcher at Shanghai AI Laboratory and coauthor of the InternLM2 Technical Report.

Maosong Sun portrait
Researcher 1 reports

Maosong Sun

Z.ai

Researcher at Z.ai and coauthor of the GLM-5: Thinking, Coding, and Agentic Intelligence.

Matas Noreika portrait
Researcher 1 reports

Matas Noreika

Databricks

Researcher at Databricks and coauthor of the DBRX: A Generalist Open Source LLM.

Maximilian Welt portrait
Researcher 1 reports

Maximilian Welt

Cerebras Systems

Researcher at Cerebras Systems and coauthor of the BTLM-3B-8K: 7B Performance in a 3B Model.

Md. Mofijul Islam portrait
Researcher 1 reports

Md. Mofijul Islam

LG AI Research

Researcher at LG AI Research and coauthor of the EXAONE 3.5: Series of Language Models for Real-world Use Cases.

Mehrdad Farajtabar portrait
Researcher 1 reports

Mehrdad Farajtabar

Snowflake

Researcher at Snowflake and coauthor of the Snowflake Arctic: An Enterprise LLM.

Mickel Liu portrait
Researcher 1 reports

Mickel Liu

Baichuan

Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.

Mike Lewis portrait
Researcher 1 reports

Mike Lewis

Salesforce AI Research

Researcher at Salesforce AI Research and coauthor of the XGen-7B Technical Report.

Milad Faghri portrait
Researcher 1 reports

Milad Faghri

Snowflake

Researcher at Snowflake and coauthor of the Snowflake Arctic: An Enterprise LLM.

MingAn Lin portrait
Researcher 1 reports

MingAn Lin

Baichuan

Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.

Mingkai Zhong portrait
Researcher 1 reports

Mingkai Zhong

01.AI

Researcher at 01.AI and coauthor of the Yi-Lightning Technical Report.

Monika Reszke portrait
Researcher 1 reports

Monika Reszke

Cerebras Systems

Researcher at Cerebras Systems and coauthor of the Cerebras-GPT: Open Compute-Optimal Language Models Trained on the Cerebras Wafer-Scale Cluster.

Mor Geva portrait
Researcher 1 reports

Mor Geva

Technology Innovation Institute

Researcher at Technology Innovation Institute and coauthor of the Falcon Mamba 7B: The First Competitive Attention-free 7B Language Model.

Mo Yu portrait
Researcher 1 reports

Mo Yu

Salesforce AI Research

Researcher at Salesforce AI Research and coauthor of the XGen-7B Technical Report.

M. Salman Asif portrait
Researcher 1 reports

M. Salman Asif

Technology Innovation Institute

Researcher at Technology Innovation Institute and coauthor of the Falcon Mamba 7B: The First Competitive Attention-free 7B Language Model.

Murali Krishna Ramachandran portrait
Researcher 1 reports

Murali Krishna Ramachandran

Databricks

Researcher at Databricks and coauthor of the DBRX: A Generalist Open Source LLM.

Nianan Wang portrait
Researcher 1 reports

Nianan Wang

01.AI

Researcher at 01.AI and coauthor of the Yi-Lightning Technical Report.

Nicholas O. Hill portrait
Researcher 1 reports

Nicholas O. Hill

Cerebras Systems

Researcher at Cerebras Systems and coauthor of the Cerebras-GPT: Open Compute-Optimal Language Models Trained on the Cerebras Wafer-Scale Cluster.

Nicolas Rajbhandari portrait
Researcher 1 reports

Nicolas Rajbhandari

Cerebras Systems

Researcher at Cerebras Systems and coauthor of the BTLM-3B-8K: 7B Performance in a 3B Model.

Nuolan Nie portrait
Researcher 1 reports

Nuolan Nie

Baichuan

Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.

Peidong Guo portrait
Researcher 1 reports

Peidong Guo

Baichuan

Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.

Prasad Reddy Y portrait
Researcher 1 reports

Prasad Reddy Y

Salesforce AI Research

Researcher at Salesforce AI Research and coauthor of the XGen-7B Technical Report.

Puzhao Zhang portrait
Researcher 1 reports

Puzhao Zhang

01.AI

Researcher at 01.AI and coauthor of the Yi-Lightning Technical Report.

Qingru Zhang portrait
Researcher 1 reports

Qingru Zhang

01.AI

Researcher at 01.AI and coauthor of the Yi-Lightning Technical Report.

Ranjan Saxena portrait
Researcher 1 reports

Ranjan Saxena

Databricks

Researcher at Databricks and coauthor of the DBRX: A Generalist Open Source LLM.

R. J. Chen portrait
Researcher 2 reports

R. J. Chen

DeepSeek

Researcher affiliated with DeepSeek-AI and co-author of the Nature paper introducing DeepSeek-R1.

Roger Jiang portrait
Researcher 1 reports

Roger Jiang

OpenAI

Official OpenAI materials list Roger Jiang among contributors to instruction-following work, ChatGPT, and GPT-4. The attached sources support OpenAI contributions but do not robustly support the prior stored homepage-based current-employment summary.

Ruiyang Sun portrait
Researcher 1 reports

Ruiyang Sun

Baichuan

Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.

Ruizhe Pan portrait
Researcher 2 reports

Ruizhe Pan

DeepSeek

Researcher affiliated with DeepSeek-AI and co-author of the Nature paper introducing DeepSeek-R1.

Ryan McCann portrait
Researcher 1 reports

Ryan McCann

Salesforce AI Research

Researcher at Salesforce AI Research and coauthor of the XGen-7B Technical Report.

Samyam Rajbhandari portrait
Researcher 1 reports

Samyam Rajbhandari

Cerebras Systems

Researcher at Cerebras Systems and coauthor of the Cerebras-GPT: Open Compute-Optimal Language Models Trained on the Cerebras Wafer-Scale Cluster.

Sankalp Sharma portrait
Researcher 1 reports

Sankalp Sharma

Databricks

Researcher at Databricks and coauthor of the DBRX: A Generalist Open Source LLM.

Satish Narayanan portrait
Researcher 1 reports

Satish Narayanan

Cerebras Systems

Researcher at Cerebras Systems and coauthor of the Cerebras-GPT: Open Compute-Optimal Language Models Trained on the Cerebras Wafer-Scale Cluster.

Sean Welleck portrait
Researcher 1 reports

Sean Welleck

Salesforce AI Research

Researcher at Salesforce AI Research and coauthor of the XGen-7B Technical Report.

Sejal Shekar portrait
Researcher 1 reports

Sejal Shekar

Databricks

Researcher at Databricks and coauthor of the DBRX: A Generalist Open Source LLM.

Shengyu Yao portrait
Researcher 1 reports

Shengyu Yao

01.AI

Researcher at 01.AI and coauthor of the Yi: Open Foundation Models by 01.AI.

Shijie Zhou portrait
Researcher 1 reports

Shijie Zhou

01.AI

Researcher at 01.AI and coauthor of the Yi-Lightning Technical Report.

Shimin Li portrait
Researcher 1 reports

Shimin Li

Shanghai AI Laboratory

Researcher at Shanghai AI Laboratory and coauthor of the InternLM2 Technical Report.

Shriya Chaturvedi portrait
Researcher 1 reports

Shriya Chaturvedi

Cerebras Systems

Researcher at Cerebras Systems and coauthor of the Cerebras-GPT: Open Compute-Optimal Language Models Trained on the Cerebras Wafer-Scale Cluster.

Shuang Li (李爽) portrait
Researcher 1 reports

Shuang Li (李爽)

Z.ai

Researcher at Z.ai and coauthor of the GLM-5: Thinking, Coding, and Agentic Intelligence.

Shuang Zhou portrait
Researcher 2 reports

Shuang Zhou

DeepSeek

Researcher affiliated with DeepSeek-AI and co-author of the Nature paper introducing DeepSeek-R1.

Shunyu Yao portrait
Researcher 1 reports

Shunyu Yao

Z.ai

Researcher at Z.ai and coauthor of the GLM-5: Thinking, Coding, and Agentic Intelligence.

Shuo Wang portrait
Researcher 1 reports

Shuo Wang

Snowflake

Researcher at Snowflake and coauthor of the Snowflake Arctic: An Enterprise LLM.

Siddharth Karamcheti portrait
Researcher 1 reports

Siddharth Karamcheti

Databricks

Researcher at Databricks and coauthor of the DBRX: A Generalist Open Source LLM.

Siming Chen portrait
Researcher 1 reports

Siming Chen

01.AI

Researcher at 01.AI and coauthor of the Yi: Open Foundation Models by 01.AI.

Sneha Agarwal portrait
Researcher 1 reports

Sneha Agarwal

Databricks

Researcher at Databricks and coauthor of the DBRX: A Generalist Open Source LLM.

Soji James portrait
Researcher 1 reports

Soji James

Cerebras Systems

Researcher at Cerebras Systems and coauthor of the Cerebras-GPT: Open Compute-Optimal Language Models Trained on the Cerebras Wafer-Scale Cluster.

Somanshu Gupta portrait
Researcher 1 reports

Somanshu Gupta

Databricks

Researcher at Databricks and coauthor of the DBRX: A Generalist Open Source LLM.

S. S. Li portrait
Researcher 2 reports

S. S. Li

DeepSeek

Researcher affiliated with DeepSeek-AI and co-author of the Nature paper introducing DeepSeek-R1.

Sungwon Suh portrait
Researcher 1 reports

Sungwon Suh

Cerebras Systems

Researcher at Cerebras Systems and coauthor of the Cerebras-GPT: Open Compute-Optimal Language Models Trained on the Cerebras Wafer-Scale Cluster.

Tanwir Shahriar portrait
Researcher 1 reports

Tanwir Shahriar

Snowflake

Researcher at Snowflake and coauthor of the Snowflake Arctic: An Enterprise LLM.

Tao Zhang portrait
Researcher 1 reports

Tao Zhang

Baichuan

Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.

Tianhao Chen portrait
Researcher 1 reports

Tianhao Chen

Shanghai AI Laboratory

Researcher at Shanghai AI Laboratory and coauthor of the InternLM2 Technical Report.

Tianhao Li portrait
Researcher 2 reports

Tianhao Li

Alibaba Qwen

Research scientist in Tongyi Lab whose official profile highlights post-training, multilingual large language models, and efficient reasoning.

Tianpeng Li portrait
Researcher 1 reports

Tianpeng Li

Baichuan

Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.

Tianyu Li portrait
Researcher 1 reports

Tianyu Li

Baichuan

Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.

T. Y. Lin portrait
Researcher 1 reports

T. Y. Lin

Snowflake

Researcher at Snowflake and coauthor of the Snowflake Arctic: An Enterprise LLM.

Urvashi Khandelwal portrait
Researcher 1 reports

Urvashi Khandelwal

Salesforce AI Research

Researcher at Salesforce AI Research and coauthor of the XGen-7B Technical Report.

Vaibhav Patil portrait
Researcher 1 reports

Vaibhav Patil

Salesforce AI Research

Researcher at Salesforce AI Research and coauthor of the XGen-7B Technical Report.

Varun Arora portrait
Researcher 1 reports

Varun Arora

Databricks

Researcher at Databricks and coauthor of the DBRX: A Generalist Open Source LLM.

Vishakh Padmakumar portrait
Researcher 1 reports

Vishakh Padmakumar

Databricks

Researcher at Databricks and coauthor of the DBRX: A Generalist Open Source LLM.

Viterbi Xiong portrait
Researcher 1 reports

Viterbi Xiong

Databricks

Researcher at Databricks and coauthor of the DBRX: A Generalist Open Source LLM.

Wei Cheng portrait
Researcher 1 reports

Wei Cheng

Baichuan

Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.

Wei Ge portrait
Researcher 1 reports

Wei Ge

01.AI

Researcher at 01.AI and coauthor of the Yi-Lightning Technical Report.

Weijie Jheng portrait
Researcher 1 reports

Weijie Jheng

Shanghai AI Laboratory

Researcher at Shanghai AI Laboratory and coauthor of the InternLM2 Technical Report.

Weipeng Chen portrait
Researcher 1 reports

Weipeng Chen

Baichuan

Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.

Weiyi Chen portrait
Researcher 1 reports

Weiyi Chen

Cerebras Systems

Researcher at Cerebras Systems and coauthor of the Cerebras-GPT: Open Compute-Optimal Language Models Trained on the Cerebras Wafer-Scale Cluster.

W. L. Xiao portrait
Researcher 2 reports

W. L. Xiao

DeepSeek

Researcher affiliated with DeepSeek-AI and co-author of the Nature paper introducing DeepSeek-R1.

Xiang Li portrait
Researcher 1 reports

Xiang Li

01.AI

Researcher at 01.AI and coauthor of the Yi-Lightning Technical Report.

Xiang Li portrait
Researcher 1 reports

Xiang Li

Z.ai

Researcher at Z.ai and coauthor of the GLM-5: Thinking, Coding, and Agentic Intelligence.

Xiangrong Zeng portrait
Researcher 1 reports

Xiangrong Zeng

Baichuan

Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.

Xiangyu Chen portrait
Researcher 1 reports

Xiangyu Chen

01.AI

Researcher at 01.AI and coauthor of the Yi-Lightning Technical Report.

Xianzu Wang portrait
Researcher 2 reports

Xianzu Wang

DeepSeek

DeepSeek contributor and coauthor of the DeepSeek-V2 and DeepSeek-V3 technical reports.

Xiaochuan Wang portrait
Researcher 1 reports

Xiaochuan Wang

Baichuan

Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.

Xiaohan Wang portrait
Researcher 2 reports

Xiaohan Wang

DeepSeek

Researcher affiliated with DeepSeek-AI and co-author of the Nature paper introducing DeepSeek-R1.

Xiaojin Shen portrait
Researcher 2 reports

Xiaojin Shen

DeepSeek

Researcher affiliated with DeepSeek-AI and co-author of the Nature paper introducing DeepSeek-R1.

Xiaokang Chen portrait
Researcher 2 reports

Xiaokang Chen

DeepSeek

Researcher affiliated with DeepSeek-AI and co-author of the Nature paper introducing DeepSeek-R1.

Xiaowei Zhou portrait
Researcher 1 reports

Xiaowei Zhou

01.AI

Researcher at 01.AI and coauthor of the Yi-Lightning Technical Report.

Xiaoxi Chen portrait
Researcher 1 reports

Xiaoxi Chen

Baichuan

Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.

Xiaozhong Zhang portrait
Researcher 1 reports

Xiaozhong Zhang

01.AI

Researcher at 01.AI and coauthor of the Yi: Open Foundation Models by 01.AI.

Xing Yao portrait
Researcher 1 reports

Xing Yao

Z.ai

Researcher at Z.ai and coauthor of the GLM-5: Thinking, Coding, and Agentic Intelligence.

Xin Jiang portrait
Researcher 1 reports

Xin Jiang

01.AI

Researcher at 01.AI and coauthor of the Yi-Lightning Technical Report.

Xin Men portrait
Researcher 1 reports

Xin Men

Baichuan

Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.

Xinnan Song portrait
Researcher 2 reports

Xinnan Song

DeepSeek

Researcher affiliated with DeepSeek-AI and co-author of the Nature paper introducing DeepSeek-R1.

Xinting Huang portrait
Researcher 1 reports

Xinting Huang

Shanghai AI Laboratory

Researcher at Shanghai AI Laboratory and coauthor of the InternLM2 Technical Report.

Xinyi Lin portrait
Researcher 1 reports

Xinyi Lin

Z.ai

Researcher at Z.ai and coauthor of the GLM-5: Thinking, Coding, and Agentic Intelligence.

Xinyi Zhou portrait
Researcher 2 reports

Xinyi Zhou

DeepSeek

Researcher affiliated with DeepSeek-AI and co-author of the Nature paper introducing DeepSeek-R1.

Xin Yu portrait
Researcher 1 reports

Xin Yu

Baichuan

Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.

Xinyu Yang portrait
Researcher 2 reports

Xinyu Yang

DeepSeek

Co-author of the Nature paper on DeepSeek-R1; Nature and PubMed list Xinyu Yang with the DeepSeek-AI Team in Hangzhou, China.

X. Q. Li portrait
Researcher 2 reports

X. Q. Li

DeepSeek

Researcher affiliated with DeepSeek-AI and co-author of the Nature paper introducing DeepSeek-R1.

Xuehai Pan portrait
Researcher 1 reports

Xuehai Pan

Baichuan

Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.

Xuting Xu portrait
Researcher 1 reports

Xuting Xu

Z.ai

Researcher at Z.ai and coauthor of the GLM-5: Thinking, Coding, and Agentic Intelligence.

Yangbo Fan portrait
Researcher 1 reports

Yangbo Fan

01.AI

Researcher at 01.AI and coauthor of the Yi-Lightning Technical Report.

Yanjun Shen portrait
Researcher 1 reports

Yanjun Shen

Baichuan

Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.

Yaohui Li portrait
Researcher 2 reports

Yaohui Li

DeepSeek

Researcher affiliated with DeepSeek-AI and co-author of the Nature paper introducing DeepSeek-R1.

Yao Nie portrait
Researcher 1 reports

Yao Nie

01.AI

Researcher at 01.AI and coauthor of the Yi-Lightning Technical Report.

Yao Wan portrait
Researcher 1 reports

Yao Wan

01.AI

Researcher at 01.AI and coauthor of the Yi-Lightning Technical Report.

Yash Pathak portrait
Researcher 1 reports

Yash Pathak

Databricks

Researcher at Databricks and coauthor of the DBRX: A Generalist Open Source LLM.

Yeqing Wang portrait
Researcher 1 reports

Yeqing Wang

Z.ai

Researcher at Z.ai and coauthor of the GLM-5: Thinking, Coding, and Agentic Intelligence.

Yichao Jiang portrait
Researcher 1 reports

Yichao Jiang

01.AI

Researcher at 01.AI and coauthor of the Yi: Open Foundation Models by 01.AI.

Yiding Wang portrait
Researcher 1 reports

Yiding Wang

Baichuan

Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.

Yifan Hu portrait
Researcher 1 reports

Yifan Hu

Z.ai

Researcher at Z.ai and coauthor of the GLM-5: Thinking, Coding, and Agentic Intelligence.

Yikang Shen portrait
Researcher 1 reports

Yikang Shen

Z.ai

Researcher at Z.ai and coauthor of the GLM-5: Thinking, Coding, and Agentic Intelligence.

Yinghao Zhao portrait
Researcher 1 reports

Yinghao Zhao

Snowflake

Researcher at Snowflake and coauthor of the Snowflake Arctic: An Enterprise LLM.

Ying Tang portrait
Researcher 2 reports

Ying Tang

DeepSeek

Member of the DeepSeek-AI team in Hangzhou, China and co-author of the DeepSeek-V2 and DeepSeek-V3 technical reports.

Yiyu Li portrait
Researcher 1 reports

Yiyu Li

Baichuan

Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.

Yi Zheng portrait
Researcher 2 reports

Yi Zheng

DeepSeek

Researcher affiliated with DeepSeek-AI and co-author of the Nature paper introducing DeepSeek-R1.

Yizhou Yan portrait
Researcher 1 reports

Yizhou Yan

Z.ai

Researcher at Z.ai and coauthor of the GLM-5: Thinking, Coding, and Agentic Intelligence.

Yonghao Zhuang portrait
Researcher 1 reports

Yonghao Zhuang

Snowflake

Researcher at Snowflake and coauthor of the Snowflake Arctic: An Enterprise LLM.

Yongji Wang portrait
Researcher 2 reports

Yongji Wang

DeepSeek

Researcher affiliated with DeepSeek-AI and co-author of the Nature paper introducing DeepSeek-R1.

Yongqiang Guo portrait
Researcher 2 reports

Yongqiang Guo

DeepSeek

Researcher affiliated with DeepSeek-AI and co-author of the Nature paper introducing DeepSeek-R1.

Yong Zhou portrait
Researcher 1 reports

Yong Zhou

Shanghai AI Laboratory

Researcher at Shanghai AI Laboratory and coauthor of the InternLM2 Technical Report.

Youngjae Yu portrait
Researcher 1 reports

Youngjae Yu

Salesforce AI Research

Researcher at Salesforce AI Research and coauthor of the XGen-7B Technical Report.

Young Wang portrait
Researcher 1 reports

Young Wang

01.AI

Researcher at 01.AI and coauthor of the Yi: Open Foundation Models by 01.AI.

Youxin Jiang portrait
Researcher 1 reports

Youxin Jiang

Baichuan

Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.

Yuchen Gao portrait
Researcher 1 reports

Yuchen Gao

Baichuan

Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.

Yuchen Zhang portrait
Researcher 1 reports

Yuchen Zhang

Databricks

Researcher at Databricks and coauthor of the DBRX: A Generalist Open Source LLM.

Yuchen Zhu portrait
Researcher 2 reports

Yuchen Zhu

DeepSeek

Researcher affiliated with DeepSeek-AI and co-author of the Nature paper introducing DeepSeek-R1.

Yuchong Hu portrait
Researcher 1 reports

Yuchong Hu

Z.ai

Researcher at Z.ai and coauthor of the GLM-5: Thinking, Coding, and Agentic Intelligence.

Yuduan Wang portrait
Researcher 2 reports

Yuduan Wang

DeepSeek

Researcher affiliated with DeepSeek-AI and co-author of the Nature paper introducing DeepSeek-R1.

Yujie Lu portrait
Researcher 1 reports

Yujie Lu

Snowflake

Researcher at Snowflake and coauthor of the Snowflake Arctic: An Enterprise LLM.

Yunxian Ma portrait
Researcher 2 reports

Yunxian Ma

DeepSeek

Researcher affiliated with DeepSeek-AI and co-author of the Nature paper introducing DeepSeek-R1.

Yupeng Zhang portrait
Researcher 1 reports

Yupeng Zhang

Baichuan

Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.

Yuting Yan portrait
Researcher 2 reports

Yuting Yan

DeepSeek

Researcher affiliated with DeepSeek-AI and co-author of the Nature paper introducing DeepSeek-R1.

Yuxuan Liu portrait
Researcher 2 reports

Yuxuan Liu

DeepSeek

Researcher affiliated with DeepSeek-AI and co-author of the Nature paper introducing DeepSeek-R1.

Yuxuan Ma portrait
Researcher 1 reports

Yuxuan Ma

Shanghai AI Laboratory

Researcher at Shanghai AI Laboratory and coauthor of the InternLM2 Technical Report.

Zenglin Xu portrait
Researcher 1 reports

Zenglin Xu

01.AI

Researcher at 01.AI and coauthor of the Yi-Lightning Technical Report.

Zhe Feng portrait
Researcher 1 reports

Zhe Feng

Snowflake

Researcher at Snowflake and coauthor of the Snowflake Arctic: An Enterprise LLM.

Zheng Cai portrait
Researcher 1 reports

Zheng Cai

Shanghai AI Laboratory

Researcher at Shanghai AI Laboratory and coauthor of the InternLM2 Technical Report.

Zhen Huang portrait
Researcher 2 reports

Zhen Huang

DeepSeek

Researcher affiliated with DeepSeek-AI and co-author of the Nature paper introducing DeepSeek-R1.

Zhenzhong Lan portrait
Researcher 1 reports

Zhenzhong Lan

01.AI

Researcher at 01.AI and coauthor of the Yi-Lightning Technical Report.

Zhipeng Xu portrait
Researcher 2 reports

Zhipeng Xu

DeepSeek

Researcher affiliated with DeepSeek-AI and co-author of the Nature paper introducing DeepSeek-R1.

Zhiqiang Zhang portrait
Researcher 1 reports

Zhiqiang Zhang

Shanghai AI Laboratory

Researcher at Shanghai AI Laboratory and coauthor of the InternLM2 Technical Report.

Zhiying Wu portrait
Researcher 1 reports

Zhiying Wu

Baichuan

Researcher at Baichuan and coauthor of the Baichuan 2: Open Large-scale Language Models.

Zhongyu Zhang portrait
Researcher 2 reports

Zhongyu Zhang

DeepSeek

Researcher at DeepSeek-AI in Hangzhou, China and co-author of the Nature paper introducing DeepSeek-R1.

Zhuoshu Li portrait
Researcher 2 reports

Zhuoshu Li

DeepSeek

Researcher affiliated with DeepSeek-AI and co-author of the Nature paper introducing DeepSeek-R1.

Zihui Gu portrait
Researcher 2 reports

Zihui Gu

DeepSeek

Researcher affiliated with DeepSeek-AI and co-author of the Nature paper introducing DeepSeek-R1.

Ziwei Wan portrait
Researcher 1 reports

Ziwei Wan

01.AI

Researcher at 01.AI and coauthor of the Yi: Open Foundation Models by 01.AI.

Zonghan Yang portrait
Researcher 1 reports

Zonghan Yang

01.AI

Researcher at 01.AI and coauthor of the Yi-Lightning Technical Report.

Florencia Leoni Aleman portrait
Researcher 1 reports

Florencia Leoni Aleman

OpenAI

Florencia Leoni Aleman is an OpenAI contributor listed on the GPT-4V contributions page.

Y. Q. Wang portrait
Researcher 1 reports

Y. Q. Wang

DeepSeek

Publicly identifiable information is limited; this name appears as an author on the DeepSeek-V3 technical report.

Molly Lin portrait
Researcher 1 reports

Molly Lin

OpenAI

Official OpenAI pages list Molly Lin among contributors to GPT-4, embedding-model updates, and rule-based-reward model-safety work.

Bo Liu portrait
Researcher 3 reports

Bo Liu

DeepSeek

Researcher affiliated with DeepSeek on Google Scholar and coauthor of the DeepSeek LLM and DeepSeek-V2 technical reports.

Chengqi Deng portrait
Researcher 3 reports

Chengqi Deng

DeepSeek

Research scientist at DeepSeek with public GitHub projects spanning language models and AI systems.

Huajian Xin portrait
Researcher 3 reports

Huajian Xin

DeepSeek

Research scientist at DeepSeek AI working on reinforcement learning from verifiable rewards, reasoning models, and AI agents.

Mingfeng Xue portrait
Researcher 3 reports

Mingfeng Xue

Alibaba Qwen

Research scientist at Alibaba's Tongyi Lab and a co-author of the Qwen3 and Qwen2.5 technical reports.

Qin Zhu portrait
Researcher 3 reports

Qin Zhu

Alibaba Qwen

Research scientist at Alibaba Group and a co-author of the Qwen3 and Qwen2.5 technical reports.

Hui-Ling Zhen portrait
Researcher 2 reports

Hui-Ling Zhen

NVIDIA

Public report authorship links Hui-Ling Zhen to the Nemotron-4 340B Technical Report at NVIDIA.

Kai Wu portrait
Researcher 2 reports

Kai Wu

NVIDIA

Public report authorship links Kai Wu to the Nemotron-4 340B Technical Report at NVIDIA.

Yu Qiao portrait
Researcher 2 reports

Yu Qiao

MiniMax

Yu Qiao is listed as an author of the MiniMax technical report MiniMax-01: Scaling Foundation Models with Lightning Attention.

Adam Roberts portrait
Researcher 2 reports

Adam Roberts

Google Gemini

Research scientist at Google Research whose public work spans language modeling and multimodal generation; arXiv author results include PaLM and PaLM 2.

Aixin Liu portrait
Researcher 2 reports

Aixin Liu

DeepSeek

Researcher affiliated with DeepSeek on Google Scholar and coauthor of the DeepSeek-V2 and DeepSeek-V3 technical reports.

Arman Cohan portrait
Researcher 2 reports

Arman Cohan

Ai2

Senior research scientist at Ai2 and associate professor at Yale working on long-form language understanding, information retrieval, and scientific NLP.

Avinatan Hassidim portrait
Researcher 2 reports

Avinatan Hassidim

Google Gemini

Professor of Computer Science at the Hebrew University of Jerusalem and Visiting Faculty Researcher at Google, with work spanning algorithms, algorithmic economics, and AI-related decision systems.

Baptiste Rozière portrait
Researcher 2 reports

Baptiste Rozière

Meta AI

Research scientist at Meta working on large language models and coding assistants. He previously worked on pre-training language models and neural machine translation and completed a PhD in natural language processing at Sorbonne University.

Bei Feng portrait
Researcher 2 reports

Bei Feng

DeepSeek

Research scientist at DeepSeek with a public Google Scholar profile covering large language models and reasoning research.

Christopher Berner portrait
Researcher 2 reports

Christopher Berner

OpenAI

Christopher Berner is an OpenAI contributor listed on the OpenAI GPT-4 contributions page.

Dawei Feng portrait
Researcher 2 reports

Dawei Feng

MiniMax

Co-founder and research scientist at MiniMax AI Research. He received a PhD from Tsinghua University and works on foundation models, reinforcement learning, and data systems, with publications at major machine learning and NLP venues.

Dongjie Ji portrait
Researcher 2 reports

Dongjie Ji

DeepSeek

Researcher affiliated with DeepSeek on Google Scholar and coauthor of the DeepSeek-V2 and DeepSeek-V3 technical reports.

Faisal Azhar portrait
Researcher 2 reports

Faisal Azhar

Meta AI

Faisal Azhar is a PhD candidate in computer science at Stanford University. His work focuses on multimodal systems that unify text, image, and speech, together with efficient training and inference for large-scale machine learning.

Hao Xu portrait
Researcher 2 reports

Hao Xu

Z.ai

Research scientist at Z.ai focused on multimodal understanding and generation, reinforcement learning, AI agents, and end-to-end models. He received a bachelor's degree from Tsinghua University and a master's degree from Peking University.

Hui Li portrait
Researcher 2 reports

Hui Li

DeepSeek

Machine learning researcher affiliated with DeepSeek on Google Scholar and coauthor of the DeepSeek-V2 and DeepSeek-V3 technical reports.

Jialiang Cai portrait
Researcher 2 reports

Jialiang Cai

DeepSeek

DeepSeek researcher and coauthor of the DeepSeek-V2 and DeepSeek-V3 technical reports.

Jian Liang portrait
Researcher 2 reports

Jian Liang

DeepSeek

Research scientist at DeepSeek with public scholarly work on large language models and efficient inference.

Jiaqi Ni portrait
Researcher 2 reports

Jiaqi Ni

DeepSeek

Researcher at DeepSeek AI and PhD in computer science at Peking University, focusing on large language models, multimodal large language models, and reinforcement learning.

Jingyang Yuan portrait
Researcher 2 reports

Jingyang Yuan

DeepSeek

Research scientist at DeepSeek with public GitHub work on language models and AI systems.

Kyle Lo portrait
Researcher 2 reports

Kyle Lo

Ai2

Senior research scientist at Ai2 and affiliate assistant professor at the University of Washington.

Lean Wang portrait
Researcher 2 reports

Lean Wang

DeepSeek

Research scientist at DeepSeek with public scholarly work on large language models and AI systems.

Lei Xu portrait
Researcher 2 reports

Lei Xu

DeepSeek

Research scientist at DeepSeek with public scholarly work on large language models and reasoning.

Lelio Renard Lavaud portrait
Researcher 2 reports

Lelio Renard Lavaud

Mistral AI

Research scientist at Mistral AI working on multimodal models and reinforcement learning. He previously worked as a machine learning engineer and research scientist at Hugging Face.

Liang Zhao portrait
Researcher 2 reports

Liang Zhao

DeepSeek

Researcher affiliated with DeepSeek on Google Scholar and coauthor of the DeepSeek-V2 and DeepSeek-V3 technical reports.

Marie-Anne Lachaux portrait
Researcher 2 reports

Marie-Anne Lachaux

Meta AI

Marie-Anne Lachaux is a research scientist at Meta whose public homepage highlights work on large language models, multilingual models, and speech.

Mark Chen portrait
Researcher 2 reports

Mark Chen

OpenAI

Chief Research Officer at OpenAI.

Melvin Johnson portrait
Researcher 2 reports

Melvin Johnson

Google Gemini

Senior Staff Research Scientist at Google DeepMind working on language modeling, speech recognition, machine translation, and multimodal understanding.

Meng Li portrait
Researcher 2 reports

Meng Li

DeepSeek

Research scientist at DeepSeek with public scholarly work on language models and AI systems.

Naman Goyal portrait
Researcher 2 reports

Naman Goyal

Meta AI

Research scientist at Meta whose work focuses on efficient large language models and multilingual NLP.

Nicholas Ruas portrait
Researcher 2 reports

Nicholas Ruas

Ai2

Machine learning engineer at Ai2 whose public work focuses on open language models, post-training, and evaluation.

Pamela Mishkin portrait
Researcher 2 reports

Pamela Mishkin

OpenAI

Research scientist at OpenAI focused on multimodal models.

Peng Zhang portrait
Researcher 2 reports

Peng Zhang

DeepSeek

Research scientist at DeepSeek with public scholarly work on large language models and AI systems.

Pete Walsh portrait
Researcher 2 reports

Pete Walsh

Ai2

Research scientist at Ai2 working on language model pre-training, adaptation, and alignment.

Rishabh Kabra portrait
Researcher 2 reports

Rishabh Kabra

Google Gemini

Rishabh Kabra is a research scientist at Google DeepMind. His public homepage highlights work on machine learning systems and large-scale language model research.

Ruilin Jin portrait
Researcher 2 reports

Ruilin Jin

DeepSeek

DeepSeek researcher and coauthor of the DeepSeek-V2 and DeepSeek-V3 technical reports.

Shengfeng Ye portrait
Researcher 2 reports

Shengfeng Ye

DeepSeek

Researcher at DeepSeek interested in scalable and efficient learning, reinforcement learning, reasoning, and large-scale distributed training and inference.

Tianyu Sun portrait
Researcher 2 reports

Tianyu Sun

DeepSeek

LLM researcher at DeepSeek focused on alignment, post-training, and reasoning.

Wei An portrait
Researcher 2 reports

Wei An

DeepSeek

Researcher affiliated with DeepSeek AI on Google Scholar and coauthor of the DeepSeek-V2 and DeepSeek-V3 technical reports.

Xiangyue Jin portrait
Researcher 2 reports

Xiangyue Jin

DeepSeek

Member of Technical Staff at DeepSeek.

Xiaosha Chen portrait
Researcher 2 reports

Xiaosha Chen

DeepSeek

Member of Technical Staff at DeepSeek.

Xiaowen Sun portrait
Researcher 2 reports

Xiaowen Sun

DeepSeek

Researcher affiliated with DeepSeek-AI on Google Scholar and coauthor of the DeepSeek-V2 and DeepSeek-V3 technical reports.

Yacine Jernite portrait
Researcher 2 reports

Yacine Jernite

Google Gemini

Staff research scientist at Google DeepMind in Cambridge, Massachusetts. His public profile says he works on translating human values and institutional constraints into tests and safeguards for foundation models.

Yi Tay portrait
Researcher 2 reports

Yi Tay

Google Gemini

Member of Technical Staff at Google DeepMind working on language and multimodal models.

Yixuan Tan portrait
Researcher 2 reports

Yixuan Tan

DeepSeek

Master's student at Peking University and research intern at OpenSeek and DeepSeek, with interests in large language models and natural language processing.

Yu Han portrait
Researcher 2 reports

Yu Han

Alibaba Qwen

Researcher affiliated with Alibaba Group on Google Scholar and coauthor of the Qwen technical report.

Yuxiang Wei portrait
Researcher 2 reports

Yuxiang Wei

DeepSeek

DeepSeek researcher and coauthor of the DeepSeek-V2 and DeepSeek-V3 technical reports.

Yuxiang Zhu portrait
Researcher 2 reports

Yuxiang Zhu

DeepSeek

DeepSeek researcher and coauthor of the DeepSeek-V2 and DeepSeek-V3 technical reports.

Zhenyang Wu portrait
Researcher 2 reports

Zhenyang Wu

Z.ai

Research scientist at Z.ai with research interests in multimodal understanding and generation, large language models, and reinforcement learning. He received a bachelor's degree from the University of Science and Technology of China and a master's degree from Tsinghua University.

Zilin Li portrait
Researcher 2 reports

Zilin Li

DeepSeek

Researcher affiliated with DeepSeek on Google Scholar and coauthor of the DeepSeek-V2 and DeepSeek-V3 technical reports.

Zoubin Ghahramani portrait
Researcher 2 reports

Zoubin Ghahramani

Google Gemini

VP of Research at Google DeepMind and Professor of Information Engineering at the University of Cambridge, known for work in probabilistic machine learning and Bayesian statistics.

Aaron Johnson portrait
Researcher 1 reports

Aaron Johnson

Cohere

Aaron Johnson is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.

Abhinav Pandey portrait
Researcher 1 reports

Abhinav Pandey

Meta AI

Abhinav Pandey is listed as an author of the Meta AI technical report The Llama 3 Herd of Models.

Adam Lerer portrait
Researcher 1 reports

Adam Lerer

NVIDIA

Adam Lerer is listed as an author of the NVIDIA technical report Nemotron-4 340B Technical Report.

Adi Sinha portrait
Researcher 1 reports

Adi Sinha

NVIDIA

Public report authorship links Adi Sinha to the Nemotron-4 340B Technical Report at NVIDIA.

Adrianna Muras portrait
Researcher 1 reports

Adrianna Muras

Cohere

Adrianna Muras is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.

Ahu Nguyen portrait
Researcher 1 reports

Ahu Nguyen

Cohere

Ahu Nguyen is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.

Akshat Shrivastava portrait
Researcher 1 reports

Akshat Shrivastava

Amazon

Akshat Shrivastava is listed as an author of the Amazon technical report Amazon Nova Premier Technical Report.

Akshay Kamath portrait
Researcher 1 reports

Akshay Kamath

NVIDIA

Public report authorship links Akshay Kamath to the Nemotron-4 340B Technical Report at NVIDIA.

Akshay Narayan Kini portrait
Researcher 1 reports

Akshay Narayan Kini

NVIDIA

Public report authorship links Akshay Narayan Kini to the Nemotron-4 340B Technical Report at NVIDIA.

Alan Chung portrait
Researcher 1 reports

Alan Chung

Google Gemini

Alan Chung is listed as an author of the Google technical report PaLM 2 Technical Report.

Alexey Pugachev portrait
Researcher 1 reports

Alexey Pugachev

NVIDIA

Alexey Pugachev is listed as an author of the NVIDIA technical report Nemotron-4 340B Technical Report.

Alexis Potapczuk portrait
Researcher 1 reports

Alexis Potapczuk

Cohere

Alexis Potapczuk is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.

Allison LaPrell portrait
Researcher 1 reports

Allison LaPrell

Amazon

Allison LaPrell is listed as an author of the Amazon technical report Amazon Nova Premier Technical Report.

Alya Gafni portrait
Researcher 1 reports

Alya Gafni

NVIDIA

Public report authorship links Alya Gafni to the Nemotron-4 340B Technical Report at NVIDIA.

Amanda Zhou portrait
Researcher 1 reports

Amanda Zhou

Google Gemini

Amanda Zhou is listed as an author of the Google technical report PaLM 2 Technical Report.

Aman Gupta portrait
Researcher 1 reports

Aman Gupta

Cohere

Aman Gupta is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.

Amy Yang portrait
Researcher 1 reports

Amy Yang

Meta AI

Amy Yang is listed as an author of the Meta AI technical report The Llama 3 Herd of Models.

Ana Constant portrait
Researcher 1 reports

Ana Constant

Google Gemini

Ana Constant is listed as an author of the Google technical report PaLM 2 Technical Report.

Andrew Austin portrait
Researcher 1 reports

Andrew Austin

Google Gemini

Andrew Austin is listed as an author of the Google technical report PaLM 2 Technical Report.

Andrew Drozdov portrait
Researcher 1 reports

Andrew Drozdov

Ai2

Public report authorship links Andrew Drozdov to the Open Instruct: A Simple Method for Aligning Language Models with Human Preferences at Ai2.

Andrew M. Dai portrait
Researcher 1 reports

Andrew M. Dai

Cohere

Andrew M. Dai is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.

Andrew Sohn portrait
Researcher 1 reports

Andrew Sohn

NVIDIA

Andrew Sohn is listed as an author of the NVIDIA technical report Nemotron-4 340B Technical Report.

Andy Pillai portrait
Researcher 1 reports

Andy Pillai

Google Gemini

Andy Pillai is listed as an author of the Google technical report PaLM 2 Technical Report.

Angelica Chen portrait
Researcher 1 reports

Angelica Chen

Amazon

Angelica Chen is listed as an author of the Amazon technical report Amazon Nova Premier Technical Report.

Anirban Ghosh portrait
Researcher 1 reports

Anirban Ghosh

Meta AI

Anirban Ghosh is listed as an author of the Meta AI technical report The Llama 3 Herd of Models.

Ankush Goyal portrait
Researcher 1 reports

Ankush Goyal

Meta AI

Ankush Goyal is listed as an author of the Meta AI technical report The Llama 3 Herd of Models.

Anna Blusys portrait
Researcher 1 reports

Anna Blusys

Cohere

Anna Blusys is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.

Anna Shlyakhter portrait
Researcher 1 reports

Anna Shlyakhter

NVIDIA

Anna Shlyakhter is listed as an author of the NVIDIA technical report Nemotron-4 340B Technical Report.

Anuj Sharma portrait
Researcher 1 reports

Anuj Sharma

Meta AI

Anuj Sharma is listed as an author of the Meta AI technical report The Llama 3 Herd of Models.

Anupama Mahapatra portrait
Researcher 1 reports

Anupama Mahapatra

Amazon

Anupama Mahapatra is listed as an author of the Amazon technical report Amazon Nova Premier Technical Report.

Ashish Mitra portrait
Researcher 1 reports

Ashish Mitra

Meta AI

Ashish Mitra is listed as an author of the Meta AI technical report The Llama 3 Herd of Models.

Ashwin Kumar portrait
Researcher 1 reports

Ashwin Kumar

NVIDIA

Public report authorship links Ashwin Kumar to the Nemotron-4 340B Technical Report at NVIDIA.

Aswath Kumar portrait
Researcher 1 reports

Aswath Kumar

NVIDIA

Public report authorship links Aswath Kumar to the Nemotron-4 340B Technical Report at NVIDIA.

Bahareh H. Hajishirzi portrait
Researcher 1 reports

Bahareh H. Hajishirzi

Cohere

Bahareh H. Hajishirzi is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.

Baojun Wang portrait
Researcher 1 reports

Baojun Wang

Z.ai

Public report authorship links Baojun Wang to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.

Bartosz Kozdron portrait
Researcher 1 reports

Bartosz Kozdron

NVIDIA

Bartosz Kozdron is listed as an author of the NVIDIA technical report Nemotron-4 340B Technical Report.

Bartosz Ludwiczuk portrait
Researcher 1 reports

Bartosz Ludwiczuk

Cohere

Bartosz Ludwiczuk is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.

Bayan Bruss portrait
Researcher 1 reports

Bayan Bruss

Cohere

Bayan Bruss is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.

Benoit Lalloz portrait
Researcher 1 reports

Benoit Lalloz

Cohere

Benoit Lalloz is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.

Bhagyashree Agrawal portrait
Researcher 1 reports

Bhagyashree Agrawal

NVIDIA

Public report authorship links Bhagyashree Agrawal to the Nemotron-4 340B Technical Report at NVIDIA.

Bharath Ramavarapu portrait
Researcher 1 reports

Bharath Ramavarapu

NVIDIA

Public report authorship links Bharath Ramavarapu to the Nemotron-4 340B Technical Report at NVIDIA.

Bhargav Vaidya portrait
Researcher 1 reports

Bhargav Vaidya

NVIDIA

Bhargav Vaidya is listed as an author of the NVIDIA technical report Nemotron-4 340B Technical Report.

Biao Gong portrait
Researcher 1 reports

Biao Gong

Z.ai

Public report authorship links Biao Gong to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.

Biao Gong portrait
Researcher 1 reports

Biao Gong

Amazon

Biao Gong is listed as an author of the Amazon technical report Amazon Nova Premier Technical Report.

Bikash Koley portrait
Researcher 1 reports

Bikash Koley

NVIDIA

Bikash Koley is listed as an author of the NVIDIA technical report Nemotron-4 340B Technical Report.

Bingchen Feng portrait
Researcher 1 reports

Bingchen Feng

Cohere

Bingchen Feng is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.

Błażej Miasojedow portrait
Researcher 1 reports

Błażej Miasojedow

Google Gemini

Błażej Miasojedow is listed as an author of the Google technical report Gemma: Open Models Based on Gemini Research and Technology.

Bobby Nanduri portrait
Researcher 1 reports

Bobby Nanduri

Amazon

Bobby Nanduri is listed as an author of the Amazon technical report Amazon Nova Premier Technical Report.

Bofei Gao portrait
Researcher 1 reports

Bofei Gao

Z.ai

Public report authorship links Bofei Gao to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.

Brandon McKinzie portrait
Researcher 1 reports

Brandon McKinzie

Google Gemini

Brandon McKinzie is listed as an author of the Google technical report PaLM 2 Technical Report.

Buro Sim portrait
Researcher 1 reports

Buro Sim

Meta AI

Buro Sim is listed as an author of the Meta AI technical report The Llama 3 Herd of Models.

Can Xu portrait
Researcher 1 reports

Can Xu

Z.ai

Public report authorship links Can Xu to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.

Cesi Chen portrait
Researcher 1 reports

Cesi Chen

Cohere

Cesi Chen is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.

Chenguang Wang portrait
Researcher 1 reports

Chenguang Wang

Amazon

Chenguang Wang is listed as an author of the Amazon technical report Amazon Nova Premier Technical Report.

Cheng Xing portrait
Researcher 1 reports

Cheng Xing

Cohere

Cheng Xing is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.

Chen Xing portrait
Researcher 1 reports

Chen Xing

Meta AI

Chen Xing is listed as an author of the Meta AI technical report The Llama 3 Herd of Models.

Chong Wang portrait
Researcher 1 reports

Chong Wang

Google Gemini

Chong Wang is listed as an author of the Google technical report PaLM 2 Technical Report.

Christian R. A. Azevedo portrait
Researcher 1 reports

Christian R. A. Azevedo

Google Gemini

Christian R. A. Azevedo is listed as an author of the Google technical report PaLM 2 Technical Report.

Christian Wolf portrait
Researcher 1 reports

Christian Wolf

Mistral AI

Christian Wolf is listed as an author of the Mistral AI technical report Mistral Small 3.1 Technical Report.

Danielle Epstein portrait
Researcher 1 reports

Danielle Epstein

Google Gemini

Public report authorship links Danielle Epstein to the PaLM: Scaling Language Modeling with Pathways at Google.

Daniel M. Ziegler portrait
Researcher 1 reports

Daniel M. Ziegler

OpenAI

Daniel M. Ziegler is listed as an author of the OpenAI technical report Language Models are Few-Shot Learners.

Danqi Zhang portrait
Researcher 1 reports

Danqi Zhang

Cohere

Danqi Zhang is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.

Darren Kong portrait
Researcher 1 reports

Darren Kong

Cohere

Darren Kong is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.

David Graham portrait
Researcher 1 reports

David Graham

Ai2

Public sources identify David Graham as a member of Ai2's team and a coauthor of OLMo 3.

David Gritz portrait
Researcher 1 reports

David Gritz

Cohere

David Gritz is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.

David Li portrait
Researcher 1 reports

David Li

Google Gemini

David Li is listed as an author of the Google technical report PaLM 2 Technical Report.

David Yu portrait
Researcher 1 reports

David Yu

Cohere

David Yu is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.

De-Chang Chen portrait
Researcher 1 reports

De-Chang Chen

Google Gemini

De-Chang Chen is listed as an author of the Google technical report PaLM 2 Technical Report.

Deli Zhao portrait
Researcher 1 reports

Deli Zhao

Z.ai

Public report authorship links Deli Zhao to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.

Denny Zhou portrait
Researcher 1 reports

Denny Zhou

Amazon

Denny Zhou is listed as an author of the Amazon technical report Amazon Nova Premier Technical Report.

Dimitris Papailiopoulos portrait
Researcher 1 reports

Dimitris Papailiopoulos

NVIDIA

Dimitris Papailiopoulos is listed as an author of the NVIDIA technical report Nemotron-4 340B Technical Report.

Dirk van Strien portrait
Researcher 1 reports

Dirk van Strien

Ai2

Dirk van Strien is listed as an author of the Ai2 technical report OLMo: Accelerating the Science of Language Models.

Elsie Wang portrait
Researcher 1 reports

Elsie Wang

Google Gemini

Elsie Wang is listed as an author of the Google technical report PaLM 2 Technical Report.

Eric Huo portrait
Researcher 1 reports

Eric Huo

Cohere

Eric Huo is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.

Esteban Real portrait
Researcher 1 reports

Esteban Real

Amazon

Esteban Real is listed as an author of the Amazon technical report Amazon Nova Premier Technical Report.

Ezekiel Wong portrait
Researcher 1 reports

Ezekiel Wong

Cohere

Ezekiel Wong is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.

Fang Liu portrait
Researcher 1 reports

Fang Liu

Cohere

Fang Liu is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.

Florian Bressand portrait
Researcher 1 reports

Florian Bressand

Mistral AI

Florian Bressand is listed as an author of the Mistral AI technical report Mistral 7B.

Gabriel Georgiou portrait
Researcher 1 reports

Gabriel Georgiou

Meta AI

Gabriel Georgiou is listed as an author of the Meta AI technical report The Llama 3 Herd of Models.

Gabriel Gladstone portrait
Researcher 1 reports

Gabriel Gladstone

Cohere

Gabriel Gladstone is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.

Gang Li portrait
Researcher 1 reports

Gang Li

Google Gemini

Gang Li is listed as an author of the Google technical report PaLM 2 Technical Report.

George Thomas portrait
Researcher 1 reports

George Thomas

Cohere

George Thomas is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.

Geunwoo Kim portrait
Researcher 1 reports

Geunwoo Kim

Cohere

Geunwoo Kim is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.

Guanyu Feng portrait
Researcher 1 reports

Guanyu Feng

Z.ai

Public report authorship links Guanyu Feng to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.

Guillem Cieslak portrait
Researcher 1 reports

Guillem Cieslak

Meta AI

Guillem Cieslak is listed as an author of the Meta AI technical report Llama 2: Open Foundation and Fine-Tuned Chat Models.

Hai Hu portrait
Researcher 1 reports

Hai Hu

Cohere

Hai Hu is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.

Hanjie Chen portrait
Researcher 1 reports

Hanjie Chen

Ai2

Hanjie Chen is listed as an author of the Ai2 technical report Tulu 2: Demystifying the Effectiveness of RLHF and Reinforcement Learning with Human Feedback.

Hanlin Zhang portrait
Researcher 1 reports

Hanlin Zhang

Z.ai

Public report authorship links Hanlin Zhang to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.

Han Wang portrait
Researcher 1 reports

Han Wang

Cohere

Han Wang is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.

Han Wei portrait
Researcher 1 reports

Han Wei

MiniMax

Han Wei is listed as an author of the MiniMax technical report MiniMax-01: Scaling Foundation Models with Lightning Attention.

Han Xu portrait
Researcher 1 reports

Han Xu

Z.ai

Public report authorship links Han Xu to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.

Hao Peng portrait
Researcher 1 reports

Hao Peng

Cohere

Hao Peng is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.

Haotian Tang portrait
Researcher 1 reports

Haotian Tang

NVIDIA

Public report authorship links Haotian Tang to the Nemotron-4 340B Technical Report at NVIDIA.

Hariharan Subramoni portrait
Researcher 1 reports

Hariharan Subramoni

NVIDIA

Hariharan Subramoni is listed as an author of the NVIDIA technical report Nemotron-4 340B Technical Report.

Haroon Khan portrait
Researcher 1 reports

Haroon Khan

Google Gemini

Haroon Khan is listed as an author of the Google technical report PaLM 2 Technical Report.

Harshita Diddee portrait
Researcher 1 reports

Harshita Diddee

Cohere

Harshita Diddee is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.

Heiga Zen portrait
Researcher 1 reports

Heiga Zen

Google Gemini

Heiga Zen is listed as an author of the Google technical report PaLM 2 Technical Report.

Hongbo Zhang portrait
Researcher 1 reports

Hongbo Zhang

Z.ai

Public report authorship links Hongbo Zhang to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.

Hongwei Cao portrait
Researcher 1 reports

Hongwei Cao

Z.ai

Public report authorship links Hongwei Cao to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.

Hongxing He portrait
Researcher 1 reports

Hongxing He

Z.ai

Public report authorship links Hongxing He to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.

Hongyang Li portrait
Researcher 1 reports

Hongyang Li

MiniMax

Hongyang Li is a MiniMax report-backed author on MiniMax-Text-01, a MiniMax technical report in the LLMpeople catalog.

Howie Liu portrait
Researcher 1 reports

Howie Liu

Google Gemini

Howie Liu is listed as an author of the Google technical report PaLM 2 Technical Report.

Hugo Javaux portrait
Researcher 1 reports

Hugo Javaux

Mistral AI

Hugo Javaux is listed as an author of the Mistral AI technical report Mistral Small 3.1 Technical Report.

Hung-Cheng Chen portrait
Researcher 1 reports

Hung-Cheng Chen

Google Gemini

Hung-Cheng Chen is listed as an author of the Google technical report PaLM 2 Technical Report.

Hu Zhang portrait
Researcher 1 reports

Hu Zhang

Amazon

Hu Zhang is listed as an author of the Amazon technical report Amazon Nova Premier Technical Report.

Hyunewoo Cho portrait
Researcher 1 reports

Hyunewoo Cho

Ai2

Hyunewoo Cho is listed as an author of the Ai2 technical report OLMo: Accelerating the Science of Language Models.

Imene Kerboua portrait
Researcher 1 reports

Imene Kerboua

Cohere

Imene Kerboua is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.

Irwan Bello portrait
Researcher 1 reports

Irwan Bello

OpenAI

Irwan Bello is listed as an author of the OpenAI technical report GPT-4 Technical Report.

Jack Rae portrait
Researcher 1 reports

Jack Rae

OpenAI

Jack Rae is listed as an author of the OpenAI technical report GPT-4 Technical Report.

Jackson Wu portrait
Researcher 1 reports

Jackson Wu

Cohere

Jackson Wu is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.

Jacky Peng portrait
Researcher 1 reports

Jacky Peng

NVIDIA

Public report authorship links Jacky Peng to the Nemotron-4 340B Technical Report at NVIDIA.

Jacqueline Hardin portrait
Researcher 1 reports

Jacqueline Hardin

Google Gemini

Jacqueline Hardin is listed as an author of the Google technical report PaLM 2 Technical Report.

Jane Qian portrait
Researcher 1 reports

Jane Qian

Google Gemini

Jane Qian is listed as an author of the Google technical report PaLM 2 Technical Report.

Jarett Gray portrait
Researcher 1 reports

Jarett Gray

Amazon

Jarett Gray is listed as an author of the Amazon technical report Amazon Nova Premier Technical Report.

Jasdeep Singh Pannu portrait
Researcher 1 reports

Jasdeep Singh Pannu

NVIDIA

Jasdeep Singh Pannu is listed as an author of the NVIDIA technical report Nemotron-4 340B Technical Report.

Jasjeet Sekhon portrait
Researcher 1 reports

Jasjeet Sekhon

Cohere

Jasjeet Sekhon is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.

Jason Li portrait
Researcher 1 reports

Jason Li

Cohere

Jason Li is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.

Jason Wei portrait
Researcher 1 reports

Jason Wei

Amazon

Jason Wei is listed as an author of the Amazon technical report Amazon Nova Premier Technical Report.

Jason Wu portrait
Researcher 1 reports

Jason Wu

Google Gemini

Jason Wu is listed as an author of the Google technical report PaLM 2 Technical Report.

Jason Wu portrait
Researcher 1 reports

Jason Wu

Ai2

Public report authorship links Jason Wu to the Open Instruct: A Simple Method for Aligning Language Models with Human Preferences at Ai2.

Jeremy Dwivedi-Yu portrait
Researcher 1 reports

Jeremy Dwivedi-Yu

Ai2

Jeremy Dwivedi-Yu is listed as an author of the Ai2 technical report Tulu 2: Demystifying the Effectiveness of RLHF and Reinforcement Learning with Human Feedback.

Jerry Wang portrait
Researcher 1 reports

Jerry Wang

Cohere

Jerry Wang is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.

Jiahao Zhou portrait
Researcher 1 reports

Jiahao Zhou

Z.ai

Public report authorship links Jiahao Zhou to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.

Jiahui Li portrait
Researcher 1 reports

Jiahui Li

Cohere

Jiahui Li is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.

Jianguo Zhang portrait
Researcher 1 reports

Jianguo Zhang

Z.ai

Public report authorship links Jianguo Zhang to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.

Jianli Zhang portrait
Researcher 1 reports

Jianli Zhang

Z.ai

Public report authorship links Jianli Zhang to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.

Jianshu Chen portrait
Researcher 1 reports

Jianshu Chen

Amazon

Jianshu Chen is listed as an author of the Amazon technical report Amazon Nova Premier Technical Report.

Jianshu Chen portrait
Researcher 1 reports

Jianshu Chen

Cohere

Jianshu Chen is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.

Jian Xie portrait
Researcher 1 reports

Jian Xie

Baichuan

Jian Xie is listed as an author of the Baichuan technical report Baichuan 2: Open Large-scale Language Models.

Jianyu Wang portrait
Researcher 1 reports

Jianyu Wang

NVIDIA

Jianyu Wang is listed as an author of the NVIDIA technical report Nemotron-4 340B Technical Report.

Jia Song portrait
Researcher 1 reports

Jia Song

Cohere

Jia Song is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.

Jiaxiang Guo portrait
Researcher 1 reports

Jiaxiang Guo

Google Gemini

Public report authorship links Jiaxiang Guo to the RecurrentGemma: Moving Past Transformers for Efficient Open Language Models at Google.

Jie Chen portrait
Researcher 1 reports

Jie Chen

MiniMax

Jie Chen is a MiniMax report-backed author on MiniMax-Text-01, a MiniMax technical report in the LLMpeople catalog.

Jiefeng Cheng portrait
Researcher 1 reports

Jiefeng Cheng

Z.ai

Public report authorship links Jiefeng Cheng to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.

Jie Fu portrait
Researcher 1 reports

Jie Fu

Z.ai

Public report authorship links Jie Fu to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.

Ji Ma portrait
Researcher 1 reports

Ji Ma

Amazon

Ji Ma is listed as an author of the Amazon technical report Amazon Nova Premier Technical Report.

Ji Ma portrait
Researcher 1 reports

Ji Ma

Google Gemini

Ji Ma is listed as an author of the Google technical report PaLM 2 Technical Report.

Jimmy Lee portrait
Researcher 1 reports

Jimmy Lee

Google Gemini

Public report authorship links Jimmy Lee to the Gemma 2: Improving Open Language Models at a Practical Size at Google.

Jincheng Han portrait
Researcher 1 reports

Jincheng Han

MiniMax

Jincheng Han is a MiniMax report-backed author on MiniMax-Text-01, a MiniMax technical report in the LLMpeople catalog.

Jingkuang Wang portrait
Researcher 1 reports

Jingkuang Wang

Z.ai

Public report authorship links Jingkuang Wang to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.

Jingwen Leng portrait
Researcher 1 reports

Jingwen Leng

Z.ai

Public report authorship links Jingwen Leng to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.

Jingyu Hua portrait
Researcher 1 reports

Jingyu Hua

NVIDIA

Public report authorship links Jingyu Hua to the Nemotron-4 340B Technical Report at NVIDIA.

Jingyun Jiang portrait
Researcher 1 reports

Jingyun Jiang

MiniMax

Jingyun Jiang is a MiniMax report-backed author on MiniMax-Text-01, a MiniMax technical report in the LLMpeople catalog.

Jinmian Zhang portrait
Researcher 1 reports

Jinmian Zhang

MiniMax

Jinmian Zhang is a MiniMax report-backed author on MiniMax-Text-01, a MiniMax technical report in the LLMpeople catalog.

Jiuyang Tang portrait
Researcher 1 reports

Jiuyang Tang

Z.ai

Public report authorship links Jiuyang Tang to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.

Jiwei Li portrait
Researcher 1 reports

Jiwei Li

Z.ai

Public report authorship links Jiwei Li to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.

Ji Xu portrait
Researcher 1 reports

Ji Xu

Google Gemini

Ji Xu is listed as an author of the Google technical report PaLM 2 Technical Report.

Joe Hall portrait
Researcher 1 reports

Joe Hall

Meta AI

Joe Hall is listed as an author of the Meta AI technical report The Llama 3 Herd of Models.

Joel Ruben Antony portrait
Researcher 1 reports

Joel Ruben Antony

Google Gemini

Joel Ruben Antony is listed as an author of the Google technical report PaLM 2 Technical Report.

John Gagneur portrait
Researcher 1 reports

John Gagneur

Cohere

John Gagneur is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.

John Lee portrait
Researcher 1 reports

John Lee

Meta AI

John Lee is listed as an author of the Meta AI technical report The Llama 3 Herd of Models.

Jonathan Branson portrait
Researcher 1 reports

Jonathan Branson

Cohere

Jonathan Branson is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.

Jonathan Lee portrait
Researcher 1 reports

Jonathan Lee

Meta AI

Jonathan Lee is listed as an author of the Meta AI technical report The Llama 3 Herd of Models.

Josh Gorlin portrait
Researcher 1 reports

Josh Gorlin

Google Gemini

Josh Gorlin is listed as an author of the Google technical report PaLM 2 Technical Report.

Joshua Stoddard portrait
Researcher 1 reports

Joshua Stoddard

Amazon

Joshua Stoddard is listed as an author of the Amazon technical report Amazon Nova Premier Technical Report.

Julien Mouchere portrait
Researcher 1 reports

Julien Mouchere

Mistral AI

Julien Mouchere is listed as an author of the Mistral AI technical report Mistral Small 3.1 Technical Report.

Junlin Han portrait
Researcher 1 reports

Junlin Han

Z.ai

Public report authorship links Junlin Han to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.

Jun Yin portrait
Researcher 1 reports

Jun Yin

Cohere

Jun Yin is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.

Jun Zhang portrait
Researcher 1 reports

Jun Zhang

Z.ai

Public report authorship links Jun Zhang to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.

Junzheng Wang portrait
Researcher 1 reports

Junzheng Wang

Google Gemini

Junzheng Wang is listed as an author of the Google technical report PaLM 2 Technical Report.

Kailash Karthikeyan portrait
Researcher 1 reports

Kailash Karthikeyan

NVIDIA

Public report authorship links Kailash Karthikeyan to the Nemotron-4 340B Technical Report at NVIDIA.

Kaiyan Li portrait
Researcher 1 reports

Kaiyan Li

Z.ai

Public report authorship links Kaiyan Li to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.

Kaiyou Yang portrait
Researcher 1 reports

Kaiyou Yang

Z.ai

Public report authorship links Kaiyou Yang to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.

Kangning Liu portrait
Researcher 1 reports

Kangning Liu

MiniMax

Kangning Liu is a MiniMax report-backed author on MiniMax-Text-01, a MiniMax technical report in the LLMpeople catalog.

Karan Bhatia portrait
Researcher 1 reports

Karan Bhatia

Cohere

Karan Bhatia is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.

Karthik Ramavarmaraju portrait
Researcher 1 reports

Karthik Ramavarmaraju

NVIDIA

Public report authorship links Karthik Ramavarmaraju to the Nemotron-4 340B Technical Report at NVIDIA.

Katherine Hou portrait
Researcher 1 reports

Katherine Hou

Google Gemini

Katherine Hou is listed as an author of the Google technical report PaLM 2 Technical Report.

Katie Mayer portrait
Researcher 1 reports

Katie Mayer

OpenAI

Katie Mayer is listed as an author of the OpenAI technical report GPT-4 Technical Report.

Kaushal K. Kishore portrait
Researcher 1 reports

Kaushal K. Kishore

Google Gemini

Kaushal K. Kishore is listed as an author of the Google technical report PaLM 2 Technical Report.

Kaushik Roy portrait
Researcher 1 reports

Kaushik Roy

Google Gemini

Public report authorship links Kaushik Roy to the Gemma 2: Improving Open Language Models at a Practical Size at Google.

Kaushik Sivakumar portrait
Researcher 1 reports

Kaushik Sivakumar

Google Gemini

Kaushik Sivakumar is listed as an author of the Google technical report PaLM 2 Technical Report.

Ke Sun portrait
Researcher 1 reports

Ke Sun

Z.ai

Ke Sun is listed as an author of the Z.ai technical report GLM-5: Thinking, Coding, and Agentic Intelligence.

Kevin Lin portrait
Researcher 1 reports

Kevin Lin

Cohere

Kevin Lin is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.

Khaled Saeed portrait
Researcher 1 reports

Khaled Saeed

Ai2

Public report authorship links Khaled Saeed to the OLMoE: Open Mixture-of-Experts Language Models at Ai2.

Khe Chai Sim portrait
Researcher 1 reports

Khe Chai Sim

Google Gemini

Khe Chai Sim is listed as an author of the Google technical report PaLM 2 Technical Report.

Kris Cao portrait
Researcher 1 reports

Kris Cao

Google Gemini

Kris Cao is listed as an author of the Google technical report PaLM 2 Technical Report.

Kun Li portrait
Researcher 1 reports

Kun Li

Cohere

Kun Li is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.

Lakshminarasimhan Sundararaman portrait
Researcher 1 reports

Lakshminarasimhan Sundararaman

NVIDIA

Public report authorship links Lakshminarasimhan Sundararaman to the Nemotron-4 340B Technical Report at NVIDIA.

Laura Martin portrait
Researcher 1 reports

Laura Martin

Cohere

Laura Martin is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.

Le Hou portrait
Researcher 1 reports

Le Hou

Meta AI

Le Hou is listed as an author of the Meta AI technical report The Llama 3 Herd of Models.

Leyang Cui portrait
Researcher 1 reports

Leyang Cui

Cohere

Leyang Cui is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.

Lia Lee portrait
Researcher 1 reports

Lia Lee

Cohere

Lia Lee is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.

Liam Dean portrait
Researcher 1 reports

Liam Dean

Google Gemini

Liam Dean is listed as an author of the Google technical report PaLM 2 Technical Report.

Lianghao Deng portrait
Researcher 1 reports

Lianghao Deng

Alibaba Qwen

Lianghao Deng is listed as an author of the Qwen technical report Qwen3 Technical Report.

Lilian Lee portrait
Researcher 1 reports

Lilian Lee

Google Gemini

Lilian Lee is listed as an author of the Google technical report PaLM 2 Technical Report.

Lokesh Ramamoorthy portrait
Researcher 1 reports

Lokesh Ramamoorthy

NVIDIA

Public report authorship links Lokesh Ramamoorthy to the Nemotron-4 340B Technical Report at NVIDIA.

Longjie Ye portrait
Researcher 1 reports

Longjie Ye

Z.ai

Public report authorship links Longjie Ye to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.

Madhura Dhuri portrait
Researcher 1 reports

Madhura Dhuri

NVIDIA

Public report authorship links Madhura Dhuri to the Nemotron-4 340B Technical Report at NVIDIA.

Mahyar Babaei portrait
Researcher 1 reports

Mahyar Babaei

Cohere

Mahyar Babaei is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.

Manav Mathur portrait
Researcher 1 reports

Manav Mathur

Meta AI

Manav Mathur is listed as an author of the Meta AI technical report The Llama 3 Herd of Models.

Manoj Kumar portrait
Researcher 1 reports

Manoj Kumar

Google Gemini

Public report authorship links Manoj Kumar to the RecurrentGemma: Moving Past Transformers for Efficient Open Language Models at Google.

Martin Connor portrait
Researcher 1 reports

Martin Connor

Google Gemini

Martin Connor is listed as an author of the Google technical report PaLM 2 Technical Report.

Marvin Polius portrait
Researcher 1 reports

Marvin Polius

Google Gemini

Marvin Polius is listed as an author of the Google technical report PaLM 2 Technical Report.

Matt Jordan portrait
Researcher 1 reports

Matt Jordan

Amazon

Matt Jordan is listed as an author of the Amazon technical report Amazon Nova Premier Technical Report.

Matt Wood portrait
Researcher 1 reports

Matt Wood

Amazon

Matt Wood is listed as an author of the Amazon technical report Amazon Nova Premier Technical Report.

Maxim Bluvshtein portrait
Researcher 1 reports

Maxim Bluvshtein

NVIDIA

Maxim Bluvshtein is listed as an author of the NVIDIA technical report Nemotron-4 340B Technical Report.

Maxim Krikun portrait
Researcher 1 reports

Maxim Krikun

Amazon

Maxim Krikun is listed as an author of the Amazon technical report Amazon Nova Premier Technical Report.

Maxwell Pirtle portrait
Researcher 1 reports

Maxwell Pirtle

Amazon

Maxwell Pirtle is listed as an author of the Amazon technical report Amazon Nova Premier Technical Report.

Maxwell Roberts portrait
Researcher 1 reports

Maxwell Roberts

Ai2

Maxwell Roberts is listed as an author of the Ai2 technical report Tulu 2: Demystifying the Effectiveness of RLHF and Reinforcement Learning with Human Feedback.

Mayank Bhargava portrait
Researcher 1 reports

Mayank Bhargava

NVIDIA

Public report authorship links Mayank Bhargava to the Nemotron-4 340B Technical Report at NVIDIA.

Meriem Belkada portrait
Researcher 1 reports

Meriem Belkada

Amazon

Meriem Belkada is listed as an author of the Amazon technical report Amazon Nova Premier Technical Report.

Michael Chen portrait
Researcher 1 reports

Michael Chen

Cohere

Michael Chen is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.

Michael Flaherty portrait
Researcher 1 reports

Michael Flaherty

NVIDIA

Public report authorship links Michael Flaherty to the Nemotron-4 15B Technical Report at NVIDIA.

Michael Tan portrait
Researcher 1 reports

Michael Tan

Meta AI

Michael Tan is listed as an author of the Meta AI technical report The Llama 3 Herd of Models.

Michael Tufano portrait
Researcher 1 reports

Michael Tufano

Ai2

Michael Tufano is listed as an author of the Ai2 technical report OLMo: Accelerating the Science of Language Models.

Mike Shumate portrait
Researcher 1 reports

Mike Shumate

Amazon

Mike Shumate is listed as an author of the Amazon technical report Amazon Nova Premier Technical Report.

Mikhail Miller portrait
Researcher 1 reports

Mikhail Miller

Amazon

Mikhail Miller is listed as an author of the Amazon technical report Amazon Nova Premier Technical Report.

Mikhail Ponomarev portrait
Researcher 1 reports

Mikhail Ponomarev

Amazon

Mikhail Ponomarev is listed as an author of the Amazon technical report Amazon Nova Premier Technical Report.

Mingfeng Wang portrait
Researcher 1 reports

Mingfeng Wang

Amazon

Mingfeng Wang is listed as an author of the Amazon technical report Amazon Nova Premier Technical Report.

Mingjie Sun portrait
Researcher 1 reports

Mingjie Sun

Amazon

Mingjie Sun is listed as an author of the Amazon technical report Amazon Nova Premier Technical Report.

Mingmei Zhang portrait
Researcher 1 reports

Mingmei Zhang

Z.ai

Public report authorship links Mingmei Zhang to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.

Mingxing Zhang portrait
Researcher 1 reports

Mingxing Zhang

Amazon

Mingxing Zhang is listed as an author of the Amazon technical report Amazon Nova Premier Technical Report.

Mingyang Wang portrait
Researcher 1 reports

Mingyang Wang

Z.ai

Public report authorship links Mingyang Wang to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.

Mingyang Wu portrait
Researcher 1 reports

Mingyang Wu

Z.ai

Public report authorship links Mingyang Wu to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.

Ming Yin portrait
Researcher 1 reports

Ming Yin

Ai2

Ming Yin is listed as an author of the Ai2 technical report Tulu 2: Demystifying the Effectiveness of RLHF and Reinforcement Learning with Human Feedback.

Mitesh Bharadwaj portrait
Researcher 1 reports

Mitesh Bharadwaj

NVIDIA

Mitesh Bharadwaj is listed as an author of the NVIDIA technical report Nemotron-4 340B Technical Report.

Mitsuru Yamakawa portrait
Researcher 1 reports

Mitsuru Yamakawa

Cohere

Mitsuru Yamakawa is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.

Mohan Kumar portrait
Researcher 1 reports

Mohan Kumar

NVIDIA

Mohan Kumar is listed as an author of the NVIDIA technical report Nemotron-4 340B Technical Report.

Mohan Kumar Ramaswamy portrait
Researcher 1 reports

Mohan Kumar Ramaswamy

NVIDIA

Mohan Kumar Ramaswamy is listed as an author of the NVIDIA technical report Nemotron-4 340B Technical Report.

Mohit Bansal portrait
Researcher 1 reports

Mohit Bansal

NVIDIA

Public report authorship links Mohit Bansal to the Nemotron-4 15B Technical Report at NVIDIA.

Mona Ranade portrait
Researcher 1 reports

Mona Ranade

Google Gemini

Mona Ranade is listed as an author of the Google technical report PaLM 2 Technical Report.

Murali Emani portrait
Researcher 1 reports

Murali Emani

Amazon

Murali Emani is listed as an author of the Amazon technical report Amazon Nova Premier Technical Report.

Murray Raum portrait
Researcher 1 reports

Murray Raum

Google Gemini

Murray Raum is listed as an author of the Google technical report PaLM 2 Technical Report.

Musab Beshir portrait
Researcher 1 reports

Musab Beshir

Cohere

Musab Beshir is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.

Naman Jain portrait
Researcher 1 reports

Naman Jain

NVIDIA

Public report authorship links Naman Jain to the Nemotron-4 340B Technical Report at NVIDIA.

Nan Du portrait
Researcher 1 reports

Nan Du

Google Gemini

Nan Du is listed as an author of the Google technical report PaLM 2 Technical Report.

Nan Duan portrait
Researcher 1 reports

Nan Duan

Z.ai

Public report authorship links Nan Duan to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.

Natalia Slyusarenko portrait
Researcher 1 reports

Natalia Slyusarenko

Cohere

Natalia Slyusarenko is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.

Nathan Knight portrait
Researcher 1 reports

Nathan Knight

NVIDIA

Nathan Knight is listed as an author of the NVIDIA technical report Nemotron-4 340B Technical Report.

N. D. R. Chinnakonduru portrait
Researcher 1 reports

N. D. R. Chinnakonduru

Google Gemini

N. D. R. Chinnakonduru is listed as an author of the Google technical report PaLM 2 Technical Report.

Neil Bhatnagar portrait
Researcher 1 reports

Neil Bhatnagar

NVIDIA

Public report authorship links Neil Bhatnagar to the Nemotron-4 340B Technical Report at NVIDIA.

Nesreen K. Ahmed portrait
Researcher 1 reports

Nesreen K. Ahmed

NVIDIA

Public report authorship links Nesreen K. Ahmed to the Nemotron-4 340B Technical Report at NVIDIA.

Nicholas Tomlin portrait
Researcher 1 reports

Nicholas Tomlin

Cohere

Nicholas Tomlin is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.

Nicola Rossi portrait
Researcher 1 reports

Nicola Rossi

Cohere

Nicola Rossi is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.

Nicolas Uzondu portrait
Researcher 1 reports

Nicolas Uzondu

Google Gemini

Nicolas Uzondu is listed as an author of the Google technical report PaLM 2 Technical Report.

Nikita Gokarn portrait
Researcher 1 reports

Nikita Gokarn

Meta AI

Nikita Gokarn is listed as an author of the Meta AI technical report The Llama 3 Herd of Models.

Nikita Pavlichenko portrait
Researcher 1 reports

Nikita Pavlichenko

Amazon

Nikita Pavlichenko is listed as an author of the Amazon technical report Amazon Nova Premier Technical Report.

Ning Ding portrait
Researcher 1 reports

Ning Ding

Z.ai

Public report authorship links Ning Ding to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.

Nino Vieillard portrait
Researcher 1 reports

Nino Vieillard

Ai2

Public report authorship links Nino Vieillard to the Open Instruct: A Simple Method for Aligning Language Models with Human Preferences at Ai2.

Nithin Raveendran portrait
Researcher 1 reports

Nithin Raveendran

Google Gemini

Nithin Raveendran is listed as an author of the Google technical report PaLM 2 Technical Report.

Nitin Suresh portrait
Researcher 1 reports

Nitin Suresh

Cohere

Nitin Suresh is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.

Noah Constant portrait
Researcher 1 reports

Noah Constant

Google Gemini

Noah Constant is listed as an author of the Google technical report PaLM 2 Technical Report.

Noam Toister portrait
Researcher 1 reports

Noam Toister

Cohere

Noam Toister is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.

Noel Nabeshima portrait
Researcher 1 reports

Noel Nabeshima

Ai2

Noel Nabeshima is listed as an author of the Ai2 technical report Tulu 2: Demystifying the Effectiveness of RLHF and Reinforcement Learning with Human Feedback.

Oliver Firat portrait
Researcher 1 reports

Oliver Firat

Google Gemini

Oliver Firat is listed as an author of the Google technical report PaLM 2 Technical Report.

Omer Goldman portrait
Researcher 1 reports

Omer Goldman

Cohere

Omer Goldman is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.

Osama Ramadan portrait
Researcher 1 reports

Osama Ramadan

Ai2

Public report authorship links Osama Ramadan to the Open Instruct: A Simple Method for Aligning Language Models with Human Preferences at Ai2.

Pablo Salinas portrait
Researcher 1 reports

Pablo Salinas

Cohere

Pablo Salinas is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.

Parham Alaei portrait
Researcher 1 reports

Parham Alaei

Cohere

Parham Alaei is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.

Parth Chadha portrait
Researcher 1 reports

Parth Chadha

NVIDIA

Parth Chadha is listed as an author of the NVIDIA technical report Nemotron-4 340B Technical Report.

Patricia Lue portrait
Researcher 1 reports

Patricia Lue

OpenAI

Patricia Lue is listed as an author of the OpenAI technical report GPT-4 Technical Report.

Pavan K. Murali portrait
Researcher 1 reports

Pavan K. Murali

NVIDIA

Pavan K. Murali is listed as an author of the NVIDIA technical report Nemotron-4 340B Technical Report.

Peilin Dong portrait
Researcher 1 reports

Peilin Dong

Meta AI

Peilin Dong is listed as an author of the Meta AI technical report The Llama 3 Herd of Models.

Pengyang Wang portrait
Researcher 1 reports

Pengyang Wang

MiniMax

Pengyang Wang is listed as an author of the MiniMax technical report MiniMax-01: Scaling Foundation Models with Lightning Attention.

P. Lu portrait
Researcher 1 reports

P. Lu

Cohere

P. Lu is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.

Pramod Kandala portrait
Researcher 1 reports

Pramod Kandala

NVIDIA

Public report authorship links Pramod Kandala to the Nemotron-4 340B Technical Report at NVIDIA.

Prasann Singhal portrait
Researcher 1 reports

Prasann Singhal

NVIDIA

Public report authorship links Prasann Singhal to the Nemotron-4 340B Technical Report at NVIDIA.

Prashanth Rajput portrait
Researcher 1 reports

Prashanth Rajput

NVIDIA

Public report authorship links Prashanth Rajput to the Nemotron-4 340B Technical Report at NVIDIA.

Prathyusha Kamesetty portrait
Researcher 1 reports

Prathyusha Kamesetty

NVIDIA

Public report authorship links Prathyusha Kamesetty to the Nemotron-4 15B Technical Report at NVIDIA.

Pratyush Patil portrait
Researcher 1 reports

Pratyush Patil

NVIDIA

Pratyush Patil is listed as an author of the NVIDIA technical report Nemotron-4 340B Technical Report.

Purnendu Patil portrait
Researcher 1 reports

Purnendu Patil

NVIDIA

Purnendu Patil is listed as an author of the NVIDIA technical report Nemotron-4 340B Technical Report.

Puyang Huang portrait
Researcher 1 reports

Puyang Huang

Z.ai

Puyang Huang is listed as an author of the Z.ai technical report GLM-5: Thinking, Coding, and Agentic Intelligence.

Qian Liu portrait
Researcher 1 reports

Qian Liu

Cohere

Qian Liu is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.

Qian Liu portrait
Researcher 1 reports

Qian Liu

Amazon

Qian Liu is listed as an author of the Amazon technical report Amazon Nova Premier Technical Report.

Qian Liu portrait
Researcher 1 reports

Qian Liu

Z.ai

Public report authorship links Qian Liu to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.

Qi Dou portrait
Researcher 1 reports

Qi Dou

MiniMax

Qi Dou is a MiniMax report-backed author on MiniMax-Text-01, a MiniMax technical report in the LLMpeople catalog.

Qiming Zhang portrait
Researcher 1 reports

Qiming Zhang

Z.ai

Public report authorship links Qiming Zhang to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.

Qinglin Lu portrait
Researcher 1 reports

Qinglin Lu

Z.ai

Public report authorship links Qinglin Lu to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.

Qingquan Li portrait
Researcher 1 reports

Qingquan Li

Z.ai

Public report authorship links Qingquan Li to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.

Qingyun Wu portrait
Researcher 1 reports

Qingyun Wu

Amazon

Qingyun Wu is listed as an author of the Amazon technical report Amazon Nova Premier Technical Report.

Qiqi Wang portrait
Researcher 1 reports

Qiqi Wang

Z.ai

Public report authorship links Qiqi Wang to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.

Quanting Du portrait
Researcher 1 reports

Quanting Du

Cohere

Quanting Du is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.

Raghavendra Amin portrait
Researcher 1 reports

Raghavendra Amin

NVIDIA

Raghavendra Amin is listed as an author of the NVIDIA technical report Nemotron-4 340B Technical Report.

Raghavendra Kokku portrait
Researcher 1 reports

Raghavendra Kokku

NVIDIA

Public report authorship links Raghavendra Kokku to the Nemotron-4 340B Technical Report at NVIDIA.

Raghu Ganti portrait
Researcher 1 reports

Raghu Ganti

Cohere

Raghu Ganti is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.

Rajan Chari portrait
Researcher 1 reports

Rajan Chari

Amazon

Rajan Chari is listed as an author of the Amazon technical report Amazon Nova Premier Technical Report.

Rajkumar Krishnakumar portrait
Researcher 1 reports

Rajkumar Krishnakumar

Google Gemini

Rajkumar Krishnakumar is listed as an author of the Google technical report PaLM 2 Technical Report.

Ram Rajagopal portrait
Researcher 1 reports

Ram Rajagopal

NVIDIA

Public report authorship links Ram Rajagopal to the Nemotron-4 340B Technical Report at NVIDIA.

Rebecca Cohen portrait
Researcher 1 reports

Rebecca Cohen

Google Gemini

Rebecca Cohen is listed as an author of the Google technical report PaLM 2 Technical Report.

Renjie Pi portrait
Researcher 1 reports

Renjie Pi

NVIDIA

Public report authorship links Renjie Pi to the Nemotron-4 340B Technical Report at NVIDIA.

Robert Berry portrait
Researcher 1 reports

Robert Berry

Ai2

Public sources list Robert Berry on the Ai2 team directory and among the authors of the OLMo 3 technical report.

Rodrigo Gomes portrait
Researcher 1 reports

Rodrigo Gomes

Google Gemini

Rodrigo Gomes is listed as an author of the Google technical report Gemma: Open Models Based on Gemini Research and Technology.

Rohan Ringwala portrait
Researcher 1 reports

Rohan Ringwala

NVIDIA

Rohan Ringwala is listed as an author of the NVIDIA technical report Nemotron-4 340B Technical Report.

Ronan Rozen portrait
Researcher 1 reports

Ronan Rozen

NVIDIA

Public report authorship links Ronan Rozen to the Nemotron-4 340B Technical Report at NVIDIA.

Roy Bar-Haim portrait
Researcher 1 reports

Roy Bar-Haim

Cohere

Roy Bar-Haim is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.

Rui Hu portrait
Researcher 1 reports

Rui Hu

Z.ai

Public report authorship links Rui Hu to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.

Rui Wang portrait
Researcher 1 reports

Rui Wang

Z.ai

Public report authorship links Rui Wang to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.

Ruoming Pang portrait
Researcher 1 reports

Ruoming Pang

Google Gemini

Ruoming Pang is listed as an author of the Google technical report RecurrentGemma: Moving Past Transformers for Efficient Open Language Models.

Rushil Anirudh portrait
Researcher 1 reports

Rushil Anirudh

NVIDIA

Rushil Anirudh is listed as an author of the NVIDIA technical report Nemotron-4 340B Technical Report.

Ryan Ann Molander portrait
Researcher 1 reports

Ryan Ann Molander

Meta AI

Ryan Ann Molander is listed as an author of the Meta AI technical report Llama 2: Open Foundation and Fine-Tuned Chat Models.

Ryan Lowe portrait
Researcher 1 reports

Ryan Lowe

OpenAI

Ryan Lowe is listed as an author of the OpenAI technical report GPT-4 Technical Report.

Ryan Zhao portrait
Researcher 1 reports

Ryan Zhao

Google Gemini

Ryan Zhao is listed as an author of the Google technical report PaLM 2 Technical Report.

Said X. Wang portrait
Researcher 1 reports

Said X. Wang

NVIDIA

Said X. Wang is listed as an author of the NVIDIA technical report Nemotron-4 340B Technical Report.

Sajjad A. Mozaffari portrait
Researcher 1 reports

Sajjad A. Mozaffari

Amazon

Sajjad A. Mozaffari is listed as an author of the Amazon technical report Amazon Nova Premier Technical Report.

Salman Khan portrait
Researcher 1 reports

Salman Khan

NVIDIA

Salman Khan is listed as an author of the NVIDIA technical report Nemotron-4 340B Technical Report.

Salvatore Romano portrait
Researcher 1 reports

Salvatore Romano

Cohere

Salvatore Romano is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.

Sameer Khan portrait
Researcher 1 reports

Sameer Khan

Cohere

Sameer Khan is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.

Sameer Singhal portrait
Researcher 1 reports

Sameer Singhal

NVIDIA

Sameer Singhal is listed as an author of the NVIDIA technical report Nemotron-4 340B Technical Report.

Sam Power portrait
Researcher 1 reports

Sam Power

Cohere

Sam Power is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.

Samuel Albanie portrait
Researcher 1 reports

Samuel Albanie

NVIDIA

Samuel Albanie is listed as an author of the NVIDIA technical report NVIDIA Nemotron 3: Efficient and Open Intelligence.

Sangeetha Abdu Jyothi portrait
Researcher 1 reports

Sangeetha Abdu Jyothi

Amazon

Sangeetha Abdu Jyothi is listed as an author of the Amazon technical report Amazon Nova Premier Technical Report.

Sarah Laszlo portrait
Researcher 1 reports

Sarah Laszlo

Cohere

Sarah Laszlo is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.

Saurabh Dolgall portrait
Researcher 1 reports

Saurabh Dolgall

NVIDIA

Saurabh Dolgall is listed as an author of the NVIDIA technical report Nemotron-4 340B Technical Report.

Saurav Muralidharan portrait
Researcher 1 reports

Saurav Muralidharan

NVIDIA

Public report authorship links Saurav Muralidharan to the Nemotron-4 15B Technical Report at NVIDIA.

Scott Mayer McKinney portrait
Researcher 1 reports

Scott Mayer McKinney

OpenAI

Scott Mayer McKinney is listed as an author of the OpenAI technical report GPT-4 Technical Report.

Sean Foo portrait
Researcher 1 reports

Sean Foo

NVIDIA

Public report authorship links Sean Foo to the Nemotron-4 340B Technical Report at NVIDIA.

Sebastian Lee portrait
Researcher 1 reports

Sebastian Lee

Google Gemini

Sebastian Lee is listed as an author of the Google technical report PaLM: Scaling Language Modeling with Pathways.

Sebastian Norgaard portrait
Researcher 1 reports

Sebastian Norgaard

NVIDIA

Public report authorship links Sebastian Norgaard to the Nemotron-4 340B Technical Report at NVIDIA.

Sha Cai portrait
Researcher 1 reports

Sha Cai

Z.ai

Public report authorship links Sha Cai to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.

Shen Gao portrait
Researcher 1 reports

Shen Gao

Z.ai

Public report authorship links Shen Gao to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.

Shibo Hao portrait
Researcher 1 reports

Shibo Hao

Google Gemini

Public report authorship links Shibo Hao to the RecurrentGemma: Moving Past Transformers for Efficient Open Language Models at Google.

Shi Dong portrait
Researcher 1 reports

Shi Dong

NVIDIA

Shi Dong is listed as an author of the NVIDIA technical report Nemotron-4 340B Technical Report.

Shijie Geng portrait
Researcher 1 reports

Shijie Geng

Z.ai

Public report authorship links Shijie Geng to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.

Shikib Rahman portrait
Researcher 1 reports

Shikib Rahman

Cohere

Shikib Rahman is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.

Shiqi Wang portrait
Researcher 1 reports

Shiqi Wang

Z.ai

Public report authorship links Shiqi Wang to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.

Shital Shah portrait
Researcher 1 reports

Shital Shah

NVIDIA

Shital Shah is listed as an author of the NVIDIA technical report Nemotron-4 340B Technical Report.

Shiyue Zhang portrait
Researcher 1 reports

Shiyue Zhang

Meta AI

Shiyue Zhang is listed as an author of the Meta AI technical report The Llama 3 Herd of Models.

Shourya Khare portrait
Researcher 1 reports

Shourya Khare

Google Gemini

Shourya Khare is listed as an author of the Google technical report PaLM 2 Technical Report.

Shreyas Paliwal portrait
Researcher 1 reports

Shreyas Paliwal

NVIDIA

Public report authorship links Shreyas Paliwal to the Nemotron-4 340B Technical Report at NVIDIA.

Shuai Wang portrait
Researcher 1 reports

Shuai Wang

Cohere

Shuai Wang is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.

Shuangfei Zhai portrait
Researcher 1 reports

Shuangfei Zhai

Amazon

Shuangfei Zhai is listed as an author of the Amazon technical report Amazon Nova Premier Technical Report.

Shuliang Wang portrait
Researcher 1 reports

Shuliang Wang

MiniMax

Shuliang Wang is a MiniMax report-backed author on MiniMax-Text-01, a MiniMax technical report in the LLMpeople catalog.

Shuo Wang portrait
Researcher 1 reports

Shuo Wang

Z.ai

Public report authorship links Shuo Wang to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.

Shuyan Zhou portrait
Researcher 1 reports

Shuyan Zhou

Cohere

Shuyan Zhou is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.

Shyam Mohan Srinivasan portrait
Researcher 1 reports

Shyam Mohan Srinivasan

NVIDIA

Public report authorship links Shyam Mohan Srinivasan to the Nemotron-4 340B Technical Report at NVIDIA.

Siddharth Jain portrait
Researcher 1 reports

Siddharth Jain

NVIDIA

Public report authorship links Siddharth Jain to the Nemotron-4 340B Technical Report at NVIDIA.

Sidharth Karamcheti portrait
Researcher 1 reports

Sidharth Karamcheti

Mistral AI

Sidharth Karamcheti is listed as an author of the Mistral AI technical report Mistral Small 3.1 Technical Report.

Siliang Tang portrait
Researcher 1 reports

Siliang Tang

MiniMax

Siliang Tang is a MiniMax report-backed author on MiniMax-Text-01, a MiniMax technical report in the LLMpeople catalog.

Siyuan Yin portrait
Researcher 1 reports

Siyuan Yin

Z.ai

Siyuan Yin is listed as an author of the Z.ai technical report GLM-5: Thinking, Coding, and Agentic Intelligence.

Songfang Huang portrait
Researcher 1 reports

Songfang Huang

Z.ai

Public report authorship links Songfang Huang to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.

Souradip Chakraborty portrait
Researcher 1 reports

Souradip Chakraborty

Amazon

Souradip Chakraborty is listed as an author of the Amazon technical report Amazon Nova Premier Technical Report.

Spike Gururangan portrait
Researcher 1 reports

Spike Gururangan

Meta AI

Spike Gururangan is listed as an author of the Meta AI technical report The Llama 3 Herd of Models.

Steve Dowling portrait
Researcher 1 reports

Steve Dowling

OpenAI

Steve Dowling is listed as an author of the OpenAI technical report GPT-4 Technical Report.

Sujith Ravi portrait
Researcher 1 reports

Sujith Ravi

Google Gemini

Sujith Ravi is listed as an author of the Google technical report PaLM 2 Technical Report.

Sukannya Purkayastha portrait
Researcher 1 reports

Sukannya Purkayastha

Amazon

Sukannya Purkayastha is listed as an author of the Amazon technical report Amazon Nova Premier Technical Report.

Sundararajan Srinivasan portrait
Researcher 1 reports

Sundararajan Srinivasan

NVIDIA

Sundararajan Srinivasan is listed as an author of the NVIDIA technical report Nemotron-4 340B Technical Report.

Taira Anderson portrait
Researcher 1 reports

Taira Anderson

Ai2

Taira Anderson is a program manager supporting Ai2's multimodal teams and is listed as an author on the Olmo 3 report.

Tanay Datta portrait
Researcher 1 reports

Tanay Datta

NVIDIA

Public report authorship links Tanay Datta to the Nemotron-4 340B Technical Report at NVIDIA.

Tanmay Shah portrait
Researcher 1 reports

Tanmay Shah

Amazon

Tanmay Shah is listed as an author of the Amazon technical report Amazon Nova Premier Technical Report.

Tao Ge portrait
Researcher 1 reports

Tao Ge

Z.ai

Public report authorship links Tao Ge to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.

Tao Ge portrait
Researcher 1 reports

Tao Ge

Amazon

Tao Ge is listed as an author of the Amazon technical report Amazon Nova Premier Technical Report.

Tarek Eloundou Nekoul portrait
Researcher 1 reports

Tarek Eloundou Nekoul

Ai2

Public report authorship links Tarek Eloundou Nekoul to the OLMoE: Open Mixture-of-Experts Language Models at Ai2.

Teng Xu portrait
Researcher 1 reports

Teng Xu

Z.ai

Public report authorship links Teng Xu to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.

Theo Garcin portrait
Researcher 1 reports

Theo Garcin

Mistral AI

Theo Garcin is listed as an author of the Mistral AI technical report Mistral Small 3.1 Technical Report.

Thomas Lavril portrait
Researcher 1 reports

Thomas Lavril

Mistral AI

Thomas Lavril is listed as an author of the Mistral AI technical report Mistral 7B.

Thomas Scialom portrait
Researcher 1 reports

Thomas Scialom

Ai2

Public report authorship links Thomas Scialom to the Open Instruct: A Simple Method for Aligning Language Models with Human Preferences at Ai2.

Tianhao Xu portrait
Researcher 1 reports

Tianhao Xu

NVIDIA

Tianhao Xu is listed as an author of the NVIDIA technical report Nemotron-4 340B Technical Report.

Tianmin Shu portrait
Researcher 1 reports

Tianmin Shu

Cohere

Tianmin Shu is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.

Tianshu Wang portrait
Researcher 1 reports

Tianshu Wang

Z.ai

Public report authorship links Tianshu Wang to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.

Tianyi Zhang portrait
Researcher 1 reports

Tianyi Zhang

Meta AI

Tianyi Zhang is listed as an author of the Meta AI technical report The Llama 3 Herd of Models.

Tianyi Zhang portrait
Researcher 1 reports

Tianyi Zhang

Ai2

Public report authorship links Tianyi Zhang to the Open Instruct: A Simple Method for Aligning Language Models with Human Preferences at Ai2.

Tielong Chen portrait
Researcher 1 reports

Tielong Chen

Z.ai

Public report authorship links Tielong Chen to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.

Tim Dodds portrait
Researcher 1 reports

Tim Dodds

Ai2

Public report authorship links Tim Dodds to the Open Instruct: A Simple Method for Aligning Language Models with Human Preferences at Ai2.

Ting Chen portrait
Researcher 1 reports

Ting Chen

Google Gemini

Ting Chen is listed as an author of the Google technical report PaLM 2 Technical Report.

Tolly Powell portrait
Researcher 1 reports

Tolly Powell

OpenAI

Tolly Powell is listed as an author of the OpenAI technical report GPT-4 Technical Report.

Tong Wang portrait
Researcher 1 reports

Tong Wang

NVIDIA

Public report authorship links Tong Wang to the Nemotron-4 340B Technical Report at NVIDIA.

Tyler A. Chang portrait
Researcher 1 reports

Tyler A. Chang

Ai2

Public report authorship links Tyler A. Chang to the OLMoE: Open Mixture-of-Experts Language Models at Ai2.

Upal Ayyalasomayajula portrait
Researcher 1 reports

Upal Ayyalasomayajula

NVIDIA

Upal Ayyalasomayajula is listed as an author of the NVIDIA technical report Nemotron-4 340B Technical Report.

Valentin Nechifor portrait
Researcher 1 reports

Valentin Nechifor

NVIDIA

Public report authorship links Valentin Nechifor to the Nemotron-4 340B Technical Report at NVIDIA.

Vartika Bhandari portrait
Researcher 1 reports

Vartika Bhandari

NVIDIA

Vartika Bhandari is listed as an author of the NVIDIA technical report Nemotron-4 340B Technical Report.

Venkata Pingali portrait
Researcher 1 reports

Venkata Pingali

NVIDIA

Public report authorship links Venkata Pingali to the Nemotron-4 340B Technical Report at NVIDIA.

Venkatesh Korthikanti portrait
Researcher 1 reports

Venkatesh Korthikanti

NVIDIA

Public report authorship links Venkatesh Korthikanti to the Nemotron-4 340B Technical Report at NVIDIA.

Vikas Chandra portrait
Researcher 1 reports

Vikas Chandra

NVIDIA

Vikas Chandra is listed as an author of the NVIDIA technical report Nemotron-4 340B Technical Report.

Vikas Ours portrait
Researcher 1 reports

Vikas Ours

NVIDIA

Vikas Ours is listed as an author of the NVIDIA technical report Nemotron-4 340B Technical Report.

Vineet Gupta portrait
Researcher 1 reports

Vineet Gupta

NVIDIA

Public report authorship links Vineet Gupta to the Nemotron-4 340B Technical Report at NVIDIA.

Viraj Bagal portrait
Researcher 1 reports

Viraj Bagal

Ai2

Public report authorship links Viraj Bagal to the OLMoE: Open Mixture-of-Experts Language Models at Ai2.

Vishakha Gupta portrait
Researcher 1 reports

Vishakha Gupta

NVIDIA

Vishakha Gupta is listed as an author of the NVIDIA technical report Nemotron-4 340B Technical Report.

Wei Dai portrait
Researcher 1 reports

Wei Dai

Google Gemini

Wei Dai is listed as an author of the Google technical report PaLM 2 Technical Report.

Wei Li portrait
Researcher 1 reports

Wei Li

Google Gemini

Wei Li is listed as an author of the Google technical report PaLM 2 Technical Report.

Wenhui Huang portrait
Researcher 1 reports

Wenhui Huang

MiniMax

Wenhui Huang is a MiniMax report-backed author on MiniMax-Text-01, a MiniMax technical report in the LLMpeople catalog.

Wenping Wang portrait
Researcher 1 reports

Wenping Wang

Z.ai

Public report authorship links Wenping Wang to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.

Wenxuan Tan portrait
Researcher 1 reports

Wenxuan Tan

Z.ai

Public report authorship links Wenxuan Tan to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.

Wenyi Hong portrait
Researcher 1 reports

Wenyi Hong

Z.ai

Public report authorship links Wenyi Hong to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.

Wenyuan Han portrait
Researcher 1 reports

Wenyuan Han

Z.ai

Public report authorship links Wenyuan Han to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.

Wojciech Zaremba portrait
Researcher 1 reports

Wojciech Zaremba

OpenAI

Wojciech Zaremba is listed as an author of the OpenAI technical report GPT-4 Technical Report.

Xiangyong Deng portrait
Researcher 1 reports

Xiangyong Deng

Z.ai

Public report authorship links Xiangyong Deng to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.

Xiao Feng portrait
Researcher 1 reports

Xiao Feng

Z.ai

Public report authorship links Xiao Feng to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.

Xiaoming Zhang portrait
Researcher 1 reports

Xiaoming Zhang

MiniMax

Xiaoming Zhang is listed as an author of the MiniMax technical report MiniMax-01: Scaling Foundation Models with Lightning Attention.

Xiaosi Yang portrait
Researcher 1 reports

Xiaosi Yang

NVIDIA

Xiaosi Yang is listed as an author of the NVIDIA technical report Nemotron-H: A Family of Accurate and Efficient Hybrid Mamba-Transformer Models.

Xiaoyang Liu portrait
Researcher 1 reports

Xiaoyang Liu

NVIDIA

Public report authorship links Xiaoyang Liu to the Nemotron-4 340B Technical Report at NVIDIA.

Xiaoze Liu portrait
Researcher 1 reports

Xiaoze Liu

Z.ai

Public report authorship links Xiaoze Liu to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.

Xiaozhi Wang portrait
Researcher 1 reports

Xiaozhi Wang

Z.ai

Public report authorship links Xiaozhi Wang to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.

Xinggong Zhang portrait
Researcher 1 reports

Xinggong Zhang

Z.ai

Public report authorship links Xinggong Zhang to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.

Xin Jiang portrait
Researcher 1 reports

Xin Jiang

Google Gemini

Xin Jiang is listed as an author of the Google technical report PaLM 2 Technical Report.

Xinlong Wang portrait
Researcher 1 reports

Xinlong Wang

Z.ai

Public report authorship links Xinlong Wang to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.

Xintao Wang portrait
Researcher 1 reports

Xintao Wang

MiniMax

Xintao Wang is a MiniMax report-backed author on MiniMax-Text-01, a MiniMax technical report in the LLMpeople catalog.

Xinyu Xu portrait
Researcher 1 reports

Xinyu Xu

Z.ai

Public report authorship links Xinyu Xu to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.

Xuesong Jin portrait
Researcher 1 reports

Xuesong Jin

Z.ai

Public report authorship links Xuesong Jin to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.

Xu Ming portrait
Researcher 1 reports

Xu Ming

Z.ai

Xu Ming is listed as an author of the Z.ai technical report GLM-5: Thinking, Coding, and Agentic Intelligence.

Xun Gao portrait
Researcher 1 reports

Xun Gao

Cohere

Xun Gao is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.

Xupeng Miao portrait
Researcher 1 reports

Xupeng Miao

Z.ai

Public report authorship links Xupeng Miao to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.

Yadong Yin portrait
Researcher 1 reports

Yadong Yin

Z.ai

Public report authorship links Yadong Yin to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.

Yajun Wang portrait
Researcher 1 reports

Yajun Wang

Z.ai

Public report authorship links Yajun Wang to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.

Yang Gao portrait
Researcher 1 reports

Yang Gao

Z.ai

Public report authorship links Yang Gao to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.

Yang Liu portrait
Researcher 1 reports

Yang Liu

Z.ai

Public report authorship links Yang Liu to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.

Yangyou Huang portrait
Researcher 1 reports

Yangyou Huang

NVIDIA

Yangyou Huang is listed as an author of the NVIDIA technical report Nemotron-4 340B Technical Report.

Yanhui Guo portrait
Researcher 1 reports

Yanhui Guo

Z.ai

Public report authorship links Yanhui Guo to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.

Yann Barbaron portrait
Researcher 1 reports

Yann Barbaron

Mistral AI

Yann Barbaron is listed as an author of the Mistral AI technical report Mixtral of Experts.

Yann Dubois portrait
Researcher 1 reports

Yann Dubois

Ai2

Public report authorship links Yann Dubois to the Open Instruct: A Simple Method for Aligning Language Models with Human Preferences at Ai2.

Yan Zhou portrait
Researcher 1 reports

Yan Zhou

Z.ai

Public report authorship links Yan Zhou to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.

Yao Wu portrait
Researcher 1 reports

Yao Wu

Google Gemini

Yao Wu is listed as an author of the Google technical report PaLM 2 Technical Report.

Yaqin Fan portrait
Researcher 1 reports

Yaqin Fan

Z.ai

Public report authorship links Yaqin Fan to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.

Yash Sharma portrait
Researcher 1 reports

Yash Sharma

Meta AI

Yash Sharma is listed as an author of the Meta AI technical report The Llama 3 Herd of Models.

Yecong Wei portrait
Researcher 1 reports

Yecong Wei

Z.ai

Public report authorship links Yecong Wei to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.

Yejin Choi portrait
Researcher 1 reports

Yejin Choi

NVIDIA

Public report authorship links Yejin Choi to the Nemotron-4 15B Technical Report at NVIDIA.

Yejin Choi portrait
Researcher 1 reports

Yejin Choi

Cohere

Yejin Choi is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.

Yelin Zhou portrait
Researcher 1 reports

Yelin Zhou

Google Gemini

Yelin Zhou is listed as an author of the Google technical report PaLM 2 Technical Report.

Yeqi Bai portrait
Researcher 1 reports

Yeqi Bai

Z.ai

Public report authorship links Yeqi Bai to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.

Yevgeny Kuratov portrait
Researcher 1 reports

Yevgeny Kuratov

Amazon

Yevgeny Kuratov is listed as an author of the Amazon technical report Amazon Nova Premier Technical Report.

Yibo Xu portrait
Researcher 1 reports

Yibo Xu

Z.ai

Public report authorship links Yibo Xu to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.

Yicheng Jin portrait
Researcher 1 reports

Yicheng Jin

Z.ai

Yicheng Jin is listed as an author of the Z.ai technical report GLM-5: Thinking, Coding, and Agentic Intelligence.

Yicheng Wang portrait
Researcher 1 reports

Yicheng Wang

Z.ai

Public report authorship links Yicheng Wang to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.

Yichen Wang portrait
Researcher 1 reports

Yichen Wang

Z.ai

Yichen Wang is listed as an author of the Z.ai technical report GLM-5: Thinking, Coding, and Agentic Intelligence.

Yichen Zhu portrait
Researcher 1 reports

Yichen Zhu

MiniMax

Yichen Zhu is listed as an author of the MiniMax technical report MiniMax-01: Scaling Foundation Models with Lightning Attention.

Yi Cui portrait
Researcher 1 reports

Yi Cui

Google Gemini

Yi Cui is listed as an author of the Google technical report PaLM 2 Technical Report.

Yifan Li portrait
Researcher 1 reports

Yifan Li

MiniMax

Yifan Li is a MiniMax report-backed author on MiniMax-Text-01, a MiniMax technical report in the LLMpeople catalog.

Yijia Shao portrait
Researcher 1 reports

Yijia Shao

Cohere

Yijia Shao is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.

Yile Wang portrait
Researcher 1 reports

Yile Wang

Cohere

Yile Wang is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.

Yile Wang portrait
Researcher 1 reports

Yile Wang

Z.ai

Public report authorship links Yile Wang to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.

Yimeng Zhu portrait
Researcher 1 reports

Yimeng Zhu

Z.ai

Public report authorship links Yimeng Zhu to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.

Yiming Li portrait
Researcher 1 reports

Yiming Li

Z.ai

Public report authorship links Yiming Li to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.

Yingbo Zhou portrait
Researcher 1 reports

Yingbo Zhou

Amazon

Yingbo Zhou is listed as an author of the Amazon technical report Amazon Nova Premier Technical Report.

Yingqing He portrait
Researcher 1 reports

Yingqing He

MiniMax

Yingqing He is a MiniMax report-backed author on MiniMax-Text-01, a MiniMax technical report in the LLMpeople catalog.

Yinqiang Zheng portrait
Researcher 1 reports

Yinqiang Zheng

MiniMax

Yinqiang Zheng is listed as an author of the MiniMax technical report MiniMax-01: Scaling Foundation Models with Lightning Attention.

Yiran Fang portrait
Researcher 1 reports

Yiran Fang

MiniMax

Yiran Fang is a MiniMax report-backed author on MiniMax-Text-01, a MiniMax technical report in the LLMpeople catalog.

Yixing Liao portrait
Researcher 1 reports

Yixing Liao

Amazon

Yixing Liao is listed as an author of the Amazon technical report Amazon Nova Premier Technical Report.

Yizhu Jiao portrait
Researcher 1 reports

Yizhu Jiao

Ai2

Yizhu Jiao is listed as an author of the Ai2 technical report Tulu 2: Demystifying the Effectiveness of RLHF and Reinforcement Learning with Human Feedback.

Yongbin Li portrait
Researcher 1 reports

Yongbin Li

Z.ai

Yongbin Li is listed as an author of the Z.ai technical report GLM-5: Thinking, Coding, and Agentic Intelligence.

Yongdagger Lee portrait
Researcher 1 reports

Yongdagger Lee

NVIDIA

Yongdagger Lee is listed as an author of the NVIDIA technical report Nemotron-4 340B Technical Report.

Yonghong Tian portrait
Researcher 1 reports

Yonghong Tian

MiniMax

Yonghong Tian is listed as an author of the MiniMax technical report MiniMax-01: Scaling Foundation Models with Lightning Attention.

Yong Wang portrait
Researcher 1 reports

Yong Wang

Z.ai

Public report authorship links Yong Wang to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.

Yongxin Guo portrait
Researcher 1 reports

Yongxin Guo

Z.ai

Public report authorship links Yongxin Guo to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.

Yoni Halpern portrait
Researcher 1 reports

Yoni Halpern

Amazon

Yoni Halpern is listed as an author of the Amazon technical report Amazon Nova Premier Technical Report.

Youjia Zhang portrait
Researcher 1 reports

Youjia Zhang

MiniMax

Youjia Zhang is a MiniMax report-backed author on MiniMax-Text-01, a MiniMax technical report in the LLMpeople catalog.

Yuan Gu portrait
Researcher 1 reports

Yuan Gu

Z.ai

Public report authorship links Yuan Gu to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.

Yubo Miao portrait
Researcher 1 reports

Yubo Miao

Z.ai

Public report authorship links Yubo Miao to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.

Yu Cheng portrait
Researcher 1 reports

Yu Cheng

Z.ai

Public report authorship links Yu Cheng to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.

Yucheng Zheng portrait
Researcher 1 reports

Yucheng Zheng

Z.ai

Public report authorship links Yucheng Zheng to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.

Yudong Chen portrait
Researcher 1 reports

Yudong Chen

Z.ai

Public report authorship links Yudong Chen to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.

Yudong Hao portrait
Researcher 1 reports

Yudong Hao

NVIDIA

Yudong Hao is listed as an author of the NVIDIA technical report Nemotron-4 340B Technical Report.

Yue Zhou portrait
Researcher 1 reports

Yue Zhou

Z.ai

Public report authorship links Yue Zhou to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.

Yujiu Yang portrait
Researcher 1 reports

Yujiu Yang

Cohere

Yujiu Yang is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.

Yukun Zhu portrait
Researcher 1 reports

Yukun Zhu

Z.ai

Public report authorship links Yukun Zhu to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.

Yuliya Simachev portrait
Researcher 1 reports

Yuliya Simachev

NVIDIA

Yuliya Simachev is listed as an author of the NVIDIA technical report Nemotron-4 340B Technical Report.

Yunan Hu portrait
Researcher 1 reports

Yunan Hu

Z.ai

Public report authorship links Yunan Hu to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.

Yunjia Xi portrait
Researcher 1 reports

Yunjia Xi

Z.ai

Public report authorship links Yunjia Xi to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.

Yunyang Zhao portrait
Researcher 1 reports

Yunyang Zhao

NVIDIA

Public report authorship links Yunyang Zhao to the Nemotron-4 340B Technical Report at NVIDIA.

Yuqi Zhou portrait
Researcher 1 reports

Yuqi Zhou

Google Gemini

Public report authorship links Yuqi Zhou to the PaLM: Scaling Language Modeling with Pathways at Google.

Yury Malkov portrait
Researcher 1 reports

Yury Malkov

Google Gemini

Yury Malkov is listed as an author of the Google technical report PaLM 2 Technical Report.

Yury Malkov portrait
Researcher 1 reports

Yury Malkov

Amazon

Yury Malkov is listed as an author of the Amazon technical report Amazon Nova Premier Technical Report.

Yu Wang portrait
Researcher 1 reports

Yu Wang

Z.ai

Public report authorship links Yu Wang to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.

Yuxiang Luo portrait
Researcher 1 reports

Yuxiang Luo

DeepSeek

Yuxiang Luo is listed as an author of the DeepSeek technical report DeepSeek-V3 Technical Report.

Yuxi Li portrait
Researcher 1 reports

Yuxi Li

Cohere

Yuxi Li is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.

Yuxiong He portrait
Researcher 1 reports

Yuxiong He

NVIDIA

Public report authorship links Yuxiong He to the Nemotron-4 340B Technical Report at NVIDIA.

Yu Zhang portrait
Researcher 1 reports

Yu Zhang

Z.ai

Public report authorship links Yu Zhang to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.

Yuzheng Zhuang portrait
Researcher 1 reports

Yuzheng Zhuang

Cohere

Yuzheng Zhuang is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.

Zachary Yung portrait
Researcher 1 reports

Zachary Yung

Google Gemini

Zachary Yung is listed as an author of the Google technical report PaLM 2 Technical Report.

Zefeng Wang portrait
Researcher 1 reports

Zefeng Wang

Z.ai

Public report authorship links Zefeng Wang to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.

Zehua Wang portrait
Researcher 1 reports

Zehua Wang

NVIDIA

Public report authorship links Zehua Wang to the Nemotron-4 340B Technical Report at NVIDIA.

Zhangliang Chen portrait
Researcher 1 reports

Zhangliang Chen

Z.ai

Public report authorship links Zhangliang Chen to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.

Zhaocheng Fang portrait
Researcher 1 reports

Zhaocheng Fang

Z.ai

Public report authorship links Zhaocheng Fang to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.

Zhenhai Zhu portrait
Researcher 1 reports

Zhenhai Zhu

Cohere

Zhenhai Zhu is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.

Zhenhuan Yang portrait
Researcher 1 reports

Zhenhuan Yang

Z.ai

Public report authorship links Zhenhuan Yang to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.

Zhen Zhang portrait
Researcher 1 reports

Zhen Zhang

Amazon

Zhen Zhang is listed as an author of the Amazon technical report Amazon Nova Premier Technical Report.

Zheyu Song portrait
Researcher 1 reports

Zheyu Song

Z.ai

Public report authorship links Zheyu Song to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.

Zhibin Chen portrait
Researcher 1 reports

Zhibin Chen

Z.ai

Public report authorship links Zhibin Chen to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.

Zhiming Yang portrait
Researcher 1 reports

Zhiming Yang

Meta AI

Zhiming Yang is listed as an author of the Meta AI technical report The Llama 3 Herd of Models.

Zhiqiang Zhang portrait
Researcher 1 reports

Zhiqiang Zhang

Z.ai

Public report authorship links Zhiqiang Zhang to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.

Zhiwen Fan portrait
Researcher 1 reports

Zhiwen Fan

Cohere

Zhiwen Fan is listed as an author of the Cohere technical report Command A: An Enterprise-Ready Large Language Model.

Zhiyao Li portrait
Researcher 1 reports

Zhiyao Li

Z.ai

Public report authorship links Zhiyao Li to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.

Zhuohan Li portrait
Researcher 1 reports

Zhuohan Li

Ai2

Public report authorship links Zhuohan Li to the OLMoE: Open Mixture-of-Experts Language Models at Ai2.

Zihang Li portrait
Researcher 1 reports

Zihang Li

Meta AI

Zihang Li is listed as an author of the Meta AI technical report The Llama 3 Herd of Models.

Zihan Qiao portrait
Researcher 1 reports

Zihan Qiao

MiniMax

Zihan Qiao is a MiniMax report-backed author on MiniMax-Text-01, a MiniMax technical report in the LLMpeople catalog.

Ziyang Song portrait
Researcher 1 reports

Ziyang Song

DeepSeek

Ziyang Song is listed as an author of the DeepSeek technical report DeepSeek-V3 Technical Report.

Ziyi Yang portrait
Researcher 1 reports

Ziyi Yang

Ai2

Ziyi Yang is listed as an author of the Ai2 technical report Tulu 2: Demystifying the Effectiveness of RLHF and Reinforcement Learning with Human Feedback.

Z. Olivia Li portrait
Researcher 1 reports

Z. Olivia Li

Google Gemini

Z. Olivia Li is listed as an author of the Google technical report PaLM 2 Technical Report.

Zongze Kou portrait
Researcher 1 reports

Zongze Kou

Z.ai

Public report authorship links Zongze Kou to the GLM-5: Thinking, Coding, and Agentic Intelligence at Z.ai.

Chengen Huang portrait
Researcher 1 reports

Chengen Huang

Alibaba Qwen

Research scientist in Tongyi Lab whose official profile highlights coding large language models and AI agents.

Gao Liu portrait
Researcher 1 reports

Gao Liu

Alibaba Qwen

Research scientist in Tongyi Lab whose official profile highlights post-training, reinforcement learning, reasoning, and AI agents.

Jialong Tang portrait
Researcher 1 reports

Jialong Tang

Alibaba Qwen

Qwen researcher whose work focuses on alignment, reasoning, and multilingual large language models.

Jing Zhou portrait
Researcher 1 reports

Jing Zhou

Alibaba Qwen

Research scientist in Tongyi Lab whose official profile highlights post-training and multimodal large language models.

Luo Ji portrait
Researcher 1 reports

Luo Ji

Alibaba Qwen

Leader of Qwen's reasoning and agent team.

Shixuan Liu portrait
Researcher 1 reports

Shixuan Liu

Alibaba Qwen

Research scientist in Tongyi Lab whose official profile highlights post-training, large language models, and AI agents.

Tianhang Zhu portrait
Researcher 1 reports

Tianhang Zhu

Alibaba Qwen

Research scientist in Tongyi Lab whose official profile highlights post-training, reasoning, and large multimodal models.

Xingxuan Zhang portrait
Researcher 1 reports

Xingxuan Zhang

Alibaba Qwen

Research scientist in Tongyi Lab whose official profile highlights reasoning, large-scale reinforcement learning, and multilingual large language models.

Yang Yao portrait
Researcher 1 reports

Yang Yao

Alibaba Qwen

Research scientist in Tongyi Lab whose official profile highlights post-training, efficient reasoning, and multilingual large language models.

Zheng Yuan portrait
Researcher 1 reports

Zheng Yuan

Alibaba Qwen

Research scientist in Tongyi Lab whose official profile highlights multimodal reasoning, agentic reinforcement learning, and post-training for large language models.

Wei Wang portrait
Researcher 3 reports

Wei Wang

Moonshot AI / Alibaba Qwen

Public report authorship links Wei Wang to Qwen Technical Report, Qwen-Image-2.0 Technical Report, and Kimi K2.5.

Bailey Kuehl portrait
Researcher 2 reports

Bailey Kuehl

Ai2

Ai2's public team directory lists Bailey Kuehl, and recent public papers credit Kuehl on OLMo Hybrid and scientific-literature work.

Christopher Wilhelm portrait
Researcher 2 reports

Christopher Wilhelm

Ai2

Research scientist at Ai2 working in NLP and machine learning, with interests in efficient training, pruning, and sparsity.

DJ Matusz portrait
Researcher 1 reports

DJ Matusz

Ai2

This pass only confirmed DJ Matusz as a named author on the OLMo Hybrid report and Lambda's MFU optimization white paper.

Xingxing Zhang portrait
Researcher 1 reports

Xingxing Zhang

Microsoft

Xingxing Zhang is a researcher at Microsoft Research and a co-author of the BitNet b1.58 2B4T Technical Report.

Xun Wu portrait
Researcher 1 reports

Xun Wu

Microsoft

Xun Wu is listed as a coauthor of the arXiv paper "Sparse-BitNet: 1.58-bit LLMs are Naturally Friendly to Semi-Structured Sparsity," with affiliation 1 shown as Microsoft Research.

Yingbo Hao portrait
Researcher 1 reports

Yingbo Hao

Microsoft

Public Microsoft-linked sources identify Yingbo Hao as a coauthor on recent efficient language-model technical reports.

Aakanksha Naik portrait
Researcher 1 reports

Aakanksha Naik

Google Gemini

Research scientist at Google DeepMind working on natural language understanding, generation, question answering, and evaluation, with current interests in multilinguality, attribution, and agent evaluation.

Aalok Mehta portrait
Researcher 1 reports

Aalok Mehta

OpenAI

Research engineer on OpenAI's safety systems team. Previously studied at Carnegie Mellon University and is interested in safety, alignment, and evaluation of language models.

Abhimanyu Dubey portrait
Researcher 1 reports

Abhimanyu Dubey

Meta AI

Research scientist focused on parallel and distributed machine learning systems.

Abhinav Sethy portrait
Researcher 1 reports

Abhinav Sethy

Cohere

Cohere researcher working on speech, multilinguality, multilingual speech, and speech translation, and a co-author of the Command A technical report.

Adam Perelman portrait
Researcher 1 reports

Adam Perelman

OpenAI

Software engineer at OpenAI; previously worked at Palantir.

Adrian Barbu portrait
Researcher 1 reports

Adrian Barbu

Google Gemini

Researcher working on large language model training and evaluation, including PaLM 2.

Adrian Mollov portrait
Researcher 1 reports

Adrian Mollov

Google Gemini

Adrian Mollov is a machine learning engineer and researcher interested in generative models, large language models, reinforcement learning, and scalable multimodal systems; his profile notes experience at Google DeepMind.

Aimin Yang portrait
Researcher 1 reports

Aimin Yang

Moonshot AI

Aimin Yang is a Moonshot AI contributor and co-author of Kimi k1.5: Scaling Reinforcement Learning with LLMs.

Aishwarya Agrawal portrait
Researcher 1 reports

Aishwarya Agrawal

Amazon

Assistant professor at the Mila-Quebec AI Institute and the University of Montreal, and Canada CIFAR AI Chair; her research focuses on vision-and-language and multimodal representation learning.

Akila Welihinda portrait
Researcher 1 reports

Akila Welihinda

OpenAI

Product professional at OpenAI focused on models, products, and agents; previously led AI transformation and platform efforts at HubSpot.

Alekh Agarwal portrait
Researcher 1 reports

Alekh Agarwal

Cohere

Principal scientist at Cohere and adjunct associate professor at Columbia University working on reinforcement learning, active learning, and representation learning.

Alex Berard portrait
Researcher 1 reports

Alex Berard

Google Gemini

Research scientist at Google Research working on machine learning and natural language processing.

Alex Passos portrait
Researcher 1 reports

Alex Passos

OpenAI

OpenAI contributor credited on the GPT-4 technical report.

Aliaksei Severyn portrait
Researcher 1 reports

Aliaksei Severyn

Cohere

Senior research scientist at Cohere Labs researching multilingual and multimodal large language models.

Alice Meyers portrait
Researcher 1 reports

Alice Meyers

Databricks

Researcher working on open foundation models and evaluation, including DBRX.

Ali Kamali portrait
Researcher 1 reports

Ali Kamali

OpenAI

Senior Software Engineer at OpenAI. Previously studied computer science at the University of Waterloo.

Ali Shoeybi portrait
Researcher 1 reports

Ali Shoeybi

NVIDIA

Vice president of applied deep learning research at NVIDIA, leading work on foundation models, large language models, computer vision, robotics, and autonomous driving.

Allison Moreira portrait
Researcher 1 reports

Allison Moreira

Google Gemini

Google researcher whose publications include the PaLM 2 technical report and related work on multilingual language models and dialogue systems.

Alvin Wang portrait
Researcher 1 reports

Alvin Wang

OpenAI

Alvin Wang is a software engineer at OpenAI focused on speech and audio. His official profile lists contributions to GPT-4.1, the audio models in the API, Sora, GPT-4, DALL-E 3, and Whisper.

Aman Rangapur portrait
Researcher 1 reports

Aman Rangapur

Ai2

Research Engineer at Ai2 on the OLMo team, focused on reliable and interpretable AI, transformers, retrieval, and autonomous agents.

Amin Tootoonchian portrait
Researcher 1 reports

Amin Tootoonchian

OpenAI

Computer systems researcher and GPT-4 coauthor whose public work centers on networking and large-scale distributed systems.

Andrew Healey portrait
Researcher 1 reports

Andrew Healey

Google Gemini

AI researcher at Google DeepMind with interests in natural language processing, machine learning, and neuroscience.

Andrew Kondrich portrait
Researcher 1 reports

Andrew Kondrich

OpenAI

Member of Technical Staff at OpenAI. Previously worked at Scale AI and studied computer science at Stanford University.

Andrew Mayne portrait
Researcher 1 reports

Andrew Mayne

OpenAI

Author and communicator who led science communication work at OpenAI and helped present GPT-4; later continued working on AI communication and startups.

Andrew Peng portrait
Researcher 1 reports

Andrew Peng

OpenAI

Software engineer at OpenAI whose official profile focuses on multimodal models and AI agents.

Andrey Mishchenko portrait
Researcher 1 reports

Andrey Mishchenko

OpenAI

Researcher at OpenAI working on post-training and reasoning, after earlier research on self-supervised learning, reinforcement learning, and robotics.

Andy Sonneveld portrait
Researcher 1 reports

Andy Sonneveld

Google Gemini

Researcher working on large language model training and evaluation, including PaLM 2.

Anfeng Li portrait
Researcher 1 reports

Anfeng Li

Alibaba Qwen

Anfeng Li works in AI model research at Alibaba DAMO Academy and is a co-author of the Qwen3 Technical Report.

Anna Petrovic portrait
Researcher 1 reports

Anna Petrovic

Google Gemini

Google researcher whose publications include the PaLM 2 technical report.

Anthony Hartshorn portrait
Researcher 1 reports

Anthony Hartshorn

Meta AI

Machine learning engineer at Meta working on Llama post-training, with interests in reinforcement learning, generative AI, digital humans, AI at the edge, and ML systems.

Antoine Bosselut portrait
Researcher 1 reports

Antoine Bosselut

Google Gemini

Antoine Bosselut is an assistant professor in the Language Technologies Institute at Carnegie Mellon University. His research develops language technologies that align with people's social and cultural values and stay robust in realistic settings.

Aparna Cholakkal Hombali portrait
Researcher 1 reports

Aparna Cholakkal Hombali

Cohere

Research scientist at Cohere working on machine learning, computer vision, and explainable AI, including the Command A model family.

Archit Narang portrait
Researcher 1 reports

Archit Narang

Meta AI

Machine learning researcher at Meta AI and visiting scientist at the University of Illinois Urbana-Champaign focused on language models and multimodal learning.

Arian Hosseini portrait
Researcher 1 reports

Arian Hosseini

Cohere

Research scientist at Cohere working on language modeling, reasoning, and vision.

Ari Morcos portrait
Researcher 1 reports

Ari Morcos

NVIDIA

Research scientist at NVIDIA working on foundation models, large-scale training, and model capabilities and safety.

Arun Vijayvergiya portrait
Researcher 1 reports

Arun Vijayvergiya

OpenAI

Research scientist at OpenAI whose official profile highlights multimodal speech.

Aryo Pradipta Gema portrait
Researcher 1 reports

Aryo Pradipta Gema

Ai2

Research engineer at Ai2 focused on post-training and data for open language models.

A. X. Liu portrait
Researcher 1 reports

A. X. Liu

DeepSeek

Researcher affiliated with DeepSeek AI on Google Scholar and co-author of the DeepSeek LLM Technical Report.

Balazs Farkas portrait
Researcher 1 reports

Balazs Farkas

NVIDIA

NVIDIA researcher with a Google Scholar profile that includes the Nemotron-4 340B technical report.

Baoyuan Zhang portrait
Researcher 1 reports

Baoyuan Zhang

DeepSeek

DeepSeek researcher and coauthor of the DeepSeek LLM Technical Report.

Behdad Tavakoli portrait
Researcher 1 reports

Behdad Tavakoli

Google Gemini

Research scientist at Google DeepMind working on large-scale language models, multimodal learning, and efficient deep learning systems.

Ben Chess portrait
Researcher 1 reports

Ben Chess

OpenAI

Product leader at OpenAI working on post-training, voice, and applied reasoning, after earlier product work on GPT-4V and developer products.

Benjamin Chess portrait
Researcher 1 reports

Benjamin Chess

OpenAI

Engineer and infrastructure specialist who co-authored OpenAI scaling work and later joined Magic as a former OpenAI supercomputing lead.

Benjamin Sokolowsky portrait
Researcher 1 reports

Benjamin Sokolowsky

OpenAI

Researcher at OpenAI working on multimodal models, reinforcement learning, and AI safety.

Bhagyashree N. Patil portrait
Researcher 1 reports

Bhagyashree N. Patil

Cerebras Systems

Bhagyashree N. Patil is a Research Scientist at Cohere whose public profile lists prior work at Cerebras Systems, Brown NLP, and IIT Bombay.

Bhavana Dalvi portrait
Researcher 1 reports

Bhavana Dalvi

Ai2

Researcher whose public work spans NLP, reasoning, and evaluation; arXiv author results include OLMo and related language model work.

Billie Jonn portrait
Researcher 1 reports

Billie Jonn

OpenAI

OpenAI finance and accounting professional who was publicly credited on the GPT-4 technical report.

Binh Tang portrait
Researcher 1 reports

Binh Tang

Meta AI

Binh Tang is an AI researcher at Meta. His public homepage highlights work in machine learning, natural language processing, and large-scale AI systems.

Bob McGrew portrait
Researcher 1 reports

Bob McGrew

OpenAI

Research leader and GPT-4 coauthor who joined OpenAI in 2017 and later served as chief research officer after earlier engineering leadership work at Palantir.

Bob Rotsted portrait
Researcher 1 reports

Bob Rotsted

OpenAI

Member of Technical Staff at OpenAI with a background in machine learning, computer vision, robotics, and cyber-physical systems.

Bochao Wu portrait
Researcher 1 reports

Bochao Wu

DeepSeek

Research scientist at DeepSeek with public work on large language models and reasoning.

Bogdan Mazoure portrait
Researcher 1 reports

Bogdan Mazoure

Cohere

Research scientist at Cohere working on machine learning and reinforcement learning.

Boris Power portrait
Researcher 1 reports

Boris Power

OpenAI

Research scientist at OpenAI designing language environments and agents; previously built peer-to-peer applications and holds a PhD from Oxford.

Brandon Houghton portrait
Researcher 1 reports

Brandon Houghton

OpenAI

Strategy and operations leader at OpenAI whose official profile highlights interests in learning, biology, and language.

Brendan Bosma portrait
Researcher 1 reports

Brendan Bosma

Google Gemini

Research engineer at Google DeepMind and PhD candidate at UC Berkeley working on multimodal understanding, reasoning, and agentic systems.

Brian Hawkins portrait
Researcher 1 reports

Brian Hawkins

Meta AI

Research scientist at Meta working on post-training, alignment, and reinforcement learning for language and multimodal models.

Brittany Carey portrait
Researcher 1 reports

Brittany Carey

OpenAI

Research scientist at OpenAI whose official profile highlights multimodal speech and audio.

Brooke Chan portrait
Researcher 1 reports

Brooke Chan

OpenAI

Software engineer at OpenAI. Previously studied electrical engineering and computer science at the University of California, Berkeley.

Casey Chu portrait
Researcher 1 reports

Casey Chu

OpenAI

OpenAI contributor credited on the GPT-4 technical report.

Chak Ming Li portrait
Researcher 1 reports

Chak Ming Li

OpenAI

Researcher focused on speech AI, computer vision, and audio-visual learning. He has worked at OpenAI and publishes publications and projects on his personal website.

Charles Sutton portrait
Researcher 1 reports

Charles Sutton

Google Gemini

Professor at Carnegie Mellon University whose public research spans machine learning and natural language processing; arXiv author results include the PaLM paper.

Che Chang portrait
Researcher 1 reports

Che Chang

OpenAI

Research scientist at OpenAI whose official profile highlights multimodal reasoning and agents.

Chelsea Carlson portrait
Researcher 1 reports

Chelsea Carlson

OpenAI

Research engineer at OpenAI working on AI agents and coding systems. Previously worked on data and product efforts at Stripe and studied computer science and philosophy at Harvard University.

Chengda Lu portrait
Researcher 1 reports

Chengda Lu

DeepSeek

Research scientist at DeepSeek with public work on language models and multimodal learning.

Chester Cho portrait
Researcher 1 reports

Chester Cho

OpenAI

Researcher at OpenAI working on reasoning, multimodal systems, and products that make AI more useful in everyday life.

Chris Carothers portrait
Researcher 1 reports

Chris Carothers

Cerebras Systems

Principal engineer at Cerebras working on systems for training large language models.

Christian Gibson portrait
Researcher 1 reports

Christian Gibson

OpenAI

Software engineer at OpenAI whose official profile focuses on speech and audio.

CJ Weinmann portrait
Researcher 1 reports

CJ Weinmann

OpenAI

Member of Technical Staff at OpenAI working on reliability engineering. Previously worked on software in cloud, AI, and hardware, and earned a BSE in computer systems engineering from Arizona State University.

Colin Kasson portrait
Researcher 1 reports

Colin Kasson

Amazon

Applied scientist at Amazon working on large language models, responsible AI, and evaluation for foundation model systems.

Colton Wei portrait
Researcher 1 reports

Colton Wei

Google Gemini

Google researcher whose publications include the PaLM 2 technical report.

Connor Rees portrait
Researcher 1 reports

Connor Rees

Meta AI

Research scientist at Meta working on reinforcement learning and generative models. His homepage highlights robotics, computer vision, and simulation, alongside work related to Llama 3.

Cory Decareaux portrait
Researcher 1 reports

Cory Decareaux

OpenAI

Engineering manager at OpenAI with a background in privacy and security engineering across biotech, payments, and advertising.

Crystal Nam portrait
Researcher 1 reports

Crystal Nam

Ai2

Member of Ai2's data team interested in data management systems and workflow tooling for research and model development.

Daisy Han portrait
Researcher 1 reports

Daisy Han

Google Gemini

Research scientist at Google Research focused on natural language processing and machine learning.

Daniel Kokotajlo portrait
Researcher 1 reports

Daniel Kokotajlo

OpenAI

Former OpenAI governance researcher and GPT-4 coauthor known for AI forecasting and alignment writing.

Daniel Selsam portrait
Researcher 1 reports

Daniel Selsam

OpenAI

Research scientist at OpenAI focused on machine learning and theorem proving.

Dave Willner portrait
Researcher 1 reports

Dave Willner

OpenAI

Investor and advisor at OpenAI; previously on the management team at Stripe.

David Farhi portrait
Researcher 1 reports

David Farhi

OpenAI

OpenAI contributor credited for business strategy and investor relations support on GPT-4.

David Golovin portrait
Researcher 1 reports

David Golovin

Google Gemini

Principal Scientist at Google DeepMind focused on machine learning systems and AI research.

Dawei Lu portrait
Researcher 1 reports

Dawei Lu

Cohere

Research scientist at Cohere whose public OpenReview profile lists interests in natural language processing, efficiency, and large language models.

Denny Jin portrait
Researcher 1 reports

Denny Jin

OpenAI

Researcher at OpenAI working on multimodal systems, post-training, and evaluations, with earlier work across 3D, computer vision, and embodied AI.

Denny Zhou portrait
Researcher 1 reports

Denny Zhou

Google Gemini

Research scientist at Google Research working on large language models, natural language processing, and multi-agent systems.

Dmitry Yarats portrait
Researcher 1 reports

Dmitry Yarats

NVIDIA

Research scientist at NVIDIA and assistant professor at NYU, working on machine learning, deep learning, and robotics.

Dustin Schwenk portrait
Researcher 1 reports

Dustin Schwenk

Ai2

Research scientist at Ai2 whose work focuses on open language models and multimodal systems.

Eli Collins portrait
Researcher 1 reports

Eli Collins

Google Gemini

Vice President of Product at Google DeepMind and head of the Google DeepMind product team.

Elie Georges portrait
Researcher 1 reports

Elie Georges

OpenAI

Works at OpenAI.

Elizabeth Proehl portrait
Researcher 1 reports

Elizabeth Proehl

OpenAI

OpenAI staff contributor whose public research profile links her work across public policy and frontier evaluations at OpenAI, following earlier operations leadership at OpenResearch.

Emilia Antonucci portrait
Researcher 1 reports

Emilia Antonucci

Google Gemini

Google researcher whose public profile lists work across machine learning, natural language processing, and generative AI.

Emilie Morand portrait
Researcher 1 reports

Emilie Morand

Cohere

Senior machine learning researcher at Cohere working on pretraining, post-training, and safety for large language models, according to her public OpenReview profile.

Emily Schwenk portrait
Researcher 1 reports

Emily Schwenk

Ai2

Final-year PhD candidate at the University of Washington whose public research focuses on evaluation, uncertainty, and multimodal grounding in language models; her arXiv author results include OLMo and OLMES papers.

Emma Hughes portrait
Researcher 1 reports

Emma Hughes

Google Gemini

Researcher working on open language models and multimodal systems, including Gemma.

Emy Parparita portrait
Researcher 1 reports

Emy Parparita

OpenAI

Research scientist at OpenAI focused on machine learning, multimodal world models, and representation learning.

Eric Rehm portrait
Researcher 1 reports

Eric Rehm

Technology Innovation Institute

Principal researcher in the Technology Innovation Institute's AI Cross-Center Unit focused on language model architectures, long-context LLMs, and vision-language models.

Erik Lindholm portrait
Researcher 1 reports

Erik Lindholm

NVIDIA

Researcher at NVIDIA working on computer architecture, machine learning, and large language models, including Nemotron-4.

Fahimeh Saleh portrait
Researcher 1 reports

Fahimeh Saleh

Cohere

Research scientist at Cohere working on reasoning, simulation, and agentic AI.

Feliciano Giannone portrait
Researcher 1 reports

Feliciano Giannone

Google Gemini

Research scientist at Google DeepMind in Cambridge working on large language models and generative AI.

Felipe Petroski Such portrait
Researcher 1 reports

Felipe Petroski Such

OpenAI

Member of Technical Staff at OpenAI focused on data quality, data curation, and training methods for frontier models.

Feng Hu portrait
Researcher 1 reports

Feng Hu

Alibaba Qwen

Researcher affiliated with Alibaba Group on Google Scholar and coauthor of the Qwen3 technical report.

Fnu Ruchi portrait
Researcher 1 reports

Fnu Ruchi

Amazon

Principal applied scientist in Amazon AGI whose publications include the Amazon Nova Premier technical report.

Fotis Chantzis portrait
Researcher 1 reports

Fotis Chantzis

OpenAI

Security engineer at OpenAI with a background in network and information security, including prior work at Nmap, Mayo Clinic, and CENSUS.

Francis Real portrait
Researcher 1 reports

Francis Real

OpenAI

Software engineer at OpenAI whose official profile focuses on data and evaluations for language models.

Gabriel Bernadett-Shapiro portrait
Researcher 1 reports

Gabriel Bernadett-Shapiro

OpenAI

Gabriel Bernadett-Shapiro is an OpenAI contributor listed on the OpenAI GPT-4 contributions page.

Gabriel Goh portrait
Researcher 1 reports

Gabriel Goh

OpenAI

Research scientist and writer at OpenAI who works on the societal impact and interpretability of advanced AI systems.

Gautier Izacard portrait
Researcher 1 reports

Gautier Izacard

Meta AI

Researcher whose public work focuses on retrieval, question answering, and large language models; arXiv author results include the LLaMA paper.

Gregory Piché portrait
Researcher 1 reports

Gregory Piché

Cohere

Researcher working on enterprise language models and multilingual systems, including Cohere Command A and Aya.

Guillem Cucurull portrait
Researcher 1 reports

Guillem Cucurull

Google Gemini

Research scientist at Google DeepMind focused on machine learning, optimization, computer vision, and AI for science.

Haiming Bao portrait
Researcher 1 reports

Haiming Bao

OpenAI

Haiming Bao is an OpenAI contributor listed on the OpenAI o1 contributions page.

Hamza Nassar portrait
Researcher 1 reports

Hamza Nassar

Cohere

Researcher working on enterprise and multilingual language models, including Cohere Command A.

Harish P V portrait
Researcher 1 reports

Harish P V

NVIDIA

Researcher at NVIDIA working on computer vision, machine learning, and 3D vision, including Nemotron-4.

Heather Schmidt portrait
Researcher 1 reports

Heather Schmidt

OpenAI

Product manager at OpenAI working on bringing AI models into products, after earlier product and go-to-market work at Spotify and Soundtrap.

Hobin Ryu portrait
Researcher 1 reports

Hobin Ryu

Meta AI

Research scientist working on generative AI and multimodal intelligence.

Hongrae Lee portrait
Researcher 1 reports

Hongrae Lee

Google Gemini

Google researcher whose public profile lists work across data management, distributed systems, data mining, human-computer interaction, and natural language processing.

Ishita Dasgupta portrait
Researcher 1 reports

Ishita Dasgupta

Meta AI

Research scientist at Meta FAIR working on language modeling, generalization, reasoning, and AI agents.

Jade Leung portrait
Researcher 1 reports

Jade Leung

OpenAI

Jade Leung leads OpenAI's work on frontier model policy, governance, and partnerships. She was formerly a Governance and Public Policy fellow at the University of Oxford.

Jakub Pachocki portrait
Researcher 1 reports

Jakub Pachocki

OpenAI

Chief Scientist at OpenAI. Previously served as Director of Research at OpenAI and studied computer science at the University of Warsaw.

James Pennington portrait
Researcher 1 reports

James Pennington

Google Gemini

Research scientist at Google DeepMind working on large language models, multimodal models, and language design.

Janny Hong portrait
Researcher 1 reports

Janny Hong

Cohere

Research scientist at Cohere with research interests in computer vision, multimodal learning, and model editing.

Jared Chow portrait
Researcher 1 reports

Jared Chow

Google Gemini

Google researcher whose publications include the PaLM 2 technical report.

Jared Quincy Davis portrait
Researcher 1 reports

Jared Quincy Davis

Cohere

Research scientist at Cohere working on language models, post-training, and evaluation.

Jasmine Hall portrait
Researcher 1 reports

Jasmine Hall

Google Gemini

Staff research engineer at Google DeepMind working on language models and technical infrastructure.

Jason Chen portrait
Researcher 1 reports

Jason Chen

OpenAI

Software engineer at OpenAI whose official profile highlights reinforcement learning, robotics, and AI for health.

Jason Lee portrait
Researcher 1 reports

Jason Lee

Cohere

Researcher working on large language models and enterprise NLP systems, including Cohere Command A.

Jeff Belgum portrait
Researcher 1 reports

Jeff Belgum

OpenAI

Jeff Belgum is an OpenAI contributor listed on the DALL·E 2 preview contributions page.

Jeffrey Ritter portrait
Researcher 1 reports

Jeffrey Ritter

Google Gemini

Staff software engineer at Google DeepMind interested in systems and applied machine learning.

Jeremiah Currier portrait
Researcher 1 reports

Jeremiah Currier

OpenAI

Founding engineer at Galactica and former staff software engineer at OpenAI.

Jesse Han portrait
Researcher 1 reports

Jesse Han

OpenAI

Research scientist at OpenAI whose official profile focuses on multimodal speech.

Jian Cai portrait
Researcher 1 reports

Jian Cai

NVIDIA

Researcher working on large language models and multimodal systems, including NVIDIA Nemotron-4.

Jiawei Wang portrait
Researcher 1 reports

Jiawei Wang

DeepSeek

Research scientist at DeepSeek with public GitHub projects spanning language models and AI systems.

Ji Lin portrait
Researcher 1 reports

Ji Lin

Google Gemini

Research scientist at Google working on efficient AI models for mobile devices and ML systems, with interests in systems, algorithms, and co-design.

Jingchang Chen portrait
Researcher 1 reports

Jingchang Chen

DeepSeek

Researcher at DeepSeek focused on large language models, multimodal systems, and AI agents.

João Monteiro portrait
Researcher 1 reports

João Monteiro

Cohere

Research scientist at Cohere Labs and technical lead of the multimodal team.

Joel Parish portrait
Researcher 1 reports

Joel Parish

OpenAI

OpenAI contributor focused on security applications of large language models; public author and event pages identify Joel Parish as a security staff member at OpenAI.

Joe Palermo portrait
Researcher 1 reports

Joe Palermo

OpenAI

AI research engineer at OpenAI focused on reinforcement learning, and co-author of the GPT-4 Technical Report.

Jonathan Tow portrait
Researcher 1 reports

Jonathan Tow

Cohere

Jonathan Tow is a research scientist at Cohere whose public profile highlights work on efficient large language model adaptation and finetuning. He is known as the lead author of QLoRA and works on practical methods for training and aligning language models.

Jonathan Ward portrait
Researcher 1 reports

Jonathan Ward

OpenAI

Research scientist at OpenAI working on general intelligence; previously studied at Princeton University and Williams College.

Jong Wook Kim portrait
Researcher 1 reports

Jong Wook Kim

OpenAI

Member of technical staff at OpenAI who worked on CLIP and Whisper; previously a PhD student in computer science at NYU.

Jordan Sitkin portrait
Researcher 1 reports

Jordan Sitkin

OpenAI

Founding member of OpenAI's alignment team. Public profile notes prior software engineering work at Google.

Joshua A. Gonzalez portrait
Researcher 1 reports

Joshua A. Gonzalez

Google Gemini

Researcher working on large language model training and evaluation, including PaLM 2.

Joshua Gross portrait
Researcher 1 reports

Joshua Gross

OpenAI

Product and engineering leader who worked on GPT-4, ChatGPT Enterprise, and advanced data analysis at OpenAI. He later became CTO of Khan Academy.

Joshua Maynez portrait
Researcher 1 reports

Joshua Maynez

Google Gemini

Research Scientist at Google working on making language models more factual and controllable.

Justin Gu portrait
Researcher 1 reports

Justin Gu

Google Gemini

Research scientist at Google DeepMind working on natural language processing and machine learning systems.

Justin Wang portrait
Researcher 1 reports

Justin Wang

Google Gemini

Google researcher whose publications include the PaLM 2 technical report.

Kai Hu portrait
Researcher 1 reports

Kai Hu

DeepSeek

Research scientist at DeepSeek with public scholarly work on large language models and reasoning.

Kai Xiao portrait
Researcher 1 reports

Kai Xiao

01.AI

Senior research scientist and research lead at 01.AI; previously worked at Microsoft on language and search systems.

Karan Ahuja portrait
Researcher 1 reports

Karan Ahuja

Google Gemini

Research scientist at Google DeepMind in Seattle.

Karen Simonyan portrait
Researcher 1 reports

Karen Simonyan

NVIDIA

Research scientist at NVIDIA working on computer vision, deep learning, and reinforcement learning.

Karishma Prasad portrait
Researcher 1 reports

Karishma Prasad

Ai2

Research engineer at Ai2 working on open language models, machine learning systems, and evaluation.

Kasra Abbasi portrait
Researcher 1 reports

Kasra Abbasi

Snowflake

Researcher working on enterprise large language models, including Snowflake Arctic.

Katherine Hui-Ling Chiu portrait
Researcher 1 reports

Katherine Hui-Ling Chiu

Google Gemini

Research engineer at Google DeepMind whose research interests include computer vision, geometric deep learning, and generative models.

Kautuva Huang portrait
Researcher 1 reports

Kautuva Huang

Google Gemini

Research scientist at Google DeepMind working on natural language generation, multimodality, and on-device machine learning.

Kelsey Kuan portrait
Researcher 1 reports

Kelsey Kuan

Meta AI

Machine learning researcher at Meta focused on Llama model adaptation, post-training, and data-centric learning.

Kendra Rimbach portrait
Researcher 1 reports

Kendra Rimbach

OpenAI

Staff Research Engineer at OpenAI working on language modeling and reasoning.

Kenny Hsu portrait
Researcher 1 reports

Kenny Hsu

OpenAI

Research scientist at OpenAI focused on multimodal language and large language model alignment.

Kevin Gu portrait
Researcher 1 reports

Kevin Gu

Ai2

Research scientist at Ai2 working on large language models, machine learning with human feedback, and related topics; previously at Stanford and MIT.

Kevin Stone portrait
Researcher 1 reports

Kevin Stone

Meta AI

Research scientist at Meta FAIR working on multimodal and speech-language AI; his personal site highlights projects including the Large Concept Model family.

Kuai Yu portrait
Researcher 1 reports

Kuai Yu

DeepSeek

Research scientist at DeepSeek with public work on large language models and reasoning.

Kyle Kosic portrait
Researcher 1 reports

Kyle Kosic

OpenAI

Member of technical staff at OpenAI.

Lama Ahmad portrait
Researcher 1 reports

Lama Ahmad

OpenAI

Vice President of Global Affairs at OpenAI and a co-author of the GPT-4 Technical Report.

Lauren Workman portrait
Researcher 1 reports

Lauren Workman

OpenAI

Lauren Workman is a research scientist at OpenAI working on deep learning, robotics, and multimodal AI. Her official profile lists contributions across GPT-4.1, o3 and o4-mini, GPT-4o, robotics, and DALL-E 3.

Leandro Almeida portrait
Researcher 1 reports

Leandro Almeida

NVIDIA

Research scientist at NVIDIA working on machine learning, natural language processing, and vision-language models, including the NVLM project.

Leonard Berrada portrait
Researcher 1 reports

Leonard Berrada

Google Gemini

Research scientist at Google DeepMind in London working on efficient, robust, and reasoning-capable language models. His homepage also notes interests in language model alignment and a mathematics background from Cambridge.

Lifu Tu portrait
Researcher 1 reports

Lifu Tu

Google Gemini

Research scientist at Google Research working on large language models, natural language understanding, and reasoning.

Lilit Yasakova portrait
Researcher 1 reports

Lilit Yasakova

Amazon

Researcher with public work on fairness, robustness, and language technologies. Public profiles connect her research to both UNC Charlotte and Amazon.

Luca Muttini portrait
Researcher 1 reports

Luca Muttini

Amazon

Applied scientist at Amazon working on large language models, multimodal AI, and audio generation.

Lucas Atkins portrait
Researcher 1 reports

Lucas Atkins

Cohere

Research scientist at Cohere working on machine learning and natural language processing, including the Command A model family.

Łukasz Kaiser portrait
Researcher 1 reports

Łukasz Kaiser

OpenAI

Research scientist and engineer whose public work spans deep learning, language models, and efficient inference.

Maddie Simens portrait
Researcher 1 reports

Maddie Simens

OpenAI

Member of Technical Staff at OpenAI with interests spanning AI for education, machine learning, and human-computer interaction.

Marc Greenberg portrait
Researcher 1 reports

Marc Greenberg

Amazon

Researcher working on foundation models and agentic systems, including Amazon Nova Premier.

Mario Saltarelli portrait
Researcher 1 reports

Mario Saltarelli

OpenAI

Product designer at OpenAI whose official profile explores human-computer interaction.

Marta Garnelo portrait
Researcher 1 reports

Marta Garnelo

Google Gemini

Research scientist at Google DeepMind whose public profiles describe work in machine learning and generative modeling, with a personal homepage covering neural networks, variational inference, and program synthesis.

Matthew Botvinick portrait
Researcher 1 reports

Matthew Botvinick

Google Gemini

Matthew Botvinick is a Google DeepMind scientist whose public profile focuses on neuroscience-inspired AI, cognitive science, and machine learning.

Matthias Lampe portrait
Researcher 1 reports

Matthias Lampe

Google Gemini

Research scientist at Google working on large-scale language models and natural language processing.

Matt Knight portrait
Researcher 1 reports

Matt Knight

OpenAI

Technical and creative leader who has worked across games, startups, and AI products. He has held engineering and product roles at OpenAI and previously worked in the Los Angeles games industry.

Matt Wiethoff portrait
Researcher 1 reports

Matt Wiethoff

OpenAI

Member of technical staff at OpenAI whose official profile focuses on deep learning, robotics, and speech.

Melanie Subbiah portrait
Researcher 1 reports

Melanie Subbiah

OpenAI

Melanie Subbiah is an independent AI researcher based in New York City. Her work focuses on language and multimodal AI, especially commonsense reasoning and communicative behavior.

Mia Xu Chen portrait
Researcher 1 reports

Mia Xu Chen

Google Gemini

Research scientist at Google Research working on machine learning and natural language processing.

Michael Kasner portrait
Researcher 1 reports

Michael Kasner

Google Gemini

Research scientist at Google DeepMind working on AI for language.

Michael Schmitz portrait
Researcher 1 reports

Michael Schmitz

Ai2

Research scientist at Ai2 whose interests include natural language processing, machine learning, and computer vision.

Miguel Larrauri Iñigo portrait
Researcher 1 reports

Miguel Larrauri Iñigo

Google Gemini

Research engineer at Google DeepMind working on large language models and generative AI.

Mike Ranzinger portrait
Researcher 1 reports

Mike Ranzinger

Cohere

Research scientist at Cohere focused on language modeling, post-training, and multimodality.

Mikhail Chernyshev portrait
Researcher 1 reports

Mikhail Chernyshev

Meta AI

Mikhail Chernyshev is a systems engineer focused on efficient distributed systems for large-scale deep learning, spanning compiler stacks, networking, operating systems, and hardware.

Mirko Tuset portrait
Researcher 1 reports

Mirko Tuset

Amazon

Applied scientist in Amazon AGI whose publications include the Amazon Nova Premier technical report.

Mohammad Shoeybi portrait
Researcher 1 reports

Mohammad Shoeybi

Amazon

Senior principal scientist at Amazon AGI focusing on efficient training and inference of foundation models, large language models, and multimodal models.

Morgan Grafstein portrait
Researcher 1 reports

Morgan Grafstein

OpenAI

Morgan Grafstein is a software engineer and artist at OpenAI. His personal site says his work spans hardware, software, and rendering.

Mourad Si Bac portrait
Researcher 1 reports

Mourad Si Bac

NVIDIA

Director of engineering at NVIDIA working on machine learning, computer vision, and agentic AI systems.

Mrinal Kalakrishnan portrait
Researcher 1 reports

Mrinal Kalakrishnan

Google Gemini

Research scientist at Google DeepMind working on robot learning, real-world robotics, and reinforcement learning.

Mukul Agrawal portrait
Researcher 1 reports

Mukul Agrawal

NVIDIA

Research scientist at NVIDIA working on training large language and multimodal models.

Mustafa Hajij portrait
Researcher 1 reports

Mustafa Hajij

Ai2

Mustafa Hajij is a research scientist at Ai2 and an adjunct professor in the Department of Computer Science at the University of Southern Maine. His research spans graph machine learning, geometric learning, and applied mathematics.

Namit Katariya portrait
Researcher 1 reports

Namit Katariya

Meta AI

Namit Katariya is a computer science PhD student at Stanford University. His interests include large-scale machine learning, natural language processing, ranking, and recommendation systems.

Neeraj Gaur portrait
Researcher 1 reports

Neeraj Gaur

Google Gemini

Staff software engineer at Google specializing in machine learning model development and performance optimization for TPUs and other accelerators.

Nenad Tomasev portrait
Researcher 1 reports

Nenad Tomasev

Google Gemini

Research scientist and research director at Google DeepMind working on machine learning for health, sustainability, and human-centered AI.

Ngoc-Quan Pham portrait
Researcher 1 reports

Ngoc-Quan Pham

Cohere

Ngoc-Quan Pham is a research scientist at Cohere whose work spans large language models, natural language generation and understanding, multimodal learning, continual learning, and transfer learning.

Niket Tandon portrait
Researcher 1 reports

Niket Tandon

Snowflake

Niket Tandon is a Research Scientist at Snowflake AI Research whose work focuses on natural language processing, reasoning, and language models.

Nikhil Chandra portrait
Researcher 1 reports

Nikhil Chandra

NVIDIA

Researcher at NVIDIA working on computer vision and machine learning, including large language model work such as Nemotron-4.

Niko Felix portrait
Researcher 1 reports

Niko Felix

OpenAI

Engineer at OpenAI whose official profile focuses on compute systems and inference.

Nikolas Nado portrait
Researcher 1 reports

Nikolas Nado

Google Gemini

Research Scientist at Google DeepMind focused on post-training large language models and evaluation.

Nikolas Tezak portrait
Researcher 1 reports

Nikolas Tezak

OpenAI

OpenAI contributor credited on the GPT-4 technical report; his personal website describes interests in machine learning, software engineering, and data visualization.

Nikolay Bashlykov portrait
Researcher 1 reports

Nikolay Bashlykov

Meta AI

Research engineer whose public work spans language models and multimodal systems; arXiv author results include the Llama 2 paper.

Nishank Kedia portrait
Researcher 1 reports

Nishank Kedia

NVIDIA

NVIDIA researcher with a Google Scholar profile that includes the Nemotron-4 340B technical report.

Nitish Shirish Keskar portrait
Researcher 1 reports

Nitish Shirish Keskar

OpenAI

Member of Technical Staff at OpenAI and co-author of the GPT-4 Technical Report.

Noah Shazeer portrait
Researcher 1 reports

Noah Shazeer

Google Gemini

Google researcher and engineer best known as a co-inventor of the Transformer and a contributor to systems including T5, LaMDA, and Gemini.

Oleg Boiko portrait
Researcher 1 reports

Oleg Boiko

OpenAI

Oleg Boiko is an OpenAI contributor listed on the OpenAI GPT-4 contributions page.

Oleg Murk portrait
Researcher 1 reports

Oleg Murk

OpenAI

Research scientist at OpenAI and contributor to the o1 reasoning launch.

Pablo Foradada portrait
Researcher 1 reports

Pablo Foradada

Technology Innovation Institute

Researcher in the Technology Innovation Institute's AI Cross-Center Unit focused on large language models, multimodality, and visual representation learning.

Parker Schuh portrait
Researcher 1 reports

Parker Schuh

Google Gemini

Researcher working on large language model scaling and evaluation, including Google PaLM.

Paul McMillan portrait
Researcher 1 reports

Paul McMillan

OpenAI

Software engineer at OpenAI whose official profile focuses on multimodal models.

Pavankumar Balachandran portrait
Researcher 1 reports

Pavankumar Balachandran

NVIDIA

Research scientist at NVIDIA working on large language model training and machine learning systems.

Petar Veličković portrait
Researcher 1 reports

Petar Veličković

Google Gemini

Petar Veličković is a Senior Staff Research Scientist at Google DeepMind and an affiliated lecturer at Cambridge. His research explores building general-purpose reasoning agents by combining inference, search, and learning.

Peter Albert portrait
Researcher 1 reports

Peter Albert

Meta AI

Postdoctoral researcher at FAIR with interests in multilingual and multimodal language models, including work associated with Llama 2.

Peter Hoeschele portrait
Researcher 1 reports

Peter Hoeschele

OpenAI

OpenAI infrastructure leader focused on industrial compute, large-scale AI systems, and strategic buildout of compute and energy capacity.

Pokorny portrait
Researcher 1 reports

Pokorny

OpenAI

J. Michael Pokorny is a software engineer at OpenAI focused on inference. His official profile lists contributions to GPT-4.1, the Realtime API, API cost optimizations, GPT-4o, and gpt-oss.

Praneet Dutta portrait
Researcher 1 reports

Praneet Dutta

Cohere

Praneet Dutta is a researcher affiliated with Cohere whose publications focus on large language models and related model development work.

Pratik Chaudhari portrait
Researcher 1 reports

Pratik Chaudhari

NVIDIA

Associate Professor at the University of Pennsylvania working on machine learning, optimization, and large-scale AI systems.

Praveen Kumar Challa portrait
Researcher 1 reports

Praveen Kumar Challa

Cerebras Systems

Praveen Kumar Challa is a software engineer at Cerebras Systems whose interests include machine learning, large language models, and efficient systems.

P. S. Huang portrait
Researcher 1 reports

P. S. Huang

Cohere

Researcher currently at UC Berkeley whose homepage notes prior work at Cohere on pretraining and post-training for large language models.

Puja Pudipeddi portrait
Researcher 1 reports

Puja Pudipeddi

Google Gemini

Research scientist at Google working on speech, language, and multimodal machine learning systems.

Qiming Yuan portrait
Researcher 1 reports

Qiming Yuan

OpenAI

Scientist and engineer at OpenAI working to build AGI that benefits all of humanity.

Quoc Le portrait
Researcher 1 reports

Quoc Le

Google Gemini

Research scientist at Google Research known for large-scale deep learning and language models; arXiv author results include the PaLM 2 technical report.

Rachel Lim portrait
Researcher 1 reports

Rachel Lim

OpenAI

Rachel Lim is a technical staff manager at OpenAI focused on speech and audio. Her official profile lists contributions across GPT-4.1, ChatGPT search, deep research, the audio models in the API, GPT-4o mini, GPT-4.5, and GPT-4.

Radek Strope portrait
Researcher 1 reports

Radek Strope

Google Gemini

Research scientist at Google DeepMind interested in large language models, efficient transformers, AI for coding, and AI4Mat.

Rajeev Nayak portrait
Researcher 1 reports

Rajeev Nayak

OpenAI

Research scientist at OpenAI working across computer vision, robotics, and language.

Ramesh Nallapati portrait
Researcher 1 reports

Ramesh Nallapati

Google Gemini

Research scientist at Google Research focused on machine learning and natural language processing, including text generation and summarization.

Rami Globerson portrait
Researcher 1 reports

Rami Globerson

Google Gemini

Research scientist at Google Research and professor at Tel Aviv University whose work spans machine learning, optimization, and natural language processing.

Rashmi Gangadhar portrait
Researcher 1 reports

Rashmi Gangadhar

NVIDIA

Research scientist at NVIDIA working on large language models and natural language processing.

Rewon Child portrait
Researcher 1 reports

Rewon Child

OpenAI

Rewon Child is a machine learning researcher whose public homepage notes previous work at OpenAI and current work on developer tools. His public research profile highlights sparse transformers and generative modeling.

Rishabh Joshi portrait
Researcher 1 reports

Rishabh Joshi

Google Gemini

Google researcher whose publications include the PaLM 2 technical report.

Robin Sikand portrait
Researcher 1 reports

Robin Sikand

Google Gemini

Robin Sikand is a machine learning researcher at Google working mostly on large language models.

Rosie Campbell portrait
Researcher 1 reports

Rosie Campbell

OpenAI

DPhil student in robotics and AI at the University of Oxford and Rhodes Scholar interested in the safe deployment of autonomous systems.

Ruby Chen portrait
Researcher 1 reports

Ruby Chen

OpenAI

Software engineer at OpenAI whose official profile focuses on speech and audio.

Ruibin Chen portrait
Researcher 1 reports

Ruibin Chen

Tencent Hunyuan

Researcher at Tencent Hunyuan with public publications on large language models, multimodal systems, and efficient post-training, including Hunyuan-Large, LLaVA-Video-R1, and Rest-R1.

Ryan Greene portrait
Researcher 1 reports

Ryan Greene

OpenAI

Chief Global Affairs Officer at OpenAI.

Sam Manning portrait
Researcher 1 reports

Sam Manning

OpenAI

Systems engineer at OpenAI working on production AI systems.

Sam Skjonsberg portrait
Researcher 1 reports

Sam Skjonsberg

Ai2

Research engineer at Ai2 working on open language models, post-training, evaluations, and data.

Samuel Wolrich portrait
Researcher 1 reports

Samuel Wolrich

OpenAI

PhD student in EECS at UC Berkeley advised by David Wagner and Jacob Steinhardt, with research focused on trustworthy machine learning and AI safety.

Sana Saxena portrait
Researcher 1 reports

Sana Saxena

Meta AI

Research scientist at Meta working on large language models, foundation models, and reasoning.

Sarah Collier portrait
Researcher 1 reports

Sarah Collier

Google Gemini

Research scientist at Google DeepMind in Mountain View focused on AI red teaming, post-deployment risk management, and language model evaluation. She previously worked in cyber policy and software engineering and earned a PhD from the NYU Center for Data Science.

Sarah Shoker portrait
Researcher 1 reports

Sarah Shoker

OpenAI

Researcher whose public work focuses on AI safety, alignment, and governance.

Sasha Tsvyashchenko portrait
Researcher 1 reports

Sasha Tsvyashchenko

Google Gemini

Research scientist at Google DeepMind working on large language models and artificial intelligence, based on his public OpenReview profile.

Sean K. R. Tan portrait
Researcher 1 reports

Sean K. R. Tan

Meta AI

Incoming assistant professor in Carnegie Mellon's School of Computer Science and current research scientist at Meta AI, working on long-context language models and reasoning.

Sean Narenthiran portrait
Researcher 1 reports

Sean Narenthiran

Cohere

Researcher at Cohere working on machine learning and natural language processing, including the Command A model family.

Shanghang Zhang portrait
Researcher 1 reports

Shanghang Zhang

01.AI

Associate Professor at Peking University and Principal Researcher at 01.AI, working on computer vision and embodied AI including Yi-Lightning.

Shawn Jain portrait
Researcher 1 reports

Shawn Jain

OpenAI

Researcher at OpenAI focused on reasoning, agency, and post-training to make AI systems more useful.

Sheila Dunning portrait
Researcher 1 reports

Sheila Dunning

OpenAI

Sheila Dunning is an operations leader at OpenAI whose official profile focuses on helping research, governance, and safety teams work effectively. Her public contributions page lists work across GPT-4.1, the Preparedness Framework, safety and policy efforts, and GPT-4.

Shuang Luo portrait
Researcher 1 reports

Shuang Luo

Alibaba Qwen

Researcher affiliated with Alibaba Group on Google Scholar and coauthor of the Qwen3 technical report.

Sibylle Hess portrait
Researcher 1 reports

Sibylle Hess

Google Gemini

Senior research scientist at Google DeepMind in Zurich working on the Gemini team.

Sinan Tan portrait
Researcher 1 reports

Sinan Tan

Alibaba Qwen

Research scientist on the Alibaba Qwen Team and co-author of the Qwen Technical Report.

Siva Reddy portrait
Researcher 1 reports

Siva Reddy

Snowflake

Researcher whose public work focuses on natural language processing, reasoning, and machine learning.

Soroush GholamZadeh portrait
Researcher 1 reports

Soroush GholamZadeh

Mistral AI

Research scientist at Mistral AI.

Spencer Kinney portrait
Researcher 1 reports

Spencer Kinney

Ai2

Research scientist at Ai2 whose research interests include language grounding and visual understanding.

Sreekanth K. Mandava portrait
Researcher 1 reports

Sreekanth K. Mandava

NVIDIA

Research scientist at NVIDIA with interests in machine learning systems, deep learning, and distributed systems.

Stéphane Albinet portrait
Researcher 1 reports

Stéphane Albinet

Google Gemini

Staff research engineer at Google DeepMind in London working on the Gemini team.

Stephanie Lin portrait
Researcher 1 reports

Stephanie Lin

OpenAI

Research scientist at OpenAI working on reasoning, code generation, and interpretability.

Sujith Raghuraman portrait
Researcher 1 reports

Sujith Raghuraman

Google Gemini

Research scientist at Google DeepMind working on machine learning and large language models.

Sumanth Doddapaneni portrait
Researcher 1 reports

Sumanth Doddapaneni

Databricks

Member of technical staff at Databricks working on open source language models.

Szymon Sidor portrait
Researcher 1 reports

Szymon Sidor

OpenAI

Engineer at OpenAI focused on software and hardware systems.

Tabarak Khan portrait
Researcher 1 reports

Tabarak Khan

OpenAI

Member of technical staff at OpenAI and co-author of the GPT-4 Technical Report.

Tarun Gogineni portrait
Researcher 1 reports

Tarun Gogineni

OpenAI

Research scientist at OpenAI working on AI safety; previously a graduate student at UC Berkeley studying machine learning and neural signal processing.

Taylor Dai portrait
Researcher 1 reports

Taylor Dai

Cohere

Research scientist at Cohere focused on long-context language models and efficient training.

Teddy Lee portrait
Researcher 1 reports

Teddy Lee

OpenAI

Member of technical staff at OpenAI whose official profile highlights multimodal models, audio, and video.

Theresa Lopez portrait
Researcher 1 reports

Theresa Lopez

OpenAI

Software engineer at OpenAI.

Thibault Biervoye portrait
Researcher 1 reports

Thibault Biervoye

Google Gemini

Thibault Biervoye is a Research Scientist at Google DeepMind focused on large language models, memory-augmented transformers, and knowledge transfer.

Thibaut Lavril portrait
Researcher 1 reports

Thibaut Lavril

Meta AI

Research scientist at Meta working on multimodal foundation models and open-source AI. His homepage says he leads multimodal efforts at Meta AI and contributes to open-source model releases.

Thomas Cho portrait
Researcher 1 reports

Thomas Cho

NVIDIA

NVIDIA researcher with a Google Scholar profile that includes the Nemotron-4 340B technical report.

Tianyi Zhang portrait
Researcher 1 reports

Tianyi Zhang

Cohere

Researcher at Cohere in Toronto working on large language models.

Timothy I. Wang portrait
Researcher 1 reports

Timothy I. Wang

Google Gemini

Machine learning researcher and engineer working on large-scale AI systems, and a co-author of the PaLM 2 technical report.

Tobias Fevry portrait
Researcher 1 reports

Tobias Fevry

Amazon

Senior applied scientist at Amazon AGI focused on multimodal deep learning and computer vision, especially video generation, diffusion, and representation learning.

Toki Sherbakov portrait
Researcher 1 reports

Toki Sherbakov

OpenAI

OpenAI researcher and software engineer based in San Francisco, and co-author of the GPT-4 Technical Report.

Tom Bosc portrait
Researcher 1 reports

Tom Bosc

Mistral AI

Research scientist at Mistral AI working on efficient training and low-latency inference for large models.

Tomer Ronen portrait
Researcher 1 reports

Tomer Ronen

Cohere

Tomer Ronen is a research scientist at Cohere Labs whose work focuses on efficient training and fine-tuning of large language models, natural language processing, and applications in genomics and biology.

Tongzheng Ren portrait
Researcher 1 reports

Tongzheng Ren

DeepSeek

Research scientist at DeepSeek working on deep learning, large language models, multimodal systems, and trustworthy machine learning.

Ty Dunn portrait
Researcher 1 reports

Ty Dunn

Ai2

Research scientist at Ai2 working on efficient and transparent large language models.

Valerie Balcom portrait
Researcher 1 reports

Valerie Balcom

OpenAI

Senior researcher at OpenAI whose official profile focuses on AI robustness and the future of compute.

Victoria McGough portrait
Researcher 1 reports

Victoria McGough

Google Gemini

Google researcher whose publications include the Gemma 2 technical report.

William Zhuk portrait
Researcher 1 reports

William Zhuk

OpenAI

William Zhuk is an OpenAI contributor credited on the GPT-4 Technical Report and listed on the OpenAI GPT-4 contributions page.

Xiaokang Zhang portrait
Researcher 1 reports

Xiaokang Zhang

DeepSeek

Researcher and co-author of the DeepSeek-V3 Technical Report.

Xiao Wang portrait
Researcher 1 reports

Xiao Wang

Moonshot AI

Xiao Wang is a Moonshot AI contributor and co-author of Kimi k1.5: Scaling Reinforcement Learning with LLMs.

Xinyuan Li portrait
Researcher 1 reports

Xinyuan Li

DeepSeek

Developer and researcher with public GitHub activity on DeepSeek-AI repositories including DeepSeek-V3 and DeepEP, plus coauthor credit on the DeepSeek-V3 technical report.

Xinyu Zhou portrait
Researcher 1 reports

Xinyu Zhou

01.AI

Researcher working on language model efficiency and reasoning, including Yi-Lightning.

Xi Victoria Lin portrait
Researcher 1 reports

Xi Victoria Lin

Cohere

Research scientist at Cohere and associate at Harvard Business School whose work focuses on natural language processing and multimodal machine learning.

Xixuan Song portrait
Researcher 1 reports

Xixuan Song

Moonshot AI

Xixuan Song is a Moonshot AI contributor and co-author of Kimi k1.5: Scaling Reinforcement Learning with LLMs.

Yaliang Li portrait
Researcher 1 reports

Yaliang Li

01.AI

Senior research scientist and research lead at 01.AI; previously a senior principal researcher and research manager at Microsoft Research.

Yanfei Huang portrait
Researcher 1 reports

Yanfei Huang

Google Gemini

Software engineer at Google DeepMind interested in machine learning, applied mathematics, and scalable systems.

Yifan Shi portrait
Researcher 1 reports

Yifan Shi

DeepSeek

Member of Technical Staff at DeepSeek.

Yilong Zhao portrait
Researcher 1 reports

Yilong Zhao

DeepSeek

Researcher affiliated with DeepSeek on Google Scholar and coauthor of the DeepSeek-V2 technical report.

Yingtao Zhang portrait
Researcher 1 reports

Yingtao Zhang

Moonshot AI

Yingtao Zhang is a Moonshot AI contributor and co-author of Kimi k1.5: Scaling Reinforcement Learning with LLMs.

Ying Zhang portrait
Researcher 1 reports

Ying Zhang

NVIDIA

Ying Zhang is a senior applied research scientist on NVIDIA's foundation model team.

Yining Wang portrait
Researcher 1 reports

Yining Wang

Cohere

Researcher working on large language models and enterprise NLP systems, including Cohere Command A.

Yiping Cao portrait
Researcher 1 reports

Yiping Cao

Google Gemini

Researcher working on large language model training and evaluation, including PaLM 2.

Yixin Dong portrait
Researcher 1 reports

Yixin Dong

DeepSeek

Final-year PhD student at Peking University and research intern at DeepSeek AI, focusing on machine learning and large language models.

Yi Yu portrait
Researcher 1 reports

Yi Yu

DeepSeek

Researcher affiliated with DeepSeek-AI on Google Scholar and coauthor of the DeepSeek-V3 technical report.

Youlong Cheng portrait
Researcher 1 reports

Youlong Cheng

Google Gemini

Software engineer at Google DeepMind and PhD student at UCLA focusing on efficient large language models.

Yuanzhong Xu portrait
Researcher 1 reports

Yuanzhong Xu

Google Gemini

Research scientist at Google working on large language models, AI agents, systems for machine learning, and optimization.

Yuchen He portrait
Researcher 1 reports

Yuchen He

OpenAI

Yuchen He is a technical staff manager at OpenAI focused on speech and audio. His official profile lists contributions across GPT-4.1, the audio models in the API, Sora, GPT-4, DALL-E 3, and Whisper.

Yue Wu portrait
Researcher 1 reports

Yue Wu

Google Gemini

Research scientist at Google Research focused on language and multimodal models, with interests spanning natural language processing, computer vision, and speech.

Yulia Tsvetkov portrait
Researcher 1 reports

Yulia Tsvetkov

Ai2

Associate professor at Carnegie Mellon University whose public research spans NLP and machine learning; arXiv author results include OLMo and related open language model work.

Yuqing Cao portrait
Researcher 1 reports

Yuqing Cao

Google Gemini

Research scientist at Google DeepMind working on post-training, alignment, and multilinguality.

Yury Zemlyanskiy portrait
Researcher 1 reports

Yury Zemlyanskiy

Google Gemini

Staff Research Scientist at Google DeepMind working on foundation models for language, speech, and multimodal systems.

Yutao Zhou portrait
Researcher 1 reports

Yutao Zhou

Moonshot AI

Applied research scientist at Moonshot AI Labs working on large language model reasoning and multimodal systems.

Yu Wu portrait
Researcher 1 reports

Yu Wu

DeepSeek

AI researcher at DeepSeek focused on reasoning and large language models, based on his public homepage and DeepSeek authorship context.

Zejiang Shen portrait
Researcher 1 reports

Zejiang Shen

Ai2

Zejiang Shen is a research scientist at the Allen Institute for AI whose interests include representation learning, generative AI, and multimodal learning.

Zekun Wang portrait
Researcher 1 reports

Zekun Wang

Alibaba Qwen

Research Scientist at Alibaba Group, Qwen Team.

Zhaofeng Wu portrait
Researcher 1 reports

Zhaofeng Wu

Ai2

PhD student in computer science at Stanford University working on natural language processing and machine learning.

Zhibin Gou portrait
Researcher 1 reports

Zhibin Gou

DeepSeek

Researcher and co-author of the DeepSeek-V3 Technical Report.

Zhifu Wu portrait
Researcher 1 reports

Zhifu Wu

DeepSeek

DeepSeek researcher and coauthor of the DeepSeek-V3 technical report.

Zhigang Yan portrait
Researcher 1 reports

Zhigang Yan

DeepSeek

Researcher affiliated with DeepSeek AI on Google Scholar and coauthor of the DeepSeek-V3 technical report.

Zhiyue Fan portrait
Researcher 1 reports

Zhiyue Fan

Cohere

Member of technical staff at Cohere Labs working on reinforcement learning and post-training for language models and agents.

Zora Tung portrait
Researcher 1 reports

Zora Tung

Google Gemini

Zora Tung is a Research Scientist at Google DeepMind.