Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

OpenAI's GPT-4 contributions page credits Ben Wang as attention architecture lead for long context. Public profiles identify him as a University of Pennsylvania undergraduate and an OpenAI researcher from 2021 to 2022.

Public report authorship links Jason Wei to the Gemma 3n Technical Report at Google.

Zhifeng Chen's public homepage describes him as a distinguished software engineer at Google Brain focused on large-scale computer systems and machine learning applications.

James Manyika is a Google leader whose public work focuses on research, technology, and society.

Public report authorship links Brennan Saeta to the Gemma 2: Improving Open Language Models at a Practical Size at Google.

Research Director at Google working on machine learning, production systems, and sociotechnical AI.

Google Gemini report author listed on Gemini, Gemini 1.5, RecurrentGemma, and CodeGemma technical reports, with report-backed work on multimodal models, long-context models, efficient architectures, and code models.

Rohan Anil is a research scientist at Google DeepMind. His public homepage highlights work on large language models, efficient machine learning systems, and multimodal AI.

Research scientist at Google DeepMind in London working on agentic reasoning, efficient inference, and large-scale post-training, with a background in high-dimensional statistics and theory.

DeepMind researcher working on machine learning, computer vision, and structured learning from video and language.

Jiahui Yu is a Research Lead at OpenAI leading the Perception team. His homepage notes prior co-leadership on Gemini Multimodal at Google DeepMind and work on deep learning and high-performance computing.

VP of Research at Google DeepMind working on robotics and embodied intelligence, with expertise in machine learning, reinforcement learning, neuroscience, and computer vision.

Radu Soricut is a Distinguished Scientist at Google DeepMind working on natural language processing and machine learning, with earlier Google Research and Google Translate work.

Rishabh Singh is a research scientist at Google DeepMind working on human-centered AI, programming systems, and AI for software and problem solving. His work spans program synthesis, code intelligence, education, and interactive AI systems.

Research scientist at Google DeepMind in Mountain View working on machine learning, reinforcement learning, and robotics.

Mandy Guo is listed as an author of the Google technical report Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context.

Senior Staff Research Scientist at Google DeepMind working on language modeling, speech recognition, machine translation, and multimodal understanding.

Kevin Robinson is a research engineer at Google Research working on evaluations of language models and NLP systems. His Google Research profile says he previously worked as a special education teacher, a software engineer building visualization and analytics systems, and a researcher in K12 computer science education.

Research scientist and engineer focused on machine learning, computer vision, and natural language processing.

Senior Staff Research Scientist at Google DeepMind working on machine learning, with a focus on efficient inference and training algorithms for large language and vision-language models.

Senior Staff Research Scientist at Google DeepMind and CTO of the Gemini app, with work spanning speech, language, vision, and large-scale AI systems.

VP at Google DeepMind working on deep learning, computer vision, and language understanding.

Distinguished Scientist at Google Research and one of the inventors of the transformer architecture; his work also includes language models, speech recognition, and multi-agent reinforcement learning.

Member of Technical Staff at Google DeepMind working on machine learning, natural language processing, and large language models.

Canonical link

Ben Wang

Jason Wei

Zhifeng Chen

James Manyika

Brennan Saeta

D. Sculley

Donald W. McFadden

Rohan Anil

Sebastian Borgeaud

Jean-Baptiste Alayrac

Jiahui Yu

Raia Hadsell

Radu Soricut

Rishabh Singh

Andrew M. Dai

Mandy Guo

Melvin Johnson

Kevin Robinson

Mohammad Norouzi

Hanie Sedghi

Vincent Vanhoucke

Quoc V. Le

Noam Shazeer

Yifeng Lu