Atlas / Reports / Detail
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context
Multimodal Models report from Google Gemini with 60 connected researchers in the LLMpeople atlas.
Connected researchers
Ben Wang
Google Gemini / OpenAI
OpenAI's GPT-4 contributions page credits Ben Wang as attention architecture lead for long context. Public profiles identify him as a University of Pennsylvania undergraduate and an OpenAI researcher from 2021 to 2022.
Jason Wei
Google Gemini / OpenAI
Public report authorship links Jason Wei to the Gemma 3n Technical Report at Google.
Zhifeng Chen
Google Gemini / Z.ai
Zhifeng Chen's public homepage describes him as a distinguished software engineer at Google Brain focused on large-scale computer systems and machine learning applications.
James Manyika
Google Gemini
James Manyika is a Google leader whose public work focuses on research, technology, and society.
Brennan Saeta
Google Gemini
Public report authorship links Brennan Saeta to the Gemma 2: Improving Open Language Models at a Practical Size at Google.
D. Sculley
Google Gemini
Research Director at Google working on machine learning, production systems, and sociotechnical AI.
Donald W. McFadden
Google Gemini
Google Gemini report author listed on Gemini, Gemini 1.5, RecurrentGemma, and CodeGemma technical reports, with report-backed work on multimodal models, long-context models, efficient architectures, and code models.
Rohan Anil
Google Gemini
Rohan Anil is a research scientist at Google DeepMind. His public homepage highlights work on large language models, efficient machine learning systems, and multimodal AI.
Sebastian Borgeaud
Google Gemini
Research scientist at Google DeepMind in London working on agentic reasoning, efficient inference, and large-scale post-training, with a background in high-dimensional statistics and theory.
Jean-Baptiste Alayrac
Google Gemini / Meta AI
DeepMind researcher working on machine learning, computer vision, and structured learning from video and language.
Jiahui Yu
Google Gemini
Jiahui Yu is a Research Lead at OpenAI leading the Perception team. His homepage notes prior co-leadership on Gemini Multimodal at Google DeepMind and work on deep learning and high-performance computing.
Raia Hadsell
Google Gemini
VP of Research at Google DeepMind working on robotics and embodied intelligence, with expertise in machine learning, reinforcement learning, neuroscience, and computer vision.
Radu Soricut
Google Gemini
Radu Soricut is a Distinguished Scientist at Google DeepMind working on natural language processing and machine learning, with earlier Google Research and Google Translate work.
Rishabh Singh
Google Gemini
Rishabh Singh is a research scientist at Google DeepMind working on human-centered AI, programming systems, and AI for software and problem solving. His work spans program synthesis, code intelligence, education, and interactive AI systems.
Andrew M. Dai
Google Gemini
Research scientist at Google DeepMind in Mountain View working on machine learning, reinforcement learning, and robotics.
Mandy Guo
Google Gemini
Mandy Guo is listed as an author of the Google technical report Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context.
Melvin Johnson
Google Gemini
Senior Staff Research Scientist at Google DeepMind working on language modeling, speech recognition, machine translation, and multimodal understanding.
Kevin Robinson
Google Gemini
Kevin Robinson is a research engineer at Google Research working on evaluations of language models and NLP systems. His Google Research profile says he previously worked as a special education teacher, a software engineer building visualization and analytics systems, and a researcher in K12 computer science education.
Mohammad Norouzi
Google Gemini
Research scientist and engineer focused on machine learning, computer vision, and natural language processing.
Hanie Sedghi
Google Gemini
Senior Staff Research Scientist at Google DeepMind working on machine learning, with a focus on efficient inference and training algorithms for large language and vision-language models.
Vincent Vanhoucke
Google Gemini
Senior Staff Research Scientist at Google DeepMind and CTO of the Gemini app, with work spanning speech, language, vision, and large-scale AI systems.
Quoc V. Le
Google Gemini
VP at Google DeepMind working on deep learning, computer vision, and language understanding.
Noam Shazeer
Google Gemini
Distinguished Scientist at Google Research and one of the inventors of the transformer architecture; his work also includes language models, speech recognition, and multi-agent reinforcement learning.
Yifeng Lu
Google Gemini
Member of Technical Staff at Google DeepMind working on machine learning, natural language processing, and large language models.