LLMpeople
Home People Organizations Reports Fields Schools
Public Atlas People first, reports as evidence, organizations as context.

Atlas / Reports / Detail

Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

Multimodal Models report from Google Gemini with 60 connected researchers in the LLMpeople atlas.

Google Gemini2024-03-0860 researchers
Field
Multimodal Models
Organization
Google Gemini
arXiv
2403.05530

Canonical link

https://arxiv.org/abs/2403.05530

Connected researchers

Ben Wang portrait
Researcher 2 reports

Ben Wang

Google Gemini / OpenAI

OpenAI's GPT-4 contributions page credits Ben Wang as attention architecture lead for long context. Public profiles identify him as a University of Pennsylvania undergraduate and an OpenAI researcher from 2021 to 2022.

Google GeminiOpenAI
Jason Wei portrait
Researcher 4 reports

Jason Wei

Google Gemini / OpenAI

Public report authorship links Jason Wei to the Gemma 3n Technical Report at Google.

Google GeminiOpenAI
United States
Zhifeng Chen portrait
Researcher 8 reports

Zhifeng Chen

Google Gemini / Z.ai

Zhifeng Chen's public homepage describes him as a distinguished software engineer at Google Brain focused on large-scale computer systems and machine learning applications.

Google GeminiZ.ai
James Manyika portrait
Researcher 2 reports

James Manyika

Google Gemini

James Manyika is a Google leader whose public work focuses on research, technology, and society.

Google Gemini
Brennan Saeta portrait
Researcher 4 reports

Brennan Saeta

Google Gemini

Public report authorship links Brennan Saeta to the Gemma 2: Improving Open Language Models at a Practical Size at Google.

Google Gemini
D. Sculley portrait
Researcher 3 reports

D. Sculley

Google Gemini

Research Director at Google working on machine learning, production systems, and sociotechnical AI.

Google Gemini
Donald W. McFadden portrait
Researcher 4 reports

Donald W. McFadden

Google Gemini

Google Gemini report author listed on Gemini, Gemini 1.5, RecurrentGemma, and CodeGemma technical reports, with report-backed work on multimodal models, long-context models, efficient architectures, and code models.

Google Gemini
Rohan Anil portrait
Researcher 3 reports

Rohan Anil

Google Gemini

Rohan Anil is a research scientist at Google DeepMind. His public homepage highlights work on large language models, efficient machine learning systems, and multimodal AI.

Google Gemini
Sebastian Borgeaud portrait
Researcher 3 reports

Sebastian Borgeaud

Google Gemini

Research scientist at Google DeepMind in London working on agentic reasoning, efficient inference, and large-scale post-training, with a background in high-dimensional statistics and theory.

Google Gemini
Jean-Baptiste Alayrac portrait
Researcher 5 reports

Jean-Baptiste Alayrac

Google Gemini / Meta AI

DeepMind researcher working on machine learning, computer vision, and structured learning from video and language.

Google GeminiMeta AI
United Kingdom
Jiahui Yu portrait
Researcher 8 reports

Jiahui Yu

Google Gemini

Jiahui Yu is a Research Lead at OpenAI leading the Perception team. His homepage notes prior co-leadership on Gemini Multimodal at Google DeepMind and work on deep learning and high-performance computing.

Google Gemini
Raia Hadsell portrait
Researcher 2 reports

Raia Hadsell

Google Gemini

VP of Research at Google DeepMind working on robotics and embodied intelligence, with expertise in machine learning, reinforcement learning, neuroscience, and computer vision.

Google Gemini
Radu Soricut portrait
Researcher 4 reports

Radu Soricut

Google Gemini

Radu Soricut is a Distinguished Scientist at Google DeepMind working on natural language processing and machine learning, with earlier Google Research and Google Translate work.

Google Gemini
United States 9 likes
Rishabh Singh portrait
Researcher 1 reports

Rishabh Singh

Google Gemini

Rishabh Singh is a research scientist at Google DeepMind working on human-centered AI, programming systems, and AI for software and problem solving. His work spans program synthesis, code intelligence, education, and interactive AI systems.

Google Gemini
United States
Andrew M. Dai portrait
Researcher 3 reports

Andrew M. Dai

Google Gemini

Research scientist at Google DeepMind in Mountain View working on machine learning, reinforcement learning, and robotics.

Google Gemini
Mandy Guo portrait
Researcher 1 reports

Mandy Guo

Google Gemini

Mandy Guo is listed as an author of the Google technical report Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context.

Google Gemini
Melvin Johnson portrait
Researcher 2 reports

Melvin Johnson

Google Gemini

Senior Staff Research Scientist at Google DeepMind working on language modeling, speech recognition, machine translation, and multimodal understanding.

Google Gemini
Kevin Robinson portrait
Researcher 2 reports

Kevin Robinson

Google Gemini

Kevin Robinson is a research engineer at Google Research working on evaluations of language models and NLP systems. His Google Research profile says he previously worked as a special education teacher, a software engineer building visualization and analytics systems, and a researcher in K12 computer science education.

Google Gemini
Mohammad Norouzi portrait
Researcher 1 reports

Mohammad Norouzi

Google Gemini

Research scientist and engineer focused on machine learning, computer vision, and natural language processing.

Google Gemini
Hanie Sedghi portrait
Researcher 1 reports

Hanie Sedghi

Google Gemini

Senior Staff Research Scientist at Google DeepMind working on machine learning, with a focus on efficient inference and training algorithms for large language and vision-language models.

Google Gemini
Vincent Vanhoucke portrait
Researcher 3 reports

Vincent Vanhoucke

Google Gemini

Senior Staff Research Scientist at Google DeepMind and CTO of the Gemini app, with work spanning speech, language, vision, and large-scale AI systems.

Google Gemini
Quoc V. Le portrait
Researcher 2 reports

Quoc V. Le

Google Gemini

VP at Google DeepMind working on deep learning, computer vision, and language understanding.

Google Gemini
Noam Shazeer portrait
Researcher 2 reports

Noam Shazeer

Google Gemini

Distinguished Scientist at Google Research and one of the inventors of the transformer architecture; his work also includes language models, speech recognition, and multi-agent reinforcement learning.

Google Gemini
Yifeng Lu portrait
Researcher 3 reports

Yifeng Lu

Google Gemini

Member of Technical Staff at Google DeepMind working on machine learning, natural language processing, and large language models.

Google Gemini

LLMpeople is a public atlas for discovering frontier AI researchers with context, provenance, and respect.

Privacy · Terms