LLMpeople
Home People Organizations Reports Fields Schools
Public Atlas People first, reports as evidence, organizations as context.

Atlas / Reports / Detail

Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

Multimodal Models report from Google Gemini with 60 connected researchers in the LLMpeople atlas.

Google GeminiUndated60 researchers
Field
Multimodal Models
Organization
Google Gemini
arXiv
2403.05530

Canonical link

https://arxiv.org/abs/2403.05530

Connected researchers

Researcher 3 reports

Radu Soricut

Google Gemini

Research scientist focused on machine learning and natural language understanding, with work spanning machine translation, semantic parsing, and large-scale language modeling.

Google Gemini
Researcher 1 reports

Aakanksha Chowdhery

Google Gemini

Aakanksha Chowdhery is a machine learning researcher based in New York City. She works on large-scale machine learning across pre-training, post-training, inference, and system efficiency, and is known for contributions such as PaLM, Pathways, and Gemini.

Google Gemini
Researcher 1 reports

Mohammad Norouzi

Google Gemini

Research scientist and engineer focused on machine learning, computer vision, and natural language processing.

Google Gemini
Researcher 1 reports

Yuhuai Wu

Google Gemini

Research scientist working on large language models, reasoning, agents, and reinforcement learning.

Google Gemini
Researcher 3 reports

Jiahui Yu

Google Gemini

Jiahui Yu is a research scientist at Google DeepMind working on multimodal learning and large language models.

Google Gemini
Researcher 1 reports

Matthias Minderer

Google Gemini

Research Scientist at Google DeepMind in London working on large multimodal models, evaluation, agents, and computer vision; he completed a PhD at the University of Tuebingen and MPI for Intelligent Systems.

Google Gemini
Researcher 1 reports

David Silver

Google Gemini

Computer scientist and reinforcement learning researcher, Professor at University College London, and former Principal Research Scientist at DeepMind.

Google Gemini
Researcher 1 reports

Hanie Sedghi

Google Gemini

Senior Staff Research Scientist at Google DeepMind working on machine learning, with a focus on efficient inference and training algorithms for large language and vision-language models.

Google Gemini
Researcher 1 reports

Rishabh Singh

Google Gemini

Rishabh Singh is a research scientist at Google DeepMind working on human-centered AI, programming systems, and AI for software and problem solving. His work spans program synthesis, code intelligence, education, and interactive AI systems.

Google Gemini
United States
Researcher 1 reports

Vedant Misra

Google Gemini

Research engineer focused on frontier multimodal AI systems; a founding member of Google's Gemini core team who previously helped start OpenAI's multimodal team.

Google Gemini
Researcher 3 reports

Jean-Baptiste Alayrac

Google Gemini / Meta AI

Research scientist at Meta working on multimodal machine learning and AI. Previously worked on multimodal learning at Google DeepMind and earned a PhD in computer vision and machine learning from Ecole des Ponts ParisTech.

Google GeminiMeta AI
Researcher 3 reports

Yifeng Lu

Google Gemini

Member of Technical Staff at Google DeepMind working on machine learning, natural language processing, and large language models.

Google Gemini
Researcher 1 reports

Raia Hadsell

Google Gemini

VP of Research at Google DeepMind working on robotics and embodied intelligence, with expertise in machine learning, reinforcement learning, neuroscience, and computer vision.

Google Gemini
Researcher 2 reports

Noam Shazeer

Google Gemini

Distinguished Scientist at Google Research and one of the inventors of the transformer architecture; his work also includes language models, speech recognition, and multi-agent reinforcement learning.

Google Gemini
Researcher 1 reports

Aitor Lewkowycz

Google Gemini

Research scientist at Google DeepMind interested in large language models and mathematical reasoning. He earned a Ph.D. in mathematics from Columbia University.

Google Gemini
Researcher 1 reports

Yun-Hsuan Sung

Google Gemini

Yun-Hsuan Sung is a machine learning researcher focused on multimodal learning, robotics, and representation learning.

Google Gemini
Researcher 6 reports

Zhifeng Chen

Google Gemini / Z.ai

Distinguished software engineer at Google Brain focused on large-scale computer systems and machine learning applications.

Google GeminiZ.ai
Researcher 2 reports

Aäron van den Oord

Google Gemini

Aäron van den Oord is a Google DeepMind researcher known for generative and sequence-model research.

Google Gemini
Researcher 1 reports

HyoukJoong Lee

Google Gemini

HyoukJoong Lee is a research scientist at Google DeepMind. His public work includes long-context and multimodal model research, including Gemini 1.5 and Gemini Diffusion.

Google Gemini
Researcher 1 reports

Alexander Vladymyrov

Google Gemini

Senior Staff Research Scientist at Google DeepMind working on AI for science, imaging, geometry processing, and differentiable simulation; previously at Adobe and NVIDIA.

Google Gemini
Researcher 1 reports

Alex Beutel

Google Gemini

Research scientist at Google with public work on machine learning systems, recommendation, fairness, and safety.

Google Gemini
Researcher 1 reports

Branislav Kveton

Google Gemini

Staff research scientist at Google DeepMind and associate professor at Purdue University working on sequential decision making, machine learning, and algorithms.

Google Gemini
Researcher 1 reports

C. Le Lan

Google Gemini

Research scientist at Google DeepMind.

Google Gemini
Researcher 1 reports

Clement Farabet

Google Gemini

Research scientist at Google DeepMind whose public profile also lists prior AI infrastructure leadership at NVIDIA, founding Mesosphere, and earlier research roles at FAIR.

Google Gemini

LLMpeople is a public atlas for discovering frontier AI researchers with context, provenance, and respect.

Privacy · Terms