LLMpeople
Home People Organizations Reports Fields Schools
Public Atlas People first, reports as evidence, organizations as context.

Atlas / Reports / Detail

PaliGemma: A versatile 3B VLM for transfer

Vision-Language Models report from Google Gemini with 14 connected researchers in the LLMpeople atlas.

Google Gemini2024-07-1014 researchers
Field
Vision-Language Models
Organization
Google Gemini
arXiv
2407.07726

Canonical link

https://arxiv.org/abs/2407.07726

Connected researchers

Nan Ding portrait
Researcher 2 reports

Nan Ding

Google Gemini

Researcher at Google Research whose public work includes multimodal and vision-language modeling, with arXiv publications tied to PaliGemma and related transfer work.

Google Gemini
Yonghui Wu portrait
Researcher 5 reports

Yonghui Wu

Google Gemini

Google researcher whose official profile says he joined Google in September 2008 and has been with Google Brain since January 2015, with research interests spanning information retrieval, machine learning, machine translation, and natural language processing.

Google Gemini
Koray Kavukcuoglu portrait
Researcher 3 reports

Koray Kavukcuoglu

Google Gemini

Chief Technology Officer at Google DeepMind, with work spanning machine learning and reinforcement learning.

Google Gemini
Jiahui Yu portrait
Researcher 8 reports

Jiahui Yu

Google Gemini

Jiahui Yu is a Research Lead at OpenAI leading the Perception team. His homepage notes prior co-leadership on Gemini Multimodal at Google DeepMind and work on deep learning and high-performance computing.

Google Gemini
Radu Soricut portrait
Researcher 4 reports

Radu Soricut

Google Gemini

Radu Soricut is a Distinguished Scientist at Google DeepMind working on natural language processing and machine learning, with earlier Google Research and Google Translate work.

Google Gemini
United States 9 likes
Matthieu Devin portrait
Researcher 2 reports

Matthieu Devin

Google Gemini

Research scientist at Google DeepMind based in Paris, focused on deep learning and computer vision.

Google Gemini
Nikolay Savinov portrait
Researcher 2 reports

Nikolay Savinov

Google Gemini

Research scientist at Google DeepMind on the Gemini team, working on multimodal AI.

Google Gemini
Leonardo Beyer portrait
Researcher 1 reports

Leonardo Beyer

Google Gemini

Leonardo Beyer is a research scientist at Google DeepMind. His public homepage highlights work across representation learning, multimodal models, and large-scale machine learning systems.

Google Gemini
Xiaohua Zhai portrait
Researcher 2 reports

Xiaohua Zhai

Google Gemini

Xiaohua Zhai is a researcher on the Google Research team in Zurich whose work focuses on large multimodal models and efficient deep learning.

Google Gemini
William Kolesnikov portrait
Researcher 1 reports

William Kolesnikov

Google Gemini

Staff software engineer at Google DeepMind working on post-training, alignment, multimodal models, and data filtering. He previously worked on hardware and software co-design for machine learning.

Google Gemini
Siyuan Li portrait
Researcher 2 reports

Siyuan Li

Google Gemini / NVIDIA

Siyuan Li is a research scientist at NVIDIA working on large language models, multimodal foundation models, and reinforcement learning. His homepage says he received a PhD in computer science from the University of Toronto in 2024 and previously worked at Meta AI, Microsoft Research, and Mila.

Google GeminiNVIDIA
Xinyi Chen portrait
Researcher 1 reports

Xinyi Chen

Google Gemini

Xinyi Chen is a PhD candidate in computer science at Princeton University and concurrently a research scientist at Google DeepMind. Her public homepage says she works at the intersection of machine learning, optimization, and dynamical systems, focusing on robust and efficient methods for sequential decision-making and control, and that she previously completed undergraduate studies in mathematics at Princeton.

Google Gemini
Xiuye Gu portrait
Researcher 1 reports

Xiuye Gu

Google Gemini

Xiuye Gu is a researcher whose public work focuses on vision-language modeling and machine learning systems.

Google Gemini
Maxwell Collins portrait
Researcher 1 reports

Maxwell Collins

Google Gemini

Maxwell Collins is a Research Scientist at Google DeepMind.

Google Gemini

LLMpeople is a public atlas for discovering frontier AI researchers with context, provenance, and respect.

Privacy ยท Terms