Atlas / Reports / Detail
PaliGemma 2: A Family of Versatile VLMs for Transfer
Vision-Language Models report from Google Gemini with 13 connected researchers in the LLMpeople atlas.
Connected researchers
Andrea Steiner
Google Gemini
Research scientist at Google DeepMind working on multimodal generative models, visual generation, and image editing; previously completed a PhD at TU Munich.
Jiahui Yu
Google Gemini
Jiahui Yu is a research scientist at Google DeepMind working on multimodal learning and large language models.
Lucas Beyer
Google Gemini
Lucas Beyer is an ML researcher at Google DeepMind in Zurich. His public homepage highlights prior work at Google Brain and a PhD at ETH Zurich.
Antonio Torralba
Google Gemini
Antonio Torralba is the Delta Electronics Professor in the EECS Department at MIT and a member of CSAIL whose research focuses on computer vision, visual learning, and scene understanding.
Xiaohua Zhai
Google Gemini
Xiaohua Zhai is a researcher on the Google Research team in Zurich whose work focuses on large multimodal models and efficient deep learning.
Alexander Kolesnikov
Google Gemini
Alexander Kolesnikov is a Research Scientist at Google DeepMind exploring multimodal general intelligence.
Andrea Dafoe
Google Gemini
Andrea Dafoe is a senior research scientist at Google DeepMind whose work focuses on frontier AI risks, international governance, and the societal impacts of advanced AI.
Bilal Mustafa
Google Gemini
Senior research scientist at Google DeepMind.
Olivier Henaff
Google Gemini
Research scientist at Google DeepMind working on deep learning, reinforcement learning, self-supervised learning, and robotics.
Rohit Saxena
Google Gemini
Rohit Saxena is a Research Scientist at Google DeepMind working on visual perception, multimodal learning, and language understanding.
Andrew Shen
Google Gemini
Profile still being enriched.
Matias Mazzocconi
Google Gemini
Profile still being enriched.
Mikhail Ryabinin
Google Gemini
Profile still being enriched.