Atlas / Reports / Detail
PaliGemma 2: A Family of Versatile VLMs for Transfer
Vision-Language Models report from Google Gemini with 13 connected researchers in the LLMpeople atlas.
Connected researchers
Jiahui Yu
Google Gemini
Jiahui Yu is a Research Lead at OpenAI leading the Perception team. His homepage notes prior co-leadership on Gemini Multimodal at Google DeepMind and work on deep learning and high-performance computing.
Lucas Beyer
Google Gemini
Lucas Beyer is an ML researcher at Google DeepMind in Zurich. His public homepage highlights prior work at Google Brain and a PhD at ETH Zurich.
Xiaohua Zhai
Google Gemini
Xiaohua Zhai is a researcher on the Google Research team in Zurich whose work focuses on large multimodal models and efficient deep learning.
Antonio Torralba
Google Gemini
Antonio Torralba is the Delta Electronics Professor in the EECS Department at MIT and a member of CSAIL whose research focuses on computer vision, visual learning, and scene understanding.
Mikhail Ryabinin
Google Gemini
Mikhail Ryabinin is listed as an author of the Google technical report PaliGemma 2: A Family of Versatile VLMs for Transfer.
Matias Mazzocconi
Google Gemini
Matias Mazzocconi is listed as an author of the Google technical report PaliGemma 2: A Family of Versatile VLMs for Transfer.
Andrew Shen
Google Gemini
Andrew Shen is listed as an author of the Google technical report PaliGemma 2: A Family of Versatile VLMs for Transfer.
Bilal Mustafa
Google Gemini
Senior research scientist at Google DeepMind.
Alexander Kolesnikov
Google Gemini
Alexander Kolesnikov is a Research Scientist at Google DeepMind exploring multimodal general intelligence.
Olivier Henaff
Google Gemini
Research scientist at Google DeepMind working on deep learning, reinforcement learning, self-supervised learning, and robotics.
Andrea Steiner
Google Gemini
Research scientist at Google DeepMind working on multimodal generative models, visual generation, and image editing; previously completed a PhD at TU Munich.
Rohit Saxena
Google Gemini
Rohit Saxena is a Research Scientist at Google DeepMind working on visual perception, multimodal learning, and language understanding.
Andrea Dafoe
Google Gemini
Andrea Dafoe is a senior research scientist at Google DeepMind whose work focuses on frontier AI risks, international governance, and the societal impacts of advanced AI.