Atlas / Reports / Detail
PaLM-E: An Embodied Multimodal Language Model
Multimodal Language Models report from Google Gemini with 24 connected researchers in the LLMpeople atlas.
Connected researchers
Jiahui Yu
Google Gemini
Jiahui Yu is a Research Lead at OpenAI leading the Perception team. His homepage notes prior co-leadership on Gemini Multimodal at Google DeepMind and work on deep learning and high-performance computing.
Yao Lu
DeepSeek / Google Gemini
Yao Lu is listed as an author of the Google technical report Gemini Robotics: Bringing AI into the Physical World.
Fei Xia
Google Gemini / Mistral AI
Senior Staff Research Scientist and Tech Lead Manager at Google DeepMind Robotics, focused on embodied agents and foundation models for robot decision-making.
Fei Xia
Google Gemini
Research scientist at Google DeepMind working on robotics and embodied intelligence. His research spans robot learning, navigation, manipulation, and multimodal agents.
Andy Zeng
Google Gemini
Andy Zeng is a Research Scientist at Google DeepMind. His public research interests include robot learning, computer vision, graphics, and personalized 3D content generation.
Yao Lu
Google Gemini
Yao Lu is a research scientist at NVIDIA Research working on embodied AI, foundation models, and computer vision. His public homepage says he previously worked at Google DeepMind and Boston University and earned a Ph.D. from Carnegie Mellon University.
Nicolas Heess
Google Gemini
Nicolas Heess is a research scientist at Google DeepMind whose work focuses on machine learning, reinforcement learning, and robotics.
Danny Driess
Google Gemini
Danny Driess is a research scientist at Google DeepMind whose work focuses on general AI, robot learning, and multimodal foundation models.
Louis Borry
Google Gemini
Louis Borry is a PhD student at Google DeepMind working on embodied language models and grounded language understanding.
Yiwen Wang
Google Gemini
Yiwen Wang is listed as an author of the Google technical report PaLM-E: An Embodied Multimodal Language Model.
Thomas Blankevoort
Google Gemini
Thomas Blankevoort is a Research Scientist at Google DeepMind whose work focuses on efficient neural networks and machine learning systems.
Jose A. Arenas
Google Gemini
Staff software engineer at Google focused on machine learning and systems.
Marc G. Bellemare
Google Gemini
Principal research scientist at Google DeepMind and professor of computer science at McGill University.
Daria Buchsbaum
Google Gemini
Daria Buchsbaum is a PhD student at Georgia Tech and a Research Scientist Intern at Google DeepMind.
Sora Tokumine
Google Gemini
Sora Tokumine is listed as an author of the Google technical report PaLM-E: An Embodied Multimodal Language Model.
Yusuke M. Asano
Google Gemini
Research scientist at Google whose work spans computer vision, multimodal learning, and large embodied models, including PaLM-E.
Teddy Karrer
Google Gemini
Teddy Karrer is a research scientist working on embodied AI, multimodal reasoning, and machine learning for interactive systems. His public profile highlights robotics, decision making, and intelligent agents.
Montserrat Gonzalez Arenas
Google Gemini
Montserrat Gonzalez Arenas is a research engineer at Google Research whose public work focuses on robot learning and mobile manipulation, including robotic table wiping, waste sorting, and RT-Trajectory for robot task generalization.
Mikel Arza
Google Gemini
Research scientist at Google DeepMind focused on robotics and machine learning, especially reinforcement learning and language models.
Scott Reed
Google Gemini
Research scientist at Google DeepMind working on language, vision, action, and robotics; previously on the Google Brain team and a co-creator of the first text-to-image GAN.
Qiaozi Gao
Google Gemini
Qiaozi Gao is a Stanford PhD student whose work spans vision and language, machine learning, and robotics, with research internships at Google and Google DeepMind.
Cengiz Oztireli
Google Gemini
Senior staff research scientist at Google DeepMind and affiliated lecturer at Cambridge working on computer vision, machine learning, and computer graphics.
Zhe Lin
Google Gemini
Zhe Lin is listed as an author of the Google technical report PaLM-E: An Embodied Multimodal Language Model.
Johnny Mao
Google Gemini
Senior research scientist at Google DeepMind working on machine learning.