LLMpeople
Home People Organizations Reports Fields Schools
Public Atlas People first, reports as evidence, organizations as context.

Atlas / Reports / Detail

MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training

Multimodal Language Models report from Apple with 31 connected researchers in the LLMpeople atlas.

Apple2024-03-1431 researchers
Field
Multimodal Language Models
Organization
Apple
arXiv
2403.09611

Canonical link

https://arxiv.org/abs/2403.09611

Connected researchers

Brandon McKinzie portrait
Researcher 1 reports

Brandon McKinzie

Apple

Senior research scientist at Apple working on large multimodal foundation models, with prior work on large language models at MosaicML.

Apple
Zhe Gan portrait
Researcher 2 reports

Zhe Gan

Apple

Machine learning researcher at Apple working on large multimodal foundation models, video generation, and vision-language systems.

Apple
Jean-Philippe Fauconnier portrait
Researcher 2 reports

Jean-Philippe Fauconnier

Apple

Research scientist at Apple Foundation Models working on generative AI, large language models, and multimodal models.

Apple
Sam Dodge portrait
Researcher 2 reports

Sam Dodge

Apple

Sam Dodge is an Apple AI/ML-affiliated researcher. The linked arXiv paper lists him as a coauthor of MM1 and shows his affiliation as Apple AI/ML in Cupertino, California.

Apple
Philipp Dufter portrait
Researcher 2 reports

Philipp Dufter

Apple

Research scientist at Apple Foundation Models with interests in natural language processing, structured generation, controllable generation, and algorithmic efficiency.

Apple
Bowen Zhang portrait
Researcher 2 reports

Bowen Zhang

Apple

Research scientist at Apple working on large language models, vision-language models, and model scaling.

Apple
Dhruti Shah portrait
Researcher 2 reports

Dhruti Shah

Apple

Researcher working on machine learning, vision and language, computer vision, diffusion, and generative AI.

Apple
Xianzhi Du portrait
Researcher 2 reports

Xianzhi Du

Apple

Research scientist at Apple working on language and vision-language modeling, AI agents, and post-training.

Apple
Futang Peng portrait
Researcher 1 reports

Futang Peng

Apple

Research scientist at Apple focusing on understanding and generating text and images.

Apple
Haotian Zhang portrait
Researcher 2 reports

Haotian Zhang

Apple

Haotian Zhang is a research scientist on Apple AI/ML's Visual Intelligence team. His homepage says he works on embodied agents that understand the world from 2D and 3D image data as well as natural language, previously interned at Microsoft Research and Azure AI, completed a PhD in electrical and computer engineering at the University of Washington in 2022, and earlier earned master's degrees at Washington and a bachelor's degree at Shanghai Jiao Tong University.

Apple
Floris Weers portrait
Researcher 3 reports

Floris Weers

Apple

Research scientist at Apple working on efficient and multilingual language modeling, speech and language systems, and large language models.

Apple
Anton Belyi portrait
Researcher 1 reports

Anton Belyi

Apple

Research scientist at Apple and adjunct professor at MIPT working in computer vision, image processing, and machine learning.

Apple
Hongyu He portrait
Researcher 1 reports

Hongyu He

Apple

Research scientist at Apple focused on computer vision, machine learning, and multimodal understanding.

Apple
Max Schwarzer portrait
Researcher 1 reports

Max Schwarzer

Apple

Max Schwarzer is a reinforcement learning researcher whose work focuses on scaling and sample-efficient RL. He completed a PhD at Mila, later interned in Apple's machine learning research group, and was an author on Apple's MM1 multimodal pre-training report.

Apple
Tom Gunter portrait
Researcher 3 reports

Tom Gunter

Apple

Research scientist at Apple Intelligence working on computer vision, machine learning, and natural language processing.

Apple
Xiang Kong portrait
Researcher 3 reports

Xiang Kong

Apple

Xiang Kong's public homepage says he is a machine learning researcher at Apple and that he received his PhD from the School of Computer Science at Carnegie Mellon University.

Apple
Nan Du portrait
Researcher 1 reports

Nan Du

Apple

Nan Du works on large language models, mixture-of-experts methods, few-shot learning, and natural language processing. Her public OpenReview profile lists Apple AIML as her current affiliation and Google Brain as an earlier role.

Apple
Sam Wiseman portrait
Researcher 1 reports

Sam Wiseman

Apple

Sam Wiseman is an assistant professor of computer science at New York University whose research focuses on natural language processing and machine learning, including controllable generation, summarization, and learning from human feedback.

Apple
Mark Lee portrait
Researcher 3 reports

Mark Lee

Apple

Co-author of MM1, which studies multimodal LLM pre-training.

Apple
Zirui Wang portrait
Researcher 2 reports

Zirui Wang

Apple

Senior researcher at Apple working on large models, multimodal learning, and speech processing, according to his personal site.

Apple
Ruoming Pang portrait
Researcher 3 reports

Ruoming Pang

Apple

Public profiles and publications link Ruoming Pang to speech, language, and multimodal model research. Reuters reported on February 25, 2026 that OpenAI hired him from Meta.

Apple
Peter Grasch portrait
Researcher 2 reports

Peter Grasch

Apple

Research scientist at Apple focused on state-of-the-art machine learning and computer vision methods.

Apple
Alexander Toshev portrait
Researcher 1 reports

Alexander Toshev

Apple

Computer vision and machine learning scientist at Apple whose work includes multimodal understanding and robotics, following earlier leadership roles at Google.

Apple
Yinfei Yang portrait
Researcher 2 reports

Yinfei Yang

Apple

Research scientist at Apple focused on natural language processing and machine learning.

Apple

LLMpeople is a public atlas for discovering frontier AI researchers with context, provenance, and respect.

Privacy ยท Terms