MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training

Senior research scientist at Apple working on large multimodal foundation models, with prior work on large language models at MosaicML.

Machine learning researcher at Apple working on large multimodal foundation models, video generation, and vision-language systems.

Research scientist at Apple Foundation Models working on generative AI, large language models, and multimodal models.

Sam Dodge is an Apple AI/ML-affiliated researcher. The linked arXiv paper lists him as a coauthor of MM1 and shows his affiliation as Apple AI/ML in Cupertino, California.

Research scientist at Apple Foundation Models with interests in natural language processing, structured generation, controllable generation, and algorithmic efficiency.

Research scientist at Apple working on large language models, vision-language models, and model scaling.

Researcher working on machine learning, vision and language, computer vision, diffusion, and generative AI.

Research scientist at Apple working on language and vision-language modeling, AI agents, and post-training.

Research scientist at Apple focusing on understanding and generating text and images.

Haotian Zhang is a research scientist on Apple AI/ML's Visual Intelligence team. His homepage says he works on embodied agents that understand the world from 2D and 3D image data as well as natural language, previously interned at Microsoft Research and Azure AI, completed a PhD in electrical and computer engineering at the University of Washington in 2022, and earlier earned master's degrees at Washington and a bachelor's degree at Shanghai Jiao Tong University.

Research scientist at Apple working on efficient and multilingual language modeling, speech and language systems, and large language models.

Research scientist at Apple and adjunct professor at MIPT working in computer vision, image processing, and machine learning.

Research scientist at Apple focused on computer vision, machine learning, and multimodal understanding.

Max Schwarzer is a reinforcement learning researcher whose work focuses on scaling and sample-efficient RL. He completed a PhD at Mila, later interned in Apple's machine learning research group, and was an author on Apple's MM1 multimodal pre-training report.

Research scientist at Apple Intelligence working on computer vision, machine learning, and natural language processing.

Xiang Kong's public homepage says he is a machine learning researcher at Apple and that he received his PhD from the School of Computer Science at Carnegie Mellon University.

Nan Du works on large language models, mixture-of-experts methods, few-shot learning, and natural language processing. Her public OpenReview profile lists Apple AIML as her current affiliation and Google Brain as an earlier role.

Sam Wiseman is an assistant professor of computer science at New York University whose research focuses on natural language processing and machine learning, including controllable generation, summarization, and learning from human feedback.

Co-author of MM1, which studies multimodal LLM pre-training.

Senior researcher at Apple working on large models, multimodal learning, and speech processing, according to his personal site.

Public profiles and publications link Ruoming Pang to speech, language, and multimodal model research. Reuters reported on February 25, 2026 that OpenAI hired him from Meta.

Research scientist at Apple focused on state-of-the-art machine learning and computer vision methods.

Computer vision and machine learning scientist at Apple whose work includes multimodal understanding and robotics, following earlier leadership roles at Google.

Research scientist at Apple focused on natural language processing and machine learning.

Canonical link

Brandon McKinzie

Zhe Gan

Jean-Philippe Fauconnier

Sam Dodge

Philipp Dufter

Bowen Zhang

Dhruti Shah

Xianzhi Du

Futang Peng

Haotian Zhang

Floris Weers

Anton Belyi

Hongyu He

Max Schwarzer

Tom Gunter

Xiang Kong

Nan Du

Sam Wiseman

Mark Lee

Zirui Wang

Ruoming Pang

Peter Grasch

Alexander Toshev

Yinfei Yang