SPIrit-LM: Interleaved Spoken and Written Language Model

Jade Copet is listed as an author of the Meta AI technical report SPIrit-LM: Interleaved Spoken and Written Language Model.

Research scientist at Meta AI working on generative AI, multimodal learning, and speech and audio generation. His public homepage notes earlier research at Bar-Ilan University before joining Meta in Menlo Park.

Tu Anh Nguyen is a research scientist at Meta working on speech and audio generation. He is also a PhD candidate at Mila and the Universite de Montreal, advised by Yoshua Bengio and Abdelrahman Mohamed, with interests in audio language models, speech generation, and efficient inference.

Yossi Adi is a computer scientist at the Hebrew University of Jerusalem and a research scientist at Meta FAIR. His research focuses on speech, audio, and language modeling, including spoken language models and machine learning methods for speech applications.

Máté Gergely Csajbók is listed as an author of the Meta AI technical report SPIrit-LM: Interleaved Spoken and Written Language Model.

Research scientist at Meta whose OpenReview profile describes work on multilingual language and speech models, along with data and inference optimization.

Research scientist at Meta working on spoken language technology and multimodal language models.

Wenyi Wang is listed as an author of the Meta AI technical report SPIrit-LM: Interleaved Spoken and Written Language Model.

Research scientist at Meta whose public work spans speech and audio-language modeling; arXiv author results include SPIrit-LM.

Research scientist working on natural language processing, with public work spanning speech and language modeling such as VoxPopuli, pGSLM, and SPIrit-LM.

Research scientist at Meta FAIR working on speech and audio foundation models. His research covers self-supervised learning, spoken language modeling, and multimodal audio-language systems.

Abdelrahman Mohamed is a professor at the University of Toronto and a Canada CIFAR AI Chair whose work spans speech, audio, and language modeling. His public profile highlights speech recognition, representation learning, and multimodal foundation models.

Research scientist and professor working across Meta, NYU, and EHESS on speech, language, and cognitive science. His work studies how humans and machines acquire language and how spoken and written models can be aligned.

Laurent Sagot is listed as an author of the Meta AI technical report SPIrit-LM: Interleaved Spoken and Written Language Model.

Canonical link

Jade Copet

Felix Kreuk

Tu Anh Nguyen

Yossi Adi

Máté Gergely Csajbók

Kushal Lakhotia

Ariel Noy

Wenyi Wang

Alexei Baevski

Morgane Riviere

Wei-Ning Hsu

Abdelrahman Mohamed

Emmanuel Dupoux

Laurent Sagot