LLMpeople
Home People Organizations Reports Fields Schools
Public Atlas People first, reports as evidence, organizations as context.

Atlas / Reports / Detail

VALL-E 2: Neural Codec Language Models are Human Parity Zero-Shot Text to Speech Synthesizers

Speech and Audio Models report from Microsoft with 9 connected researchers in the LLMpeople atlas.

Microsoft2024-06-089 researchers
Field
Speech and Audio Models
Organization
Microsoft
arXiv
2406.05370

Canonical link

https://arxiv.org/abs/2406.05370

Connected researchers

Organization-authored report.

LLMpeople is a public atlas for discovering frontier AI researchers with context, provenance, and respect.

Privacy ยท Terms