LLMpeople
Home People Organizations Reports Fields Schools
Public Atlas People first, reports as evidence, organizations as context.

Atlas / Reports / Detail

Qwen-Audio: Advancing Universal Audio Understanding via Unified Large-Scale Audio-Language Models

Audio Language Models

Qwen2023-11-138 researchers
Field
Audio Language Models
Organization
Qwen
arXiv
2311.07919

Canonical link

https://arxiv.org/abs/2311.07919

Connected researchers

Profile Reports

Chao Zhang

Qwen

Chao Zhang is an applied scientist in the Alibaba Foundation Model team. His public profile notes a PhD in computer science from the University of Illinois Urbana-Champaign and research interests in NLP, large language models, reasoning, and multimodal generation.

Qwen
Unknown 2
Profile Reports

Yeyun Gong

Qwen

Yeyun Gong is a researcher and engineering leader focused on multimodal large language models, grounding, and large-scale knowledge systems. His homepage lists selected work including Qwen2-Audio.

Qwen
Unknown 2
Profile Reports

Jie Tang

OpenAI / Qwen

OpenAI contributor credited on the GPT-4 Technical Report; previously a Dropbox engineer and a Ph.D. student at UC Berkeley focused on machine learning and robotics.

OpenAIQwen
Unknown 3
Profile Reports

Mingyang Shang

Qwen

Research intern at Alibaba Group focused on multimodal understanding and generation, large multimodal models, and reinforcement learning; coauthor of Qwen2-Audio.

Qwen
Unknown 2
Profile Reports

Yaqi Wang

Qwen

Research scientist in Tongyi Lab and technical lead of Qwen2-Audio, with public work on audio-language models.

Qwen
Unknown 2
Profile Reports

Yushi Hu

Qwen

Yushi Hu is a senior research engineer at Shanghai AI Laboratory and a founding member of OpenMMLab. Public arXiv records also list him as a coauthor of Qwen2-Audio.

Qwen
Unknown 2
Profile Reports

Hongyin Luo

Qwen

Researcher whose arXiv author results include Qwen-Audio and related audio-language modeling work.

Qwen
Unknown 1
Profile Reports

Qingqing Zheng

Qwen

Co-author of the Qwen-Audio technical report on unified large-scale audio-language models.

Qwen
Unknown 1

LLMpeople is a public atlas for discovering frontier AI researchers with context, provenance, and respect.