LLMpeople
Home People Organizations Reports Fields Schools
Public Atlas People first, reports as evidence, organizations as context.

Atlas / Reports / Detail

Qwen2-Audio Technical Report

Audio Language Models

Qwen2024-07-1426 researchers
Field
Audio Language Models
Organization
Qwen
arXiv
2407.10759

Canonical link

https://arxiv.org/abs/2407.10759

Connected researchers

Profile Reports

Nan Duan

Qwen

Nan Duan is head of foundation model post-training at Qwen and a vice president at Alibaba Group. His public profile highlights work on foundation models and natural language processing, after earlier research leadership at Microsoft Research.

Qwen
Unknown 1
Profile Reports

Chao Zhang

Qwen

Chao Zhang is an applied scientist in the Alibaba Foundation Model team. His public profile notes a PhD in computer science from the University of Illinois Urbana-Champaign and research interests in NLP, large language models, reasoning, and multimodal generation.

Qwen
Unknown 2
Profile Reports

Zhengyuan Liu

Qwen

Zhengyuan Liu is a research scientist at Alibaba Group and a PhD student at the National University of Singapore. His public profile highlights work in natural language processing, vision-language models, and grounding.

Qwen
Unknown 1
Profile Reports

Zhen Ye

Qwen

Zhen Ye is a researcher in the Qwen team at Alibaba Cloud. His public profile notes a PhD in computer science from the University of Massachusetts Amherst and research interests in natural language understanding, generation, and reasoning.

Qwen
Unknown 1
Profile Reports

Yongqiang Wang

Qwen

Research scientist at Alibaba working on speech processing, multimodal learning, natural language processing, and efficient human-computer interaction.

Qwen
Unknown 1
Profile Reports

Jiaqi Wang

Qwen

Jiaqi Wang works on machine learning, multimodal large language models, and AI for healthcare. Public profiles connect him to the Qwen2-Audio technical report.

Qwen
Unknown 1
Profile Reports

Shen Gao

Qwen

Shen Gao is a PhD student at Zhejiang University working on multimedia and large language models. His public profiles connect him to Qwen2-Audio and related multimodal systems including OmniParser.

Qwen
Unknown 1
Profile Reports

Weiqiang Wang

Qwen

Weiqiang Wang is a PhD student working on multimedia and multimodal AI. Public profiles connect him to the Qwen2-Audio technical report and related research.

Qwen
Unknown 1
Profile Reports

Tianyu Liu

Moonshot AI / Qwen

Principal scientist at Moonshot AI working on multimodal large models.

Moonshot AIQwen
Unknown 2
Profile Reports

Yeyun Gong

Qwen

Yeyun Gong is a researcher and engineering leader focused on multimodal large language models, grounding, and large-scale knowledge systems. His homepage lists selected work including Qwen2-Audio.

Qwen
Unknown 2
Profile Reports

Jie Tang

OpenAI / Qwen

OpenAI contributor credited on the GPT-4 Technical Report; previously a Dropbox engineer and a Ph.D. student at UC Berkeley focused on machine learning and robotics.

OpenAIQwen
Unknown 3
Profile Reports

Hongning Wang

Qwen

Associate professor at the University of Virginia and Qwen contributor whose research focuses on personalization and recommender systems, online advertising, and AI systems.

Qwen
Unknown 2
Profile Reports

Xian-Sheng Hua

Qwen

Xian-Sheng Hua is a computer vision and multimodal AI researcher known for work in visual recognition, multimedia understanding, and large AI systems. Public profiles tie him to Alibaba DAMO Academy and related academic service roles.

Qwen
Unknown 1
Profile Reports

Jingren Zhou

MiniMax / Moonshot AI / Qwen / Z.ai

Alibaba senior technology leader and researcher associated with Qwen. Public profiles list him with Alibaba Group, and official Alibaba Cloud coverage identifies him as a chief technology officer leading large-model work.

MiniMaxMoonshot AIQwen
Unknown 17
Profile Reports

Xiaoyong Du

Qwen

Xiaoyong Du works on multimodal large language models and language agents, with public profile text highlighting omni models, visual agents, and GUI agents. His homepage explicitly identifies him with Qwen.

Qwen
Unknown 1
Profile Reports

An Yang

Qwen

Alibaba researcher working on large language models and multimodal pretraining; public research profiles connect An Yang to Qwen-related work and earlier study at Peking University.

Qwen
Unknown 5
Profile Reports

Yuanzhi Zhu

Qwen

Yuanzhi Zhu is a Qwen researcher whose public work includes multimodal and audio-language models.

Qwen
Unknown 3
Profile Reports

Shijie Wang

Qwen

Senior research scientist in Tongyi Lab whose official profile highlights post-training, AI for science, evaluation and alignment, multimodal reasoning, and large language model reasoning.

Qwen
Unknown 3
Profile Reports

Mingyang Shang

Qwen

Research intern at Alibaba Group focused on multimodal understanding and generation, large multimodal models, and reinforcement learning; coauthor of Qwen2-Audio.

Qwen
Unknown 2
Profile Reports

Qingyang Zhang

Qwen

Second-year PhD student at Peking University focused on audio-language foundation models, trustworthy AI, and embodied AI; coauthor of Qwen2-Audio.

Qwen
Unknown 2
Profile Reports

Yaqi Wang

Qwen

Research scientist in Tongyi Lab and technical lead of Qwen2-Audio, with public work on audio-language models.

Qwen
Unknown 2
Profile Reports

Yinghao Li

Qwen

Machine learning engineer and researcher interested in large language models and multimodal audio-language systems; coauthor of Qwen2-Audio.

Qwen
Unknown 2
Profile Reports

Yongqi Wang

Qwen

Research scientist in Tongyi Lab whose public profile highlights work on speech processing, machine learning, and multimodal large language models.

Qwen
Unknown 2
Profile Reports

Yushi Hu

Qwen

Yushi Hu is a senior research engineer at Shanghai AI Laboratory and a founding member of OpenMMLab. Public arXiv records also list him as a coauthor of Qwen2-Audio.

Qwen
Unknown 2

LLMpeople is a public atlas for discovering frontier AI researchers with context, provenance, and respect.