LLMpeople
Home People Organizations Reports Fields Schools
Public Atlas People first, reports as evidence, organizations as context.

Atlas / Reports / Detail

Qwen2-Audio Technical Report

Audio Language Models report from Alibaba Qwen with 26 connected researchers in the LLMpeople atlas.

Alibaba Qwen2024-07-1426 researchers
Field
Audio Language Models
Organization
Alibaba Qwen
arXiv
2407.10759

Canonical link

https://arxiv.org/abs/2407.10759

Connected researchers

Jie Tang portrait
Researcher 3 reports

Jie Tang

OpenAI / Alibaba Qwen

Computer scientist and engineer credited on OpenAI's GPT-4 public contributions page; OpenAI's 2016 team update says he previously led Dropbox's core file sync team after earlier work in Pieter Abbeel's Berkeley robotics lab.

OpenAIAlibaba Qwen
Shijie Wang portrait
Researcher 3 reports

Shijie Wang

Alibaba Qwen

Senior research scientist in Tongyi Lab whose official profile highlights post-training, AI for science, evaluation and alignment, multimodal reasoning, and large language model reasoning.

Alibaba Qwen
An Yang portrait
Researcher 9 reports

An Yang

Alibaba Qwen

Alibaba researcher working on large language models and multimodal pretraining; public research profiles connect An Yang to Qwen-related work and earlier study at Peking University.

Alibaba Qwen
Jingren Zhou portrait
Researcher 23 reports

Jingren Zhou

MiniMax / Moonshot AI

Jingren Zhou is Chief Technology Officer of Alibaba Cloud. Public speaker biographies describe him as a computer scientist and entrepreneur whose work includes large-scale AI and cloud systems.

MiniMaxMoonshot AIAlibaba Qwen
China
Tianyu Liu portrait
Researcher 3 reports

Tianyu Liu

Moonshot AI / Alibaba Qwen

Tianyu Liu is a researcher at Kimi working on coding and agents. He previously worked on Qwen at Alibaba and was a founding member of Tencent Hunyuan, and he earned a PhD in natural language processing from Peking University.

Moonshot AIAlibaba Qwen
Yuanzhi Zhu portrait
Researcher 3 reports

Yuanzhi Zhu

Alibaba Qwen

Yuanzhi Zhu is a Qwen researcher whose public work includes multimodal and audio-language models.

Alibaba Qwen
Qingyang Zhang portrait
Researcher 2 reports

Qingyang Zhang

Alibaba Qwen

Second-year PhD student at Peking University focused on audio-language foundation models, trustworthy AI, and embodied AI; coauthor of Qwen2-Audio.

Alibaba Qwen
Zhen Ye portrait
Researcher 1 reports

Zhen Ye

Alibaba Qwen

Zhen Ye is a researcher in the Qwen team at Alibaba Cloud. His public profile notes a PhD in computer science from the University of Massachusetts Amherst and research interests in natural language understanding, generation, and reasoning.

Alibaba Qwen
Hong Kong 2 likes
Yeyun Gong portrait
Researcher 2 reports

Yeyun Gong

Alibaba Qwen

Yeyun Gong is a researcher and engineering leader focused on multimodal large language models, grounding, and large-scale knowledge systems. His homepage lists selected work including Qwen2-Audio.

Alibaba Qwen
Yongqi Wang portrait
Researcher 3 reports

Yongqi Wang

Alibaba Qwen

Research scientist in Tongyi Lab whose public profile highlights work on speech processing, machine learning, and multimodal large language models.

Alibaba Qwen
Mingyang Shang portrait
Researcher 2 reports

Mingyang Shang

Alibaba Qwen

Research intern at Alibaba Group focused on multimodal understanding and generation, large multimodal models, and reinforcement learning; coauthor of Qwen2-Audio.

Alibaba Qwen
Yaqi Wang portrait
Researcher 2 reports

Yaqi Wang

Alibaba Qwen

Research scientist in Tongyi Lab and technical lead of Qwen2-Audio, with public work on audio-language models.

Alibaba Qwen
Shen Gao portrait
Researcher 1 reports

Shen Gao

Alibaba Qwen

Shen Gao is a PhD student at Zhejiang University working on multimedia and large language models. His public profiles connect him to Qwen2-Audio and related multimodal systems including OmniParser.

Alibaba Qwen
Yinghao Li portrait
Researcher 2 reports

Yinghao Li

Alibaba Qwen

Machine learning engineer and researcher interested in large language models and multimodal audio-language systems; coauthor of Qwen2-Audio.

Alibaba Qwen
Nan Duan portrait
Researcher 1 reports

Nan Duan

Alibaba Qwen

Vice President of JD.COM and Deputy Director of JD Future Academy, leading foundation model research across language, audio, vision, and embodied AI; previously Technical Fellow at StepFun and Senior Principal Researcher at Microsoft Research Asia.

Alibaba Qwen
China 5 likes
Weiqiang Wang portrait
Researcher 1 reports

Weiqiang Wang

Alibaba Qwen

Weiqiang Wang is a PhD student working on multimedia and multimodal AI. Public profiles connect him to the Qwen2-Audio technical report and related research.

Alibaba Qwen
Mengzhe Chen portrait
Researcher 1 reports

Mengzhe Chen

Alibaba Qwen

Research assistant at CUHK-Shenzhen focused on multimodal learning, efficient adaptation, alignment, and reinforcement learning; coauthor of Qwen2-Audio.

Alibaba Qwen
Zhengyuan Liu portrait
Researcher 1 reports

Zhengyuan Liu

Alibaba Qwen

Zhengyuan Liu is a research scientist at Alibaba Group and a PhD student at the National University of Singapore. His public profile highlights work in natural language processing, vision-language models, and grounding.

Alibaba Qwen
Yongqiang Wang portrait
Researcher 1 reports

Yongqiang Wang

Alibaba Qwen

Research scientist at Alibaba working on speech processing, multimodal learning, natural language processing, and efficient human-computer interaction.

Alibaba Qwen
Yushi Hu portrait
Researcher 2 reports

Yushi Hu

Alibaba Qwen

Yushi Hu is a senior research engineer at Shanghai AI Laboratory and a founding member of OpenMMLab. Public arXiv records also list him as a coauthor of Qwen2-Audio.

Alibaba Qwen
Jiaqi Wang portrait
Researcher 1 reports

Jiaqi Wang

Alibaba Qwen

Jiaqi Wang works on machine learning, multimodal large language models, and AI for healthcare. Public profiles connect him to the Qwen2-Audio technical report.

Alibaba Qwen
Hongning Wang portrait
Researcher 2 reports

Hongning Wang

Alibaba Qwen

Associate professor at the University of Virginia and Qwen contributor whose research focuses on personalization and recommender systems, online advertising, and AI systems.

Alibaba Qwen
Chao Zhang portrait
Researcher 2 reports

Chao Zhang

Alibaba Qwen

Chao Zhang is an applied scientist in the Alibaba Foundation Model team. His public profile notes a PhD in computer science from the University of Illinois Urbana-Champaign and research interests in NLP, large language models, reasoning, and multimodal generation.

Alibaba Qwen
Zejun Ma portrait
Researcher 1 reports

Zejun Ma

Alibaba Qwen

PhD student at The Chinese University of Hong Kong focused on speech language understanding, audio-language multimodal learning, and efficient model adaptation; coauthor of Qwen2-Audio.

Alibaba Qwen

LLMpeople is a public atlas for discovering frontier AI researchers with context, provenance, and respect.

Privacy ยท Terms