Qwen3-Omni Technical Report

Alibaba Qwen report author whose DBLP profile identifies an Alibaba Group affiliation and Qwen technical report authorship.

Jingren Zhou is Chief Technology Officer of Alibaba Cloud. Public speaker biographies describe him as a computer scientist and entrepreneur whose work includes large-scale AI and cloud systems.

Qinyu Chen is listed as an author of the DeepSeek technical report DeepSeek-V3 Technical Report.

Yuanzhi Zhu is a Qwen researcher whose public work includes multimodal and audio-language models.

Mingkun Yang works on multimodal large language models, embodied AI, and robotics. His public profile says he is a postdoc at Zhejiang University and a research scientist at Qwen.

Research scientist in Tongyi Lab whose public homepage and OpenReview profile describe work on large language models, multimodal learning, and visual grounding. His public profiles also list affiliations with Alibaba Group and East China Normal University.

Researcher at Alibaba Group working on multimodal large language models; public profile and publication context connect Hang Zhang to the Qwen2-VL technical report.

Second-year PhD student at Peking University focused on audio-language foundation models, trustworthy AI, and embodied AI; coauthor of Qwen2-Audio.

Machine learning engineer and researcher interested in large language models and multimodal audio-language systems; coauthor of Qwen2-Audio.

Associate professor at the University of Virginia and Qwen contributor whose research focuses on personalization and recommender systems, online advertising, and AI systems.

Yale Song is an assistant professor in artificial intelligence at Yonsei University and is also affiliated with the Stanford AI Lab while working part-time with Adobe Research.

Researcher whose public homepage focuses on computer vision, multimodal foundation models, and embodied AI; publication context connects Shilong Liu to the Qwen2.5-Omni technical report.

Linjie Li is a research scientist at Alibaba Group and a contributor to the Qwen2.5-Omni Technical Report.

Research engineer at Alibaba Group working on audio and multimodal foundation models, multimodal RL, and speech processing; coauthor of Qwen2.5-Omni.

Jianwei Niu is a tenure-track research assistant professor in the School of Data Science at Lingnan University, Hong Kong. His research focuses on multimodal learning, computer vision, and embodied AI.

Researcher working on multimodal foundation models, including Qwen3-Omni and related speech-language systems.

Canonical link

Peng Wang

Jingren Zhou

Qinyu Chen

Yuanzhi Zhu

Mingkun Yang

Jiabo Ye

Hang Zhang

Qingyang Zhang

Yinghao Li

Hongning Wang

Yale Song

Shilong Liu

Linjie Li

Xiaoyu Hu

Jianwei Niu

Kexuan Wei