Atlas / Reports / Detail
Qwen3-Omni Technical Report
Multimodal Models
Connected researchers
Jiabo Ye
Qwen
Research scientist in Tongyi Lab whose public homepage and OpenReview profile describe work on large language models, multimodal learning, and visual grounding. His public profiles also list affiliations with Alibaba Group and East China Normal University.
Yale Song
Qwen
Yale Song is an assistant professor in artificial intelligence at Yonsei University and is also affiliated with the Stanford AI Lab while working part-time with Adobe Research.
Mingkun Yang
Qwen
Mingkun Yang works on multimodal large language models, embodied AI, and robotics. His public profile says he is a postdoc at Zhejiang University and a research scientist at Qwen.
Hongning Wang
Qwen
Associate professor at the University of Virginia and Qwen contributor whose research focuses on personalization and recommender systems, online advertising, and AI systems.
Shilong Liu
Qwen
Researcher whose public homepage focuses on computer vision, multimodal foundation models, and embodied AI; publication context connects Shilong Liu to the Qwen2.5-Omni technical report.
Jianwei Niu
Qwen
Jianwei Niu is a tenure-track research assistant professor in the School of Data Science at Lingnan University, Hong Kong. His research focuses on multimodal learning, computer vision, and embodied AI.
Jingren Zhou
MiniMax / Moonshot AI / Qwen / Z.ai
Alibaba senior technology leader and researcher associated with Qwen. Public profiles list him with Alibaba Group, and official Alibaba Cloud coverage identifies him as a chief technology officer leading large-model work.
Yuanzhi Zhu
Qwen
Yuanzhi Zhu is a Qwen researcher whose public work includes multimodal and audio-language models.
Peng Wang
Qwen
Researcher affiliated with the Qwen team at Alibaba Group on Google Scholar and coauthor of the Qwen and Qwen3 technical reports.
Qinyu Chen
DeepSeek / Qwen
Research scientist at Qwen, Alibaba, whose public OpenReview profile lists work on vision-language models and large language models.
Hang Zhang
Qwen
Researcher at Alibaba Group working on multimodal large language models; public profile and publication context connect Hang Zhang to the Qwen2-VL technical report.
Linjie Li
Qwen
Linjie Li is a research scientist at Alibaba Group and a contributor to the Qwen2.5-Omni Technical Report.
Qingyang Zhang
Qwen
Second-year PhD student at Peking University focused on audio-language foundation models, trustworthy AI, and embodied AI; coauthor of Qwen2-Audio.
Xiaoyu Hu
Qwen
Research engineer at Alibaba Group working on audio and multimodal foundation models, multimodal RL, and speech processing; coauthor of Qwen2.5-Omni.
Yinghao Li
Qwen
Machine learning engineer and researcher interested in large language models and multimodal audio-language systems; coauthor of Qwen2-Audio.
Kexuan Wei
Qwen
Researcher working on multimodal foundation models, including Qwen3-Omni and related speech-language systems.