Atlas / Reports / Detail
Step-Audio 2: Cascaded Multimodal Large Language Models with Versatile Speech Capabilities
Audio Language Models report from Stepfun with 10 connected researchers in the LLMpeople atlas.
Connected researchers
Can Cui
Stepfun
Researcher at Stepfun and coauthor of the Step-Audio 2: Cascaded Multimodal Large Language Models with Versatile Speech Capabilities.
Xiao Ma
Stepfun
Researcher at Stepfun and coauthor of the Step-Audio 2: Cascaded Multimodal Large Language Models with Versatile Speech Capabilities.
Zeyi Yan
Stepfun
Researcher at Stepfun and coauthor of the Step-Audio 2: Cascaded Multimodal Large Language Models with Versatile Speech Capabilities.
Siyao Wang
Stepfun
Researcher at Stepfun and coauthor of the Step-Audio 2: Cascaded Multimodal Large Language Models with Versatile Speech Capabilities.
Jiale Zhuang
Stepfun
Researcher at Stepfun and coauthor of the Step-Audio 2: Cascaded Multimodal Large Language Models with Versatile Speech Capabilities.
Yu Guo
Stepfun
Researcher at Stepfun and coauthor of the Step-Audio 2: Cascaded Multimodal Large Language Models with Versatile Speech Capabilities.
Yizhou Zou
Stepfun
Researcher at StepFun AI working on speech, language, and multimodal learning, including Step-Audio 2.
Huan Yang
Stepfun
Researcher at Stepfun and coauthor of the Step-Audio 2: Cascaded Multimodal Large Language Models with Versatile Speech Capabilities.
Ruiqi Song
Stepfun
Researcher at Stepfun and coauthor of the Step-Audio 2: Cascaded Multimodal Large Language Models with Versatile Speech Capabilities.
Hui Yu
Stepfun
Researcher at Stepfun and coauthor of the Step-Audio 2: Cascaded Multimodal Large Language Models with Versatile Speech Capabilities.