Step Audio Model Ranks Among the Top Three Globally, Setting a New High for Chinese Large Models in Speech Perception
The speech generation model StepAudio2.5TTS from Chinese company StepXingchen has entered the top three globally in the Artificial Analysis Speech Arena Leaderboard, becoming the highest-ranked Chinese large model product on the list. The ranking uses a blind-test Elo scoring system, where users evaluate speech perception without knowing the model's identity, highlighting its genuine speech synthesis capabilities.