MOSS-TTSD Makes a Stunning Open Source Debut: A Million Hours of Training Creates a New King in AI Podcasts
Tsinghua & partners open-sourced MOSS-TTSD, a bilingual speech model based on Qwen3-1.7B. Features XY-Tokenizer for 1kbps low-bitrate quality, zero-shot cloning, and 960s generation. Outperforms MoonCast in Chinese metrics.....