MOSS-TTSD
A dialogue speech synthesis model that supports both Chinese and English.
CommonProductProductivity[\Speech Synthesis\\Podcast Production\
MOSS-TTSD is an open-source bilingual dialogue synthesis model that supports natural and expressive speech generation. It can convert dialogue scripts into high-quality speech, suitable for podcast production and AI dialogue applications. The model's features include zero-shot voice cloning and long-duration speech generation, with a high level of expressiveness and realism. MOSS-TTSD is trained on large-scale language and speech data, ensuring the naturalness and accuracy of the generated speech. This technology is suitable for commercial use and is completely open source.
MOSS-TTSD Visit Over Time
Monthly Visits
479936721
Bounce Rate
36.14%
Page per Visit
6.1
Visit Duration
00:06:28