MOSS-TTSD is an open-source bilingual dialogue synthesis model that supports natural and expressive speech generation. It can convert dialogue scripts into high-quality speech, suitable for podcast production and AI dialogue applications. The model's features include zero-shot voice cloning and long-duration speech generation, with a high level of expressiveness and realism. MOSS-TTSD is trained on large-scale language and speech data, ensuring the naturalness and accuracy of the generated speech. This technology is suitable for commercial use and is completely open source.