On August 23, Meta announced the open-source release of SeamlessM4T, a large-scale model capable of translating multiple voices and languages. SeamlessM4T supports translation across 100 languages and voices, enabling multi-modal translation including speech-to-text, speech-to-speech, text-to-speech, and text-to-text. This model integrates previously released translation models by Meta such as NLLB and MMS, and has been trained using 270,000 hours of aligned voice-text data, making it the largest and most comprehensive open-source translation model to date.
The World's Largest Open Source Translation Model! Produced by Meta, Supports 100 Languages and Voices!

AIGC开放社区公众号
This article is from AIbase Daily
Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.