Pushing the Limits of Zero-shot End-to-End Speech Translation
? Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Port of OpenAI's Whisper model in C/C++
?? - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
A generative speech model for daily dialogue.
?AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
《The Way to Go》中文译本,中文正式名《Go 入门指南》
A libre lightweight streaming front-end for Android.
?掘金翻译计划,可能是世界最大最好的英译中技术社区,最懂读者和译者的翻译平台:
Instant voice cloning by MIT and MyShell. Audio foundation model.