Training code for FAcodec presented in NaturalSpeech3
Clone a voice in 5 seconds to generate arbitrary speech in real-time
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Meteor, the JavaScript App Platform
?? - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
:robot: The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more models architectures. Features: Generate Text, Audio, Video, Images, Voice Cloning, Distributed, P2P inference
Instant voice cloning by MIT and MyShell. Audio foundation model.
A cloud-native Go microservices framework with cli tool for productivity.
Easily train a good VC model with voice data <= 10 mins!
Cross-platform, customizable ML solutions for live and streaming media.