Speech recognition toolkit for the arduino
? Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
ComfyUI docker images for use in GPU cloud and local environments. Includes AI-Dock base for authentication and improved user experience.
Open Source Computer Vision Library
《动手学深度学习》:面向中文读者、能运行、可讨论。中英文版被70多个国家的500多所大学用于教学。
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Port of OpenAI's Whisper model in C/C++
?? - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Learn how to design, develop, deploy and iterate on production-grade ML applications.
A generative speech model for daily dialogue.
?AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time