Vietnamese speech recognition using Wavenet
? Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
ComfyUI docker images for use in GPU cloud and local environments. Includes AI-Dock base for authentication and improved user experience.
Open Source Computer Vision Library
《动手学深度学习》:面向中文读者、能运行、可讨论。中英文版被70多个国家的500多所大学用于教学。
Apache Superset is a Data Visualization and Data Exploration Platform
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
30 days of Python programming challenge is a step-by-step guide to learn the Python programming language in 30 days. This challenge may take more than100 days, follow your own pace. These videos may help too: https://www.youtube.com/channel/UC7PNRuno1rzYPb1xLa4yktw
Port of OpenAI's Whisper model in C/C++
?? - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Learn how to design, develop, deploy and iterate on production-grade ML applications.