ICASSP 2023 Accepted
? Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
real time face swap and one-click video deepfake with only a single image
??? 60+ Implementations/tutorials of deep learning papers with side-by-side notes ?; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), ? reinforcement learning (ppo, dqn), capsnet, distillation, ... ?
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Port of OpenAI's Whisper model in C/C++
?? - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
12 Weeks, 24 Lessons, AI for All!
A generative speech model for daily dialogue.
GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.
?AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time