DoyenTalker
PublicDoyenTalker uses deep learning techniques to generate personalized avatar videos that speak user-provided text in a specified voice. The system utilizes Coqui TTS for text-to-speech generation, along with various face rendering and animation techniques to create a video where the given avatar articulates the speech.
audio-driven-talking-facecoqui-ttslipsyncpytorchtalking-facetalking-face-generationtalking-headttswav2lip-gan
Creat:2024-07-11T14:41:03
Update:2025-02-14T07:31:30
12
Stars
0
Stars Increase