VideoChat
Public实时语音交互数字人,支持端到端语音方案(GLM-4-Voice - THG)和级联方案(ASR-LLM-TTS-THG)。可自定义形象与音色,无须训练,支持音色克隆,首包延迟低至3s。Real-time voice interactive digital human, supporting end-to-end voice solutions (GLM-4-Voice - THG) and cascaded solutions (ASR-LLM-TTS-THG). Customizable appearance and voice, supporting voice cloning, with initial package delay as low as 3s.
asrdialogue-systemsdigital-humanend-to-endgradio-python-applip-syncmultimodal-large-language-modelsmusetalkreal-timestreaming
Creat:2024-10-18T15:11:02
Update:2025-03-23T07:11:47
https://www.modelscope.cn/studios/AI-ModelScope/video_chat
1.0K
Stars
1
Stars Increase