SynthAVSR
PublicThis repository contains the development of SynthAVSR, the first Audiovisual Speech Recognition (AVSR) system tailored for the Spanish and Catalan languages. Based on the AV-HuBERT (Audio-Visual Hidden Unit BERT) model, SynthAVSR leverages synthetic audiovisual data to bridge the gap in speech recognition technology for these languages.
Creat:2024-10-28T01:14:53
Update:2025-01-20T18:13:43
0
Stars
0
Stars Increase