AIbase
Product LibraryTool Navigation

SynthAVSR

Public

This repository contains the development of SynthAVSR, the first Audiovisual Speech Recognition (AVSR) system tailored for the Spanish and Catalan languages. Based on the AV-HuBERT (Audio-Visual Hidden Unit BERT) model, SynthAVSR leverages synthetic audiovisual data to bridge the gap in speech recognition technology for these languages.

Creat2024-10-28T01:14:53
Update2025-01-20T18:13:43
0
Stars
0
Stars Increase

Related projects