A speech dialect classification model based on the Whisper-small architecture, specifically designed to identify 8 Chinese dialect variants, including Jianghuai dialect, Jiao-Liao Mandarin, Ji-Lu Mandarin, Lan-Yin Mandarin, Mandarin, Southwestern Mandarin, Zhongyuan Mandarin, and Cantonese. This model is trained on the Common Voice 11.0 dataset and has significant value in speech recognition.
Audio Processing
TransformersChinese