Acoustic and language models for minorised languages.
? Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Port of OpenAI's Whisper model in C/C++
?? - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
A generative speech model for daily dialogue.
?AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
中文分词 词性标注 命名实体识别 依存句法分析 成分句法分析 语义依存分析 语义角色标注 指代消解 风格转换 语义相似度 新词发现 关键词短语提取 自动摘要 文本分类聚类 拼音简繁转换 自然语言处理
Instant voice cloning by MIT and MyShell. Audio foundation model.
? Industrial-strength Natural Language Processing (NLP) in Python
SoftVC VITS Singing Voice Conversion