KaniTTS is a high-speed, high-fidelity Arabic text-to-speech model optimized for real-time conversational artificial intelligence applications. It adopts a two-stage pipeline architecture, combining a large language model with an efficient audio codec to achieve excellent speed and audio quality, and can meet the speech synthesis needs of multiple fields such as conversational AI, accessibility assistance, and research.
Audio Processing
TransformersArabic