Alibaba
-
Input tokens/M
Output tokens/M
Context Length
$2.4
$12
8
$1.6
$10
GenMedLabs
XTTS v2 GGUF is a memory-efficient text-to-speech system optimized for mobile devices. It uses a C++ inference engine to achieve ultra-low memory usage and fast loading.
amenIKh
XTTS V2 text-to-speech model fine-tuned on a custom Tunisian dataset
OmarSamir
A text-to-speech (TTS) model specifically designed for Egyptian Arabic, developed based on the XTTS v2 architecture
suhaibrashid17
A TTS model supporting Urdu text-to-speech and voice cloning
shadialhakimi
ⓍTTS-v2 is an advanced voice generation model that supports 17 languages. It can clone voices and achieve cross-lingual voice synthesis with just a 6-second audio clip.
Xerror
A multilingual speech synthesis model fine-tuned with voice samples of C-3PO from 'Star Wars', supporting 17 languages while retaining the character's iconic voice and sarcastic style
Abhinay45
This is a fine-tuned version of the XTTS v2 model developed by Coqui-AI, specifically optimized for Hindi speech datasets, supporting voice cloning and multilingual speech generation.
UNRN
ⓍTTS is a voice generation model that can clone a voice with just 6 seconds of audio and apply it to different languages, supporting Argentinian-accented Spanish.
AOLCDROM
A Hindi text-to-speech model fine-tuned from a forked version of Coqui TTS, supporting Hindi and English speech synthesis with Hindi accent
marianbasti
ⓍTTS is a speech generation model that can clone voices with just 6 seconds of audio and apply them to different languages. No need for hours of extensive training data.
Borcherding
A multilingual text-to-speech model fine-tuned with C-3PO's voice from Star Wars, featuring sarcastic tone and emotional expression
NeuroDonu
XTTS is a text-to-speech model trained on a 40-hour dataset, supporting Russian and suitable for speech synthesis tasks in the legal domain.
reach-vb
ⓍTTS is an advanced voice generation model that achieves cross-lingual voice cloning with just 6 seconds of audio, supporting 16 languages.
coqui
ⓍTTS is a revolutionary voice generation model that achieves cross-lingual voice cloning with just a 6-second audio clip, supporting 17 languages.
ⓍTTS is a voice generation model that can clone voices and apply them to different languages with just a 6-second audio clip.