On August 20, 2025, ElevenLabs, a leading global AI voice technology company, officially announced the launch of its latest Eleven v3 Alpha API, offering developers a groundbreaking Text-to-Speech (TTS) tool.
The Eleven v3 Alpha API is hailed as "the most expressive text-to-speech model on Earth." Its core advantage lies in supporting over 70 languages and generating natural, smooth, and emotionally rich voice outputs.
This API introduces a new Dialogue Mode, allowing developers to create multi-character dialogue scenarios. It supports an unlimited number of virtual characters and can handle changes in tone, emotional fluctuations, and natural interruptions in conversations. This feature makes it particularly suitable for creating multi-character interactive audio content, such as audiobooks, interactive game narratives, and multimedia projects.
In addition, the Eleven v3 Alpha API also supports advanced Audio Tags functionality. Developers can precisely control the tone, emotion, and rhythm of the voice by inserting tags like [happy], [whispering], or [sighs] into the script. This technological breakthrough enables AI voice to not only "speak" but also "perform," providing users with a more realistic and immersive auditory experience. For example, developers can easily achieve dynamic voice generation ranging from dramatic monologues to light-hearted conversations.