ElevenLabs Launches V3 Voice Model: Supports Over 70 Languages and Allows Emotional and Tonal Control via Tags

AIbase基地

Published inAI News · 6 min read · Jun 6, 2025

ElevenLabs, a global leader in AI voice technology, has officially released its latest text-to-speech model, Eleven v3 (Alpha version), which is hailed as the most expressive AI voice model to date. This breakthrough not only enhances the naturalness and emotional expression of speech synthesis but also provides creators and developers with more powerful tools for developing videos, audiobooks, and multimedia applications.

Technical Breakthrough: More Natural Conversations and Emotional Expression

Eleven v3 introduces a new architecture that enables deeper understanding of text semantics, significantly enhancing the expressiveness of the voice. Compared to previous models, v3 supports over 70 languages and can handle multi-character dialogue scenarios, simulating natural characteristics such as tone changes, emotional fluctuations, and interruptions during real conversations. With the addition of audio tagging features, users can directly use tags like [sad], [angry], [whispers], or [laughs] to precisely control emotional expression and non-verbal reactions, such as laughter or sighs. This fine-grained control offers creators unprecedented flexibility, particularly suitable for movie dubbing, audiobook production, and game voice design.

Applications: Empowering Creators and Developers

ElevenLabs emphasizes that the v3 model is specifically designed for content creators and media tool developers. Whether it’s creating engaging video narration, emotionally rich audiobooks, or developing interactive media tools, the high expressiveness of v3 can significantly enhance user experience. Additionally, the model supports up to 32 different speakers, providing strong support for multi-speaker dialogue scenarios. This makes v3 highly applicable in fields such as education, entertainment, and enterprise-level applications like AI customer service centers.

Beta Testing and Discounts: A Blessing for Developers and Creators

Eleven v3 is now in public Alpha testing, and an 80% discount is available throughout June to encourage users to experience its powerful features. ElevenLabs also announced that the public API for v3 will soon be launched, and developers can obtain early access by contacting the sales team. For real-time and conversational scenarios, ElevenLabs recommends continuing to use the v2.5Turbo or Flash models, as the real-time version of v3 is still under development and is expected to further expand its application scope.

Industry Impact: Leading the New Trend in AI Voice Technology

As AI voice technology rapidly develops, the release of ElevenLabs v3 undoubtedly intensifies industry competition. Previously, ElevenLabs has held an important position in the audiobook, dubbing, and AI customer service sectors due to its high-precision voice cloning and text-to-speech technologies. The release of v3 further solidifies its leading position, especially in its standout performance in multi-language support and emotional expression compared to competitors like OpenAI Whisper v3 and Google Gemini2.0. Users on the X platform have already called v3 the "ultimate text-to-speech model," demonstrating its influence.

ElevenLabs stated that v3 is just one step in its technical roadmap, and it will continue to optimize model performance, release low-latency versions to support real-time applications, and further expand language support and scenario adaptability. AIbase believes that the release of v3 not only marks a technological breakthrough for ElevenLabs in the field of AI voice but also opens up new possibilities for content creation and human-computer interaction. With the popularization of the technology, AI voice is expected to become the core driving force behind digital content creation.

AIbase will continue to follow the latest developments of ElevenLabs and AI voice technology to bring you cutting-edge information.

Alibaba Cloud Launches AgentBay: Opening the Era of Super Brain for AI Agents

Alibaba Cloud launched 'Wuying AgentBay' at WAIC, a cloud platform for AI agent development. It supports code execution, web browsing, and advanced features like vision/NLP, compatible with multiple OS. Developers can access with 3 lines of code, leveraging cloud computing power without local hardware. Offers sandbox, scalable resources, secure storage, and GPU support for efficient AI training.....

Spotify Plans to Use AI Technology to Create a More Interactive Voice Assistant

Spotify executives revealed that they will use AI technology to create a smarter voice interaction experience. By analyzing user voice commands and song association data, the platform can provide more accurate personalized recommendations. The voice assistant currently supports basic English commands, and in the future, it will enable more complex interactive functions. Spotify also uses AI to accelerate internal product development and improve operational efficiency. Despite having 276 million paid users (a 12% increase year-over-year), the stock price fell by 10% due to revenue not meeting expectations. The company is driving business development through AI technological innovation.

AI Chip Startup Groq Approaching New Funding with Valuation Surging to $6 Billion!

AI chip startup Groq is in negotiations for a new $600 million funding round, with a valuation reaching $6 billion, doubling from last year's $2.8 billion valuation. The company was founded by developers of Google's TPU and has recently made significant partnerships with Bell Canada and Meta to support their AI infrastructure projects. This round is led by Disruptive, with previous investors including Blackstone and Cisco.

AI Daily: Volcano Engine Launches Doubao 3.0; Tongyi Opensources Qwen3 Non-Thinking Model; Google Secretly Upgrades Imagen 4

1.Volcano Engine upgrades Doubao AI with enhanced NLP, dialect support & optimized inference. 2.Qwen3-30B model rivals GPT-4o. 3.OpenAI launches ChatGPT Study. 4.HYPIR achieves 8K photo restoration in 1.7s. 5.NotebookLM adds video summaries. 6.Imagen4 outperforms GPT-4o. 7.Skywork UniPic goes open-source. 8.Li Auto's i8 features VLA driver model. 9.Google debuts Gemini2.5-powered UK search. 10.OWL releases multi-agent tool Eigent. 11.DeepSeek pre....

Zuckerberg's Recruitment Plan Exposed! Meta Offers High Salaries to Hire AI Talent but Faces Repeated Setbacks

Meta CEO Zuckerberg offered $1B to 12 TML AI talents, but none accepted. Meta denies some reports. Zuckerberg contacted candidates via WhatsApp, promising a top AI assistant and open-source strategy. However, Meta's vague roadmap and TML's recent $12B funding deterred talent, who prioritized growth over salary.....

Alibaba 1688 Fully Embraces AI: Launches AI Version of App and 88Chaz, Helping Buyers Efficiently Procure

1688 launches multiple AI upgrade initiatives, introducing the 1688 AI version of the app and the 88Chaz tool to empower small and medium-sized enterprises in procurement. The 1688 AI version integrates five AI functions: AI search, product selection, design creation, image search, and enterprise inquiry, covering the entire business process; 88Chaz supports natural language queries about enterprise qualifications and production capabilities, and generates industry reports. AI capabilities will also be available on both PC and mobile devices, improving product selection efficiency and procurement experience, and helping small and medium-sized enterprises achieve digital transformation.

Anthropic Plans to Raise $3 Billion to $5 Billion, Valuation to Rise to $170 Billion

Anthropic plans to raise $3-5B at a $170B valuation, led by Iconiq Capital. The OpenAI rival, founded by ex-OpenAI staff, raised $3.5B in March at $61.5B valuation. Revenue grew from $4B to $5B annually by July. Amid fierce AI funding competition, CEO admits difficulty in ideal capital sourcing.....

Google Launches AI Search Mode in the UK to Enhance Complex Question Answering Experience

Google has launched a new AI search mode in the UK, powered by the Gemini 2.5 model, which supports complex questions and can handle multi-part queries. The new feature uses query expansion technology to break down questions into sub-topics for parallel search, providing more accurate answers. It supports multiple interaction methods, including text, voice, and images, allowing users to upload images or ask questions via voice. The AI mode evaluates the credibility of the answers, and if the confidence is low, it switches to traditional search results. This feature aims to improve the depth and efficiency of searches while promoting diversified website traffic. It is still under optimization, and Google encourages users to provide feedback on their experience.

Product Finder

Product Submit

AI Models Finder

MCP Servers

MCP Client

MCP Inspector

Case Tutorials

Latest AI News

AI Daily Brief