Alibaba Launches Powerful Text-to-Speech Model Qwen3-TTS, 49 Voices Meet Your Voice Needs!

AIbase基地

Published inAI News · 5 min read · Dec 11, 2025

125

Alibaba Qwen released the new generation of text-to-speech large model, Qwen3-TTS, which is now freely available to developers worldwide through the Qwen API. The model offers 49 multi-character voice options, supports 10 major languages and 10 Chinese dialects, and the official claims that its average word error rate (WER) on the MiniMax TTS multilingual test set is better than MiniMax and ElevenLabs, with a level of naturalness approaching that of real people.

49 Voice Options Ready to Use

- Character Library: Includes gender, age, region, and character settings - "Coquettish and funny Moutu", "Strict Teacher Mo Teacher", "Wisdom Elder Cang Mingzi", etc., can be switched with one click

- Scenario Adaptation: Podcasts, audiobooks, game NPCs, and smart customer service can switch voices in seconds without additional training

10 Languages and 10 Dialects, Leading WER Across Languages

- Major Languages: Covering 10 languages including Chinese, English, German, Italian, and French

- Dialect List: Including Mandarin, Cantonese, Sichuan dialect, etc., 10 dialects retain authentic accents and intonation

- Objective Metrics: The average WER on the MiniMax TTS multilingual test set is lower than ElevenLabs, with a synthesis accuracy increase of about 12%

Rhythm and Speed: Text-Driven, Naturalness Close to Real People

- Adaptive Speed: Automatically adjusts speed and pauses based on the text's emotion

- Rhythm Model: Predicts stress and intonation at the syllable level, with a MOS score of 4.6, close to real people's 4.8

- Real-Time Streaming: First packet delay <300ms, suitable for live dubbing and dialogue scenarios

Free Access & Business-Friendly

- API Pricing: Currently free and no call limit

- Licensing Terms: Default support for commercial use, no additional licensing fees required

- Integration Example: A single HTTPS request can be integrated, completing voice broadcasting with 10 lines of code

Next Step: Dialect Cloning + Edge Deployment

Alibaba revealed that in Q1 2025, it will launch the "Dialect Voice Cloning" feature, allowing a 5-second audio clip to recreate regional accents; in Q2, it will release an edge box version, supporting offline local network deployment, targeting scenarios such as smart scenic spots and in-car voice systems.

Editor's Note

When text-to-speech technology has reached the stage where "voice is a character," Qwen3-TTS differentiates itself with 49 character settings, 10 dialects, and free APIs: voices can be switched instantly without training, and WER metrics directly compete with international paid engines. For applications that rely heavily on voice and style, such as podcasts, games, and customer service, this effectively brings the cost of "voice actors + post-production" close to zero.

AlibabaTongyiQianwen Qwen3-TTS LargeLanguageModelforSpeechSynthesis Multi-roleVoiceTones

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

NVIDIA Releases Nemotron 3 Embed Series 8B Version, Tops RTEB Retrieval Benchmark

NVIDIA released the Nemotron3Embed embedding model series for production-grade RAG, agent retrieval, code search, and memory. The 8B model ranks first on the RTEB benchmark, making it the top open-source embedding model. The series includes three checkpoints: accuracy-focused 8B-BF16, lightweight 1B-BF16, and 1B-NVFP4 4-bit optimized for Blackwell architecture. All models use bidirectional attention.....

Jul 17, 2026

190

AI Daily: Open Source Model Kimi K3 Makes Its Debut; Google Vids Introduces Gemini Omni Model; Zhipu AI Aims for $1 Billion ARR

Welcome to the [AI Daily] segment! Here is your guide to exploring the world of artificial intelligence every day. Every day, we present you with the latest content in the AI field, focusing on developers, helping you grasp technological trends and understand innovative AI product applications. Discover new AI products: https://app.aibase.com/zh1, 2.8 trillion parameters, 1 million token context, KimiK3 pushes the ceiling of open source large models to the highest globally. This article introduces the KimiK3 model released by Moonshot AI.

Jul 17, 2026

240

MiaoDa 3.5 Global Launch: iOS No-Code Packaging and Multi-Platform Backend Sharing. Baidu is Further Lowering the Barrier to App Development

On July 16 at WAIC Shanghai, Baidu AI Cloud's no-code platform Miaoda released version 3.5, evolving from 'one-sentence app creation' to 'simpler and more comprehensive,' further lowering development barriers. It has served 35 million users, created 3.5 million commercial apps, with nearly 200,000 daily active users.....

Jul 17, 2026

170

Roblox Launches AI Creation Tool Build, Supporting Text-Generated Games and 3D Scenes

Roblox launched AI creation tool Build and upgraded its Studio, using AI to lower the development barrier. Users can generate editable game content via text prompts. The feature begins testing on July 28, deepening the 'user-generated content' philosophy. The platform has 132 million daily active users. Build is a mobile-first tool, enabling creation anytime, anywhere.....

Jul 17, 2026

220

2.8 Trillion Parameters, 1 Million Word Context Kimi K3 Raises the Global Ceiling of Open-Source Large Models to the Highest Level

Ahead of the 2026 World AI Conference, Moonshot AI launched Kimi K3, a 2.8-trillion-parameter open-source model, now the world's largest open-source model, surpassing closed-source rivals in size for the first time. It's not just about scale—it marks a significant open-source breakthrough.....

Jul 17, 2026

420

Resume Lost in the Sea? Qwen Breaks Down Writing Resumes, Creating PPTs, and Filtering Dirty Data into a Replicable AI Office Workflow

Wuhan hosted an AI job-seeking workshop to address common pain points like ignored resumes, report struggles, and messy spreadsheets. Through hands-on practice, participants learned resume diagnosis, business report writing, and sales data analysis, turning theory into practical skills.....

Jul 16, 2026

210

AI Daily: MiniMax Code 2.0 Desktop Version Released; Kimi K3 Model Teaser Video Leaked; Tongyi Qianwen Officially Integrated into Apple Ecosystem

Welcome to the [AI Daily] column! This is your guide to exploring the world of artificial intelligence every day. Every day, we present you with the latest content in the AI field, focusing on developers, helping you understand technology trends and innovative AI product applications. Click to learn more about new AI products: https://app.aibase.com/zh1. Lingguang App - The 'Lingguang Circle' community is being renewed: launching hot lists, follow functions, PC support for importing documents and audio/video materials. The Lingguang App has upgraded the functions of the 'Lingguang Circle' community, adding hot lists and editorial selections.

Jul 16, 2026

670

Kimi K3 Model Warm-up Video Leaks, Multiple Comparisons Directly Targeting Claude Fable5, Initiating a Challenge

Moonshot released a teaser hinting at the imminent launch of Kimi K3, with a fleeting “3” fueling speculation. An anonymous Kivine model surfaced on Arena.ai, suspected to be Kimi K3, and a comparison test with the Claude model has been leaked online.....

Jul 16, 2026

420

MiniMax Releases Code2.0 Desktop Version: Comprehensive Reconstruction of the Underlying Architecture, Native Integration with Multi-source Financial Data

MiniMax launches Code 2.0 desktop, rebuilt on Pi Agent for faster startup and more stable long tasks. Improves chart loading, zoom, and download. Preview panel enables direct file selection, editing, and saving, achieving a seamless task-to-delivery loop.....

Jul 16, 2026

270

From the 70s to the 05s: Sharing the Same Qwen AI Class: These Job Seekers All Want to Learn New Skills

At 49, veteran Wan Xiaoming left his 18-year state-owned job, facing career confusion yet refusing to give up. He dedicates 4 hours daily to AI and self-media, using new tools to keep pace with the times and seek fresh paths with a young mindset, embodying the proactive spirit of middle-aged strivers.....

Jul 16, 2026

210

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

GEO Brand Visibility

AI Visibility Audit

AI Search Visibility Checker

GEO Ranking Monitor

AI Conversation Insight

GEO Promotion Link Detection

Website AI Friendliness Detection

GEO Ranking Optimization System

GEO Ranking Optimization

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

LLM API Hub

AI Models Finder

Model Providers

LLM Leaderboard

LLM API Proxy Checker

Compare LLMs

LLM Cost Calculator

LLM Arena

AI Model Compatibility Checker

AI Deployment Calculator

Alibaba Launches Powerful Text-to-Speech Model Qwen3-TTS, 49 Voices Meet Your Voice Needs!

AIbase基地

This article is from AIbase Daily

AI News Recommendations

NVIDIA Releases Nemotron 3 Embed Series 8B Version, Tops RTEB Retrieval Benchmark

AI Daily: Open Source Model Kimi K3 Makes Its Debut; Google Vids Introduces Gemini Omni Model; Zhipu AI Aims for $1 Billion ARR

MiaoDa 3.5 Global Launch: iOS No-Code Packaging and Multi-Platform Backend Sharing. Baidu is Further Lowering the Barrier to App Development

Roblox Launches AI Creation Tool Build, Supporting Text-Generated Games and 3D Scenes

2.8 Trillion Parameters, 1 Million Word Context Kimi K3 Raises the Global Ceiling of Open-Source Large Models to the Highest Level

Resume Lost in the Sea? Qwen Breaks Down Writing Resumes, Creating PPTs, and Filtering Dirty Data into a Replicable AI Office Workflow

AI Daily: MiniMax Code 2.0 Desktop Version Released; Kimi K3 Model Teaser Video Leaked; Tongyi Qianwen Officially Integrated into Apple Ecosystem

Kimi K3 Model Warm-up Video Leaks, Multiple Comparisons Directly Targeting Claude Fable5, Initiating a Challenge

MiniMax Releases Code2.0 Desktop Version: Comprehensive Reconstruction of the Underlying Architecture, Native Integration with Multi-source Financial Data

From the 70s to the 05s: Sharing the Same Qwen AI Class: These Job Seekers All Want to Learn New Skills

AI News Recommendations

NVIDIA Releases Nemotron 3 Embed Series 8B Version, Tops RTEB Retrieval Benchmark

AI Daily: Open Source Model Kimi K3 Makes Its Debut; Google Vids Introduces Gemini Omni Model; Zhipu AI Aims for $1 Billion ARR

MiaoDa 3.5 Global Launch: iOS No-Code Packaging and Multi-Platform Backend Sharing. Baidu is Further Lowering the Barrier to App Development

Roblox Launches AI Creation Tool Build, Supporting Text-Generated Games and 3D Scenes

2.8 Trillion Parameters, 1 Million Word Context Kimi K3 Raises the Global Ceiling of Open-Source Large Models to the Highest Level

Resume Lost in the Sea? Qwen Breaks Down Writing Resumes, Creating PPTs, and Filtering Dirty Data into a Replicable AI Office Workflow

AI Daily: MiniMax Code 2.0 Desktop Version Released; Kimi K3 Model Teaser Video Leaked; Tongyi Qianwen Officially Integrated into Apple Ecosystem

Kimi K3 Model Warm-up Video Leaks, Multiple Comparisons Directly Targeting Claude Fable5, Initiating a Challenge

MiniMax Releases Code2.0 Desktop Version: Comprehensive Reconstruction of the Underlying Architecture, Native Integration with Multi-source Financial Data

From the 70s to the 05s: Sharing the Same Qwen AI Class: These Job Seekers All Want to Learn New Skills