Qwen3-TTS Upgrade: Diverse Voices Make Text-to-Speech More Natural

AIbase基地

Published inAI News · 4 min read · Dec 11, 2025

Recently, the Qwen3-TTS speech synthesis model has undergone a comprehensive upgrade and has become a rising star in the field of speech synthesis due to its outstanding performance. This version not only supports multiple voices, languages, and dialects, but also improves the naturalness and stability of speech generation, allowing users to easily access this powerful feature through the Qwen API.

The number of voice options supported by Qwen3-TTS has significantly increased, now offering more than 49 high-quality voices, covering different genders, ages, and regional characteristics, so users can find suitable voices for various scenarios. For example, there are voices like Mota, who is cute and playful, Xiaoye Xing, who gives a sense of companionship, or Mo Teacher, who is strict, among many other characters. This rich selection of voices makes the synthesized speech more expressive and better conveys emotions.

Additionally, Qwen3-TTS has made significant progress in supporting multiple languages and dialects. The model supports ten major languages including Chinese, English, German, and French, and its average word error rate (WER) in multilingual testing is better than many similar products. At the same time, Qwen3-TTS also supports the generation of voices in various dialects such as Mandarin, Cantonese, and Min Nan, which can realistically restore local accents and the flavor of the language, meeting the needs of a broader range of users.

In terms of naturalness of speech, the adaptive adjustment capability of Qwen3-TTS has been greatly improved, allowing it to flexibly adjust the speed and intonation according to the text content, with a level of human-like quality close to that of real human speech. This means that when users use Qwen3-TTS for speech synthesis, they can obtain a more natural and smooth auditory experience.

In terms of user experience, Qwen3-TTS also provides a simple and easy-to-use API interface, making it convenient for developers to integrate quickly. With some simple code, users can easily generate high-quality speech synthesis content. This design not only lowers the barrier to entry but also allows more people to enjoy advanced speech synthesis technology.

Qwen3-TTS API Documentation:

https://help.aliyun.com/zh/model-studio/multi-round-conversation?spm=a2c4g.11186623.help-menu-2400256.d_0_1_1.49445002U6gJoz

Key Points:
🌟 Qwen3-TTS adds 49 high-quality voices, with diverse characters to meet different needs.
🌍 Supports 10 major languages and various dialects, realistically restoring local accents and features.
🎤 Improved speech naturalness, with a human-like level close to real human speech, enhancing user experience.

Qwen3-TTS SpeechSynthesis Multilingual VoiceTone

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

Kunlun Tech Launches 4+3 Strategy: From Technical Foundation to Business Cycle

According to Kunlun Tech's 2025 annual report, the company's revenue reached 8.198 billion yuan, an increase of 44.78% year-on-year, with overseas revenue reaching 7.723 billion yuan, up 49.91%. The company introduced the "4+3 Strategy", clearly defining the development direction of AI-driven content production, covering both technological and business layout.

Apr 24, 2026

150

Tencent Releases and Opens-ources New AI Large Model Huan Yuan Hy3 Preview

Tencent released and open-sourced the new AI model 'Hunyuan Hy3 Preview', the most intelligent in its series. Upgrades cover complex reasoning, instruction following, contextual learning, code processing, and agents. It uses a hybrid expert architecture combining fast and slow thinking, with 295 billion parameters, to enhance overall performance and intelligence.....

Apr 24, 2026

150

Xiaomi Launches Full-Chain Speech Large Model MiMo-V2.5 TTS Can Generate New Voice Models with a Single Sentence ASR Open Source Supports Dialects and Multilingual Mixtures

Xiaomi launched the MiMo-V2.5 full-chain voice model series, featuring three TTS models and one open-source ASR model, covering voice input and output. The TTS models precisely control emotion, tone, and character identity, making voice programmable, creative, and replicable, enhancing human-machine interaction naturalness and ushering in a new era of voice intelligence.....

Apr 24, 2026

170

Hy3 Preview: The First Open-Source Release After the Reconstruction of Yuan, with Enhanced Comprehensive Practicality and Agent Capabilities

Tencent Yuan released and open-sourced the Hy3preview language model on April 23. This is a Mixture of Experts model that combines fast and slow thinking, with a total of 295B parameters and 21B activated parameters, supporting a context length of 256K. As the first model trained after the reconstruction, it significantly improves in complex reasoning, instruction following, context learning, code generation, and agent capabilities, making it the most intelligent model in the history of Yuan. In February 2026, Tencent Yuan reconstructed its pre-training and reinforcement learning infrastructure, focusing on the practicality of the model.

Apr 23, 2026

3.1k

AI Daily: ByteDance Launches Seed3D2.0; Xiaomi MiMo-V2.5 Beta Test; Alibaba Qwen3.6-27B Officially Open Sourced

Welcome to the [AI Daily] column! This is your guide to exploring the world of artificial intelligence every day. Every day, we bring you the latest content in the AI field, focusing on developers, helping you understand technology trends and innovative AI product applications. Click to learn more about new AI products: https://app.aibase.com/zh1. ByteDance Launches Seed3D2.0: Dual SOTA in Geometry and Texture, API Now Available on Volcano Engine. The Seed Team of ByteDance has released the new generation 3D generation large model Seed3D2.

Apr 23, 2026

610

Tencent Launches New Open-Source Language Model Hy3 Preview, Leading the Trend of the Intelligent Era

Tencent launches open-source AI model Hy3Preview with 29.5B parameters, supporting long contexts and improved performance in reasoning, instruction following, and code processing, emphasizing practicality, real-world evaluation, and cost-effectiveness.....

Apr 23, 2026

1.2k

Tesla's Third-Generation Humanoid Robot to Be Launched in Mid-2026, with Formal Production Starting in the Third Quarter

Tesla's Optimus Gen3 humanoid robot, now set for mid-2026 launch and production, is delayed slightly. Positioned as Tesla's largest product, it will be mass-produced at a retooled Fremont factory, aiming for scale by late 2026.....

Apr 23, 2026

270

Tencent Launches Hunyuan 3.0 Large Model with Significant Improvement in Programming Capabilities

Tencent's new AI model, Hy3.0, excels in programming, enhanced by expert Yao Shunyu, and is free to try on OpenRouter as Hy3preview.....

Apr 23, 2026

1.4k

AutoNet Launches AI Agent for Automotive Travel: Achieving Active Intention Understanding Based on Qwen Large Model

AutoNet launches an AI Agent for automotive travel, based on the Qwen large model, achieving a transformation from "passive command response" to "active intention understanding" in in-car navigation. The system adopts a dual-engine architecture of "language brain" and "spatial brain", where the former interprets everyday language and the latter verifies intentions in the physical world and matches route resources. It aims to solve the pain point of "people adapting to systems" and enhance the intelligent cockpit experience.

Apr 23, 2026

200

Bloomberg: Alibaba's AI Assistant Qwen Opens Agent Technology Access to China Eastern Airlines for the First Time

Alibaba's AI assistant Qianwen opens its Agent technology to business partners, integrating with China Eastern Airlines to enable full flight booking via natural language commands, eliminating traditional interface operations.....

Apr 23, 2026

210

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

GEO Brand Visibility

AI Visibility Audit

AI Search Visibility Checker

GEO Promotion Link Detection

GEO Ranking Optimization System

GEO Ranking Optimization

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

LLM API Hub

AI Models Finder

Model Providers

LLM Leaderboard

Compare LLMs

LLM Cost Calculator

LLM Arena

AI Model Compatibility Checker

AI Deployment Calculator

Qwen3-TTS Upgrade: Diverse Voices Make Text-to-Speech More Natural

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Kunlun Tech Launches 4+3 Strategy: From Technical Foundation to Business Cycle

Tencent Releases and Opens-ources New AI Large Model Huan Yuan Hy3 Preview

Xiaomi Launches Full-Chain Speech Large Model MiMo-V2.5 TTS Can Generate New Voice Models with a Single Sentence ASR Open Source Supports Dialects and Multilingual Mixtures

Hy3 Preview: The First Open-Source Release After the Reconstruction of Yuan, with Enhanced Comprehensive Practicality and Agent Capabilities

AI Daily: ByteDance Launches Seed3D2.0; Xiaomi MiMo-V2.5 Beta Test; Alibaba Qwen3.6-27B Officially Open Sourced

Tencent Launches New Open-Source Language Model Hy3 Preview, Leading the Trend of the Intelligent Era

Tesla's Third-Generation Humanoid Robot to Be Launched in Mid-2026, with Formal Production Starting in the Third Quarter

Tencent Launches Hunyuan 3.0 Large Model with Significant Improvement in Programming Capabilities

AutoNet Launches AI Agent for Automotive Travel: Achieving Active Intention Understanding Based on Qwen Large Model

Bloomberg: Alibaba's AI Assistant Qwen Opens Agent Technology Access to China Eastern Airlines for the First Time

AI News Recommendations

Kunlun Tech Launches 4+3 Strategy: From Technical Foundation to Business Cycle

Tencent Releases and Opens-ources New AI Large Model Huan Yuan Hy3 Preview

Xiaomi Launches Full-Chain Speech Large Model MiMo-V2.5 TTS Can Generate New Voice Models with a Single Sentence ASR Open Source Supports Dialects and Multilingual Mixtures

Hy3 Preview: The First Open-Source Release After the Reconstruction of Yuan, with Enhanced Comprehensive Practicality and Agent Capabilities

AI Daily: ByteDance Launches Seed3D2.0; Xiaomi MiMo-V2.5 Beta Test; Alibaba Qwen3.6-27B Officially Open Sourced

Tencent Launches New Open-Source Language Model Hy3 Preview, Leading the Trend of the Intelligent Era

Tesla's Third-Generation Humanoid Robot to Be Launched in Mid-2026, with Formal Production Starting in the Third Quarter

Tencent Launches Hunyuan 3.0 Large Model with Significant Improvement in Programming Capabilities

AutoNet Launches AI Agent for Automotive Travel: Achieving Active Intention Understanding Based on Qwen Large Model

Bloomberg: Alibaba's AI Assistant Qwen Opens Agent Technology Access to China Eastern Airlines for the First Time