New Release of Qwen-TTS Adds Support for Three Chinese Dialects

AIbase基地

Published inAI News · 3 min read · Jul 1, 2025

Recently, a text-to-speech model named Qwen-TTS has made new progress. It has completed the update of its latest version through the Qwen API, bringing users a richer text-to-speech experience.

In this update, Qwen-TTS added support for three Chinese dialects: Beijing dialect, Shanghai dialect, and Sichuan dialect, further expanding its application scenarios. The model is trained on a large-scale corpus of more than 3 million hours, achieving human-level naturalness and expressiveness in synthesis. Qwen-TTS can not only accurately synthesize speech, but also automatically adjust intonation, rhythm, and emotional changes according to the input text, making the generated speech more natural and expressive.

Tongyi Qwen (5)

Currently, Qwen-TTS supports seven standard Chinese and English voice tones, including standard voices like Cherry and Ethan, as well as special dialect-specific voices such as Dylan (Beijing dialect), Jada (Shanghai dialect), and Sunny (Sichuan dialect). Users can choose appropriate voices based on their needs for text-to-speech synthesis.

In practical applications, Qwen-TTS has demonstrated excellent performance. Whether describing scenes of daily life or expressing complex emotions, it can generate natural and smooth speech. For example, when using the Beijing dialect voice Dylan to synthesize text about childhood games, the speech is full of childlike fun and energy; while using the Shanghai dialect voice Jada to synthesize dialogues about daily life, it conveys an authentic Shanghai flavor.

The development team of Qwen-TTS stated that they will continue to optimize the model's performance and plan to launch more languages and voice styles to meet users' increasingly diverse needs. At the same time, they have provided a convenient API interface, making it easy for developers to integrate Qwen-TTS into their own applications.

Model Studio:https://help.aliyun.com/zh/model-studio/qwen-tts

Qwen-TTS AI Speech Synthesis Dialect Support QwenAPI

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

The Revolution of Large Models! How Gemini 2.5 Pro is Transforming the Way We Process Information

Jul 1, 2025

AI Daily: Alibaba Tongyi Launches Qwen-TTS Model; Cursor Now Supports Web and Mobile; ByteDance Unveils Image Synthesis Technology XVerse

Welcome to the [AI Daily] column! This is your guide to exploring the world of artificial intelligence every day. Every day, we present you with the latest content in the AI field, focusing on developers, helping you understand technical trends and innovative AI product applications. Discover new AI products: https://top.aibase.com/1. Qwen-TTS Launches with a Major Breakthrough in Dialect Speech Synthesis, Achieving Realism Close to Human Voices. The Qwen-TTS model, developed by Alibaba's Tongyi team, has made significant breakthroughs in the field of speech synthesis.

Jul 1, 2025

PerMAXity: AI-Driven Investment Analysis and Automated Comprehensive Financial Reports

Recently, Perplexity launched a new feature called PerMAXity, which enables the creation of laboratories through scheduled tasks, allowing users to obtain comprehensive financial reports for their investment portfolios without the need for manual analysts. This innovative feature has attracted widespread attention due to its efficiency and intelligence. PerMAXity: A New Benchmark in Automated Financial Analysis. PerMAXity is a groundbreaking feature introduced by Perplexity, allowing users to automatically generate detailed financial reports for each asset in their investment portfolio through pre-designed scheduled tasks. Regardless

Jul 1, 2025

Meta Establishes a Superintelligence Lab to Lead a New Era in Artificial Intelligence

Meta is undergoing a major internal restructuring, deciding to consolidate all artificial intelligence-related teams into a new unit called "Meta Superintelligence Labs." This information was disclosed by Bloomberg, according to an internal memo from Meta, which shows that CEO Mark Zuckerberg hopes to focus the company's efforts on developing "superintelligence" artificial intelligence.

Jul 1, 2025

NoteGen Makes Its Debut: An AI-Powered Cross-Platform Note-Taking Tool, Marking a New Era in Knowledge Management

In the digital age, efficient note-taking tools have become an essential part of knowledge management. Recently, a cross-platform AI note-taking software called NoteGen has quickly gained popularity. It supports five major platforms: Windows, MacOS, Linux, iOS, and Android, and offers free multi-device data synchronization. With native Markdown formatting and strong integration with third-party large models, it redefines the note-taking experience. Full platform support and free synchronization seamlessly connect NoteGen, thanks to its powerful cross-platform compatibility.

Jul 1, 2025

Microsoft Launches MAI-DxO AI System, Medical Diagnosis Accuracy Increases Fourfold

Jul 1, 2025

140

TEN VAD Shocks Open Source: Enterprise-Level Speech Detection Tool, Creating a Super Intelligent AI Voice Assistant!

Jul 1, 2025

Chai-2 Makes a Shocking Debut: AI-Powered Zero-Shot Antibody Design, Accelerating Drug Development by Hundreds of Times

Artificial intelligence once again stirs up the field of drug development! Chai Discovery recently launched a new AI model called Chai-2, which has drawn widespread attention with its breakthrough technology in molecular design. Chai-2 achieves zero-shot antibody design with a success rate of 16%-20%, hundreds of times higher than traditional methods, shortening the drug development cycle from months or even years to just two weeks. Zero-shot antibody design breaks through traditional bottlenecks. Chai-2 is a multi-modal generative AI model developed by Chai Discovery, specifically designed for...

Jul 1, 2025

150

TEN Agent Open Source TEN VAD and Turn Detection Enable Ultra-Low Latency for Speech AI

The TEN Agent team recently announced that its core models **TEN Voice Activity Detection (VAD)** and **TEN Turn Detection** are now open source, providing powerful technical support for building real-time, multimodal speech AI agents. This move marks a significant advancement in the TEN framework's efforts to promote the democratization and open-source collaboration of speech interaction technology. The following is the latest information compiled by AIbase, offering an in-depth analysis of these two core models.

Jul 1, 2025

120

Qwen-TTS Launches with Major Breakthrough in Dialect Speech Synthesis, Realism Comparable to Human Voices

Jul 1, 2025

170

AI News

AI Daily

AI Timeline

Al Hardware

Latest Cases

Image Collection

Video Collection

Audio Collection

Content Collection

Latest Tutorials

AI Product Ranking

AI Traffic Growth Ranking

AI Traffic Decline Ranking

AI Weekly Ranking

United States

China

India

Brazil

Image Generation

Personal Assistant

Character Generation

Video Generation

AI Project Ranking

AI Project Growth Ranking

AI Developer Ranking

AI Organization Ranking

Deepseek

TTS

LLM

ChatGPT

Overview

New Release of Qwen-TTS Adds Support for Three Chinese Dialects

AIbase基地

This article is from AIbase Daily

AI News Recommendations

The Revolution of Large Models! How Gemini 2.5 Pro is Transforming the Way We Process Information

AI Daily: Alibaba Tongyi Launches Qwen-TTS Model; Cursor Now Supports Web and Mobile; ByteDance Unveils Image Synthesis Technology XVerse

PerMAXity: AI-Driven Investment Analysis and Automated Comprehensive Financial Reports

Meta Establishes a Superintelligence Lab to Lead a New Era in Artificial Intelligence

NoteGen Makes Its Debut: An AI-Powered Cross-Platform Note-Taking Tool, Marking a New Era in Knowledge Management

Microsoft Launches MAI-DxO AI System, Medical Diagnosis Accuracy Increases Fourfold

TEN VAD Shocks Open Source: Enterprise-Level Speech Detection Tool, Creating a Super Intelligent AI Voice Assistant!

Chai-2 Makes a Shocking Debut: AI-Powered Zero-Shot Antibody Design, Accelerating Drug Development by Hundreds of Times

TEN Agent Open Source TEN VAD and Turn Detection Enable Ultra-Low Latency for Speech AI

Qwen-TTS Launches with Major Breakthrough in Dialect Speech Synthesis, Realism Comparable to Human Voices