Google Releases Its Strongest TTS Model, Supporting Nearly 70 Languages

AIbase基地

Published inAI News · 3 min read · Apr 16, 2026

Google has officially launched the new text-to-speech model Gemini-TTS in the Gemini 3.1 series. The official positioning is direct and confident: "The most expressive text-to-speech solution to date."

The core breakthrough of this model lies in truly giving developers control over speech. Previously, TTS products often generated voices that were monotonous, with flat intonation, rigid rhythm, and shallow emotion. Gemini-TTS, however, supports direct control over the emotion, rhythm, and style of the voice through prompts—whether it's a narration requiring a deep and solemn tone or a conversation needing a relaxed and natural feel, pauses and emotional fluctuations can be precisely controlled by describing them in language. The naturalness and delicacy of the listening experience have taken a significant step forward compared to previous similar products.

In terms of multilingual support, Gemini-TTS covers approximately 70 languages, including mainstream languages such as Mandarin Chinese, English, Spanish, and Japanese. More conveniently, the model can automatically recognize the language of the input text without requiring developers to manually annotate it, directly generating voice output in the corresponding language. For enterprises serving global users, this means a single API can handle multilingual content voice requirements, with audiobooks, podcasts, customer service robots, and educational applications all directly benefiting from this feature.

Google also emphasized the collaborative capabilities of Gemini-TTS with other audio models in the same series. In real-time conversations, voice translation, and multimodal interaction scenarios, the system can finely adjust voice output through text prompts and audio tags while maintaining low latency, making AI sound more like real human communication in practical applications such as phone calls, meetings, and navigation.

Gemini-TTS AINeologism BrandProductTerms Text-to-Speech

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

DoorDash Launches Ask DoorDash AI Chatbot, Supporting Text and Photo-Based Cross-Modal Ordering

In June 2026, DoorDash launched 'Ask DoorDash,' an AI chatbot enabling text/photo-based ordering, replacing traditional scrolling. It features multimodal parsing and context understanding for vague searches, marking a shift toward conversational, multimodal AI in lifestyle platforms.....

Jun 12, 2026

280

Apple Developer Ecosystem Upgrade: Xcode 27 Native Integration of Gemini AI, Programming Camp Gains a Strong Ally

Apple integrates Google Gemini natively in Xcode 27 Beta, making it the third built-in AI coding agent after OpenAI Codex and Anthropic Claude Agent. This update offers developers more diverse smart coding options, boosting efficiency and interaction, marking a deep AI-driven transformation in Apple's development ecosystem.....

Jun 11, 2026

200

Expanding to More Countries: Google Chrome Browser Version of Gemini Still Excluded from the EU

Google expands its AI assistant Gemini beyond Chrome's built-in chatbot to Latin America, Africa, and the Middle East for desktop and iOS users, but the EU is excluded again.....

Jun 11, 2026

320

Crackdown on Rumors: Li Auto Sues a Media Company for Mass Defamation Using AI

Li Auto's legal department has taken legal action against a Jiangxi-based cultural media company for using AI to generate false content and maliciously defame the brand, filing a report with public security authorities. The involved entity has been investigated, penalized, and publicly apologized. Li Auto emphasizes distinguishing normal supervision from malicious rumor-mongering, resolutely protecting corporate legal rights.....

Jun 11, 2026

240

Google Releases New NotebookLM: Comprehensive Upgrades to Gemini 3.5 Flash and Introduces Standalone Cloud Computer and AI Agent

Google announced on June 10, 2026, an upgrade to NotebookLM, integrating the Gemini 3.5 Flash model and coding tool Antigravity. The key enhancement is a dedicated cloud computer per user notebook, enabling code writing, real-time execution, and AI agent deep research for complex academic and engineering projects.....

Jun 11, 2026

640

Google Enters the Smart Glasses Market: Powered by Gemini, Competing with Ray-Ban This Autumn

Google announced smart glasses developed in partnership with Warby Parker and Gentle Monster, powered by the Gemini large model, set to launch this fall. Targeting Meta and Ray-Ban's industry-leading position, this marks Google's latest AI hardware endeavor, positioning smart glasses as the next battleground for tech giants.....

Jun 11, 2026

180

Xiaomi Open-Sources Terminal AI Coding Assistant MiMo Code with Free Top-Grade Multimodal Model Built-In

Xiaomi's tech team open-sourced AI coding assistant MiMo Code V0.1.0, based on OpenCode with MIT license, enabling free modification and commercial integration. It features 'model-agent synergy optimization' and a unique persistent memory system to address AI coding tools' forgetfulness, aiming for self-evolution.....

Jun 11, 2026

460

Google Releases DiffusionGemma: Trying to Speed Up AI Inference Using Text Diffusion Architecture

Google released the open-source experimental model DiffusionGemma on June 10, using a text diffusion architecture to achieve up to 4x faster text generation on dedicated GPUs compared to traditional autoregressive models, aiming to boost AI efficiency, though the company remains cautious.....

Jun 11, 2026

190

Cold and Restrained: Foreign Media Tests Apple iOS 27 New Siri AI

Apple's new Siri AI in iOS27 exhibits a uniquely cool demeanor, offering extremely concise responses without small talk, in stark contrast to the verbose friendliness of Google Gemini and ChatGPT. For instance, when asked 'How's it going?', Siri avoids chitchat, focusing on direct information. This restrained, rational design stands out in the anthropomorphic AI market.....

Jun 11, 2026

140

Search is about to change! Google will offer AI Mode interactive chart features for free to all users

Google announced that its "AI Mode Interactive Visualized Chart" feature will be available for free to all search users this summer. Previously, the feature was only available to AI Mode Pro and Ultra subscribers. It is based on Gemini's interactive image technology, which can generate dynamic simulations and models to help users better understand new concepts and improve learning efficiency.

Jun 10, 2026

240

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

GEO Brand Visibility

AI Visibility Audit

AI Search Visibility Checker

GEO Ranking Monitor

AI Conversation Insight

GEO Promotion Link Detection

GEO Ranking Optimization System

GEO Ranking Optimization

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

LLM API Hub

AI Models Finder

Model Providers

LLM Leaderboard

LLM API Proxy Checker

Compare LLMs

LLM Cost Calculator

LLM Arena

AI Model Compatibility Checker

AI Deployment Calculator

Google Releases Its Strongest TTS Model, Supporting Nearly 70 Languages

AIbase基地

This article is from AIbase Daily

AI News Recommendations

DoorDash Launches Ask DoorDash AI Chatbot, Supporting Text and Photo-Based Cross-Modal Ordering

Apple Developer Ecosystem Upgrade: Xcode 27 Native Integration of Gemini AI, Programming Camp Gains a Strong Ally

Expanding to More Countries: Google Chrome Browser Version of Gemini Still Excluded from the EU

Crackdown on Rumors: Li Auto Sues a Media Company for Mass Defamation Using AI

Google Releases New NotebookLM: Comprehensive Upgrades to Gemini 3.5 Flash and Introduces Standalone Cloud Computer and AI Agent

Google Enters the Smart Glasses Market: Powered by Gemini, Competing with Ray-Ban This Autumn

Xiaomi Open-Sources Terminal AI Coding Assistant MiMo Code with Free Top-Grade Multimodal Model Built-In

Google Releases DiffusionGemma: Trying to Speed Up AI Inference Using Text Diffusion Architecture

Cold and Restrained: Foreign Media Tests Apple iOS 27 New Siri AI

Search is about to change! Google will offer AI Mode interactive chart features for free to all users

AI News Recommendations

DoorDash Launches Ask DoorDash AI Chatbot, Supporting Text and Photo-Based Cross-Modal Ordering

Apple Developer Ecosystem Upgrade: Xcode 27 Native Integration of Gemini AI, Programming Camp Gains a Strong Ally

Expanding to More Countries: Google Chrome Browser Version of Gemini Still Excluded from the EU

Crackdown on Rumors: Li Auto Sues a Media Company for Mass Defamation Using AI

Google Releases New NotebookLM: Comprehensive Upgrades to Gemini 3.5 Flash and Introduces Standalone Cloud Computer and AI Agent

Google Enters the Smart Glasses Market: Powered by Gemini, Competing with Ray-Ban This Autumn

Xiaomi Open-Sources Terminal AI Coding Assistant MiMo Code with Free Top-Grade Multimodal Model Built-In

Google Releases DiffusionGemma: Trying to Speed Up AI Inference Using Text Diffusion Architecture

Cold and Restrained: Foreign Media Tests Apple iOS 27 New Siri AI

Search is about to change! Google will offer AI Mode interactive chart features for free to all users