Say Goodbye to Translation Gimmick: Gemini 3.5 Real-Time Speech Translation Model Officially Released

AIbase基地

Published inAI News · 4 min read · Jun 10, 2026

Transcending language barriers is undergoing a technological revolution. Recently, Google has launched the new audio model Gemini 3.5 Live Translate, aiming to break down geographical and cultural barriers in language communication through advanced real-time speech-to-speech technology. The model is now integrated into core product ecosystems such as Google AI Studio, Google Translate, and Google Meet.

The core breakthrough of Gemini 3.5 Live Translate lies in its pursuit of "naturalness." Unlike traditional translation tools that offer a lagging experience where one speaks and the other translates alternately, this model can achieve near-real-time simultaneous interpretation. While continuously generating translations, it can accurately capture and restore the original tone, rhythm, and pitch of the speaker. By cleverly balancing the relationship between "waiting for more context to improve accuracy" and "real-time output to maintain synchronization," Gemini 3.5 reduces communication delay to just a few seconds, significantly reducing awkward pauses in conversations.

In terms of application scenarios, Google has given the model high flexibility. It supports automatic recognition and mutual translation of over 70 languages, without requiring users to manually perform tedious language configurations. Even in noisy or complex acoustic environments, the model maintains stable performance. For developers, Google has opened the Gemini Live API, making it easy to embed speech interpretation capabilities into multilingual phone calls, online education, and live commentary scenarios. Currently, the travel platform Grab has been the first to trial it, verifying the model's excellent performance in translation quality and low latency when handling millions of real-time driver-rider communications each month.

For enterprise collaboration, Gemini 3.5 Live Translate will comprehensively reshape the translation experience in Google Meet. In the future, the number of supported language pairs in meetings will expand from a limited number to over 2000, completely moving away from the single "English-centric" model. Additionally, for mobile users, the Google Translate app, which already supports real-time translation via earphones, has added a "speaker listening mode," allowing users to discreetly and privately receive translations through the phone speaker in public places where wearing earphones is inconvenient.

While pursuing technological efficiency, Google has also not overlooked security and compliance. All audio content generated by the Gemini series models includes a SynthID digital watermark, which can identify the AI-generated nature in an imperceptible way, effectively preventing risks of misinformation and misuse. As Gemini 3.5 Live Translate gradually expands, real-time communication across language barriers is transforming from a science fiction concept into an achievable reality.

Google Collaborates with NVIDIA to Release Open-Source Model DiffusionGemma: Introduces Diffusion Mechanism, Speeds Up Single-Card Inference by Four Times

In June 2026, Google released the open-source language model DiffusionGemma, which introduces image AI diffusion mechanisms into text generation, breaking the traditional autoregressive paradigm. It iteratively optimizes from random noise and outputs 256 tokens in parallel. Optimized by NVIDIA, it achieves nearly four times faster speed than similar traditional models in single-GPU single-user mode, with significant performance improvements on H1....

Google Enters the Smart Glasses Market: Powered by Gemini, Competing with Ray-Ban This Autumn

Google announced smart glasses developed in partnership with Warby Parker and Gentle Monster, powered by the Gemini large model, set to launch this fall. Targeting Meta and Ray-Ban's industry-leading position, this marks Google's latest AI hardware endeavor, positioning smart glasses as the next battleground for tech giants.....

Baidu Gaokao Service Upgrade: Launches AI Major Selection Report and Introduces Real Expert Review Mechanism

On June 10, Baidu launched an upgraded college entrance exam service, introducing a free 'AI Volunteer Report' with expert endorsement. Using Wenxin Assistant for multi-round conversations, it collects student info, integrates historical score lines, university data, and career prospects to generate personalized reports, certified by senior consultants, addressing key challenges in application decisions.....

Didi Fully Integrates with WeChat AI Ecosystem, Entering the Era of Natural Language Commands

Didi's core ride-hailing service is now fully integrated into WeChat's AI ecosystem, allowing users to book rides via natural language commands within WeChat without switching to the Didi app. This leverages Tencent's Hunyuan large model combined with Didi's big data and dispatch technology for automatic destination recognition and one-click ordering, breaking traditional app barriers.....

Search is about to change! Google will offer AI Mode interactive chart features for free to all users

Google announced that its "AI Mode Interactive Visualized Chart" feature will be available for free to all search users this summer. Previously, the feature was only available to AI Mode Pro and Ultra subscribers. It is based on Gemini's interactive image technology, which can generate dynamic simulations and models to help users better understand new concepts and improve learning efficiency.

Google lowers Google AI Plus monthly fee to $4.99, doubling storage space to 400GB

Google announced that the monthly fee for the AI subscription plan Google AI Plus has been reduced from $7.99 to $4.99, and the cloud storage space has been doubled from 200GB to 400GB. The plan, launched in January, is available for individuals and students, and includes AI features such as video generation. The storage update will be rolled out globally in the coming days.

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

GEO Brand Visibility

AI Visibility Audit

AI Search Visibility Checker

GEO Ranking Monitor

AI Conversation Insight

GEO Promotion Link Detection

GEO Ranking Optimization System

GEO Ranking Optimization

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

LLM API Hub

AI Models Finder

Model Providers

LLM Leaderboard

LLM API Proxy Checker

Compare LLMs

LLM Cost Calculator

LLM Arena

AI Model Compatibility Checker

AI Deployment Calculator

Say Goodbye to Translation Gimmick: Gemini 3.5 Real-Time Speech Translation Model Officially Released

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Google Collaborates with NVIDIA to Release Open-Source Model DiffusionGemma: Introduces Diffusion Mechanism, Speeds Up Single-Card Inference by Four Times

Google Enters the Smart Glasses Market: Powered by Gemini, Competing with Ray-Ban This Autumn

Google Releases DiffusionGemma: Trying to Speed Up AI Inference Using Text Diffusion Architecture

Baidu Gaokao Service Upgrade: Launches AI Major Selection Report and Introduces Real Expert Review Mechanism

Didi Fully Integrates with WeChat AI Ecosystem, Entering the Era of Natural Language Commands

Search is about to change! Google will offer AI Mode interactive chart features for free to all users

iFlytek Xinghuo Medical Large Model V3.5 Officially Unveiled

SpaceX AI Satellite Plan Unveiled for the First Time: 70-Meter Wingspan, Computing Power of GB300 in the US and UK

Google lowers Google AI Plus monthly fee to $4.99, doubling storage space to 400GB

3.5 Billion Dollar Entry: Broadcom Joins Forces with Financial Giants to Reshape the AI Computing Infrastructure Landscape

AI News Recommendations

Google Collaborates with NVIDIA to Release Open-Source Model DiffusionGemma: Introduces Diffusion Mechanism, Speeds Up Single-Card Inference by Four Times

Google Enters the Smart Glasses Market: Powered by Gemini, Competing with Ray-Ban This Autumn

Google Releases DiffusionGemma: Trying to Speed Up AI Inference Using Text Diffusion Architecture

Baidu Gaokao Service Upgrade: Launches AI Major Selection Report and Introduces Real Expert Review Mechanism

Didi Fully Integrates with WeChat AI Ecosystem, Entering the Era of Natural Language Commands

Search is about to change! Google will offer AI Mode interactive chart features for free to all users

iFlytek Xinghuo Medical Large Model V3.5 Officially Unveiled

SpaceX AI Satellite Plan Unveiled for the First Time: 70-Meter Wingspan, Computing Power of GB300 in the US and UK

Google lowers Google AI Plus monthly fee to $4.99, doubling storage space to 400GB

3.5 Billion Dollar Entry: Broadcom Joins Forces with Financial Giants to Reshape the AI Computing Infrastructure Landscape