Alibaba Launches New Speech Model Bai Ling: Achieve Multilingual and Emotional Switching with 3-Second Recording

AIbase基地

Published inAI News · 4 min read · Dec 15, 2025

Alibaba Tongyi large model announced that its "Bailin" series speech models have undergone a major upgrade and are now officially open-sourced. The two updated speech models can seamlessly switch to up to nine languages and eighteen dialects, including Mandarin, Cantonese, Japanese, and English, after just three seconds of audio recording. They can also simulate various emotions such as happiness and anger.

In this upgrade, the Fun-CosyVoice3 model has seen significant improvements. The first packet delay has been reduced by 50%, greatly improving the accuracy of bilingual Chinese-English speech. In addition, the model's voice cloning capability has been enhanced, allowing users to replicate a corresponding voice and synthesize new speech with just a three-second or longer audio clip. This feature makes scenarios such as real-time voice assistants, live streaming dubbing, and accessibility reading more efficient and convenient.

The capabilities of the Fun-ASR model have also been improved, achieving an accuracy rate of 93% in noisy environments. This model not only supports the recognition of lyrics and rap but also enables free mixing of multiple languages, covering various Chinese dialects and accents. To enhance user experience, the first character delay in streaming recognition has been reduced to 160 milliseconds, significantly improving the fluency of voice interaction.

Additionally, both models support local deployment and secondary development, allowing developers to customize them according to their needs. The open-source address has also been published, and users can visit relevant platforms to experience and use these two speech models, further promoting the application of voice technology in various fields.

GitHub:https://github.com/FunAudioLLM/CosyVoice

Key points:
🌐 ** Multilingual Support **: Switch between nine languages and eighteen dialects with just three seconds of audio.
⚙️ ** Technology Upgrade **: Delay reduced by 50%, accuracy improved, making voice interaction smoother.
📦 ** Open Source **: The model supports local deployment and secondary development, making it easy for personalized applications.

Tongyi Large Model Bai Ling Fun-CosyVoice3 Fun-ASR

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

Kuaishou KwaiKAT Releases KAT-Coder-Pro V2.5: Say Goodbye to Code补 - The First Domestic Agentic Programming Model That Can Run the Entire Project End-to-End

Kuaishou's KwaiKAT team launches KAT-Coder-Pro V2.5, an agentic coding model tackling the gap between high benchmarks and real-world performance. Upgraded long-range engineering, general agentic abilities, and large-scale reinforcement learning push AI from code completion to autonomous software engineering. Key innovation: self-developed AutoBuilder pipeline converts runtime environments into training support.....

Jul 10, 2026

170

Meta Releases New Flagship Model Muse Spark 1.1 with Enhanced Multi-Agent Automation Features

Meta launched its flagship large model Muse Spark 1.1, focusing on multi-agent automation workflows. It is now available for public beta through AI chat services and API. The model consists of a master agent responsible for planning and sub-agents that execute tasks according to instructions. At the start of the project, the master agent automatically generates an execution plan.

Jul 10, 2026

160

Large Model Company Launches Smartphones to Compete with OpenAI: Step Stars to Unveil Its First AI Agent Terminal on July 13th

Jieyue Xingchen to hold July 13 conference themed 'True Agent in the Agent Era,' unveiling next-gen agent terminal products, possibly including AI terminal brand, agent system, and first AI agent phone. Aligns with OpenAI's push for new AI terminals, signaling industry acceleration in agent hardware.....

Jul 10, 2026

190

Google Releases LiteRT.js: AI Inference Runs Up to 3 Times Faster with WebAssembly Hardware Acceleration

Google released LiteRT.js, replacing TensorFlow.js's JavaScript kernel. It uses WebAssembly and deeply integrates WebGPU and WebNN hardware acceleration, significantly improving AI/ML runtime efficiency in browsers. WebAssembly enables near-native high-performance computing, optimizing in-browser inference speed.....

Jul 10, 2026

160

Pre-train from Scratch: Ant Lingbo Releases the Embodied Native World Action Model LingBot-VA 2.0

On July 10, Ant Lingbo unveiled LingBot-VA2.0, the first embodied native world action model. It shifts robot foundation models from digital grafting to native physical-world design, creating a 'brain' from primal interaction needs like dynamic modeling and causal prediction.....

Jul 10, 2026

210

ByteDance Seedance 2.5 Model Will Fully Open API on July 16

ByteDance's Seedance 2.5 video generation model opens API fully on July 16 for commercial use, lowering high-quality video creation barriers. Announced June 23, official release early July. Jimeng subsidiary offers membership discounts.....

Jul 10, 2026

230

27B Large Model Fits into iPhone! Apple Focuses on AI Compression Tech: Volume Reduced to 1/14, Speed Increased 8 Times

Tech media The Information reported that Apple is in talks with AI startup PrismML to evaluate the feasibility of running larger AI models directly on iPhones. PrismML's core breakthrough is its native 1-bit model compression technology, which can compress model size to about 1/14 and reduce memory usage by over 90%. This move could enable large-scale AI models to run on mobile devices, achieving a breakthrough in edge AI.

Jul 10, 2026

200

OpenAI Releases GPT-5.6 Model Series: Sol, Terra, and Luna Versions Launched, Focusing on Cybersecurity and High Cost-Effectiveness

OpenAI launched GPT-5.6 series: Sol, Terra, Luna, emphasizing efficiency & cost to retake AI race lead. Available on ChatGPT, it's touted as the top cybersecurity model, supporting defensive security despite prior potential Trump-era restrictions.....

Jul 10, 2026

300

AI Daily: SpaceXAI Launches Opus-Level Large Model Grok4.5; Jiequ Xingchen's First AI Intelligent Phone to Be Released; Ant Lingbo Opensources LingBot-Video

SpaceXAI launches its new large model Grok4.5, with "Opus-level" performance and efficiency improvements aimed directly at OpenAI, and adopts a competitive pricing strategy to accelerate competition in the large model sector.

Jul 9, 2026

7.7k

A regular camera can navigate autonomously: Mistral releases the 8B model Robostral Navigate, performance surpasses multi-camera solutions

The French AI company Mistral has launched a lightweight robot navigation model called Robostral Navigate with only 8B parameters. The model can achieve complete autonomous navigation in complex environments using only a regular RGB camera, without the need for lidar or depth sensors, and is suitable for various indoor and outdoor scenarios, significantly reducing hardware costs and deployment complexity.

Jul 9, 2026

240

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

GEO Brand Visibility

AI Visibility Audit

AI Search Visibility Checker

GEO Ranking Monitor

AI Conversation Insight

GEO Promotion Link Detection

Website AI Friendliness Detection

GEO Ranking Optimization System

GEO Ranking Optimization

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

LLM API Hub

AI Models Finder

Model Providers

LLM Leaderboard

LLM API Proxy Checker

Compare LLMs

LLM Cost Calculator

LLM Arena

AI Model Compatibility Checker

AI Deployment Calculator

Alibaba Launches New Speech Model Bai Ling: Achieve Multilingual and Emotional Switching with 3-Second Recording

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Kuaishou KwaiKAT Releases KAT-Coder-Pro V2.5: Say Goodbye to Code补 - The First Domestic Agentic Programming Model That Can Run the Entire Project End-to-End

Meta Releases New Flagship Model Muse Spark 1.1 with Enhanced Multi-Agent Automation Features

Large Model Company Launches Smartphones to Compete with OpenAI: Step Stars to Unveil Its First AI Agent Terminal on July 13th

Google Releases LiteRT.js: AI Inference Runs Up to 3 Times Faster with WebAssembly Hardware Acceleration

Pre-train from Scratch: Ant Lingbo Releases the Embodied Native World Action Model LingBot-VA 2.0

ByteDance Seedance 2.5 Model Will Fully Open API on July 16

27B Large Model Fits into iPhone! Apple Focuses on AI Compression Tech: Volume Reduced to 1/14, Speed Increased 8 Times

OpenAI Releases GPT-5.6 Model Series: Sol, Terra, and Luna Versions Launched, Focusing on Cybersecurity and High Cost-Effectiveness

AI Daily: SpaceXAI Launches Opus-Level Large Model Grok4.5; Jiequ Xingchen's First AI Intelligent Phone to Be Released; Ant Lingbo Opensources LingBot-Video

A regular camera can navigate autonomously: Mistral releases the 8B model Robostral Navigate, performance surpasses multi-camera solutions

AI News Recommendations

Kuaishou KwaiKAT Releases KAT-Coder-Pro V2.5: Say Goodbye to Code补 - The First Domestic Agentic Programming Model That Can Run the Entire Project End-to-End

Meta Releases New Flagship Model Muse Spark 1.1 with Enhanced Multi-Agent Automation Features

Large Model Company Launches Smartphones to Compete with OpenAI: Step Stars to Unveil Its First AI Agent Terminal on July 13th

Google Releases LiteRT.js: AI Inference Runs Up to 3 Times Faster with WebAssembly Hardware Acceleration

Pre-train from Scratch: Ant Lingbo Releases the Embodied Native World Action Model LingBot-VA 2.0

ByteDance Seedance 2.5 Model Will Fully Open API on July 16

27B Large Model Fits into iPhone! Apple Focuses on AI Compression Tech: Volume Reduced to 1/14, Speed Increased 8 Times

OpenAI Releases GPT-5.6 Model Series: Sol, Terra, and Luna Versions Launched, Focusing on Cybersecurity and High Cost-Effectiveness

AI Daily: SpaceXAI Launches Opus-Level Large Model Grok4.5; Jiequ Xingchen's First AI Intelligent Phone to Be Released; Ant Lingbo Opensources LingBot-Video

A regular camera can navigate autonomously: Mistral releases the 8B model Robostral Navigate, performance surpasses multi-camera solutions