Alibaba Tongyi Fun-ASR Speech Model Upgrade, Recognition Rate in Vertical Fields Surpasses 15% Improvement

AIbase基地

Published inAI News · 4 min read · Aug 23, 2025

119

AliTongyi officially launched the new generation end-to-end speech recognition large model Fun-ASR. This model achieves a breakthrough in speech recognition accuracy of over 15% in vertical industry scenarios such as home decoration and insurance by enhancing context awareness and high-precision transcription capabilities. Test data shows that the accuracy in the insurance industry has increased by 18% compared to the previous generation, while home decoration and livestock sectors have seen increases of 15%-20%.

As a speech recognition algorithm driven by large language models, Fun-ASR adopts self-developed speech algorithms and Qwen3 supervised fine-tuning technology, combining cutting-edge model architectures and text modal alignment technology. While maintaining advantages in language processing, it integrates a RAG retrieval enhancement solution, supporting the import of over 1000 custom hot words. This feature can automatically match domain-specific hot words, historical documents, and context records in audio, significantly optimizing keyword recognition performance in specific scenarios.

AliTongyi's new generation speech model Fun-ASR evolves again, with vertical field recognition accuracy improved by more than 15%

To address pain points such as noise interference, language confusion, and generation hallucination in speech recognition, the development team has innovatively introduced reinforcement learning (RL) technology, reducing recognition errors through dynamic optimization strategies, thereby substantially improving system stability and reliability. Notably, the model performs better than similar products in recognizing dialects such as Sichuan dialect, Cantonese, and Hokkien, and adapts to complex acoustic environments such as far-field pickup and near-field noise reduction, covering diverse scenarios like meeting rooms, workstations, supermarkets, and outdoor areas.

In terms of training data, Fun-ASR is built on hundreds of millions of hours of audio data, deeply integrating professional terminology libraries from more than ten fields such as the internet, technology, livestock, and automobiles. This data advantage enables it to demonstrate significant advantages in vertical industry recognition, for example, accurately identifying key commands in animal sounds and environmental noise in the livestock industry.

The AliTongyi technology team stated that the evolution of Fun-ASR marks the deep penetration of speech recognition technology from general scenarios to specialized and scenario-based applications. As the model is deployed in more industries, its dynamic hot word updates and multimodal interaction capabilities will further drive innovation in speech interaction efficiency.

Fun-ASR AliGenie Qwen3 RAGRetrievalAugmentedGeneration

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

Three-Year Delayed Long Article: Former OpenAI Security VP Wang Li Analyzes Scaling Laws: Your Model May Have Been Trained on the Wrong Data

Lilian Weng returns with a deep dive into scaling laws, arguing the industry consensus may be reversed: from Kaplan to Chinchilla, the mainstream data allocation might not be optimal. It examines compute, model size, and data quantity trade-offs, implying the billions-invested path requires reconsideration, prompting a re-evaluation of pretraining recipes.....

Jun 26, 2026

260

China's Large Models Continue to Evolve: Kimi Aims for the Top Global Tier, Next-Generation K3 is About to Launch

Moonshot AI revealed at the AWS Summit that Kimi's overseas paying users and API revenue grew 400%, covering over 200 countries and regions, and spanning industries like internet, finance, manufacturing, education, and healthcare. The company emphasizes its R&D-first strategy.....

Jun 26, 2026

290

China's Rising Star in 3D Generation: Yinguo Technology Secures Hundreds of Millions in Funding, Technical Strength Attracts Attention from NVIDIA

Yingmo Technology raised hundreds of millions in funding led by Cathay Capital, highlighting investor interest in 3D generation. The post-00s team targets a new phase of 'world models,' advancing AI from understanding to creation, driving usable 3D digital asset deployment, and showcasing global competitiveness.....

Jun 25, 2026

290

New Milestone in the Evolution of AI Agents: Qwen-AgentWorld Released with Native Language World Model

Qwen releases the world's first native language world model, Qwen-AgentWorld. Its core breakthrough lies in achieving unified control across multiple complex environments, breaking the limitations of only handling conversations or text, and marking a crucial step in AI's evolution into agents.

Jun 24, 2026

370

Aliyun QoderWork Launches Peak-Valley Token: Use Qwen3.7-Max During Off-Peak Hours at Up to 20% Discount

Alibaba Cloud's QoderWork introduces "Peak-Valley Token" pricing, guiding users to utilize off-peak nighttime (22:00 to 8:00 next day) idle computing power, with tasks automatically enjoying up to 80% discount. Main models like Qwen3.7-Max benefit. This model finely configures AI resources, significantly reducing large model application costs for enterprises and developers.....

Jun 24, 2026

410

New Trends in Smart Healthcare: China Unicom and Yu Yue Medical Join Forces to Equip Health Devices with an AI Brain

China Unicom and Yuyue Medical are deepening cooperation, extending from 5G production lines to full-chain co-creation of AI wearable devices. Leveraging Unicom's computing cloud platform, they are reshaping device perception and driving the evolution of medical devices, marking the shift of smart healthcare into a deeper phase.....

Jun 24, 2026

260

Free Volunteer Application Agent Launches: Qwen's Downgrade Impact, Thousands of Institutions Face a Crucial Test

Big players entering Jiangsu's 2026 college application market bring changes. Qianwen App's free full-cycle AI agent disrupts the billion-yuan industry, seen as internet traffic's vertical attack, reshaping the ecosystem and driving technology democratization.....

Jun 24, 2026

370

Capital Backs Again: Yinge Technology Completes a Several Billion Yuan Funding Round to Enter the 3D Generation Large Model Market

As generative AI advances, 3D content production gains value. Yinmo Technology, focused on 3D generation models, raised hundreds of millions of yuan led by Cathay Capital and Shanghai Guotou Pioneer, with existing investors participating, and Lighthouse Capital as FA. Its tech potential has long drawn wide investor attention.....

Jun 24, 2026

180

Yingmou Technology Secures Hundreds of Millions in New Round of Funding, Unveils Rodin Gen-2.5, a Million-Face-Level 3D Large Model

Yingmou Technology has completed a funding round worth hundreds of millions of yuan, led by Kaifu Fund and Shanghai GuoTou Daxiang. The funds will be used for 3D large model research and development and global commercialization, accelerating the implementation of game and e-commerce scenarios. The core product Hyper3D has been upgraded, with 80% of revenue coming from overseas, serving clients such as ByteDance and Unity.

Jun 23, 2026

280

AI Daily: Volcano Engine launches Doubao Seedance 2.5 and other models; Shengshu Vidu Q3 launched on Huawei Cloud; BaiChuan Intelligence releases M4 model

Welcome to the [AI Daily] section! This is your guide to exploring the world of artificial intelligence every day. Every day, we bring you the latest content in the AI field, focusing on developers, helping you understand technical trends and innovative AI product applications. Discover new AI products: https://app.aibase.com/zh1. Volcano Engine launches Doubao 2.1 Pro: Confirm free daily functions, will launch a professional office mode. The Doubao large model has undergone a major upgrade at the Volcano Engine FORCE Original Power conference, releasing several new versions.

Jun 23, 2026

2.8k

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

GEO Brand Visibility

AI Visibility Audit

AI Search Visibility Checker

GEO Ranking Monitor

AI Conversation Insight

GEO Promotion Link Detection

GEO Ranking Optimization System

GEO Ranking Optimization

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

LLM API Hub

AI Models Finder

Model Providers

LLM Leaderboard

LLM API Proxy Checker

Compare LLMs

LLM Cost Calculator

LLM Arena

AI Model Compatibility Checker

AI Deployment Calculator

Alibaba Tongyi Fun-ASR Speech Model Upgrade, Recognition Rate in Vertical Fields Surpasses 15% Improvement

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Three-Year Delayed Long Article: Former OpenAI Security VP Wang Li Analyzes Scaling Laws: Your Model May Have Been Trained on the Wrong Data

China's Large Models Continue to Evolve: Kimi Aims for the Top Global Tier, Next-Generation K3 is About to Launch

China's Rising Star in 3D Generation: Yinguo Technology Secures Hundreds of Millions in Funding, Technical Strength Attracts Attention from NVIDIA

New Milestone in the Evolution of AI Agents: Qwen-AgentWorld Released with Native Language World Model

Aliyun QoderWork Launches Peak-Valley Token: Use Qwen3.7-Max During Off-Peak Hours at Up to 20% Discount

New Trends in Smart Healthcare: China Unicom and Yu Yue Medical Join Forces to Equip Health Devices with an AI Brain

Free Volunteer Application Agent Launches: Qwen's Downgrade Impact, Thousands of Institutions Face a Crucial Test

Capital Backs Again: Yinge Technology Completes a Several Billion Yuan Funding Round to Enter the 3D Generation Large Model Market

Yingmou Technology Secures Hundreds of Millions in New Round of Funding, Unveils Rodin Gen-2.5, a Million-Face-Level 3D Large Model

AI Daily: Volcano Engine launches Doubao Seedance 2.5 and other models; Shengshu Vidu Q3 launched on Huawei Cloud; BaiChuan Intelligence releases M4 model

AI News Recommendations

Three-Year Delayed Long Article: Former OpenAI Security VP Wang Li Analyzes Scaling Laws: Your Model May Have Been Trained on the Wrong Data

China's Large Models Continue to Evolve: Kimi Aims for the Top Global Tier, Next-Generation K3 is About to Launch

China's Rising Star in 3D Generation: Yinguo Technology Secures Hundreds of Millions in Funding, Technical Strength Attracts Attention from NVIDIA

New Milestone in the Evolution of AI Agents: Qwen-AgentWorld Released with Native Language World Model

Aliyun QoderWork Launches Peak-Valley Token: Use Qwen3.7-Max During Off-Peak Hours at Up to 20% Discount

New Trends in Smart Healthcare: China Unicom and Yu Yue Medical Join Forces to Equip Health Devices with an AI Brain

Free Volunteer Application Agent Launches: Qwen's Downgrade Impact, Thousands of Institutions Face a Crucial Test

Capital Backs Again: Yinge Technology Completes a Several Billion Yuan Funding Round to Enter the 3D Generation Large Model Market

Yingmou Technology Secures Hundreds of Millions in New Round of Funding, Unveils Rodin Gen-2.5, a Million-Face-Level 3D Large Model

AI Daily: Volcano Engine launches Doubao Seedance 2.5 and other models; Shengshu Vidu Q3 launched on Huawei Cloud; BaiChuan Intelligence releases M4 model