Alibaba Launches the Multimodal Large Model Qwen3-Omni-Flash: Real-Time Streaming Output Supports Interaction in 119 Languages

AIbase基地

Published inAI News · 4 min read · Dec 11, 2025

100

AIbase December 9 report: Alibaba's Qwen team has released the new generation of all-modal large model Qwen3-Omni-Flash-2025-12-01 today. The model supports seamless input of text, images, audio, and video, and generates high-quality text and natural speech in real-time streaming responses. The official claims that its voice performance is approaching human-level naturalness.

Technical Breakthrough: Real-time Streaming Multi-modal Interaction

Qwen3-Omni-Flash adopts a real-time streaming architecture, enabling seamless input and synchronized output of text, images, audio, and video. The model supports interaction in 119 text languages, 19 speech recognition languages, and 10 speech synthesis languages, ensuring accurate responses across multilingual scenarios.

Personalized Experience: System Prompt Customization Opened

The new version fully opens the system prompt customization permission, allowing users to finely control the model's behavior mode, including setting specific character styles like "sweet girl" or "dominant woman," adjusting preferences for colloquial expression and response length. The model can adaptively adjust speaking speed, pauses, and rhythm based on the text content.

Performance Improvement: Comprehensive Benchmark Advancement

Official data shows that the new model has improved by 5.6 points in logical reasoning tasks (ZebraLogic), 9.3 points in code generation (LiveCodeBench-v6), and 4.7 points in multi-disciplinary visual question answering (MMMU), demonstrating strong multi-modal understanding and analytical capabilities.

Market Deployment: API Now Available, Affordable Pricing

Qwen3-Omni-Flash is now available via API, with input pricing at 1 yuan per million tokens and output at 3 yuan per million tokens. The model has been integrated into Qwen Chat with a Demo that supports uploading a 30-second video and generating live on-screen narration in real time.

Industry Significance: Multi-modal Enters the "Personality" Stage

While multi-modal models are still competing on how many images they can understand, Alibaba has directly turned "real-time streaming + personality" into an API. For scenarios that emphasize voice and style, such as live streaming, short videos, and virtual meetings, this effectively reduces the cost of "voice actors + post-production narration" to nearly zero.

Next Steps:

AI Daily: OpenAI Tests ChatGPT Writing Template Feature; Tomatoes, Pinduoduo Test AI Interactive Drama; Canvas Officially Launches Full National Beta Testing

Welcome to the [AI Daily] section! Here is your guide to exploring the world of artificial intelligence every day. Every day, we present the latest content in the AI field for you, focusing on developers to help you understand technical trends and learn about innovative AI product applications. Click to learn more about new AI products: https://app.aibase.com/zh1, OpenAI tests the writing template feature of ChatGPT, supporting uploading samples to replicate personal writing style. OpenAI is testing the writing template feature of ChatGPT, allowing users to upload personal

AI Daily: OpenAI Launches GPT-5.3 Instant; Lin Junyang, Head of Tongyi Qianwen, Announces Resignation; Google Releases Gemini 3.1 Flash-Lite

Welcome to the [AI Daily] section! This is your guide to exploring the world of artificial intelligence every day. Every day, we present you with the latest content in the AI field, focusing on developers, helping you understand technical trends and innovative AI product applications. Click to learn more about new AI products: https://app.aibase.com/zh1. "Say No to Fatherly Sermons": OpenAI Urgently Launches GPT-5.3Instant, GPT-5.4 Is on the Way OpenAI launched GPT-5.3Instant,

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

GEO Brand Visibility

AI Brand Monitoring Tool

AI Search Visibility Checker

GEO Promotion Link Detection

GEO Ranking Optimization System

GEO Services​

AI Model Compatibility Checker

AI Deployment Calculator

Alibaba Launches the Multimodal Large Model Qwen3-Omni-Flash: Real-Time Streaming Output Supports Interaction in 119 Languages

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Intensified Competition for Model Talent: DeepMind and Zhipu Actively Recruiting After Qwen Team's Personnel Changes

AI Daily: OpenAI Tests ChatGPT Writing Template Feature; Tomatoes, Pinduoduo Test AI Interactive Drama; Canvas Officially Launches Full National Beta Testing

Google NotebookLM Launches New Cinematic Video Overview Feature

Alibaba Confirms Lin Jinyang, Head of Qwen, Has Left the Company; CEO Wu Yongming Leads the Establishment of a Basic Model Support Team

Google Search Revolution! Canvas Launches Full US Beta: One-Click Transformation of Search Results into App, Million Token Window Directly Competing with ChatGPT

Video Creation Becomes Cinematic! Google Upgrades NotebookLM: Movie-Level Visual Overview Feature Now Live

OpenAI Releases Windows Version of Codex, 1.6 Million Developers Have Already Tried It

AI Daily: OpenAI Launches GPT-5.3 Instant; Lin Junyang, Head of Tongyi Qianwen, Announces Resignation; Google Releases Gemini 3.1 Flash-Lite

Lin Junyang, the core leader of QWEN, responds to his departure: says he really needs a break, and he has been deeply involved in large models for many years

Amazon Sudden Night: 30,000 Employees Cut in 3 Months! 16,000 Employees Graduated Today - Is the AI Layoff Trend Really Here?

AI News Recommendations

Intensified Competition for Model Talent: DeepMind and Zhipu Actively Recruiting After Qwen Team's Personnel Changes

AI Daily: OpenAI Tests ChatGPT Writing Template Feature; Tomatoes, Pinduoduo Test AI Interactive Drama; Canvas Officially Launches Full National Beta Testing

Google NotebookLM Launches New Cinematic Video Overview Feature

Alibaba Confirms Lin Jinyang, Head of Qwen, Has Left the Company; CEO Wu Yongming Leads the Establishment of a Basic Model Support Team

Google Search Revolution! Canvas Launches Full US Beta: One-Click Transformation of Search Results into App, Million Token Window Directly Competing with ChatGPT

Video Creation Becomes Cinematic! Google Upgrades NotebookLM: Movie-Level Visual Overview Feature Now Live

OpenAI Releases Windows Version of Codex, 1.6 Million Developers Have Already Tried It

AI Daily: OpenAI Launches GPT-5.3 Instant; Lin Junyang, Head of Tongyi Qianwen, Announces Resignation; Google Releases Gemini 3.1 Flash-Lite

Lin Junyang, the core leader of QWEN, responds to his departure: says he really needs a break, and he has been deeply involved in large models for many years

Amazon Sudden Night: 30,000 Employees Cut in 3 Months! 16,000 Employees Graduated Today - Is the AI Layoff Trend Really Here?

GEO Services