Alibaba Announces Major Release of Multimodal Large Model Qwen3-Omni

AIbase基地

Published inAI News · 3 min read · Sep 25, 2025

Alibaba recently released the Tongyi Multimodal Pre-training Large Model Qwen3-Omni series. The key feature of this model is its ability to process multiple types of information such as audio, video, and text, comparable to human perception. This is not only a major advancement in AI technology, but also opens up more possibilities for future application scenarios.

According to reports, Qwen3-Omni achieved SOTA (State Of The Art) levels in 22 out of 36 audio and video benchmark tests, showing excellent performance, and even became a leader among open-source models in 32 tests. Especially in speech recognition and audio understanding, its capabilities have reached a level comparable to Google's Gemini 2.5-Pro. This undoubtedly lays a solid foundation for applications requiring high-quality audio processing.

Tongyi Qwen (2)

Image source note: The image was generated by AI

The design concept of Qwen3-Omni is unique, as it started with multimodal mixed training of "listening," "speaking," and "writing," simulating a baby's comprehensive perception of the world. This training method combines unimodal and cross-modal data, allowing the model to excel in audio and video processing while maintaining stable performance in text and image processing. This is the first time in the industry that such a comprehensive training effect has been achieved, demonstrating Alibaba's foresight and innovation in AI technology.

In the future, Qwen3-Omni is expected to be widely applied in areas such as intelligent customer service, content creation, and voice interaction, providing users with smarter and more humanized services. As technology continues to advance, we can look forward to AI being more closely integrated into our lives, bringing us a more convenient experience.

Alibaba's innovation marks a new step in the development of multimodal AI, and also provides a new reference benchmark for global technology companies.

AI Daily: Meitu's RoboNeo Achieves Over a Million MAU in First Month; High-Quality Audio-Visual Synchronization Model Gaga AI Released; vivo Blue Heart 3B End-Stage Large Model Launched

Meitu's AI application RoboNeo exceeded one million monthly active users in its first month, and the company achieved success through internal organizational changes and deep application of AI tools. Meitu CEO Wu Xinhong emphasized the 'AI Native' concept, advocating innovation driven by AI. The product received quick market recognition, demonstrating the potential of AI technology in practical applications.

vivo Blue Heart 3B On-Device Large Model Launches with Five Core Capabilities: Performance Exceeds All 8B Models

vivo launched the Blue Heart 3B On-Device Multimodal Reasoning Large Model at the 2025 Developer Conference. This 3 billion parameter model is the first One Model in the industry to integrate five core capabilities. After one year of training and optimization, it achieves a major breakthrough in deploying complex multimodal AI capabilities on mobile devices, establishing a leading position in the industry.

AI21 Releases Open Source Mini Language Model Jamba Reasoning3B

AI21Labs releases the open source small language model Jamba Reasoning3B, designed for on-device AI computing. The model is based on an in-house hybrid state space model-transformer architecture, and is licensed under the Apache 2.0 license. Unlike mainstream large language models, it is the latest achievement in the Jamba series developed in Tel Aviv.

Alibaba Establishes a New Robotics AI Team to Seize the Tide of the Intelligent Era

"Alibaba has established a robotics and embodied AI team", led by executive Lin Junyang, aimed at developing innovative robotics technology and promoting the advancement of embodied AI. Embodied AI refers to intelligent systems that can interact with the environment through physical bodies, marking the company's further expansion in the field of intelligence.

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Submit Your Model

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

GEO Services

AI Search Visibility Checker

AI Model Compatibility Checker

AI Dataset Collection

Intelligent Document Recognition

Alibaba Announces Major Release of Multimodal Large Model Qwen3-Omni

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Claude Launches Test Version of Claude Code Plugin

AI Daily: Meitu's RoboNeo Achieves Over a Million MAU in First Month; High-Quality Audio-Visual Synchronization Model Gaga AI Released; vivo Blue Heart 3B End-Stage Large Model Launched

India Innovation Pilot: AI Chatbots Enable a New Way to Shop E-commerce

vivo Blue Heart 3B On-Device Large Model Launches with Five Core Capabilities: Performance Exceeds All 8B Models

TSMC Q3 Performance Exceeds Expectations, AI Demand Drives 30% Revenue Growth

OpenAI confirms ChatGPT weekly active users have surpassed 800 million

Anthropic Secures IBM Strategic Partnership: Claude Large Model Enters the Enterprise Market, Challenging OpenAI's Position

AI21 Releases Open Source Mini Language Model Jamba Reasoning3B

Alibaba Establishes a New Robotics AI Team to Seize the Tide of the Intelligent Era

China's First Vertical Large Model for the Sheep Industry Released: Su Wu Large Model Empowers Smart Sheep Farming

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Submit Your Model

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

GEO Services​

AI Search Visibility Checker

AI Model Compatibility Checker

AI Dataset Collection

Intelligent Document Recognition

Alibaba Announces Major Release of Multimodal Large Model Qwen3-Omni

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Claude Launches Test Version of Claude Code Plugin

AI Daily: Meitu's RoboNeo Achieves Over a Million MAU in First Month; High-Quality Audio-Visual Synchronization Model Gaga AI Released; vivo Blue Heart 3B End-Stage Large Model Launched

India Innovation Pilot: AI Chatbots Enable a New Way to Shop E-commerce

vivo Blue Heart 3B On-Device Large Model Launches with Five Core Capabilities: Performance Exceeds All 8B Models

TSMC Q3 Performance Exceeds Expectations, AI Demand Drives 30% Revenue Growth

OpenAI confirms ChatGPT weekly active users have surpassed 800 million

Anthropic Secures IBM Strategic Partnership: Claude Large Model Enters the Enterprise Market, Challenging OpenAI's Position

AI21 Releases Open Source Mini Language Model Jamba Reasoning3B

Alibaba Establishes a New Robotics AI Team to Seize the Tide of the Intelligent Era

China's First Vertical Large Model for the Sheep Industry Released: Su Wu Large Model Empowers Smart Sheep Farming

GEO Services