Information

Latest AI News

Explore AI Frontiers, Master Industry Trends

AI Daily Brief

Your Daily AI Brief - Never Miss What's Next

Information

AI Product Finder

Smart Product Discovery - Comprehensive Market Intelligence

AI Product Rankings

AI Product Power Rankings - Performance, Buzz & Trends

AI Product Submit

Submit Your AI Product - Amplify Reach & Drive Growth

Tools

AI Tools Directory

Discover The Best AI Websites & Tools

Information

AI Models Finder

Comprehensive AI Models Collection for All Your Development & Research Needs

LLM Leaderboard

AI LLM Power Rankings - Performance, Buzz & Trends

Model Providers

Discover Trusted AI Model Partners - Guaranteed Reliable Support

Submit Your Model

Submit Your Model Info & Services - Precision Marketing & User Targeting

Tools

Compare LLMs

Multi-Dimensional Large Model Comparison - Find Your Perfect Match

LLM Cost Calculator

Calculate AI Model Costs Accurately - Optimize Your Budget

LLM Arena

Multi-Model Real-Time Evaluation & Quick Output Comparison

Information

MCP Servers

Discover Popular AI-MCP Services - Find Your Perfect Match Instantly

MCP Client

Easy MCP Client Integration - Access Powerful AI Capabilities

MCP Case Tutorials

Master MCP Usage - From Beginner to Expert

MCP Ranking

Top MCP Service Performance Rankings - Find Your Best Choice

MCP Service Submission

Publish & Promote Your MCP Services

Tools

MCP Playground

Test MCP Services Freely - Quick Online Experience

MCP Inspector

Quick MCP Service Testing - Fast Deployment

GEO Services

Achieve Dominant Visibility in AI Search for Your Business or Brand with GEO Services

AI Search Visibility Checker

Detect brand's visibility on AI platforms

Tools

AI Model Compatibility Checker

Free PC Hardware Test for DeepSeek & Llama

Information

AI Dataset Collection

Large-scale datasets and benchmarks for training, evaluating, and testing models to measure

Tools

Intelligent Document Recognition

Comprehensive Text Extraction and Document Processing Solutions for Users

AI Tutorial

Alibaba Cloud Launches the World's First All-Modal AI Model Qwen3-Omni, Enabling Unified Processing of Text, Images, Audio, and Video

AIbase基地

Published inAI News · 4 min read · Sep 23, 2025

150

Alibaba Cloud has released Qwen3-Omni, marking the launch of the world's first native end-to-end multi-modal AI model, which is now open source. Qwen3-Omni is capable of handling various input types such as text, images, audio, and video, and can provide real-time streaming output, responding quickly whether through text or natural speech.

The Qwen3-Omni model demonstrates advanced cross-modal performance in multiple fields. With early text-centered pre-training and mixed multi-modal training, the model has powerful multi-modal capabilities. It excels particularly in audio and video performance, while maintaining high standards in text and image effects. According to 36 benchmark tests for audio and video, Qwen3-Omni has achieved the latest leading results in 22 of them, and its performance in areas such as automatic speech recognition and audio understanding is comparable to that of industry peers like Gemini2.5Pro.

Qwen3-Omni supports 119 text languages and 19 speech input languages, as well as 10 speech output languages, including English, Chinese, French, and German, among others. This feature enables it to better serve global users. Its innovative architecture is based on a MoE (Mixture of Experts) system combined with AuT pre-training, giving the model strong general representation capabilities. At the same time, the multi-codebook design ensures low-latency real-time audio and video interaction, supporting smooth natural dialogue.

In addition to Qwen3-Omni, Alibaba Cloud has also released Qwen3-TTS, a text-to-speech model that supports 17 voice options. The model performs outstandingly in multiple evaluation benchmarks, surpassing several competitors, especially in terms of voice stability and voice similarity.

Qwen-Image-Edit-2509 is another newly released tool, focusing on multi-image support for image editing, significantly improving the consistency and quality of editing. It not only handles single images but also supports multi-image collage editing, meeting more complex editing needs.

GitHub: https://github.com/QwenLM/Qwen3-Omni
Huggingface: https://huggingface.co/collections/Qwen/qwen3-omni-68d100a86cd0906843ceccbe

Key Points:
🌟 Qwen3-Omni is the world's first native end-to-end multi-modal AI model, supporting unified processing of text, images, audio, and video.
🌐 The model supports 119 text languages and 19 speech inputs, meeting the multilingual needs of global users.
🖼️ The newly released Qwen-Image-Edit-2509 supports multi-image editing, significantly improving the consistency and effectiveness of editing.

Qwen3-Omni AI Modal Model Alibaba Cloud Open Source

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

Google Makes a Big Move! Gemini CLI Integrates with MCP, Developers Say Goodbye to Configuration Hell

Google's open-source tool Gemini CLI is deeply integrated with the FastMCP framework, allowing developers to install and configure an MCP server with a single command, significantly simplifying the tedious development process that traditionally required manual environment configuration, dependency handling, and debugging of transmission channels.

Oct 4, 2025

120

The AI Design Tool Invested by Sequoia Has Fallen! Acquired by Perplexity and Shut Down 90 Days Later

Visual Electric, an AI design startup, was acquired by Perplexity. The product will shut down in 90 days, with the team joining Perplexity's new 'Agent Experience' division. Deal terms undisclosed.....

Oct 4, 2025

How Developers Can Use Apple's Local AI Models in iOS 26

iOS 26's Foundation Models enables offline AI model usage, boosting apps like Lil Artist with AI storytelling features.....

Oct 4, 2025

OpenAI New App Sora Rises to the Top of Apple App Store in Four Days

OpenAI's new video generation app, Sora, topped the Apple App Store free chart within four days of its release, surpassing Google Gemini and its own ChatGPT. The app allows users to generate, edit, and share short videos. It is currently available for invitation-only testing by iOS users in the US and Canada. Market reactions indicate strong demand for AI video tools.

Oct 4, 2025

Mickey Mouse Goes Offline! Character.AI Faces Legal Letter from Disney, Removes All Disney-related Characters

Disney sent a legal notice to Character.AI, demanding removal of Mickey Mouse and other characters, citing copyright infringement. The characters were removed within 24 hours. Disney accused the company of exploiting its century-old brand reputation.....

Oct 3, 2025

140

Free Browser Add-on is Here! Perplexity Brings Comet, Which Costs $200/Month, to Everyone. An AI Assistant That Helps You Browse the Web, Write Emails, Book Tickets, and Compare Prices is Now Available

Perplexity's free AI browser Comet features a sidebar assistant for multi-tasking like flight comparisons and email replies without tab-switching. Initially paywalled, its global free launch caused a download surge, briefly crashing servers. Designed to boost efficiency and reduce user workload.....

Oct 3, 2025

190

560,000 downloads pushed it into the top three! Sora from OpenAI has already made a big impact before being fully launched

OpenAI's Sora, an AI video app, launched in North America, ranking top 3 on iOS with 56K downloads. Its text-to-video feature caused server queues. Limited to US/Canada with invite-only access, it outperformed competitors like Claude.....

Oct 3, 2025

160

Kuaishou Coline 2.5 Turbo Model Successfully Tops the Global Video Generation Ranking!

Kuaishou's Keling 2.5 Turbo leads AI video generation, scoring 1329 (image-to-video) & 1252 (text-to-video) in Artificial Analysis, surpassing Veo3 to rank #1 globally since Sept 23.....

Oct 3, 2025

140

AI Cloud Service Star CoreWeave Secures Massive Orders from Meta and OpenAI, Totaling $20.7 Billion!

CoreWeave secures $20.7B deals with OpenAI and Meta until 2031, ensuring stable revenue. Its success stems from close ties with NVIDIA.....

Oct 3, 2025

120

Anthropic Appoints New CTO: Will Strengthen Its Presence in the AI Infrastructure Field

Anthropic appoints Rahul Patil as new CTO, ex-Stripe CTO. Co-founder Sam McCandlish shifts to Chief Architect, focusing on large-scale model training. Company restructures tech team to enhance AI infrastructure.....

Oct 3, 2025

130

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Submit Your Model

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

GEO Services​

AI Search Visibility Checker

AI Model Compatibility Checker

AI Dataset Collection

Intelligent Document Recognition

Alibaba Cloud Launches the World's First All-Modal AI Model Qwen3-Omni, Enabling Unified Processing of Text, Images, Audio, and Video

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Google Makes a Big Move! Gemini CLI Integrates with MCP, Developers Say Goodbye to Configuration Hell

The AI Design Tool Invested by Sequoia Has Fallen! Acquired by Perplexity and Shut Down 90 Days Later

How Developers Can Use Apple's Local AI Models in iOS 26

OpenAI New App Sora Rises to the Top of Apple App Store in Four Days

Mickey Mouse Goes Offline! Character.AI Faces Legal Letter from Disney, Removes All Disney-related Characters

Free Browser Add-on is Here! Perplexity Brings Comet, Which Costs $200/Month, to Everyone. An AI Assistant That Helps You Browse the Web, Write Emails, Book Tickets, and Compare Prices is Now Available

560,000 downloads pushed it into the top three! Sora from OpenAI has already made a big impact before being fully launched

Kuaishou Coline 2.5 Turbo Model Successfully Tops the Global Video Generation Ranking!

AI Cloud Service Star CoreWeave Secures Massive Orders from Meta and OpenAI, Totaling $20.7 Billion!

Anthropic Appoints New CTO: Will Strengthen Its Presence in the AI Infrastructure Field

GEO Services