Alibaba Cloud Launches the World's First All-Modal AI Model Qwen3-Omni, Enabling Unified Processing of Text, Images, Audio, and Video
Alibaba Cloud released the world's first native end-to-end all-modal AI model Qwen3-Omni and open-sourced it. The model supports multi-modal inputs such as text, images, audio, and video, and enables real-time streaming output with fast response. Through text pre-training and multi-modal mixed training, Qwen3-Omni possesses strong cross-modal capabilities and demonstrates advanced performance in multiple fields.