Alibaba Open-Sources New Model Qwen3-Next-80B-A3B, Performance and Efficiency Both Improved!

AIbase基地

Published inAI News · 4 min read · Sep 12, 2025

188

Alibaba has recently open-sourced its latest architecture model Qwen3-Next-80B-A3B, marking a significant advancement in the company's artificial intelligence generated content (AIGC) efforts. The model features innovations in hybrid attention mechanisms, high sparsity expert models (MoE), and training methods, demonstrating significant performance improvements.

The total parameters of Qwen3-Next reach 80 billion, but only 3 billion parameters are activated during inference, significantly reducing the training cost by 90% compared to its predecessor Qwen3-32B. Additionally, its inference efficiency has improved 10 times, especially showing remarkable performance in handling ultra-long texts (over 32K). This allows Qwen3-Next to match or even surpass Alibaba's flagship model Qwen3-235B in executing instructions and processing long context tasks, as well as outperforming Google's latest Gemini-2.5-Flash reasoning model.

The core innovation of this model lies in the hybrid expert architecture, which combines gate DeltaNet and gate attention. Through this design, Qwen3-Next overcomes the shortcomings of traditional attention mechanisms in handling long contexts, ensuring speed while enhancing context learning capabilities. The model uses a high sparsity MoE structure during training, maximizing resource utilization without compromising performance.

In addition, Qwen3-Next introduces a multi-token prediction mechanism, improving the model's performance in speculative decoding. During the pre-training phase, the efficiency of Qwen3-Next has significantly increased compared to Qwen3-32B, with training costs only 9.3% of that of Qwen3-32B, yet achieving better performance. In terms of inference speed, Qwen3-Next shows a 7-fold increase in throughput when processing long texts compared to Qwen3-32B, maintaining a 10-fold speed advantage even in longer contexts.

Alibaba's new model has not only achieved breakthroughs in technology but also received widespread attention and praise, especially among developers and researchers. Whether in technological innovation or market competitiveness, Qwen3-Next marks Alibaba's further leadership in the field of artificial intelligence.

Online experience: https://chat.qwen.ai/

Open source address: https://huggingface.co/collections/Qwen/qwen3-next-68c25fd6838e585db8eeea9d

Key points:
🌟 The Qwen3-Next-80B-A3B model has 80 billion total parameters, with a 90% reduction in training cost and a 10-fold improvement in inference efficiency.
🔍 The new model adopts a hybrid expert architecture and a multi-token prediction mechanism, significantly enhancing context processing capabilities.
🚀 In terms of inference speed, Qwen3-Next performs excellently in ultra-long text scenarios, with throughput increasing by 7 to 10 times compared to the previous model.

Unitree Technology Launches G1-D: A Humanoid Robot Workstation That Integrates Data Collection, Training, and Deployment

Yushu Tech launches a full-stack solution with wheeled humanoid robot G1-D, integrating data collection, processing, annotation, review, and asset management. It offers one-stop AI model development, including distributed training, custom model creation, deployment, and compatibility with open-source models.....

Weibo Launches VibeThinker-1.5B, a Low-Cost AI Model Challenging Large Language Models

The Weibo AI department has launched the open-source large model VibeThinker-1.5B, which has 1.5 billion parameters. The model is optimized based on Alibaba's Qwen2.5-Math-1.5B and performs well in math and code tasks. It is now freely available on platforms such as Hugging Face, and it follows the MIT license, supporting commercial use.

Baidu releases ERNIE-4.5-VL-28B-A3B-Thinking: Accurately locates image details to solve complex problems

Baidu launches the multimodal AI model ERNIE-4.5-VL-28B-A3B-Thinking, which can deeply integrate images for reasoning. The model performs excellently in multiple benchmark tests, sometimes surpassing top commercial models such as Google Gemini 2.5 Pro and OpenAI GPT-5 High. Although it has a total of 28 billion parameters, it uses a routing architecture, activating only 3 billion parameters, achieving lightweight and efficient inference.

Li Feifei's World Labs Launches Its First Commercial 3D World Model Marble, Supporting Multiple Inputs for Generation

The world model Marble, the first commercial product from World Labs co-founded by AI expert Li Feifei, supports the generation of editable 3D environments using text, images, videos, and 3D layouts. The product offers a freemium and paid subscription model, applicable to games, film, VR, and other fields. Less than a year after the company completed a $230 million funding round, it shows a strong pace of development.

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Submit Your Model

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

AI Brand Monitoring Tool

GEO Services​

AI Search Visibility Checker

AI Model Compatibility Checker

AI Deployment Calculator

AI Dataset Collection

Intelligent Document Recognition

Alibaba Open-Sources New Model Qwen3-Next-80B-A3B, Performance and Efficiency Both Improved!

AIbase基地

This article is from AIbase Daily

AI News Recommendations

AI Daily: Baidu Launches Wenxin 5.0; Keling 2.5 Turbo Model Launches First and Last Frame Function; Weibo Launches VibeThinker-1.5B

Tesla XBot, Volvo Xiao Wo Pass National Filing! Shanghai Pioneers Foreign Enterprises' Large Models Landing, AI Supervision Enters a New Stage

Ali Qwen Project Secretly Launched: Based on Qwen Model, Fully Competing with ChatGPT to Fight the C-End AI Future

Unitree Technology Launches G1-D: A Humanoid Robot Workstation That Integrates Data Collection, Training, and Deployment

Baidu Launches the New Native Multimodal Large Model ERNIE 5.0

Nebius, a Dutch AI Cloud Company, Signs a $3 Billion Cooperation Agreement with Meta

Google Launches Private AI Computing Cloud System: Zero Access to AI Data in Isolated Environments

Weibo Launches VibeThinker-1.5B, a Low-Cost AI Model Challenging Large Language Models

Baidu releases ERNIE-4.5-VL-28B-A3B-Thinking: Accurately locates image details to solve complex problems

Li Feifei's World Labs Launches Its First Commercial 3D World Model Marble, Supporting Multiple Inputs for Generation

GEO Services