80 Billion Parameters With Only 30 Billion! Qwen3 New Model's Inference Speed Increases by 10 Times

AIbase基地

Published inAI News · 8 min read · Sep 10, 2025

The Alibaba Qwen team has just thrown a major bomb at global developers, as the upcoming Qwen3-Next-80B-A3B-Instruct model completely overturns the traditional logic of large models. This seemingly contradictory number combination hides an amazing technical breakthrough: a total of 80 billion parameters, but only 30 billion are actually activated, like a super sports car using only one-tenth of its engine but running ten times faster.

Just hours ago, the Hugging Face Transformers library quietly completed a key merge operation, and the related PR code has been officially integrated into the main branch. This seemingly ordinary technical action actually means that countless AI developers around the world are about to gain an unprecedented computing tool, and an efficiency revolution in open-source AI is about to begin.

This new model inherits the iconic A3B design philosophy of the Qwen3 series, but achieves a qualitative leap in scale. While traditional large models are still struggling with massive parameter counts and high computational costs, the Qwen team has chosen a more refined path. They adopted a MoE expert mixture architecture, like building a highly specialized team inside the model, where only the most suitable experts are activated to handle specific tasks, while others remain on standby.

This design has astonishing results. When processing context longer than 32K, the new model's inference throughput has reached more than 10 times that of Qwen3-32B. Users are already eager to share their test experiences, many of whom say that compared to the previously used Qwen3-30B-A3B series, this new model not only maintains fast inference speed, but also shows a richer knowledge base and stronger ability to handle complex tasks.

In the field of code generation, this model has shown impressive performance. Developers have found that it can achieve industry-leading results with minimal computational resources, whether it is complex algorithm implementation or multi-language code conversion, it can easily complete them. Mathematical reasoning and multilingual translation are also its strengths, making it a truly versatile intelligent assistant.

Even more exciting is the significant reduction in training costs. According to the team, the training cost of the new model is less than one-tenth of that of Qwen3-32B, which means more research institutions and small and medium-sized enterprises will have the opportunity to participate in the training and customization of large models. The deep optimization of sparse activation not only reduces resource consumption but also improves the model's generalization ability and instruction following performance, making AI more in line with actual application needs.

The open-source community's reaction to this news can be described as enthusiastic. Countless developers have expressed their expectations on various technology forums, and what they value is not just larger parameter scales, but also this innovative design concept of larger parameters and fewer activations. This design allows both edge devices and cloud deployment to enjoy the services of top-tier large models, truly realizing the democratization of AI technology.

The Qwen team continues to uphold its open-source spirit, and the new model will be fully open to global developers. The support for Instruct variants makes it easy to handle practical scenarios such as dialogue systems and tool calls. Improvements in visual aesthetics and structural accuracy have also laid a solid foundation for future multimodal expansion. Whether it is image description or document analysis, this model has shown great potential.

The qualitative leap in long-sequence processing capability makes this model more proficient in handling complex tasks. Compared to its predecessors, it not only achieves breakthroughs in efficiency, but also performs excellently in accuracy and stability. This comprehensive improvement is expected to completely reshape the landscape of local AI deployment, allowing more application scenarios to benefit from the intelligent services provided by large models.

This technological breakthrough has a profound impact on the entire AI industry. In resource-limited environments, the characteristics of high throughput and low cost will accelerate the popularity of AI technology in mobile devices and small and medium-sized enterprises. When computational efficiency is no longer a constraint, we can expect more innovative applications to emerge, and AI will truly enter every household, becoming an indispensable smart partner in people's daily lives and work.

As the official release date of the model approaches, AI developers around the world are holding their breath. This is not just the release of a new model, but also an important upgrade to the open-source AI ecosystem. In this efficiency revolution, every participant will become a beneficiary, and the boundaries of AI technology will be pushed to new heights again.

India will co-host the Global AI Summit with Anthropic in 2026

Anthropic will co-host the Global AI Summit with the Indian government in February 2026. The summit aims to bring together top experts, scholars, and business leaders to share trends in AI technology and applications, covering areas such as machine learning and natural language processing, and is expected to attract participants from many countries.

GPT-5 Pro Recovers Forgotten Mathematical Answers: Erdos Problem #339 Was Proven as Early as 2003

OpenAI's GPT-5Pro identified a proof paper for Erdos Problem #339 as early as 2003 through screenshot recognition, a discovery that shocked the mathematics community. This number theory problem was proposed by Paul Erdos, focusing on whether it is possible to cover specific mathematical properties using r elements from a set A of natural numbers when A is an r-th order basis. The resolution of this 22-year-old mystery highlights the breakthrough potential of AI in academic research.

Analysts: The Scale of AI Investment Bubble Is Tremendous, 17 Times That of the Internet Bubble

The rapid development of AI technology has raised concerns about a market bubble. An independent research institution pointed out that the scale of the AI bubble is astonishing, 17 times that of the internet bubble at the end of the last century. Analysts emphasized that the low-interest environment has fueled the AI investment boom.

Valuation Doubles 3 Times in Half a Year! Cursor Rushes to 27 Billion Dollars, AI Coding Tools Become the New Favorite of Capital

Anysphere, parent of AI coding tool Cursor, is negotiating $1B+ funding at $27B pre-money valuation with Coatue and Accel. Valuation nearly tripled since Accel's June investment at $9.9B. The firm continues investing in AI infrastructure like Scale AI. Despite ample cash reserves, Anysphere seeks new funding.....

New Breakthrough in Agricultural Intelligence! China Agricultural University Launches Shennong Model 3.0

China Agricultural University launched Shennong Model 3.0, covering agricultural disciplines and application scenarios across the country, promoting agricultural AI to a new stage. The model focuses on 36 agricultural agents, achieving the goals of small size, high intelligence, and low cost. It provides three versions: 32B, 7B, and 1B. It uses dynamic sparsity and incremental compression technology, reducing computing power by 50%.

Google Lens and AI Mode Launch Nano Banana Image Editing Features

Google introduced the Nano Banana image editing feature, which is integrated into the Gemini app and Google Lens, supporting AI mode operations. The feature was launched on October 13th and is currently available to Android and iOS users in the US and India, with plans to expand to more regions in the future. Users can achieve efficient image processing and generation through convenient operations.

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Submit Your Model

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

GEO Services

AI Search Visibility Checker

AI Model Compatibility Checker

AI Dataset Collection

Intelligent Document Recognition

80 Billion Parameters With Only 30 Billion! Qwen3 New Model's Inference Speed Increases by 10 Times

AIbase基地

This article is from AIbase Daily

AI News Recommendations

TikTok and LV-NUS jointly launch the SAIL-VL2 model: Small size but big performance!

India will co-host the Global AI Summit with Anthropic in 2026

Lucheng Technology's Self-Developed Open-Sora 2.0 is Included in the 'State of AI Report 2025'

GPT-5 Pro Recovers Forgotten Mathematical Answers: Erdos Problem #339 Was Proven as Early as 2003

Analysts: The Scale of AI Investment Bubble Is Tremendous, 17 Times That of the Internet Bubble

Valuation Doubles 3 Times in Half a Year! Cursor Rushes to 27 Billion Dollars, AI Coding Tools Become the New Favorite of Capital

AI Daily: Microsoft Launches Its First Self-Developed Image Generation Model MAI-Image-1; Baidu World Conference Dates Announced; AI Expert Opens Source Nanochat Teaching Tool

New Breakthrough in Agricultural Intelligence! China Agricultural University Launches Shennong Model 3.0

HKU and Meituan Collaborate to Solve AI Math Challenges: CodePlot-CoT Enables Large Models to Think with Code Plotting, Performance Surges by 21%

Google Lens and AI Mode Launch Nano Banana Image Editing Features

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Submit Your Model

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

GEO Services​

AI Search Visibility Checker

AI Model Compatibility Checker

AI Dataset Collection

Intelligent Document Recognition

80 Billion Parameters With Only 30 Billion! Qwen3 New Model's Inference Speed Increases by 10 Times

AIbase基地

This article is from AIbase Daily

AI News Recommendations

TikTok and LV-NUS jointly launch the SAIL-VL2 model: Small size but big performance!

India will co-host the Global AI Summit with Anthropic in 2026

Lucheng Technology's Self-Developed Open-Sora 2.0 is Included in the 'State of AI Report 2025'

GPT-5 Pro Recovers Forgotten Mathematical Answers: Erdos Problem #339 Was Proven as Early as 2003

Analysts: The Scale of AI Investment Bubble Is Tremendous, 17 Times That of the Internet Bubble

Valuation Doubles 3 Times in Half a Year! Cursor Rushes to 27 Billion Dollars, AI Coding Tools Become the New Favorite of Capital

AI Daily: Microsoft Launches Its First Self-Developed Image Generation Model MAI-Image-1; Baidu World Conference Dates Announced; AI Expert Opens Source Nanochat Teaching Tool

New Breakthrough in Agricultural Intelligence! China Agricultural University Launches Shennong Model 3.0

HKU and Meituan Collaborate to Solve AI Math Challenges: CodePlot-CoT Enables Large Models to Think with Code Plotting, Performance Surges by 21%

Google Lens and AI Mode Launch Nano Banana Image Editing Features

GEO Services