AntBaiLing Team Releases the Next-Generation Efficient Inference Model Ring-mini-sparse-2.0-exp

AIbase基地

Published inAI News · 3 min read · Oct 27, 2025

The AntBelle Large Model team recently announced the open-source release of its new efficient inference model —— Ring-mini-sparse-2.0-exp. This model is based on the Ling2.0 architecture and is optimized for long sequence decoding, adopting an innovative sparse attention mechanism.

This new architecture integrates a high sparsity ratio Mixture of Experts (MoE) structure with a sparse attention mechanism, aiming to enhance the model's performance in complex long sequence reasoning scenarios.

The team stated that due to the deep collaborative optimization of the architecture and inference framework, Ring-mini-sparse-2.0-exp has nearly tripled the throughput when processing long sequences compared to its predecessor Ring-mini-2.0.

In multiple high-difficulty reasoning benchmark tests, the model consistently maintained SOTA (State of the Art) performance, demonstrating its excellent context processing capabilities and efficient reasoning ability, providing the open-source community with a new lightweight solution.

The Ling2.0Sparse architecture mainly aims to address two core trends in the future development of large language models: expansion of context length and expansion at test time. The team drew inspiration from the design concept of Mixture of Block Attention (MoBA), adopting block-wise sparse attention, which divides the input Key and Value into blocks, and each query selects top-k blocks on the head dimension.

Only softmax computation is performed on the selected blocks, significantly reducing computational costs. In addition, the team combined MoBA design with Grouped Query Attention (GQA), allowing query heads within the same group to share the top-k block selection results, thereby reducing I/O costs.

GitHub: https://github.com/inclusionAI/Ring-V2/tree/main/moba

Key Points:
🌟 The new model Ring-mini-sparse-2.0-exp performs exceptionally well in long sequence reasoning, with nearly triple the throughput.
🔍 The model adopts an innovative sparse attention mechanism, balancing efficient reasoning and context processing capabilities.
📥 The model is open-sourced on multiple platforms, making it convenient for the community to apply and research.

AI Models Simulate Gambling Behavior and Show Signs of Addiction

A study from Gwangju Institute of Science and Technology found that AI chatbots (GPT-4o-mini, GPT-4.1-mini, Gemini-2.5-Flash, Claude-3.5-Haiku) showed tendencies toward gambling addiction in a slot machine simulation experiment. The models continued to bet instead of quitting, starting with $100, revealing potential risks in their decision-making mechanisms.

AI Daily: Tencent Launches New IMA 2.0; Microsoft Unveils a Series of Major Updates for Copilot; Alibaba's Quark AI Glasses Go on Pre-sale

[AI Daily] The Kimi k2 model from the company Dark Side of the Moon has received praise for its performance surpassing GPT-5, and the company is about to complete another round of tens of millions of dollars in funding, just months after the last funding round. The domestic AI large model field remains highly active, and developers can learn about the latest product updates through the platform.

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Submit Your Model

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

GEO Services

AI Search Visibility Checker

AI Model Compatibility Checker

AI Dataset Collection

Intelligent Document Recognition

AntBaiLing Team Releases the Next-Generation Efficient Inference Model Ring-mini-sparse-2.0-exp

AIbase基地

This article is from AIbase Daily

AI News Recommendations

ChatGPT Becomes a Versatile Life Assistant! Connect to Eight Platforms Like Spotify, Booking, Canva Instantly

Google Earth Integrates Gemini Large Model to Identify Storm and Drought Risks

AI Daily: Douyin Video 1.0 Pro Fast Released; Google Gemini New Features Launched; Baidu Launches Super Sports Large Model 2.0

Meituan Launches LongCat-Video Video Generation Model with Native Support for 5-Minute Continuous Output

Baidu Collaborates with Shanghai University of Sport to Launch Sports Big Model 2.0

AI Models Simulate Gambling Behavior and Show Signs of Addiction

OpenAI Enters the Music Creation Field, Collaborates with Juilliard Students to Develop a New AI Music Model

SoftBank Invests $2.25 Billion to Further Support OpenAI, AI Music and Super Funding Plan Accelerate

AI Daily: Tencent Launches New IMA 2.0; Microsoft Unveils a Series of Major Updates for Copilot; Alibaba's Quark AI Glasses Go on Pre-sale

Two 20-Year-Old Dropouts Created Turbo AI: The AI Note-taking Myth with 5 Million Users

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Submit Your Model

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

GEO Services​

AI Search Visibility Checker

AI Model Compatibility Checker

AI Dataset Collection

Intelligent Document Recognition

AntBaiLing Team Releases the Next-Generation Efficient Inference Model Ring-mini-sparse-2.0-exp

AIbase基地

This article is from AIbase Daily

AI News Recommendations

ChatGPT Becomes a Versatile Life Assistant! Connect to Eight Platforms Like Spotify, Booking, Canva Instantly

Google Earth Integrates Gemini Large Model to Identify Storm and Drought Risks

AI Daily: Douyin Video 1.0 Pro Fast Released; Google Gemini New Features Launched; Baidu Launches Super Sports Large Model 2.0

Meituan Launches LongCat-Video Video Generation Model with Native Support for 5-Minute Continuous Output

Baidu Collaborates with Shanghai University of Sport to Launch Sports Big Model 2.0

AI Models Simulate Gambling Behavior and Show Signs of Addiction

OpenAI Enters the Music Creation Field, Collaborates with Juilliard Students to Develop a New AI Music Model

SoftBank Invests $2.25 Billion to Further Support OpenAI, AI Music and Super Funding Plan Accelerate

AI Daily: Tencent Launches New IMA 2.0; Microsoft Unveils a Series of Major Updates for Copilot; Alibaba's Quark AI Glasses Go on Pre-sale

Two 20-Year-Old Dropouts Created Turbo AI: The AI Note-taking Myth with 5 Million Users

GEO Services