Ant Group's Bai Ling Launches Open-Source Efficient Thinking Models, Significantly Reducing Inference Costs

AIbase基地

Published inAI News · 3 min read · Sep 28, 2025

The Ant Group's Bailing Large Model team recently announced the open-source release of two new efficient reasoning models: Ring-flash-linear-2.0 and Ring-mini-linear-2.0. These models are specifically designed to improve the efficiency of deep reasoning. Along with the models, two high-performance fusion operators developed in-house were also released: the FP8 fusion operator and the linear Attention inference fusion operator. These aim to achieve efficient reasoning with "large parameters and low activation" and support for ultra-long context.

According to the team, thanks to architectural optimization and the collaborative work of high-performance operators, the cost of these two new models in deep reasoning scenarios is only one-tenth of that of dense models of the same scale. Compared to the previous Ring series, the inference cost has also been reduced by more than 50%. This means that users can significantly reduce computational resource consumption when performing complex reasoning, thereby improving work efficiency.

The advantages of the new models are not only reflected in the reduction of costs, but also in the high alignment between the training and inference engine operators. This alignment allows the model to perform long-term, stable, and efficient optimization during the reinforcement learning phase, enabling these models to consistently maintain the best performance (SOTA) on multiple challenging reasoning benchmarks. This undoubtedly provides users with more powerful tools for complex reasoning tasks.

As an open-source project, Ring-flash-linear-2.0 and Ring-mini-linear-2.0 have been released on multiple platforms, including Hugging Face and ModelScope. Developers can obtain more information and try them on these platforms.

With this open-source release, the Ant Group's Bailing Large Model team not only demonstrates its technical strength in the AI field, but also provides developers with more efficient tools, helping them achieve greater breakthroughs in future AI development and research.

Synthesia 3.0 Major Update: Introducing Video Avatars for Real-Time Interaction with Viewers, Dialogue, and Q&A

Synthesia launches version 3.0 of its video avatar platform, featuring a core new function called "Video Avatar." These virtual avatars can interact with viewers in real-time during videos, including dialogue, answering questions, and raising queries, and can access company-specific information, significantly enhancing the practicality and realism of scenarios such as corporate training and customer service.

Major Policy Adjustment by Meta: User Conversations with AI Assistants Will Be Used for Advertising and Content Delivery Across the Platform

Meta announced that starting December 16, 2025, all text or voice conversations between users and Meta AI will be integrated into its advertising and content algorithms. This means that interactions in AI chats will directly influence the ads, posts, and group content that users see on platforms such as Facebook and Instagram. For example, after discussing hiking, users will receive more related ads and content in their feeds.

Anthropic Language Model Emerges as a New Force in Cybersecurity: Claude 4.5 Demonstrates Significant Improvement in Vulnerability Discovery

Anthropic demonstrates breakthroughs of its large language model in the field of cybersecurity. The latest Claude Sonnet4.5 has a 5% probability of discovering software vulnerabilities, a significant increase from 2% in its predecessor Sonnet4. It has been proven through CyberGym tests that AI can efficiently enhance network defense, highlighting the potential of technological advancements.

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Submit Your Model

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

GEO Services

AI Search Visibility Checker

AI Model Compatibility Checker

AI Dataset Collection

Intelligent Document Recognition

Ant Group's Bai Ling Launches Open-Source Efficient Thinking Models, Significantly Reducing Inference Costs

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Stanford Report Reveals AI Writing Proliferation: One Quarter of Corporate Press Releases Show Signs of Large Models

Taylor Swift's New Album Treasure Hunt Game Sparks Controversy: Fans Question if Promotion Video Was AI-Generated

AMD Signs Multi-Billion-Dollar Chip Agreement for Years, Supplying 6 Gigawatts of AI Computing Power to OpenAI

Memory Empowers AI: Supermemory Raises $2.6 Million in Funding to Build a General AI Memory API

OpenAI Developer Day Major Release: ChatGPT Platformization Launches Autonomous AI Agents and Top Models

OpenAICodex Alpha Early Access! Seven-Layer New Model Revealed, GPT-5 Programming Capabilities Significantly Upgraded

Synthesia 3.0 Major Update: Introducing Video Avatars for Real-Time Interaction with Viewers, Dialogue, and Q&A

Alibaba Qwen-VL-30B-A3B New Model Released, Stronger Performance in Mathematics and Video Processing

Major Policy Adjustment by Meta: User Conversations with AI Assistants Will Be Used for Advertising and Content Delivery Across the Platform

Anthropic Language Model Emerges as a New Force in Cybersecurity: Claude 4.5 Demonstrates Significant Improvement in Vulnerability Discovery

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Submit Your Model

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

GEO Services​

AI Search Visibility Checker

AI Model Compatibility Checker

AI Dataset Collection

Intelligent Document Recognition

Ant Group's Bai Ling Launches Open-Source Efficient Thinking Models, Significantly Reducing Inference Costs

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Stanford Report Reveals AI Writing Proliferation: One Quarter of Corporate Press Releases Show Signs of Large Models

Taylor Swift's New Album Treasure Hunt Game Sparks Controversy: Fans Question if Promotion Video Was AI-Generated

AMD Signs Multi-Billion-Dollar Chip Agreement for Years, Supplying 6 Gigawatts of AI Computing Power to OpenAI

Memory Empowers AI: Supermemory Raises $2.6 Million in Funding to Build a General AI Memory API

OpenAI Developer Day Major Release: ChatGPT Platformization Launches Autonomous AI Agents and Top Models

OpenAICodex Alpha Early Access! Seven-Layer New Model Revealed, GPT-5 Programming Capabilities Significantly Upgraded

Synthesia 3.0 Major Update: Introducing Video Avatars for Real-Time Interaction with Viewers, Dialogue, and Q&A

Alibaba Qwen-VL-30B-A3B New Model Released, Stronger Performance in Mathematics and Video Processing

Major Policy Adjustment by Meta: User Conversations with AI Assistants Will Be Used for Advertising and Content Delivery Across the Platform

Anthropic Language Model Emerges as a New Force in Cybersecurity: Claude 4.5 Demonstrates Significant Improvement in Vulnerability Discovery

GEO Services