Silicon-Based Flow Launches Ling-mini-2.0 with Ant Group, Achieving Both Speed and Performance

AIbase基地

Published inAI News · 4 min read · Sep 10, 2025

Recently, the Silicon Flow Large Model Service Platform has officially launched the latest open-source Ling-mini-2.0 model from Ant Group's BaiLing team. This new model demonstrates extremely high generation speed while maintaining advanced performance, marking a breakthrough in achieving great power with a small size.

Ling-mini-2.0 adopts an MoE architecture, with a total of 16B parameters. However, during generation, only 1.4B parameters are activated per Token, significantly improving the generation speed. This design not only maintains excellent performance when processing tasks but also allows effective comparison with Dense language models with less than 10B parameters and other larger-scale MoE models. Its maximum context length support reaches 128K, greatly expanding the model's applicability.

In benchmark tests, Ling-mini-2.0 performed excellently in reasoning tasks across multiple fields. Whether it is coding, mathematics, or knowledge-intensive reasoning tasks, Ling-mini-2.0 achieved satisfactory results, demonstrating its strong comprehensive reasoning ability. Especially in difficult tasks, the model's performance outperformed many similar products, showing excellent performance.

In addition, Ling-mini-2.0 also has advantages in generation speed. For question-answering tasks within 2000 Tokens, its generation speed exceeds 300 Tokens per second, which is more than twice as fast as traditional 8B Dense models. As the output length increases, the model's speed can be further improved, reaching up to 7 times the relative acceleration.

To facilitate developers, the Silicon Flow platform also provides various access schemes and API documentation, supporting developers to compare and combine models on the platform, helping them easily implement generative AI applications. The platform also includes multiple large model APIs available for free, further promoting the popularization and application of AI technology.

Key Points:
🧠 Ling-mini-2.0 has a total of 16B parameters, activating only 1.4B parameters per Token, achieving efficient generation.
🚀 The model supports a maximum context length of 128K, demonstrating strong reasoning capabilities.
💻 The Silicon Flow platform provides various access schemes, enabling developers to easily use multiple large model APIs.

Alibaba Cloud Large Model Prices Cut in Half! Tongyi Qianwen 3-Max Call Cost Reduced by 50%, Only 10% Fee Charged for Cache Hits

Alibaba Cloud BaiLian announced that starting from November 13, 2025, the core call fee for the Tongyi Qianwen 3-Max model will be halved, and the cache billing strategy has been optimized, significantly reducing the cost of enterprise AI applications. This move aims to lower the entry barrier for large model usage and accelerate digital transformation for small and medium enterprises.

Reverie Launches a Speech Recognition Model Dedicated to India, Outperforming Deepgram

Reverie company launched a new text to speech model, supporting Hindi, English, and Hinglish mixed language, adapting to India's multilingual environment. The model has processed 3 million API calls and has shown high accuracy and fast response capabilities in industries such as banking and call centers.

LMArena Releases Latest Large Model Rankings: Claude, GPT-5, and Zhipu GLM-4.6 Tie for First

The latest AI programming model rankings from LMArena show that Claude from Anthropic, GPT-5 from OpenAI, and Zhipu GLM-4.6 are tied for first place globally. These models, designed specifically for programming, can significantly improve the efficiency of code writing, debugging, and optimization, driving advancements in software development.

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Submit Your Model

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

AI Brand Monitoring Tool

GEO Services​

AI Search Visibility Checker

AI Model Compatibility Checker

AI Deployment Calculator

AI Dataset Collection

Intelligent Document Recognition

Silicon-Based Flow Launches Ling-mini-2.0 with Ant Group, Achieving Both Speed and Performance

AIbase基地

This article is from AIbase Daily

AI News Recommendations

AI Daily: Feifei Li's Marble 3D World Model Public Beta; OpenAI Launches ChatGPT Group Chat Function for the First Time; Baidu Unveils Multimodal AI Assistant, Super Du

Xiaomi Launches Xiaomi Miloco: Large Model at Your Doorstep, Reconstructing Full-House Smart Interaction

OpenAI Launches ChatGPT Group Chat Feature, Will Pilot in South Korea and New Zealand

Moore Threads Launches URPO Framework, Paving the Way for a New Era in Large Model Training. AAAI 2026 Commends

Alibaba Cloud Large Model Prices Cut in Half! Tongyi Qianwen 3-Max Call Cost Reduced by 50%, Only 10% Fee Charged for Cache Hits

Li Feifei's World Labs Unveils Major Update! Marble 3D World Model Beta Test - Text/Images Turn into Interactive Virtual Universe

Reverie Launches a Speech Recognition Model Dedicated to India, Outperforming Deepgram

AI Daily: Baidu Launches Wenxin 5.0; Keling 2.5 Turbo Model Launches First and Last Frame Function; Weibo Launches VibeThinker-1.5B

LMArena Releases Latest Large Model Rankings: Claude, GPT-5, and Zhipu GLM-4.6 Tie for First

Tesla XBot, Volvo Xiao Wo Pass National Filing! Shanghai Pioneers Foreign Enterprises' Large Models Landing, AI Supervision Enters a New Stage

GEO Services