Microsoft Launches New AI Agent Model rStar2-Agent with 14 Billion Parameters to Challenge Large Models

AIbase基地

Published inAI News · 4 min read · Sep 8, 2025

Microsoft has recently made significant breakthroughs in the AI field, open-sourcing an AI agent reasoning model called rStar2-Agent. This model uses an innovative agent reinforcement learning approach. Surprisingly, despite having only 14 billion parameters, it achieved an accuracy of 80.6% on the AIME24 math reasoning test, surpassing DeepSeek-R1 with 671 billion parameters (79.8%). This performance has led people to reevaluate the relationship between model parameter size and performance.

In addition to its excellent performance in math reasoning tasks, rStar2-Agent also shows remarkable results in other fields. On the GPQA-Diamond science reasoning benchmark, the model achieved an accuracy of 60.9%, surpassing DeepSeek-V3's 59.1%; on the BFCL v3 agent tool usage task, its task completion rate reached 60.8%, also higher than DeepSeek-V3's 57.6%. These data indicate that rStar2-Agent demonstrates strong generalization capabilities across various tasks.

To achieve this breakthrough, Microsoft made three innovations in training infrastructure, algorithms, and training processes. First, in terms of infrastructure, Microsoft built an efficient isolated code execution service that can quickly process a large number of training requests, supporting up to 45,000 concurrent tool calls per training step, with an average latency of only 0.3 seconds. Second, Microsoft proposed a new GRPO-RoC algorithm, which improves the model's accuracy and efficiency during reasoning through effective reward mechanisms and algorithm optimization. Finally, rStar2-Agent designed an efficient training process called "non-reasoning fine-tuning + multi-stage reinforcement learning" to ensure steady improvement of the model at each stage.

This series of technological breakthroughs has enabled rStar2-Agent to stand out in the AI agent field and has opened up new directions for future research and applications of intelligent agents.

Open source address: https://github.com/microsoft/rStar

Key points:
🌟 The rStar2-Agent model has only 14 billion parameters but achieved an accuracy of 80.6% in math reasoning tests, surpassing DeepSeek-R1 with 671 billion parameters.
🔧 Microsoft made innovations in infrastructure, algorithms, and training processes to ensure efficient training and outstanding performance of the model.
📊 rStar2-Agent performs well in science reasoning and tool usage tasks, demonstrating strong generalization capabilities.

New AI Terms rStar2-Agent DeepSeek-R1 DeepSeek-V3

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

Israeli AI company AI21Labs clarifies there is no transaction agreement with NVIDIA

AI21 Labs CEO denies specific deal with Nvidia, stating ongoing talks with multiple parties. Previous reports valued the company at $2-3 billion.....

Dec 31, 2025

190

Xiaomi Large Model MiMo Public Testing Extended, Users Can Enjoy Free Experience Until 2026!

Xiaomi announced that the free public testing period of its self-developed large model MiMo-V2-Flash has been extended by 20 days, until January 20, 2026. The model has 309 billion parameters, with 15 billion activated parameters, and performs excellently in reasoning and code generation. This move aims to provide users with a longer experience period and demonstrate Xiaomi's continuous investment and confidence in the AI field.

Dec 31, 2025

270

MiniMax Launches M2.1 Programming Model, the Era of AI Development is About to Begin!

MiniMax has open-sourced the M2.1 programming model, which is now available on Hugging Face, ModelScope, and GitHub, making it easy for developers to integrate. The model is supported by vLLMDay-0, enabling efficient inference immediately, and performance is optimized through KTransformers technology.

Dec 31, 2025

230

Moonshot Secures $500 Million in Series C Funding: Holds Over $10 Billion in Cash, Not Rushing for IPO, Aiming for the Pinnacle of AGI

Moonshot has completed a $500 million Series C funding round, with cash reserves exceeding $10 billion. The founder stated that they are not rushing for an IPO, and will focus on long-term technology research and development and talent incentives.

Dec 31, 2025

200

AI Daily: Tencent Open Sources 3D Action Tool Huan Yuan Motion 1.0; Luo Yonghao Launches AI Book Reading App Qie Ting; Alibaba's AI Glasses Get First OTA Update

Welcome to the [AI Daily] column! This is your guide to exploring the world of artificial intelligence every day. Every day, we present you with the latest content in the AI field, focusing on developers, helping you understand technical trends and innovative AI product applications. Discover new AI products: https://app.aibase.com/zh1. Tencent Makes a Big Impact! 8. Smart glasses and bracelets can also enjoy trade-in subsidies! In 2026, the 'two new' policy adds support for AI products. The 'two new' policy in 2026 has undergone an important upgrade, for the first time including smart glasses and

Dec 31, 2025

190

20 Million European Banking Jobs Face AI Threat Before 2030

Morgan Stanley predicts that by 2030, artificial intelligence will threaten about 20 million banking jobs in the financial sector of Europe, mainly affecting back-end and mid-level office positions, such as data processing and administrative work. Development of financial technology is driving traditional banks to adopt intelligent solutions to improve efficiency and reduce costs.

Dec 31, 2025

170

Yuan3.0Flash: Open-source Multimodal Foundation Model Leading the New Wave of AI

YuanLab.ai released the open-source multimodal large model Yuan3.0Flash, with a parameter scale of 40B. It uses a sparse mixture of experts architecture, activating approximately 3.7B parameters during inference, significantly improving efficiency. The model provides 16-bit and 4-bit weights, technical reports, and training methods, supporting secondary development and industry customization, promoting the popularization of AI technology.

Dec 31, 2025

250

Qwen AI Glasses First OTA Update: AI Capabilities Further Enhanced, Adds Five New Features Including Text and Image Notes

Qwen AI Glasses completed its first OTA update on December 31st, adding five new features: voice notes, text and image notes, multi-intent understanding and execution, blue ring payment, and community services. Existing functions such as translation and itinerary inquiries were also optimized. With its self-developed voice enhancement model and hardware configuration, the recording function now supports clear audio capture and effective noise reduction within a ten-meter range.

Dec 31, 2025

220

Generate a Realistic Frozen Ruins Video of 'Ice Age' with AI, Single Video Viewed Over 4.9 Million Times

Transform classic animated characters into ultra-realistic archaeological scenes using AI technology, creating nostalgic and fantasy content to attract traffic and monetize. Suitable for animation enthusiasts, AI image/video creators, and short video operators. Basic knowledge of AI tools is required, with attention to lighting and material details. The process includes: selecting well-known animations such as 'Ice Age', coming up with creative concepts, such as characters being frozen for thousands of years and then discovered, using AI to generate realistic scenes, and finally monetizing through traffic.

Dec 31, 2025

160

Tencent Shocks the Market! 10 Billion Parameter Text-to-3D Motion Generator Open-Sourced, Making Game NPCs Come to Life in One Click!

The Tencent Huan Yuan team has open-sourced the HY-Motion1.0 model, which is based on DiT architecture and flow matching technology. It can generate high-quality 3D skeleton animations from text descriptions and is compatible with mainstream 3D tools, greatly lowering the threshold for animation creation. The model uses a full-stage training strategy, optimizing the generation effect with over 3,000 hours of action data.

Dec 31, 2025

220

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

AI Brand Monitoring Tool

AI Search Visibility Checker

GEO Services​

AI Model Compatibility Checker

AI Deployment Calculator

Microsoft Launches New AI Agent Model rStar2-Agent with 14 Billion Parameters to Challenge Large Models

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Israeli AI company AI21Labs clarifies there is no transaction agreement with NVIDIA

Xiaomi Large Model MiMo Public Testing Extended, Users Can Enjoy Free Experience Until 2026!

MiniMax Launches M2.1 Programming Model, the Era of AI Development is About to Begin!

Moonshot Secures $500 Million in Series C Funding: Holds Over $10 Billion in Cash, Not Rushing for IPO, Aiming for the Pinnacle of AGI

AI Daily: Tencent Open Sources 3D Action Tool Huan Yuan Motion 1.0; Luo Yonghao Launches AI Book Reading App Qie Ting; Alibaba's AI Glasses Get First OTA Update

20 Million European Banking Jobs Face AI Threat Before 2030

Yuan3.0Flash: Open-source Multimodal Foundation Model Leading the New Wave of AI

Qwen AI Glasses First OTA Update: AI Capabilities Further Enhanced, Adds Five New Features Including Text and Image Notes

Generate a Realistic Frozen Ruins Video of 'Ice Age' with AI, Single Video Viewed Over 4.9 Million Times

Tencent Shocks the Market! 10 Billion Parameter Text-to-3D Motion Generator Open-Sourced, Making Game NPCs Come to Life in One Click!

AI News Recommendations

Israeli AI company AI21Labs clarifies there is no transaction agreement with NVIDIA

Xiaomi Large Model MiMo Public Testing Extended, Users Can Enjoy Free Experience Until 2026!

MiniMax Launches M2.1 Programming Model, the Era of AI Development is About to Begin!

Moonshot Secures $500 Million in Series C Funding: Holds Over $10 Billion in Cash, Not Rushing for IPO, Aiming for the Pinnacle of AGI

AI Daily: Tencent Open Sources 3D Action Tool Huan Yuan Motion 1.0; Luo Yonghao Launches AI Book Reading App Qie Ting; Alibaba's AI Glasses Get First OTA Update

20 Million European Banking Jobs Face AI Threat Before 2030

Yuan3.0Flash: Open-source Multimodal Foundation Model Leading the New Wave of AI

Qwen AI Glasses First OTA Update: AI Capabilities Further Enhanced, Adds Five New Features Including Text and Image Notes

Generate a Realistic Frozen Ruins Video of 'Ice Age' with AI, Single Video Viewed Over 4.9 Million Times

Tencent Shocks the Market! 10 Billion Parameter Text-to-3D Motion Generator Open-Sourced, Making Game NPCs Come to Life in One Click!

GEO Services