Stanford Research Team Releases AgentFlow: A Next-Generation Reinforcement Learning Framework for Modular, Tool-Using AI Agents

AIbase基地

Published inAI News · 4 min read · Oct 9, 2025

Recently, a research team from Stanford University released AgentFlow, a trainable intelligent agent framework designed to enhance AI's intelligent decision-making capabilities through modular design and tool integration. AgentFlow consists of four modules: the Planner, Executor, Verifier, and Generator, and is coordinated through explicit memory. In each step, the Planner proposes sub-goals and selects appropriate tools and context, the Executor is responsible for calling the tools, the Verifier determines whether to continue, and the Generator provides the final answer once the task is completed.

The core innovation of this framework lies in its training method — Flow-GRPO (Flow-based Group Refinement Policy Optimization). This method transforms long-term, sparse reward optimization problems into manageable single-round updates. Specifically, Flow-GRPO broadcasts a single verifiable trajectory-level signal at each step, aligning successful global goals with local steps. It also uses a weighted ratio per token calculation, combined with PPO-style clipping and KL penalty, to prevent policy drift.

The research team evaluated AgentFlow on multiple benchmark tests, covering four types of tasks: knowledge-intensive search, agent reasoning, math, and science. The 7B model optimized by Flow-GRPO showed an average improvement of 14.9% (search tasks), 14.0% (agent reasoning), 14.5% (math tasks), and 4.1% (science tasks) across 10 benchmarks. The research team stated that the model outperformed existing strong baselines, even surpassing GPT-4o.

Additionally, the study showed that the reliability of tool calls using AgentFlow has significantly improved, with a 28.4% reduction in tool call errors. These results indicate that with larger round budgets and model sizes, the quality of planning has seen significant improvements.

The open-source implementation of AgentFlow demonstrates a modular toolkit accompanied by quick start scripts, making it convenient for users to perform inference, training, and benchmark testing. The project is licensed under MIT, ensuring its open source and accessibility, supporting extensive research and development.

Key Points:
🛠️ AgentFlow is a modular AI agent framework, consisting of four modules: Planner, Executor, Verifier, and Generator.
🚀 The Flow-GRPO training method efficiently optimizes the agent's decision-making process, guiding each step with trajectory-level rewards.
📈 Experimental results show that AgentFlow performs well on multiple benchmarks, with an average increase of 14.9% in task completion rates, surpassing existing strong baselines.

2.6B Parameters Outperform Billion-Level Giants! Liquid AI Releases New Experimental Model LFM2-2.6B-Exp

On Christmas Day, edge AI startup Liquid AI released the open-source model LFM2-2.6B-Exp, which has only 2.6 billion parameters but performed exceptionally well in multiple benchmark tests. Its instruction-following capability even surpassed DeepSeek R1-0528 with hundreds of billions of parameters, earning it the title "the strongest 3B model." The model is based on the second-generation LFM2 foundation model and achieved experimental breakthroughs through pure reinforcement learning.

2025 Self-Media Efficiency Tool: Loomi Makes Douyin Viral Content Replicable Instead of Dependent on Luck!

In 2025's competitive self-media landscape, creators face core anxiety over lack of content feedback. While viral hits are often attributed to luck, the algorithm era demands a shift from intuitive creation to industrialized production, with AI tools driving a paradigm from generation to research.....

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

AI Brand Monitoring Tool

AI Search Visibility Checker

GEO Services​

AI Model Compatibility Checker

AI Deployment Calculator

Stanford Research Team Releases AgentFlow: A Next-Generation Reinforcement Learning Framework for Modular, Tool-Using AI Agents

AIbase基地

This article is from AIbase Daily

AI News Recommendations

2.6B Parameters Outperform Billion-Level Giants! Liquid AI Releases New Experimental Model LFM2-2.6B-Exp

OpenAI, Meta Pouring Real Money into the Battle for AI Talent, Intensifying Competition at the System Level

Samsung Calendar App Receives a Major AI Update: Auto-Generated Visual Backgrounds and Intelligent Experience

Lenovo to Launch the World's First AI Super Agent, Global Ecosystem Hardware Integration Competing with Doubao

Xiaohongshu Collaborates with Fudan University to Open Source InstanceAssemble: Achieving Precise Layout Control in AI Image Generation

Lingguang Releases Latest Data, Users Successfully Created 12 Million Flash Apps Within One Month

Italy Orders Meta to Halt WhatsApp AI Ban! EU Investigates Simultaneously, Meta Accused of Abusing Market Dominance to Suppress Competitor Chatbots

Loomi, the World's First AI Influence Agent, Redefines Content Creation with IDE Thinking

2025 Self-Media Efficiency Tool: Loomi Makes Douyin Viral Content Replicable Instead of Dependent on Luck!

Stand Up to Doubao! Lenovo's Strategic AI Exposed at CES: The Super Intelligent Agent That Integrates PC and Mobile Ecosystems Is Here

AI News Recommendations

2.6B Parameters Outperform Billion-Level Giants! Liquid AI Releases New Experimental Model LFM2-2.6B-Exp

OpenAI, Meta Pouring Real Money into the Battle for AI Talent, Intensifying Competition at the System Level

Samsung Calendar App Receives a Major AI Update: Auto-Generated Visual Backgrounds and Intelligent Experience

Lenovo to Launch the World's First AI Super Agent, Global Ecosystem Hardware Integration Competing with Doubao

Xiaohongshu Collaborates with Fudan University to Open Source InstanceAssemble: Achieving Precise Layout Control in AI Image Generation

Lingguang Releases Latest Data, Users Successfully Created 12 Million Flash Apps Within One Month

Italy Orders Meta to Halt WhatsApp AI Ban! EU Investigates Simultaneously, Meta Accused of Abusing Market Dominance to Suppress Competitor Chatbots

Loomi, the World's First AI Influence Agent, Redefines Content Creation with IDE Thinking

2025 Self-Media Efficiency Tool: Loomi Makes Douyin Viral Content Replicable Instead of Dependent on Luck!

Stand Up to Doubao! Lenovo's Strategic AI Exposed at CES: The Super Intelligent Agent That Integrates PC and Mobile Ecosystems Is Here

GEO Services