Google Makes a Major Breakthrough! AI Agent Achieves Self-Improvement, Becoming a Super Intelligent Entity by Learning from Mistakes

AIbase基地

Published inAI News · 5 min read · Oct 13, 2025

Recently, Google's latest research has proposed a revolutionary framework called "Reasoning Memory" (learnable reasoning memory), aiming to allow AI agents to accumulate knowledge from their own experiences and mistakes, achieving true "self-evolution." This innovation has the potential to solve the critical flaw of current large language model (LLM)-driven agents, pushing AI toward a more intelligent and autonomous direction.

The core pain point of current AI agents: unable to "grow" from experience

Although AI agents based on large language models perform well in reasoning and task execution, they generally lack a sustainable learning mechanism. According to AIbase, existing agents do not "evolve" after completing tasks: each execution is like starting from scratch, equivalent to "starting over." This leads to a series of problems, including repeated errors, inability to accumulate abstract experiences, wasting historical data, and limited decision optimization. The deeper reason is that even when memory modules are added, they are mostly limited to simple information caching (such as episodic memory), lacking the ability to generalize, abstract, and reuse experiences. As a result, AI agents struggle to form "learnable reasoning memory," making it difficult for them to truly achieve self-improvement.

涌现行为作者观察到 ReasoningBank 的策略记忆逐渐自我演化：- 初期为具体操作策略（如“.jpg

Explanation of Google's new framework: How Reasoning Memory empowers self-evolution

The Google research team introduced the Reasoning Memory framework, a memory system specifically designed for AI agents, capable of accumulating, generalizing, and reusing reasoning experiences. According to AIbase, the core of this framework is to enable agents to extract abstract knowledge from their interactions, mistakes, and successes, forming learnable "reasoning memory." Specifically:

- Accumulate experience: Agents no longer discard task history but systematically record reasoning processes and results.

- Generalize and abstract: Algorithms convert specific experiences into general rules, avoiding mere episodic storage.

- Reuse and optimize: These memories are called upon in future tasks, adjusting decisions based on past experiences to reduce repeated errors.

This mechanism allows AI agents to "learn from mistakes" like humans, achieving closed-loop self-evolution. Experiments show that agents equipped with this framework significantly improve performance in complex tasks, marking a leap from static execution to dynamic growth.

Potential impact: AI agents moving toward a truly autonomous era

AIbase believes this research will reshape the AI application ecosystem. For example, in automated customer service, medical diagnosis, or game AI, agents can continuously optimize their strategies, reducing human intervention. In the long run, it fills the "evolutionary gap" of LLM agents, paving the way for building more reliable autonomous systems. However, challenges remain, such as the need to further verify memory generalization capabilities and computational costs. Google's move undoubtedly strengthens its leadership in AI frontiers, and it is worth close attention from the industry.

Paper URL: https://arxiv.org/pdf/2509.25140

Kuaizhi AI Glasses Launch Two Series with Six Models: 0.6-Second Ultra-Fast Capture, Up to 4K Video Output

Kuaizhi AI Glasses were officially launched on November 27th, introducing six models across two series: S1 and G1. The S1 series includes three models, starting at 3,799 yuan; the G1 series is more lightweight and fashionable, including a sunglasses model, starting at 1,899 yuan. All models are equipped with Alibaba's Qwen AI Assistant. They are now available for purchase on Taobao, Douyin, and JD.com, and will be available for lens fitting services in 604 offline stores soon.

6B Parameters, 16G VRAM, 8 Steps to Generate Image: Alibaba Z-Image Leaves Billion-Parameter Models in the Dust

Alibaba's Z-Image-Turbo model with 6B parameters rivals 20B+ closed-source models. Renders 1024×1024 images in 2.3s on RTX4090, using 13GB VRAM. Supports 8-step sampling for print-quality output, compatible with consumer GPUs like RTX3060, max 16GB VRAM. Accurately interprets complex Chinese prompts.....

Giant Network Launches Three Multi-Modal Models: Eliminating Video Distortion and Enabling Practical Song Usage Through Voice Conversion

Giant Network AI Lab, in collaboration with Tsinghua University and North-western Polytechnical University, has launched three audio-visual multi-modal generation technologies: YingVideo-MV (music-driven video generation), YingMusic-SVC (zero-shot voice conversion), and YingMusic-Singer (voice synthesis). These technologies will be open-sourced, with YingVideo-MV capable of generating videos using only music and a person's image.

Li Auto's Self-Developed AI Chip M100 Exposed, Performance Three Times Higher Compared to High-End Models

Li Auto's Q3 2025 financial report shows total revenue of 27.4 billion yuan, a 36.2% year-over-year decline; net loss of 624.4 million yuan, compared to a profit of 2.8 billion yuan in the same period last year. In a conference call, management emphasized that the company is accelerating its transformation in autonomous driving and AI fields. The self-developed AI inference chip M100 has made significant progress, indicating future strategic adjustments.

Quark AI Browser: System-Level Six-Plug-In Attack on Chrome, $19.9 Monthly Fee, Smooth Upgrade

Quark Browser launches a system-level feature with an integrated Qwen AI assistant. It seamlessly integrates AI through six methods such as the sidebar, screen reading, and word selection. No need to switch tabs. Start with Alt+Space. Monthly fee of $19.9, directly accessible with domestic networks.

DeepMind launches Gemini 3 Pro System Instructions: Agent Task Success Rate Increased by 5% and Multi-Step Workflow Reliability Industrialized

Google DeepMind launched exclusive system instructions for Gemini 3 Pro, achieving an average success rate increase of 5% in three agent benchmark tests and reducing the error rate for multi-step tasks by 8%. The new instructions emphasize that the model should act as a strong reasoning planner, requiring structured planning before taking actions, pushing large models from black-box tuning to a new stage of engineered instructions.

ByteDance PICO Strategic Upgrade: Launch Self-Developed Chip and New VR Headset in 2026

ByteDance accelerates the self-development and high-end positioning of VR hardware. The PICO brand under ByteDance plans to launch a new generation of headsets in 2026, equipped with a fully self-developed dedicated chip. This chip was initiated in 2022, completed the first chip return in 2024, and entered mass production. It meets the performance targets, with its core advantage being low latency performance.

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Submit Your Model

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

AI Brand Monitoring Tool

GEO Services​

AI Search Visibility Checker

AI Model Compatibility Checker

AI Deployment Calculator

AI Dataset Collection

Intelligent Document Recognition

Google Makes a Major Breakthrough! AI Agent Achieves Self-Improvement, Becoming a Super Intelligent Entity by Learning from Mistakes

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Kuaizhi AI Glasses Launch Two Series with Six Models: 0.6-Second Ultra-Fast Capture, Up to 4K Video Output

6B Parameters, 16G VRAM, 8 Steps to Generate Image: Alibaba Z-Image Leaves Billion-Parameter Models in the Dust

Giant Network Launches Three Multi-Modal Models: Eliminating Video Distortion and Enabling Practical Song Usage Through Voice Conversion

Li Auto's Self-Developed AI Chip M100 Exposed, Performance Three Times Higher Compared to High-End Models

Quark AI Browser: System-Level Six-Plug-In Attack on Chrome, $19.9 Monthly Fee, Smooth Upgrade

DeepMind launches Gemini 3 Pro System Instructions: Agent Task Success Rate Increased by 5% and Multi-Step Workflow Reliability Industrialized

Singapore's National AI Plan Switches Chips: Ditching Meta Llama in Favor of Alibaba's Qwen3-32B Open-Source Model Sea-Lion v4 Tops Southeast Asia Language Ranking

AI Daily: FLUX.2 Open Source Release; Tencent Hunyuan 3D Creation Engine Launched on International Site; Baidu Establishes Two New Large Model R&D Departments

01.AI Collaborates with Jiyi AI: Announce Joint Research on Game Industry Large Model, Launch SOON Engine This Year

ByteDance PICO Strategic Upgrade: Launch Self-Developed Chip and New VR Headset in 2026

GEO Services