Alibaba has released its innovative autonomous search AI agent, WebAgent, on GitHub. This AI agent features end-to-end information retrieval and multi-step reasoning capabilities, allowing it to actively search, analyze, and make decisions in network environments just like humans. Its launch will significantly enhance the efficiency of researchers in acquiring and organizing information.

image.png

The Core Functions of WebAgent

WebAgent is designed to help users quickly obtain the latest research results in a specific field. When users submit a query on a certain topic, WebAgent can proactively access multiple academic databases, filter out relevant literature, and conduct in-depth analysis and summary based on user needs. This process not only improves the efficiency of information retrieval but also ensures the comprehensiveness and accuracy of research reports.

image.png

Two Modules with Clear Division of Labor

WebAgent consists of two main modules: WebDancer and WebWalker. WebDancer is an intelligent agent training framework focusing on enhancing the ability to search for information on the web; while WebWalker is a language model benchmark testing tool for web traversal.

1. **The Training Process of WebDancer**

  The training process of WebDancer is divided into four stages, from data construction to model optimization, gradually forming an intelligent agent that can autonomously complete complex information retrieval. First, in the data construction phase, WebDancer uses innovative methods to generate high-quality training data to address the limitations of traditional datasets. Next, in the supervised fine-tuning (SFT) stage, the agent is initialized through high-quality trajectory data to adapt to the requirements of information retrieval. Finally, in the reinforcement learning stage, the agent learns how to make optimal decisions in complex tasks by interacting with the environment. The DAPO algorithm used significantly enhances data efficiency and the robustness of strategies.

2. **Multi-Step Reasoning Capability**

  WebAgent's multi-step reasoning capability enables it to integrate viewpoints from different documents and generate a comprehensive research report. This function holds significant value for academic research, market analysis, and other fields.

By open-sourcing WebAgent, Alibaba not only provides researchers with a powerful tool but also injects new momentum into research and development in the AI field.

Github: https://github.com/Alibaba-NLP/WebAgent