Ali WebShaper Released! GAIA Outperforms Claude 3.5 Sonnet and GPT-4o
Tongyi Lab of Alibaba released the open-source tool WebShaper, adopting an innovative formal-driven information retrieval paradigm. It achieved a score of 60.19 on the GAIA benchmark, surpassing Claude 3.5 Sonnet and GPT-4o. The framework ensures consistency between knowledge structure and reasoning logic through structured data generation methods, significantly enhancing AI's ability to handle complex tasks. As the fourth tool in the WebAgent series, WebShaper has received over 4,000 stars on GitHub and is driving the development of the open-source AI community.