Mind2Web-2

Public

Mind2Web-2 Benchmark: Evaluating Agentic Search with Agent-as-a-Judge

agents benchmark cua dataset webagent

Creat：2025-06-09T05:27:00

Update：2025-06-30T10:40:23

https://osu-nlp-group.github.io/Mind2Web-2/

Stars

Stars Increase

Related projects

AutoGPT

Hot

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

180179

1年前

+59today

Browser Use

Hot

ai-agents

Make websites accessible for AI agents

73433

1年前

+154today

Autogen

Hot

agentic

A programming framework for agentic AI ? PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/autogen-officehour

52346

2年前

+103today

Anything Llm

Hot

agent-framework-javascript

The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, No-code agent builder, and more.

51978

1年前

+113today

Ai Agents For Beginners

Hot

agentic-ai

10 Lessons to Get Started Building AI Agents

46259

1年前

+233today

Llama_index

Hot

agents

LlamaIndex is the leading framework for building LLM-powered agents over your data.

45704

1年前

+65today

Agno

Hot

agents

Agno is a lightweight library for building Multimodal Agents. It exposes LLMs as a unified API and gives them superpowers like memory, knowledge, tools and reasoning.

35825

1年前

+84today

CopilotKit

Hot

agent

React UI + elegant infrastructure for AI Copilots, AI chatbots, and in-app AI agents. The Agentic last-mile ?

25308

1年前

+55today

Awesome Ai Agents

Hot

agent

A list of AI autonomous agents

24531

1年前

+61today

AgenticSeek

agentic-ai

A open, local Manus AI alternative. Powered with Deepseek R1. No APIs, no $456 monthly bills. Enjoy an AI agent that reason, code, and browse with no worries.

24031

1年前

+24today

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

GEO Brand Visibility

AI Visibility Audit

AI Search Visibility Checker

GEO Ranking Monitor

AI Conversation Insight

GEO Promotion Link Detection

Website AI Friendliness Detection

GEO Ranking Optimization System

GEO Ranking Optimization

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

LLM API Hub

AI Models Finder

Model Providers

LLM Leaderboard

LLM API Proxy Checker

Compare LLMs

LLM Cost Calculator

LLM Arena

AI Model Compatibility Checker

AI Deployment Calculator