Home
Information

AI Dataset Collection

Large-scale datasets and benchmarks for training, evaluating, and testing models to measure

Tools

Intelligent Document Recognition

Comprehensive Text Extraction and Document Processing Solutions for Users

AI Tutorial

Mind2Web-2

Public

Mind2Web-2 Benchmark: Evaluating Agentic Search with Agent-as-a-Judge

Creat2025-06-09T05:27:00
Update2025-06-30T10:40:23
https://osu-nlp-group.github.io/Mind2Web-2/
85
Stars
1
Stars Increase