AIbase

Mind2Web-2

Public

Mind2Web-2 Benchmark: Evaluating Agentic Search with Agent-as-a-Judge

Creat2025-06-09T05:27:00
Update2025-06-30T10:40:23
https://osu-nlp-group.github.io/Mind2Web-2/
63
Stars
0
Stars Increase

Related projects