Home
Information

AI Dataset Collection

Large-scale datasets and benchmarks for training, evaluating, and testing models to measure

Tools

Intelligent Document Recognition

Comprehensive Text Extraction and Document Processing Solutions for Users

AI Tutorial

ScienceAgentBench

Public

[ICLR'25] ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery

Creat2024-10-02T22:38:55
Update2025-03-24T13:32:17
https://osu-nlp-group.github.io/ScienceAgentBench
103
Stars
0
Stars Increase

Related projects