ScienceAgentBench
Public[ICLR'25] ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery
ai4sciencebioinformaticschemoinformaticscode-generationcognitive-neurosciencegeosciencelanguage-agentlarge-language-modelstask-automation
Creat:2024-10-02T22:38:55
Update:2025-03-24T13:32:17
https://osu-nlp-group.github.io/ScienceAgentBench
94
Stars
0
Stars Increase