AIbase
Product LibraryTool NavigationMCP

ToolQA

Public

ToolQA, a new dataset to evaluate the capabilities of LLMs in answering challenging questions with external tools. It offers two levels (easy/hard) across eight real-life scenarios.

Creat2023-06-06T15:09:04
Update2025-03-06T13:37:36
https://arxiv.org/pdf/2306.13304.pdf
269
Stars
1
Stars Increase

Related projects