Home
Information

AI Dataset Collection

Large-scale datasets and benchmarks for training, evaluating, and testing models to measure

Tools

Intelligent Document Recognition

Comprehensive Text Extraction and Document Processing Solutions for Users

AI Tutorial

ToolQA

Public

ToolQA, a new dataset to evaluate the capabilities of LLMs in answering challenging questions with external tools. It offers two levels (easy/hard) across eight real-life scenarios.

Creat2023-06-06T15:09:04
Update2025-03-06T13:37:36
https://arxiv.org/pdf/2306.13304.pdf
277
Stars
0
Stars Increase

Related projects