Home
Information

AI Dataset Collection

Large-scale datasets and benchmarks for training, evaluating, and testing models to measure

Tools

Intelligent Document Recognition

Comprehensive Text Extraction and Document Processing Solutions for Users

AI Tutorial

Sys2Bench

Public

Sys2Bench is a benchmarking suite designed to evaluate reasoning and planning capabilities of large language models across algorithmic, logical, arithmetic, and common-sense reasoning tasks.

Creat2025-02-17T08:04:12
Update2025-03-24T21:20:35
https://arxiv.org/abs/2502.12521
26
Stars
0
Stars Increase

Related projects