Home
Information

AI Dataset Collection

Large-scale datasets and benchmarks for training, evaluating, and testing models to measure

Tools

Intelligent Document Recognition

Comprehensive Text Extraction and Document Processing Solutions for Users

AI Tutorial

wikigame-llm-eval

Public

Companion repo for CLiC-it 2025 paper on WikiGame. Reproducible pipeline to benchmark LLMs on Wikipedia navigation with human baselines.

Creat2025-06-24T20:55:26
Update2025-09-24T21:36:38
1
Stars
0
Stars Increase

Related projects