Home
Information

AI Dataset Collection

Large-scale datasets and benchmarks for training, evaluating, and testing models to measure

Tools

Intelligent Document Recognition

Comprehensive Text Extraction and Document Processing Solutions for Users

AI Tutorial

GoodBadGreedy

Public

The Good, The Bad, and The Greedy: Evaluation of LLMs Should Not Ignore Non-Determinism

Creat2024-07-13T16:56:27
Update2025-03-22T09:41:07
https://arxiv.org/abs/2407.10457
30
Stars
0
Stars Increase