LLM-Game-Benchmark
PublicEvaluating Large Language Models with Grid-Based Game Competitions: An Extensible LLM Benchmark and Leaderboard
Creat:2024-05-20T23:33:24
Update:2025-03-21T04:54:07
17
Stars
0
Stars Increase
Evaluating Large Language Models with Grid-Based Game Competitions: An Extensible LLM Benchmark and Leaderboard