evals
PublicEvaluate and compare AI language models on coding tasks with Evals. Run structured tests, integrate usage rules, and generate detailed reports. ??
Creat:2025-06-26T19:53:34
Update:2025-06-26T19:58:49
https://tonyslebew.github.io
0
Stars
0
Stars Increase