cobbler
PublicCode and data for Koo et al's ACL 2024 paper "Benchmarking Cognitive Biases in Large Language Models as Evaluators"
biasbias-detectionevaluationllmllm-as-a-judgellm-as-evaluatorllm-as-judgellm-evaluationllmsllms-benchmarking
Creat:2023-08-12T03:45:05
Update:2025-03-24T13:53:48
https://minnesotanlp.github.io/cobbler-project-page/
20
Stars
0
Stars Increase