mirage-bench
Public用于竞技场 RAG 排行榜 (NAACL'25) 的多语言生成、RAG 评估和替代裁判训练的代码库
anyscale-endpointarenaazure-apiclaude-apicohere-apievaluation-frameworkgemini-apillm-inferenceopenai-apirag
Creat:2024-09-17T21:30:59
Update:2025-04-10T23:55:46
https://mirage-bench.github.io/
9
Stars
0
Stars Increase