MLLM-CompBench
Public[NeurIPS'25] MLLM-CompBench evaluates the comparative reasoning of MLLMs with 40K image pairs and questions across 8 dimensions of relative comparison: visual attribute, existence, state, emotion, temporality, spatiality, quantity, and quality. CompBench covers diverse visual domains, including animals, fashion, sports, and scenes
benchmarkevaluation-llmsfoundation-modelshuman-annotationlarge-language-modelsllmsllms-benchmarkingmultimodal-deep-learningmultimodal-large-language-modelsneurips-2024
Creat:2024-07-24T01:48:04
Update:2025-04-16T01:41:53
https://compbench.github.io/
40
Stars
0
Stars Increase