SPEC
Public[CVPR 2024] The official implementation of paper "synthesize, diagnose, and optimize: towards fine-grained vision-language understanding"
clipcompositionalitycompostionalcomputer-visioncvpr2024fine-grainedimage-retrievallanguagemultimodalrobustness
Creat:2023-11-27T15:55:15
Update:2025-03-26T22:58:05
https://arxiv.org/abs/2312.00081
45
Stars
0
Stars Increase