No description available
Interpretable Image Detection and Localization based on Multimodal Large Language Models