PAINT
PublicPAINT (Paying Attention to INformed Tokens) is a plug-and-play framework that intervenes in the self-attention of the LLM and selectively boost the visual attention informed tokens to mitigate hallucination of Vision Language Models
Creat:2024-10-02T00:31:56
Update:2024-03-07T11:41:20
https://arxiv.org/abs/2501.12206
9
Stars
0
Stars Increase