LLaVA-STF
PublicThe official implementation of "Learning Compact Vision Tokens for Efficient Large Multimodal Models"
efficient-deep-learningefficient-inferencelarge-multimodal-modelslarge-vision-language-modelsllamallavatoken-fusiontoken-mergingvision-token-merging
Erstellungszeit:2025-05-21T16:11:14
Aktualisierungszeit:2025-06-16T05:37:39
29
Stars
0
Stars Increase