LLaVA-Mini
PublicLLaVA-Mini is a unified large multimodal model (LMM) that can support the understanding of images, high-resolution images, and videos in an efficient manner.
efficientgpt4ogpt4vlarge-language-modelslarge-multimodal-modelsllamallavamultimodalmultimodal-large-language-modelsvideo
Creat:2025-01-08T02:37:05
Update:2025-03-26T11:12:00
515
Stars
0
Stars Increase