multiclass-image-classification-using-multimodal-llms
PublicA comprehensive comparison of multimodal models - llama3.2-vision, minicpm-v, llava-llama3, llava, llava13:b and closed source models for animal classification tasks. This project evaluates various models' performance in classifying 10 different animal species, ranging from common to rare animals.
artificial-intelligencecomputer-visiongeminigoogle-generative-ailarge-language-modelsmachine-learningnatural-language-processingollamaopenaipython
Creat:2024-12-09T03:47:43
Update:2025-02-23T19:28:40
8
Stars
0
Stars Increase