PaliGemma-flickr8k-finetuning
PublicThis repository contains code for fine-tuning Google's PaliGemma vision-language model on the Flickr8k dataset for image captioning tasks
artificial-intelligencecompter-visioncomputer-visiondeep-learningfine-tuningflaxflickr8k-datasetimage-annotationimage-captioningimage-processing
Creat:2025-05-25T22:02:12
Update:2025-05-25T22:55:09
1
Stars
0
Stars Increase