VisionGPT2
PublicCombining ViT and GPT-2 for image captioning. Trained on MS-COCO. The model was implemented mostly from scratch.
Creat:2023-09-29T03:36:07
Update:2025-03-21T05:08:05
https://www.kaggle.com/code/shreydan/visiongpt2-image-captioning-pytorch
43
Stars
0
Stars Increase