AIbase
Product LibraryTool NavigationMCP

VisionGPT2

Public

Combining ViT and GPT-2 for image captioning. Trained on MS-COCO. The model was implemented mostly from scratch.

Creat2023-09-29T03:36:07
Update2025-03-21T05:08:05
https://www.kaggle.com/code/shreydan/visiongpt2-image-captioning-pytorch
43
Stars
0
Stars Increase

Related projects