HomeAI Tutorial

vit-gpt2-image-captioning

Public

Fine-tuning an encoder-decoder transformer (ViT-Base-Patch16-224-In21k and DistilGPT2) for image captioning on the COCO dataset

Creat2023-05-11T04:02:32
Update2024-11-29T17:04:38
3
Stars
0
Stars Increase

Related projects