vit-gpt2-image-captioning

Public

Fine-tuning an encoder-decoder transformer (ViT-Base-Patch16-224-In21k and DistilGPT2) for image captioning on the COCO dataset

bert coco-dataset distilbert encoder-decoder gpt-2 image-captioning imagenet pre-trained-language-models pytorch torch

Creat：2023-05-11T04:02:32

Update：2024-11-29T17:04:38

Stars

Stars Increase

Related projects

Transformers

Hot

bert

? Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

153615

2年前

+136today

Qlib

Hot

algorithmic-trading

Qlib is an AI-oriented quantitative investment platform that aims to realize the potential, empower research, and create value using AI technologies in quantitative investment, from exploring ideas to implementing productions. Qlib supports diverse machine learning modeling paradigms. including supervised learning, market dynamics modeling, and RL.

34395

8个月前

+85today

Label Studio

annotation

Label Studio is a multi-type data labeling and annotation tool with standardized output format

25742

8个月前

+45today

Haystack

AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.

23563

8个月前

+32today

BiT

beam-splitter

[CVPR2023] Blur Interpolation Transformer for Real-World Motion from Blur

18311

8个月前

+4today

Leedl Tutorial

bert

《李宏毅深度学习教程》（李宏毅老师推荐?，苹果书?），PDF下载地址：https://github.com/datawhalechina/leedl-tutorial/releases

16083

8个月前

+11today

LaTeX OCR

dataset

pix2tex: Using a ViT to convert images of equations into LaTeX code.

16012

8个月前

+12today

Cvat

annotation

Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams at any scale, for data of any scale.

14894

8个月前

+16today

Nlp Tutorial

attention

Natural Language Processing Tutorial for Deep Learning Researchers

14800

1年前

+3today

PaddleNLP

bert

? Easy-to-use and powerful NLP and LLM library with ? Awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications, including ?Text Classification, ? Neural Search, ? Question Answering, ?? Information Extraction, ? Document Intelligence, ? Sentiment Analysis etc.

12872

8个月前

+6today

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

AI Brand Monitoring Tool

AI Search Visibility Checker

GEO Services

AI Model Compatibility Checker

AI Deployment Calculator

vit-gpt2-image-captioning

Related projects

Transformers

Qlib

Label Studio

Haystack

BiT

Leedl Tutorial

LaTeX OCR

Cvat

Nlp Tutorial

PaddleNLP

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

AI Brand Monitoring Tool

AI Search Visibility Checker

GEO Services​

AI Model Compatibility Checker

AI Deployment Calculator

vit-gpt2-image-captioning

Related projects

Transformers

Qlib

Label Studio

Haystack

BiT

Leedl Tutorial

LaTeX OCR

Cvat

Nlp Tutorial

PaddleNLP

GEO Services