clip-synthetic-captions

Public

Tiny-scale experiment showing that CLIP models trained using detailed captions generated by multimodal models (CogVLM and LLaVA 1.5) outperform models trained using the original alt-texts on a range of classification and retrieval tasks.

clip cogvlm llava multimodal synthetic-data vision-language-model

Creat：2024-03-05T19:57:49

Update：2024-03-31T02:25:46

Stars

Stars Increase

Related projects

LLaVA

chatbot

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

24131

1年前

+23today

Sglang

Hot

cuda

SGLang is a fast serving framework for large language models and vision language models.

21068

12个月前

+144today

GenSim

clip

Generating Robotic Simulation Tasks via Large Language Models

16296

12个月前

+6today

Clip As Service

bert

? Scalable embedding, reasoning, ranking for images and sentences with CLIP

12790

12个月前

Boxmot

boosttrack

BoxMOT: pluggable SOTA tracking modules for segmentation, object detection and pose estimation models

7856

12个月前

+4today

X AnyLabeling

Hot

annotation-tool

Effortless data labeling with AI support from Segment Anything and other awesome models.

7283

12个月前

+95today

Chinese CLIP

chinese

Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.

5682

12个月前

+6today

Data Juicer

chinese

Data processing for and with foundation models! ? ? ? ?? ??? ? ?

5604

12个月前

+11today

SUPIR

deep-learning

SUPIR aims at developing Practical Algorithms for Photo-Realistic Image Restoration In the Wild. Our new online demo is also released at suppixel.ai.

5366

12个月前

+7today

FunClip

gradio

Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.

5191

12个月前

+9today

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

LLM API Hub

AI Models Finder

Model Providers

LLM Leaderboard

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

GEO Brand Visibility

AI Brand Monitoring Tool

AI Search Visibility Checker

GEO Promotion Link Detection

GEO Ranking Optimization System

GEO Services

AI Model Compatibility Checker

AI Deployment Calculator

clip-synthetic-captions

Related projects

LLaVA

Sglang

GenSim

Clip As Service

Boxmot

X AnyLabeling

Chinese CLIP

Data Juicer

SUPIR

FunClip

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

LLM API Hub

AI Models Finder

Model Providers

LLM Leaderboard

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

GEO Brand Visibility

AI Brand Monitoring Tool

AI Search Visibility Checker

GEO Promotion Link Detection

GEO Ranking Optimization System

GEO Services​

AI Model Compatibility Checker

AI Deployment Calculator

clip-synthetic-captions

Related projects

LLaVA

Sglang

GenSim

Clip As Service

Boxmot

X AnyLabeling

Chinese CLIP

Data Juicer

SUPIR

FunClip

GEO Services