VisualLanguageModel

Public

A custom Vision-Language Model (VLM) built from scratch, using SigLip for contrastive learning and a ViT-based encoder to generate meaningful image captions and semantic descriptions.

contrastive-learning image-captioning kv-cache multimodal-learning pytorch siglip vision-language-model

Creat：2025-03-26T14:32:59

Update：2025-04-06T02:29:57

Stars

Stars Increase

Related projects

Tensorflow

deep-learning

An Open Source Machine Learning Framework for Everyone

192719

2年前

+38today

Stable Diffusion Webui

Hot

Stable Diffusion web UI

158833

1年前

+73today

Transformers

Hot

bert

? Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

153615

2年前

+136today

30 Seconds Of Code

astro

Coding articles to level up your development skills

125990

10个月前

+35today

Comfyui

Hot

ComfyUI docker images for use in GPU cloud and local environments. Includes AI-Dock base for authentication and improved user experience.

96213

10个月前

+526today

Pytorch

Hot

autograd

Tensors and Dynamic neural networks in Python with strong GPU acceleration

95701

10个月前

+88today

Opencv

c-plus-plus

Open Source Computer Vision Library

85202

7年前

+42today

Netdata

alerting

X-Ray Vision for your infrastructure!

76944

10个月前

+31today

D2l Zh

Hot

book

《动手学深度学习》：面向中文读者、能运行、可讨论。中英文版被70多个国家的500多所大学用于教学。

74277

10个月前

+60today

Redis

Hot

cache

For developers, who are building real-time data-driven applications, Redis is the preferred, fastest, and most feature-rich cache, data structure server, and document and vector query engine.

72066

7个月前

+60today

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

GEO Brand Visibility

AI Brand Monitoring Tool

AI Search Visibility Checker

GEO Promotion Link Detection

GEO Services

AI Model Compatibility Checker

AI Deployment Calculator

VisualLanguageModel

Related projects

Tensorflow

Stable Diffusion Webui

Transformers

30 Seconds Of Code

Comfyui

Pytorch

Opencv

Netdata

D2l Zh

Redis

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

GEO Brand Visibility

AI Brand Monitoring Tool

AI Search Visibility Checker

GEO Promotion Link Detection

GEO Services​

AI Model Compatibility Checker

AI Deployment Calculator

VisualLanguageModel

Related projects

Tensorflow

Stable Diffusion Webui

Transformers

30 Seconds Of Code

Comfyui

Pytorch

Opencv

Netdata

D2l Zh

Redis

GEO Services