Inkscope-Captions-2B-0526

Public

The Inkscope-Captions-2B-0526 model is a fine-tuned version of Qwen2-VL-2B-Instruct, optimized for image captioning, vision-language understanding, and English-language caption generation. This model was fine-tuned on the conceptual-captions-cc12m-llavanext dataset (first 30k entries) to generate detailed, high-quality captions for images.

captions gguf gradio gradio-interface hugging-face huggingface-transformers ocr-recognition qwen2-vl

Creat：2025-05-29T15:33:54

Update：2025-05-30T14:15:48

https://huggingface.co/prithivMLmods/Inkscope-Captions-2B-0526

Stars

Stars Increase

Related projects

Stable Diffusion Webui

Stable Diffusion web UI

154604

1年前

+45today

Gradio

data-analysis

Build and share delightful machine learning apps, all in Python. ? Star to support our work!

39077

1年前

+21today

HivisionIDPhotos

cnn

??HivisionIDPhotos: a lightweight and efficient AI ID photos tools. 一个轻量级的AI证件照制作算法。

18472

3个月前

+17today

Stable Diffusion Webui Colab

stable diffusion webui colab

15898

3个月前

+1today

Ebook2audiobook

audiobooks

Convert ebooks to audiobooks with chapters and metadata using dynamic AI models and voice cloning. Supports 1,107+ languages!

10805

3个月前

+12today

FunClip

gradio

Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.

4761

3个月前

+5today

Voice Pro

audiobook

Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2E, F5-TTS, CosyVoice), with Whisper audio processing, RVC voice changer, YouTube download, UVR5 vocal isolation, and multilingual translation.

4318

3个月前

+4today

Ask Anything

big-model

[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.

3269

3个月前

+1today

InternGPT

chatgpt

InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now it supports DragGAN, ChatGPT, ImageBind, multimodal chat like GPT-4, SAM, interactive image editing, etc. Try it at igpt.opengvlab.com (支持DragGAN、ChatGPT、ImageBind、SAM的在线Demo系统)

3216

3个月前

Sidekick

A native macOS app that allows users to chat with a local LLM that can respond with information from files, folders and websites on your Mac without installing any other software. Powered by llama.cpp.

2970

3个月前

+2today

Product Finder

Product Submit

AI Models Finder

MCP Servers

MCP Client

MCP Inspector

Case Tutorials

Latest AI News

AI Daily Brief

Inkscope-Captions-2B-0526

Related projects

Stable Diffusion Webui

Gradio

HivisionIDPhotos

Stable Diffusion Webui Colab

Ebook2audiobook

FunClip

Voice Pro

Ask Anything

InternGPT

Sidekick