HomeAI Tutorial
Information

AI Dataset Collection

Large-scale datasets and benchmarks for training, evaluating, and testing models to measure

Tools

Intelligent Document Recognition

Comprehensive Text Extraction and Document Processing Solutions for Users

VideoGLaMM

Public

[CVPR 2025 ?]A Large Multimodal Model for Pixel-Level Visual Grounding in Videos

Creat2024-10-31T20:00:44
Update2025-03-24T17:59:00
https://mbzuai-oryx.github.io/VideoGLaMM/
90
Stars
0
Stars Increase

Related projects