Video-RAG-master
PublicThis is the official implementation of our paper "Video-RAG: Visually-aligned Retrieval-Augmented Long Video Comprehension"
long-video-understandingmulti-modal-large-language-modelplug-and-playretrieval-augmented-generationtraining-freevideo-large-language-modelsvideo-understanding
Creat:2024-11-19T19:01:10
Update:2025-03-26T14:19:35
https://arxiv.org/pdf/2411.13093
221
Stars
1
Stars Increase