ReVisionLLM
PublicThis is the official implementation of ReVisionLLM: Recursive Vision-Language Model for Temporal Grounding in Hour-Long Videos
Creat:2024-11-15T01:08:15
Update:2025-03-26T19:32:25
https://arxiv.org/abs/2411.14901
27
Stars
0
Stars Increase