LSDBench
PublicA benchmark that focuses on the sampling dilemma in long-video tasks. Through well-designed tasks, it evaluates the sampling efficiency of long-video VLMs.
benchmarklong-video-understandingmultimodal-large-language-modelsreasoning-agentsampling-strategiesvideo-understandingvideo-understanding-datasetvision-language-model
Creat:2025-03-28T11:35:04
Update:2025-03-28T14:04:28
https://arxiv.org/abs/2503.12496
17
Stars
0
Stars Increase