mPLUG-2
PublicmPLUG-2: A Modularized Multi-modal Foundation Model Across Text, Image and Video (ICML 2023)
foundation-modelsimage-retrievalmllmmplugmultimodalmultimodal-pretrainingvideovideo-question-answeringvideo-retrievalvqa
Creat:2023-05-22T21:09:51
Update:2025-02-16T10:34:54
228
Stars
0
Stars Increase