groundingLMM
Public[CVPR 2024 ?] Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks.
Creat:2023-11-03T00:53:47
Update:2025-03-24T16:23:12
https://grounding-anything.com
901
Stars
0
Stars Increase