ml-ferret
End-to-end MLLM, enabling precise referencing and localization.
CommonProductProgrammingMachine LearningLanguage Model
ml-ferret is an end-to-end machine learning language model (MLLM) that can accept various forms of references and respond with precise localization in multimodal environments. It combines mixed regional representations and spatially aware visual samplers, supporting fine-grained and open-vocabulary referencing and localization. Additionally, ml-ferret includes the GRIT dataset (approximately 1.1 million samples) and the Ferret-Bench evaluation benchmark.
ml-ferret Visit Over Time
Monthly Visits
492133528
Bounce Rate
36.20%
Page per Visit
6.1
Visit Duration
00:06:33
ml-ferret Visit Trend
ml-ferret Visit Geography
ml-ferret Traffic Sources
ml-ferret Alternatives

Language Learning Games — AI text adventure games for language learning
•language learning•AI game
666

InternVL2_5-78B — Advanced multimodal large language model series
•Multimodal•Large Language Model
462