Ferret-UI-Llama8b
A multimodal large language model based on Llama-3-8B, focused on UI tasks.
CommonProductProgrammingMultimodalLarge Language Model
Ferret-UI is the first multimodal large language model (MLLM) centered on user interfaces, specifically designed for gesture expression, localization, and reasoning tasks. Built on Gemma-2B and Llama-3-8B, it is capable of performing complex user interface tasks. This version aligns with Apple's research paper and serves as a powerful tool for image-to-text tasks, excelling in dialogue and text generation.
Ferret-UI-Llama8b Visit Over Time
Monthly Visits
20899836
Bounce Rate
46.04%
Page per Visit
5.2
Visit Duration
00:04:57