uform
PublicPocket-Sized Multimodal AI for content understanding and generation across multilingual texts, images, and ? video, up to 5x faster than OpenAI CLIP and LLaVA ?? & ??
Pocket-Sized Multimodal AI for content understanding and generation across multilingual texts, images, and ? video, up to 5x faster than OpenAI CLIP and LLaVA ?? & ??