Liquid AI Launches LFM2-VL Model, Bringing 'Compact and Sensitive' AI Vision and Language Capabilities to Mobile Devices

AIbase基地

Published inAI News · 5 min read · Sep 3, 2025

Liquid AI has recently released the LFM2-VL series of vision-language foundation models, marking a faster development trend for multimodal AI towards "lightweight, fast, and deployable on devices."

These two models are LFM2-VL-450M and LFM2-VL-1.6B. The former is designed for resource-constrained hardware environments with fewer than 500 million parameters, while the latter, though with more parameters, remains lightweight and suitable for direct deployment on a single GPU or device.

LFM2-VL extends from Liquid AI's previous LFM2 architecture, integrating visual and language processing capabilities, supporting multi-resolution image input, and capable of handling text and images, offering excellent flexibility and compatibility . The model achieved a significant improvement in GPU inference speed, up to "twice," and performed well in common performance evaluations .

In image processing, LFM2-VL can input images at their original resolution (up to 512×512), avoiding distortion caused by forced scaling. For larger images, the model processes them using a non-overlapping tiling method and combines thumbnails to obtain global context information . Its architecture consists of a language model backbone, a SigLIP2NaFlex visual encoder, and a multimodal projector. The projector uses two layers of MLP (with pixel unshuffle technology) to reduce the number of image tokens, thereby improving processing speed .

Regarding training data, LFM2-VL involves approximately 10 billion multimodal training tokens, sourced from open-source datasets and company-generated synthetic image data . Evaluation results show that LFM2-VL-1.6B performs excellently in tasks such as RealWorldQA (65.23), InfoVQA (58.68), and OCRBench (742), and it leads in inference efficiency compared to similar models .

Currently, these models have been released on Hugging Face, along with fine-tuning example code on Colab, compatible with Hugging Face Transformers and TRL libraries. They use a new "LFM1.0 license agreement" based on the Apache 2.0 principle, allowing academic use, and companies with annual revenue below $10 million can use it for commercial purposes, while enterprises with higher annual revenue need to contact Liquid AI for authorization .

Liquid AI's LFM2-VL model portfolio provides a new path for deploying visual and text fusion AI on devices, especially suitable for mobile phones, laptops, wearable devices, and other scenarios, helping to reduce reliance on the cloud, improve privacy, and response speed.

Project: https://huggingface.co/LiquidAI/LFM2-VL-1.6B

Key Points:
🆕 Two Model Designs: LFM2-VL-450M (suitable for minimal resource environments) and LFM2-VL-1.6B (more powerful but still lightweight), suitable for device-side deployment.
Speed and Efficiency: Achieved up to twice the GPU inference speed, while maintaining excellent performance in multimodal tasks.
Multi-platform Friendly Environment: Released on Hugging Face, with licensing options, compatible with mainstream development tools, suitable for academic and small-to-medium enterprise commercial use.

Grok 4.20 Stocks: Becoming a God in Stock Trading - 10,000 Dollars Turned into 12,000 in 2 Weeks with a 12% Return Rate, Outperforming GPT-5.1 and Gemini 3.0

In the Alpha Arena 1.5 season, xAI's Grok4.20 model won with a 12.11% return rate, increasing $10,000 to $12,193 in 14 days, becoming the only large language model that generated profit. At the same time, GPT-5.1 and Gemini 3.0 suffered losses of 3.4% and 5.7%, respectively. The competition used a rule without human intervention, requiring the model to trade automatically under the 'ascetic mode' (high leverage restrictions) and the 'situational awareness mode' (can view opponents' holdings).

Vidu Launches Q2 Image Creation Suite: 4K Image Generation + Image Editing + Image-to-Video, All Free to Use

Shengshu Technology launches the 'Vidu Q2 Image Creation Suite', integrating three main functions: reference-based image generation, text-to-image generation, and image editing. The new version exceeded 500,000 uses on its first day of release, indicating strong user demand. Vidu Q2 enhances image generation control, allowing precise specification of position, action, and composition in the image, and outputs 4K quality. The new image editing features include local retouching and material replacement, performing excellently in international evaluations.

Amazon Launches Nova 2 Series Models, AI Performance Reaches New Heights!

AWS unveils four self-developed 'Nova2' AI models at re:Invent 2025, covering text, image, video, and speech with built-in web search and code execution, claiming leading price-performance. Nova2 Lite offers cost-effective inference, outperforming Claude Haiku4.5 and GPT-5Mini at about half the cost, while Nova2 Pro targets complex agent tasks.....

Amazon Launches New Nova 2 Model Family with Comprehensive Technological Advantages

At the 2025 re:Invent conference, Amazon Web Services introduced the Nova2 model series, including four new models, offering leading cost-effectiveness in reasoning, multimodal, dialogue AI, code generation, and agent tasks. Among them, Nova2Lite is designed for everyday workloads, supporting text, image, and video input and generating text output. It is a fast and economical reasoning model.

AWS Launches the Nova 2 Series Models and Introduces a $100,000 Nova Forge Custom Training Service

Amazon launched the second-generation self-developed large model family Nova 2 at re:Invent 2025, including four new models: Lite, Pro, Sonic, and Omni, focusing on industry-leading cost-effectiveness, with pricing approximately half that of similar models. At the same time, it announced interconnection with Google Cloud to facilitate customers in calling competing models across platforms.

MIT-based Startup Liquid AI Unveils Enterprise-Level Small Model Training Blueprint LFM2

Liquid AI company released the second generation of Liquid Foundation Models (LFM2) in July 2025, featuring an innovative "liquid" architecture, aiming to become the fastest on-device foundation model in the market. Its efficient training and inference capabilities allow small models to rival large language models in the cloud. LFM2 initially offers dense checkpoint versions with 350M, 700M, and 1.2B parameters.

AI Daily: Beijing Releases the Artificial Intelligence Industry White Paper; Bytedance Releases Video Editing Model Vidi2; Kuaishou to Release Kling Omni

Beijing released the "Artificial Intelligence Industry White Paper (2025)", which expects the core output value to exceed 450 billion yuan. The white paper details the holding of the 2025 China Artificial Intelligence Conference in Beijing, as well as the Beijing Municipal Science and Technology Commission's plans and prospects for the development of the artificial intelligence industry.

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

AI Brand Monitoring Tool

AI Search Visibility Checker

GEO Services

AI Model Compatibility Checker

AI Deployment Calculator

Liquid AI Launches LFM2-VL Model, Bringing 'Compact and Sensitive' AI Vision and Language Capabilities to Mobile Devices

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Grok 4.20 Stocks: Becoming a God in Stock Trading - 10,000 Dollars Turned into 12,000 in 2 Weeks with a 12% Return Rate, Outperforming GPT-5.1 and Gemini 3.0

Vidu Launches Q2 Image Creation Suite: 4K Image Generation + Image Editing + Image-to-Video, All Free to Use

Amazon Launches Nova 2 Series Models, AI Performance Reaches New Heights!

Amazon Launches New Nova 2 Model Family with Comprehensive Technological Advantages

AWS Launches the Nova 2 Series Models and Introduces a $100,000 Nova Forge Custom Training Service

NVIDIA Invests $2 Billion in Strategic Partnership with Synopsys to Drive Transformation in Engineering Design

MIT-based Startup Liquid AI Unveils Enterprise-Level Small Model Training Blueprint LFM2

Nuclear Bomb in Sora 2 Was a Dud: 1 Million Installs in the First Week, 60-Day Retention Almost Zero%

AI Daily: Beijing Releases the Artificial Intelligence Industry White Paper; Bytedance Releases Video Editing Model Vidi2; Kuaishou to Release Kling Omni

TikTok Vidi2 Makes a Big Entrance! AI Video Editing Surpasses Gemini 3 Pro, Transforming Hour-Long Footage into a Cinematic Masterpiece in One Click

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

AI Brand Monitoring Tool

AI Search Visibility Checker

GEO Services​

AI Model Compatibility Checker

AI Deployment Calculator

Liquid AI Launches LFM2-VL Model, Bringing 'Compact and Sensitive' AI Vision and Language Capabilities to Mobile Devices

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Grok 4.20 Stocks: Becoming a God in Stock Trading - 10,000 Dollars Turned into 12,000 in 2 Weeks with a 12% Return Rate, Outperforming GPT-5.1 and Gemini 3.0

Vidu Launches Q2 Image Creation Suite: 4K Image Generation + Image Editing + Image-to-Video, All Free to Use

Amazon Launches Nova 2 Series Models, AI Performance Reaches New Heights!

Amazon Launches New Nova 2 Model Family with Comprehensive Technological Advantages

AWS Launches the Nova 2 Series Models and Introduces a $100,000 Nova Forge Custom Training Service

NVIDIA Invests $2 Billion in Strategic Partnership with Synopsys to Drive Transformation in Engineering Design

MIT-based Startup Liquid AI Unveils Enterprise-Level Small Model Training Blueprint LFM2

Nuclear Bomb in Sora 2 Was a Dud: 1 Million Installs in the First Week, 60-Day Retention Almost Zero%

AI Daily: Beijing Releases the Artificial Intelligence Industry White Paper; Bytedance Releases Video Editing Model Vidi2; Kuaishou to Release Kling Omni

TikTok Vidi2 Makes a Big Entrance! AI Video Editing Surpasses Gemini 3 Pro, Transforming Hour-Long Footage into a Cinematic Masterpiece in One Click

GEO Services