Information

Latest AI News

Explore AI Frontiers, Master Industry Trends

AI Daily Brief

Your Daily AI Brief - Never Miss What's Next

Information

AI Product Finder

Smart Product Discovery - Comprehensive Market Intelligence

AI Product Rankings

AI Product Power Rankings - Performance, Buzz & Trends

AI Product Submit

Submit Your AI Product - Amplify Reach & Drive Growth

Tools

AI Tools Directory

Discover The Best AI Websites & Tools

Information

AI Models Finder

Comprehensive AI Models Collection for All Your Development & Research Needs

LLM Leaderboard

AI LLM Power Rankings - Performance, Buzz & Trends

Model Providers

Discover Trusted AI Model Partners - Guaranteed Reliable Support

Submit Your Model

Submit Your Model Info & Services - Precision Marketing & User Targeting

Tools

Compare LLMs

Multi-Dimensional Large Model Comparison - Find Your Perfect Match

LLM Cost Calculator

Calculate AI Model Costs Accurately - Optimize Your Budget

LLM Arena

Multi-Model Real-Time Evaluation & Quick Output Comparison

Information

MCP Servers

Discover Popular AI-MCP Services - Find Your Perfect Match Instantly

MCP Client

Easy MCP Client Integration - Access Powerful AI Capabilities

MCP Case Tutorials

Master MCP Usage - From Beginner to Expert

MCP Ranking

Top MCP Service Performance Rankings - Find Your Best Choice

MCP Service Submission

Publish & Promote Your MCP Services

Tools

MCP Playground

Test MCP Services Freely - Quick Online Experience

MCP Inspector

Quick MCP Service Testing - Fast Deployment

GEO Services

Achieve Dominant Visibility in AI Search for Your Business or Brand with GEO Services

AI Search Visibility Checker

Detect brand's visibility on AI platforms

Tools

AI Model Compatibility Checker

Free PC Hardware Test for DeepSeek & Llama

Information

AI Dataset Collection

Large-scale datasets and benchmarks for training, evaluating, and testing models to measure

Tools

Intelligent Document Recognition

Comprehensive Text Extraction and Document Processing Solutions for Users

AI Tutorial

Liquid AI Launches LFM2-8B-A1B: 8B Parameters with Only 1.5B Activated, Achieving 4B-Level AI Speed on Mobile Devices!

AIbase基地

Published inAI News · 6 min read · Oct 11, 2025

Efficient MoE Architecture Reshapes Edge AI. Liquid AI's LFM2-8B-A1B is the first Mixture-of-Experts (MoE) model in its LFM2 series, with a total parameter scale of 8.3B, but only about 1.5B parameters are activated per token. This sparse activation mechanism significantly reduces computational load while maintaining high representational capability, making it suitable for resource-constrained device-side scenarios. Unlike traditional cloud-based MoE models, this design is optimized for real-time interaction, challenging the industry perception that "small-scale MoE is inefficient."

The model is based on the LFM2 hybrid backbone architecture, including 18 gate short convolution blocks and 6 group query attention (GQA) blocks. Except for the first two layers, which remain dense to ensure stability, the rest of the layers integrate sparse MoE feedforward networks. Each layer has 32 experts, activating only the top-4 experts, and uses a normalized sigmoid router combined with adaptive bias to achieve load balancing. It supports a 32K context length and is compatible with multiple languages such as English, Arabic, Chinese, French, German, Japanese, Korean, and Spanish.

Training and Performance: 12T Tokens Forge 3-4B Level Capabilities. LFM2-8B-A1B achieves 3-4B-level capabilities through pre-training with approximately 12T tokens, including 55% English, 25% multilingual, and 20% code data distribution. Subsequently, post-training is conducted using Liquid Preference Alignment (length-normalized DPO/APO-Zero fusion), employing mixed BF16/FP8 precision to improve training efficiency by more than three times.

In benchmark tests, the model demonstrates strength surpassing competitors of similar scale:

Knowledge and Instruction Following: MMLU-Pro score of 37.4 (an increase of 11.5 from LFM2-2.6B), IFEval 77.6, Multi-IF 58.2.
Mathematical Ability: GSM8K 84.4, GSMPlus 64.8, MATH500 74.2.
Multi-language Processing: MGSM 72.4, MMMLU 55.3.
Coding and Writing: HumanEval+ 69.5, LiveCodeBench v6 21.0, EQ-Bench 44.2.

Overall, its output quality rivals 3-4B dense models, performing well in tasks such as multi-turn dialogue, creative writing, RAG retrieval-enhanced generation, and tool calling. Deployment and Integration: 5x Speedup, Compatible with Mainstream Frameworks. The LFM2-8B-A1B shows significant improvement in inference speed on CPUs and GPUs.

On devices such as AMD Ryzen AI9HX370 and Samsung Galaxy S24 Ultra, using custom XNNPACK MoE kernels with int4 quantization and int8 dynamic activation, its decoding throughput is up to 5 times faster than Qwen3-1.7B and IBM Granite4.0. On the GPU side, integration with vLLM supports FlashInfer and CUDA-graph compilation, enabling efficient operation for single requests and online batching.

Quantized variants have been optimized for high-end mobile phones/tablets/laptops: Q4_0 is approximately 4.7GB, F16 is approximately 16.7GB. Supported frameworks include llama.cpp (requires b6709+ version support for lfm2moe), ExecuTorch (mobile/embedded CPU), and vLLM (GPU). Additionally, GGUF quantized files are available on Hugging Face, along with Colab fine-tuning notebooks, facilitating quick development. The model is now available for testing on Liquid Playground.

Open Source and Impact: Promoting AI Accessibility at the Edge. LFM2-8B-A1B is open-sourced under the LFM Open License v1.0 (based on Apache2.0), and weights and technical details have been uploaded to Hugging Face (LiquidAI/LFM2-8B-A1B). This release not only lowers the barrier to AI deployment but also injects new vitality into edge computing—benefiting everything from real-time private chat to embedded intelligent systems. AIbase Perspective: In the current era of soaring cloud AI costs, efficient models like LFM2-8B-A1B are accelerating the trend of "AI decentralization."

Project: https://huggingface.co/LiquidAI/LFM2-8B-A1B

MoE LFM2-8B-A1B SparseActivationMechanism EdgeAI

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

OpenAI Video Generation Platform Sora2 Sparks Controversy Over Portrait Rights: Daughter of Robin Williams Condemns AI Resurrection of the Deceased

The AI video generation platform Sora2 can realistically recreate the image of deceased celebrities, sparking controversy over portrait rights. The daughter of the late comedian Robin Williams, Zelda, publicly protested and called for an end to related dissemination. Technological development has touched the boundaries of ethics and law, causing anger and heartbreak for the families of the deceased.

Oct 14, 2025

Microsoft Expands Its Autonomous AI Landscape! After Voice and Chatbots, Microsoft Launches Text-to-Image MAI-Image-1

Microsoft launches its first self-developed text-to-image generator, MAI-Image-1, marking a new stage in its AI development. The model effectively avoids repetition and stylistic issues by incorporating feedback from creative professionals, and it performs exceptionally well in generating photorealistic images, especially excelling in handling complex scenes such as lightning and landscapes.

Oct 14, 2025

Microsoft Launches Its Own AI Image Generator MAI-Image-1, Saying Goodbye to Reliance on OpenAI!

Microsoft launched its first self-developed image-generating AI model, MAI-Image-1, capable of producing realistic images with natural lighting. Currently in testing on LMArena, it will soon integrate into Copilot and Bing Image Creator, marking a step in Microsoft's AI independence.....

Oct 14, 2025

Microsoft Launches Its First Self-Developed Image Generation Model MAI-Image-1, Ranking in the Top Ten of LMArena

Microsoft launched its first self-developed image generation model, MAI-Image-1, which has entered the top ten of the LMArena text-to-image ranking. Microsoft emphasizes practical application value, using rigorous data selection and evaluation mechanisms to avoid generating repetitive or monotonous style images, providing creators with more powerful and flexible tool support.

Oct 14, 2025

New Breakthrough in Diffusion Models: Radical Numerics Open-Sources 30B-Parameter RND1 AI, Marking a Key Step in Self-Improvement

Radical Numerics released the open-source diffusion language model RND1-Base with 30B parameters, using a sparse expert mixture architecture that activates only 3B parameters. The model has advantages in parallel generation and performs well in benchmark tests. It also publishes complete weights and training methods, promoting the development of diffusion model technology.

Oct 13, 2025

120

Latest Domestic Direct Connection Sora2 No Watermark Free Usage Tutorial

OpenAI released Sora2, with over a million downloads in five days, topping the App Store free chart, with a growth rate surpassing GPT. Compared to its predecessor, the text understanding capability has significantly improved, enabling the generation of complete videos with synchronized audio and video based on simple prompt words, without the need for manual voiceover or music, suitable for short videos, advertisements, short plays, MVs, and animation production.

Oct 13, 2025

Japanese Government Issues Copyright Warning Regarding OpenAI Sora 2, Demands Compliance with Laws

The Japanese government has asked OpenAI to prohibit Sora 2 from generating content that infringes on copyright, especially concerning its ability to imitate the style of Japanese animation. This move aims to protect the country's anime industry, which is considered a core part of its economic and cultural landscape.

Oct 13, 2025

120

Say Goodbye to Random Chart Drawing, Hong Kong Chinese Team Launches the First Structured Image Generation System!

CUHK MMLab, with Beihang & SJTU, introduces a structured image generation system, overcoming AI's limitations in charts & formulas. It advances data visualization where models like FLUX.1 & GPT-Image fall short.....

Oct 13, 2025

120

OpenAI May Face Fines of up to $1 Billion for Allegedly Using Pirated Books to Train AI

OpenAI faces potential huge fines for data infringement, accused of deleting evidence of using pirated books. Plaintiffs seek lawyer communications to prove intentional violation, with penalties up to $150,000 per work under US law.....

Oct 13, 2025

Ant Group Launches Ling-1T, a 1 Trillion Parameter Model Surpassing GPT-5 as the New Benchmark

Ant Group open-sources trillion-parameter model Ling-1T, using FP8 for efficient training. It's the largest base model currently, developed by the 'Bailing' team, part of Ling2.0 family with Ling, Ring, Ming series. Ling focuses on general tasks with speed and efficiency.....

Oct 13, 2025

120

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Submit Your Model

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

GEO Services​

AI Search Visibility Checker

AI Model Compatibility Checker

AI Dataset Collection

Intelligent Document Recognition

Liquid AI Launches LFM2-8B-A1B: 8B Parameters with Only 1.5B Activated, Achieving 4B-Level AI Speed on Mobile Devices!

AIbase基地

This article is from AIbase Daily

AI News Recommendations

OpenAI Video Generation Platform Sora2 Sparks Controversy Over Portrait Rights: Daughter of Robin Williams Condemns AI Resurrection of the Deceased

Microsoft Expands Its Autonomous AI Landscape! After Voice and Chatbots, Microsoft Launches Text-to-Image MAI-Image-1

Microsoft Launches Its Own AI Image Generator MAI-Image-1, Saying Goodbye to Reliance on OpenAI!

Microsoft Launches Its First Self-Developed Image Generation Model MAI-Image-1, Ranking in the Top Ten of LMArena

New Breakthrough in Diffusion Models: Radical Numerics Open-Sources 30B-Parameter RND1 AI, Marking a Key Step in Self-Improvement

Latest Domestic Direct Connection Sora2 No Watermark Free Usage Tutorial

Japanese Government Issues Copyright Warning Regarding OpenAI Sora 2, Demands Compliance with Laws

Say Goodbye to Random Chart Drawing, Hong Kong Chinese Team Launches the First Structured Image Generation System!

OpenAI May Face Fines of up to $1 Billion for Allegedly Using Pirated Books to Train AI

Ant Group Launches Ling-1T, a 1 Trillion Parameter Model Surpassing GPT-5 as the New Benchmark

GEO Services