Google Releases Open-Source Gemma412B Model: Focuses on Encoder-Free Multimodal with 16GB Memory Notebook for Local Execution

AIbase基地

Published inAI News · 3 min read · Jun 4, 2026

Google has officially released the newly open-source large model Gemma412B, marking a breakthrough in edge-side multimodal AI. This model overturns the complex chain of traditional multimodal models that rely on external visual and audio encoders, and innovatively adopts a "Unified" encoder-free architecture.

Through this design, the raw data of four modalities—text, images, audio, and video—can be directly input into a single Transformer backbone network for integrated processing, fundamentally eliminating the memory usage and high latency issues caused by traditional external "translation" modules, achieving a more native cross-modal understanding.

As an edge-side model optimized for consumer hardware, Gemma412B demonstrates remarkable parameter efficiency. In benchmark tests, its performance scores are close to Google's own 26B-scale model, while its memory usage is less than half. The model features an ultra-long context window of 256K Tokens, supports over 140 languages, and includes a Thinking mode with enhanced step-by-step reasoning and native Function Calling capabilities.

In terms of deployment, the model can run smoothly with as little as 16GB of VRAM or unified memory, and even down to 8GB after 4-bit quantization. Its core goal is to achieve efficient local execution on ordinary laptops. Currently, the Google AI Edge Gallery has officially expanded from mobile devices to desktops, allowing macOS users to download and install it to activate Gemma412B locally. Thanks to the built-in sandbox Python environment and the Eloquent system supporting voice interaction, users can now directly execute code, draw charts, and engage in smooth voice alignment interactions within the chat interface.

Industry analysts believe that the release of Gemma412B further accelerates the process of AI decentralization. Its extremely high performance density and edge-side compatibility not only break through the constraints of cloud computing power but also pave the way for future edge-side multimodal personal assistant applications that balance low latency and privacy security.

Gemma412B AINeologism Edge-sideMultimodalAI UnifiedArchitecture

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

Apple Mac Experiences a Big Surge: 16GB of RAM Can Run Google's Gemma 4 Flagship Model Locally!

The Google AI Edge Gallery app has officially launched on macOS, allowing Mac users to run the Gemma series of AI models offline. This application does not require an internet connection, which can improve response speed and ensure data privacy. Users can perform intelligent conversations, image processing, and semantic understanding offline.

Jun 4, 2026

150

16GB Memory, Local Instant Response! Google Releases Gemma 4 12B Revolutionary Encoder-Free Architecture Ignites Open Source Community

Google releases a new multimodal model Gemma 4 12B, revolutionizing the traditional architecture by eliminating the separate encoder component, achieving efficient local deployment and inference on consumer-level hardware. This breakthrough significantly reduces the computational complexity of multimodal models, improves processing speed, and marks a new stage in the open source large model ecosystem.

Jun 4, 2026

110

16GB Memory Runs 12B Multimodal Model Directly! Google AI Edge Gallery Launches on Mac, Bringing a New Surge in Productivity

Google AI Edge Gallery is now available on macOS, allowing Mac users to run generative large models locally without relying on cloud computing power, enabling private chats, image processing, and semantic understanding. Unlike general platforms such as Ollama, this application takes a deep vertical approach, further lowering the barrier for local AI usage.

Jun 4, 2026

170

Google Launches New Gemma 4 12B Model: Easily Handle Visual and Audio Data Without an Encoder

Google released the Gemma 4 12B multimodal model, which has 12 billion parameters and innovatively eliminates traditional encoders, allowing direct processing of visual and audio data. This model requires only 16GB of VRAM and can run locally on high-end laptops without relying on cloud resources.

Jun 4, 2026

170

ByteDance Open Sources Bernini Framework: Achieving Perfect Unity in Video Generation and Precise Editing

ByteDance's commercialization tech team open-sourced Bernini, a video generation and editing framework using a 'understand first, generate later' collaborative mechanism to address frame instability and flicker from complex instructions. In internal tests, Bernini ranks among top-tier models. Inference code and Bernini-R model access are now open, with full version upcoming.....

Jun 4, 2026

260

Google Releases Gemma 4 E2B Architecture, Enabling Local AI on Phones to Achieve a Breakthrough

Google DeepMind released the open-source large model Gemma4. Although the parameter scale remains around 30 billion, the 'intelligent density per parameter' has significantly improved, making its performance comparable to top closed-source models from 1.5 years ago. Its core breakthrough lies in introducing the 'E2B' (parameter unloading) architecture, marking a significant upgrade in the underlying architecture of open-source large models.

Jun 3, 2026

310

Spend 7.5 Billion Euros! SoftBank Fully Invests in European Computing Power, Plans to Build a Super Data Center Cluster in France

SoftBank Group announces an investment of up to €75 billion ($87 billion) in France to massively expand data center capacity, targeting an additional 5 GW of computing infrastructure. The project will be phased, with the first phase already initiated, marking a major commitment to global AI infrastructure.....

Jun 2, 2026

160

Large Model Agents Say Goodbye to Blind Stacking! Hong Kong Chinese University Team Releases SLIM Framework to Dynamically Manage the Lifecycle of External Skills

The Chinese University of Hong Kong team proposed the SLIM framework to address skill management challenges in evolving large language model agents from conversational to task-oriented capabilities. It enables dynamic skill lifecycle management, breaking the industry's blind skill accumulation, offering a new approach for efficient external ability management.....

Jun 1, 2026

400

Exceeding GPT-5.5! Domestic AI Large Model MiniMax M3 Officially Released

MiniMax M3, a new open-source model from Xiyu Technology, features cutting-edge programming capabilities, 1M ultra-long context, and native multimodal abilities (image, video input, and desktop operation), making it the first domestic model to integrate these three core features. It leads in multiple metrics on the SWE-Bench programming benchmark.....

Jun 1, 2026

800

Huawei AI Glasses Officially Launched at 2499 CNY, Self-Developed Chip + Xiaoyi Intelligent Entity Trigger Edge Computing

Huawei AI Glasses were officially launched on June 1st through all channels, with a titanium silver gray color option, priced at 2499 CNY. The product uses lightweight materials and precise stacking technology to compress the temple thickness to 6.25 millimeters, solving the problem of bulkiness in smart glasses, marking a new stage for smart wearable devices toward "Embodied Intelligence".

Jun 1, 2026

450

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

GEO Brand Visibility

AI Visibility Audit

AI Search Visibility Checker

GEO Ranking Monitor

AI Conversation Insight

GEO Promotion Link Detection

GEO Ranking Optimization System

GEO Ranking Optimization

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

LLM API Hub

AI Models Finder

Model Providers

LLM Leaderboard

Compare LLMs

LLM Cost Calculator

LLM Arena

AI Model Compatibility Checker

AI Deployment Calculator

Google Releases Open-Source Gemma412B Model: Focuses on Encoder-Free Multimodal with 16GB Memory Notebook for Local Execution

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Apple Mac Experiences a Big Surge: 16GB of RAM Can Run Google's Gemma 4 Flagship Model Locally!

16GB Memory, Local Instant Response! Google Releases Gemma 4 12B Revolutionary Encoder-Free Architecture Ignites Open Source Community

16GB Memory Runs 12B Multimodal Model Directly! Google AI Edge Gallery Launches on Mac, Bringing a New Surge in Productivity

Google Launches New Gemma 4 12B Model: Easily Handle Visual and Audio Data Without an Encoder

ByteDance Open Sources Bernini Framework: Achieving Perfect Unity in Video Generation and Precise Editing

Google Releases Gemma 4 E2B Architecture, Enabling Local AI on Phones to Achieve a Breakthrough

Spend 7.5 Billion Euros! SoftBank Fully Invests in European Computing Power, Plans to Build a Super Data Center Cluster in France

Large Model Agents Say Goodbye to Blind Stacking! Hong Kong Chinese University Team Releases SLIM Framework to Dynamically Manage the Lifecycle of External Skills

Exceeding GPT-5.5! Domestic AI Large Model MiniMax M3 Officially Released

Huawei AI Glasses Officially Launched at 2499 CNY, Self-Developed Chip + Xiaoyi Intelligent Entity Trigger Edge Computing

AI News Recommendations

Apple Mac Experiences a Big Surge: 16GB of RAM Can Run Google's Gemma 4 Flagship Model Locally!

16GB Memory, Local Instant Response! Google Releases Gemma 4 12B Revolutionary Encoder-Free Architecture Ignites Open Source Community

16GB Memory Runs 12B Multimodal Model Directly! Google AI Edge Gallery Launches on Mac, Bringing a New Surge in Productivity

Google Launches New Gemma 4 12B Model: Easily Handle Visual and Audio Data Without an Encoder

ByteDance Open Sources Bernini Framework: Achieving Perfect Unity in Video Generation and Precise Editing

Google Releases Gemma 4 E2B Architecture, Enabling Local AI on Phones to Achieve a Breakthrough

Spend 7.5 Billion Euros! SoftBank Fully Invests in European Computing Power, Plans to Build a Super Data Center Cluster in France

Large Model Agents Say Goodbye to Blind Stacking! Hong Kong Chinese University Team Releases SLIM Framework to Dynamically Manage the Lifecycle of External Skills

Exceeding GPT-5.5! Domestic AI Large Model MiniMax M3 Officially Released

Huawei AI Glasses Officially Launched at 2499 CNY, Self-Developed Chip + Xiaoyi Intelligent Entity Trigger Edge Computing