Information

Latest AI News

Explore AI Frontiers, Master Industry Trends

AI Daily Brief

Your Daily AI Brief - Never Miss What's Next

Information

AI Product Finder

Smart Product Discovery - Comprehensive Market Intelligence

AI Product Rankings

AI Product Power Rankings - Performance, Buzz & Trends

AI Product Submit

Submit Your AI Product - Amplify Reach & Drive Growth

Tools

AI Tools Directory

Discover The Best AI Websites & Tools

Information

AI Models Finder

Comprehensive AI Models Collection for All Your Development & Research Needs

Model Providers

Discover Trusted AI Model Partners - Guaranteed Reliable Support

Submit Your Model

Submit Your Model Info & Services - Precision Marketing & User Targeting

Tools

Compare LLMs

Multi-Dimensional Large Model Comparison - Find Your Perfect Match

LLM Cost Calculator

Calculate AI Model Costs Accurately - Optimize Your Budget

LLM Arena

Multi-Model Real-Time Evaluation & Quick Output Comparison

Information

MCP Servers

Discover Popular AI-MCP Services - Find Your Perfect Match Instantly

MCP Client

Easy MCP Client Integration - Access Powerful AI Capabilities

MCP Case Tutorials

Master MCP Usage - From Beginner to Expert

MCP Ranking

Top MCP Service Performance Rankings - Find Your Best Choice

MCP Service Submission

Publish & Promote Your MCP Services

Tools

MCP Playground

Test MCP Services Freely - Quick Online Experience

MCP Inspector

Quick MCP Service Testing - Fast Deployment

GEO Services

Achieve Dominant Visibility in AI Search for Your Business or Brand with GEO Services

Datasets

AI Compute

AI Tutorial

Google Launches EmbeddingGemma: An Efficient Text Embedding Model for Mobile Devices

AIbase基地

Published inAI News · 4 min read · Sep 8, 2025

Latest news from Google's Deep Learning team: they have officially launched EmbeddingGemma, an open-source embedding model designed for mobile devices. With its efficient design of 308 million parameters, EmbeddingGemma has been rated as the best multilingual text embedding model under 500M in the MTEB (Massive Text Embedding Benchmark), demonstrating powerful capabilities such as Retrieval-Augmented Generation (RAG) and semantic search, which can run directly on devices like smartphones without an internet connection.

EmbeddingGemma's superiority lies in its performance that can rival popular models almost twice its size. It is not only small but also flexible, suitable for various scenarios, supporting customizable output dimensions from 768 to 128, and featuring a 2000-context token window, allowing it to run on everyday devices such as smartphones, laptops, and desktops. In addition, it integrates with various popular tools, enabling users to easily collaborate with tools like sentence-transformers, MLX, and Ollama.

EmbeddingGemma performs exceptionally well in building RAG pipelines, capable of generating embeddings for text, converting text into numerical representations to represent its meaning in high-dimensional space. In a RAG pipeline, embeddings are first generated based on user input, and then their similarity with all document embeddings in the system is calculated, retrieving the most relevant passages. This high-quality embedding ensures that the final generated response is accurate and contextually relevant.

In addition, EmbeddingGemma has been carefully designed for speed and resource consumption, offering features such as being compact, fast, and efficient. Its embedding inference time is less than 15 milliseconds, allowing real-time interaction. Its offline functionality ensures the privacy and security of user data, making it particularly suitable for developing mobile device-based applications.

Developers can now use EmbeddingGemma to create personalized chatbots, perform file searches, or quickly fine-tune for specific domains. Whether in offline applications or server-side applications requiring efficient performance, EmbeddingGemma provides an ideal choice.

Official blog: https://developers.googleblog.com/en/introducing-embeddinggemma/

Key points:
🌟 EmbeddingGemma is an open-source embedding model with 308M parameters, specifically designed for mobile devices, and can run without an internet connection.
📱 It supports integration with multiple tools, flexibly adapting to various application scenarios to meet developers' needs.
🔒 Powerful offline functionality ensures user data security, enhancing privacy protection, and providing reliable support for mobile applications.

EmbeddingGemma AITerminology GoogleDeepLearning OpenSourceEmbeddingModel

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

Koah Secures $5 Million in Funding, Focused on Introducing Advertising in AI Applications

Startup Koah raises $5M to integrate ads into AI chatbots, aiming to monetize as AI products expand globally.....

Sep 8, 2025

100

Perplexity AI Launches Copyright Revenue Sharing Program, Will Pay News Publishers

Perplexity AI launches a $42.5M publisher revenue-sharing plan, allowing media outlets to earn from content traffic, pioneering direct compensation in AI.....

Aug 26, 2025

Zhipu AI launches revolutionary product AutoGLM 2.0 - One sentence of voice can replace hands to control the entire web

Aug 21, 2025

200

DeepSeek Releases V3.1 Version, Doubling the Context Window to 128K Tokens

DeepSeek released DeepSeek-V3.1, expanding context window to 128k tokens while maintaining API compatibility, marking a milestone in open-source AI technology.....

Aug 19, 2025

450

Kunlun AI Open Sources 'Skywork UniPic 2.0' Model

On the third day of the SkyWork AI Technology Release Week, Kunlun AI Group officially announced the open sourcing of its latest developed 'Skywork UniPic 2.0' model. The release of this unified multimodal model marks another major breakthrough in the field of multimodal artificial intelligence. Skywork UniPic 2.0 is an efficient training and inference framework for unified multimodal modeling, which achieves lightweight generation and editing modules, as well as multi-modal

Aug 13, 2025

170

Claude Sonnet 4 model from Anthropic now supports up to 1 million tokens

Aug 13, 2025

120

LegalZoom Teams Up with OpenAI to Launch New AI Legal Assistant, Helping Users Access Legal Services Conveniently

Aug 6, 2025

100

Google AI Launches MLE-STAR: An Intelligent Machine Learning Engineering Agent to Assist with Automated Tasks

Aug 4, 2025

150

OpenAI Launches New Learning Assistant ChatGPT Study, Targeting Education Sector Users

OpenAI launches ChatGPT Study with interactive prompts and scaffolding responses for systematic learning across subjects. Available to all users, with an Edu version coming soon. Educators see potential to transform teaching methods.....

Jul 30, 2025

120

Unitree Launches the Affordable Humanoid Robot R1, Priced at Just $5,900

Chinese robot manufacturer Unitree has launched the world's first full-size humanoid robot R1 priced below $6,000, at only 39,900 yuan. This 25-kilogram robot features 26 joints and can perform high-difficulty movements such as backflips, equipped with a multi-modal AI system. Compared to similar products, the R1 is only one-third the price of Tesla's Optimus, thanks to China's supply chain advantages and lightweight design. Unitree's move comes as it prepares for an IPO, potentially triggering a price war in the industry. Although the R1 is not yet suitable for home use,

Jul 29, 2025

140

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

Model Providers

Submit Your Model

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

GEO Services​

Google Launches EmbeddingGemma: An Efficient Text Embedding Model for Mobile Devices

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Koah Secures $5 Million in Funding, Focused on Introducing Advertising in AI Applications

Perplexity AI Launches Copyright Revenue Sharing Program, Will Pay News Publishers

Zhipu AI launches revolutionary product AutoGLM 2.0 - One sentence of voice can replace hands to control the entire web

DeepSeek Releases V3.1 Version, Doubling the Context Window to 128K Tokens

Kunlun AI Open Sources 'Skywork UniPic 2.0' Model

Claude Sonnet 4 model from Anthropic now supports up to 1 million tokens

LegalZoom Teams Up with OpenAI to Launch New AI Legal Assistant, Helping Users Access Legal Services Conveniently

Google AI Launches MLE-STAR: An Intelligent Machine Learning Engineering Agent to Assist with Automated Tasks

OpenAI Launches New Learning Assistant ChatGPT Study, Targeting Education Sector Users

Unitree Launches the Affordable Humanoid Robot R1, Priced at Just $5,900

GEO Services