DeepMind's Secret Technology Shakes Up! Gemini Robotics On-Device Makes Robots Instantly Versatile

AIbase基地

Published inAI News · 5 min read · Jun 25, 2025

Google DeepMind has launched its new robot AI model - Gemini Robotics On-Device, sparking industry discussions. This model showcases a new breakthrough in robot AI technology with its fully on-device operation, strong task adaptability, and low-shot learning capabilities. AIbase has compiled the latest online information to provide an in-depth analysis of the model's innovations and its potential impact on the robotics industry.

Completely On-Device Operation: Freeing from Cloud Constraints

The biggest highlight of Gemini Robotics On-Device is that it runs entirely on the robot's local hardware without relying on cloud computing resources. This feature solves the issues of latency and unstable connections faced by traditional cloud-based robots, especially suitable for scenarios with limited network environments such as factories, warehouses, or remote areas. According to the introduction, the model can still approach the performance of the cloud-based Gemini model when running locally, demonstrating strong computing efficiency and reliability.

Multi-Task Capabilities: From Zipping a Jacket to Folding Clothes

This model integrates vision, language, and action control, featuring excellent multi-modal capabilities. It can understand human intentions through natural language instructions and convert them into precise robotic actions. In demonstrations, the robot successfully completed complex tasks such as zipping a jacket, pouring liquid, and folding clothes. It also performed well in unfamiliar scenarios, such as assembling on an industrial production line. Google DeepMind stated that the model performs particularly well on dual-arm robots (such as Franka FR3 and Apollo humanoid robots), demonstrating general dexterity and task generalization capabilities.

Low-Shot Learning: Get Started with 50-100 Demonstrations

Another major innovation of Gemini Robotics On-Device is its low-shot learning capability. Developers can quickly adapt the robot to new tasks with just 50 to 100 task demonstrations. This efficient fine-tuning method benefits from the model's architecture based on Gemini 2.0, combined with powerful visual perception, semantic understanding, and behavior generation capabilities. Google DeepMind also released the Gemini Robotics SDK, allowing developers to test the model in the MuJoCo physics simulator and obtain development permissions through the "Trusted Tester" program, greatly lowering the deployment threshold for robot AI.

Industry Prospects: Redefining Robot Applications

The release of Gemini Robotics On-Device marks a new stage in robot AI, moving towards "usability, deployability, and generalizability." Its on-device operation and low-shot learning characteristics not only reduce the deployment costs for enterprises but also promote the widespread application of robot technology in manufacturing, logistics, security, and other fields. However, the model's generalization ability and safety in complex environments still need further verification. AIbase believes that with continuous optimization by Google DeepMind, this technology has the potential to reshape the future landscape of the robotics industry.

Google DeepMind's Gemini Robotics On-Device demonstrates a breakthrough in robot AI technology with its on-device operation, multi-task capabilities, and low-shot learning features. From zipping a jacket to industrial assembly, this model has endowed robots with unprecedented flexibility and intelligence. In the future, with the release of the SDK and technological iteration, robots may become indispensable "versatile assistants" across various industries.

The Purple Spell of AI Interface Design: A Tweet Unveils a Technological Phenomenon

This article analyzes the widespread phenomenon of purple themes in AI-generated user interfaces, exploring its origins, technical causes, and potential impact on future UI design. The study reveals that this phenomenon stems from the overrepresentation of Tailwind CSS framework's default color scheme in AI training data, highlighting how human design decisions can lead to unexpected long-term effects through the training process of machine learning models.

AI Daily: GPT-5 Officially Released; Baidu to Launch Wenshen 5.0 Large Model; CNKI Launches AIKBase V2.0 Multimodal Data Management System

GPT-5 launched with multimodal capabilities and tiered pricing. AIKBase V2.0 offers fast retrieval. Ideogram ensures character consistency. Cursor CLI aids cross-platform coding. Baidu preps new models. dots.ocr excels in document parsing. Tesla shifts from Dojo to Nvidia. Pixel 10 adds AI camera coach. Augment Code supports GPT-5. Amazon Bedrock leads as top AI platform.....

OpenAI GPT-5 Officially Launches on Cline, Demonstrating Advanced AI Capabilities

OpenAI's GPT-5, the latest flagship model on Cline, excels in reasoning, coding, and user experience, outperforming Claude4Sonnet. It offers multimodal capabilities with three versions (flagship, lightweight, low-latency) for diverse applications. Despite $500M R&D costs, it delivers efficiency with lower error rates.....

Cursor Makes a Big Move! CLI Version Released, AI Programming in the Terminal is Now Possible!

Cursor launches a command-line interface (CLI) version, providing AI programming support for developers in terminal environments. The new version supports automated script writing, document updates, and triggering security reviews. Developers can adjust AI behavior in real-time within the terminal. Highlights include one-click review of AI-generated code, compatibility with Linux/macOS/Windows terminal environments, and it's especially suitable for server development without a graphical interface. The CLI version upgrades Cursor from an editor to a comprehensive development tool, receiving positive feedback from the developer community, showcasing

Musk: AI is the only hope to solve Japan's population crisis

Musk expresses his opinion on Japan's population crisis: Japan's population will decrease by 908,000 in 2025, setting a new historical high. This trend originated 50 years ago and is unrelated to AI. He suggests that AI may be the only hope to solve the population problem. Official data from Japan show that the local population has been declining for 16 consecutive years, with birth rates reaching a new low and death rates rising. Musk's remarks offer a controversial solution to global population challenges.

Musk Responds to Tesla Dojo Team Dissolution: Developing Two AI Chips at the Same Time Is Meaningless

Elon Musk, founder of Tesla, recently publicly responded to rumors about the dissolution of the Dojo supercomputer team, clearly stating that the company will terminate its strategy of developing two different architecture AI chips simultaneously. He pointed out that spreading resources to advance Dojo and the next generation of AI chips in parallel is inefficient, and Tesla will focus its efforts on subsequent core chips such as AI5 and AI6.

Amazon Launches the World's Largest AI Model Platform, Amazon Bedrock

Amazon Web Services (AWS) launched the model supermarket platform Amazon Bedrock, breaking the strongest model competition model in the AI industry and advocating a strategy of 'choice is everything'. The platform integrates AI models from multiple companies such as OpenAI and Anthropic. Enterprises can freely combine different models according to their needs to achieve a result greater than 1+1. AWS builds the world's largest AI model aggregation platform through two platforms, Bedrock and SageMaker, promoting the development of generative AI applications and helping enterprises choose the most suitable rather than the strongest.

Product Finder

Product Submit

AI Models Finder

MCP Servers

MCP Client

MCP Inspector

Case Tutorials

Latest AI News

AI Daily Brief

DeepMind's Secret Technology Shakes Up! Gemini Robotics On-Device Makes Robots Instantly Versatile

AIbase基地

This article is from AIbase Daily

AI News Recommendations

New Breakthrough in AI Agents Payments: Lava Payments Secures $5.8 Million Seed Funding to Develop an One-Click Digital Wallet