Google's DeepMind Introduces New Technology: Realistic Motion Videos Can Be Generated Without 3D Models

AIbase基地

Published inAI News · 5 min read · Jun 4, 2025

Recently, the Google DeepMind team collaborated with Brown University to develop a new technology called "force prompting." This technology can generate realistic motion effects without 3D models or physics engines, marking a significant breakthrough in the field of AI video generation.

With this technology, users can manipulate AI-generated video content simply by specifying the direction and intensity of the force. Force prompting can be applied to both global forces (e.g., overall wind) and local forces (e.g., specific point impacts). The input forces enter the system as vector fields and are then converted into natural and fluid movements, greatly enhancing the realism and dynamic performance of the generated videos.

The research team based their work on the CogVideoX-5B-IV video model and added a ControlNet module to handle physical control data. The entire signal is generated through a Transformer architecture, with each video consisting of 49 frames. The training process used only four Nvidia A100 GPUs and took just one day.

Notably, the training data was entirely synthetic, including 15,000 videos of flags waving under different wind conditions, 12,000 videos of rolling spheres, and 11,000 videos of flowers reacting to impacts. These rich synthetic datasets allowed the model to automatically establish correct force-motion relationships when generating videos using physical terms like "wind" or "bubbles" mentioned in text descriptions.

Despite the relatively limited amount of training data, the model demonstrated strong generalization capabilities, adapting to new objects, materials, and scenes, and even mastering some simple physical rules, such as lighter objects moving farther than heavier ones under the same force.

User tests showed that the force prompting technology outperforms baseline models that rely solely on text or motion path control in terms of motion matching and realism, and surpasses PhysDreamer, which is based on real physics simulations. However, there are still some shortcomings in complex scenes, such as smoke sometimes failing to respond correctly to wind forces, and occasional arm movements resembling fabric-like lightness in human bodies.

Demis Hassabis, CEO of DeepMind, stated that the next generation of AI video models (such as Veo3) is gradually understanding physical rules, moving beyond text or image processing to start representing the physical structure of the world. This is considered an important step toward more general AI, where future AI systems could continuously optimize and enhance their abilities through experience learning in simulated environments.

Project page: https://force-prompting.github.io/

Key points:

🌟 The new technology "force prompting" can generate realistic motion videos without 3D models or physics engines.

⚙️ Users can achieve natural and fluid motion expressions by simply operating the direction and intensity of the force.

📈 The model demonstrates strong generalization capabilities, able to adapt to new scenes and objects.

Tencent Hunyuan Game Visual Generation Platform Officially Launches Version 2.0

On September 5, the Tencent Hunyuan Game Visual Generation Platform officially launched version 2.0, adding capabilities such as image-to-video generation for games, custom model training, and one-click character refinement. It also significantly enhanced the 2D image generation model for games. The image-to-video and text-to-image models achieved industry-leading performance in game scenarios. This upgrade further addresses pain points in the dynamic content generation, style customization, and detail optimization during game art design and promotion, helping game art designers improve efficiency.

DingTalk and OpenDataLab jointly launch the document parsing tool DLU

In the rapidly evolving field of artificial intelligence, OpenDataLab and DingTalk have jointly launched a document parsing tool called DLU, aimed at helping enterprise users process and understand professional content more efficiently. This tool is developed based on the powerful intelligent document parsing engine MinerU, and is expected to be open-sourced soon, promoting the popularization and application of AI. MinerU has already received over 40,000 stars on GitHub, and its 2.0 version has been widely praised for its excellent parsing performance. DLU

OpenAI Launches AI Recruitment Platform, Aiming to Compete with LinkedIn

OpenAI is developing an AI-powered recruitment platform, OpenAI Jobs Platform, set to launch in mid-2026. It aims to connect businesses with job seekers using AI, directly competing with LinkedIn, which is backed by OpenAI's early investor Reid Hoffman and owned by Microsoft, OpenAI's major supporter. Fidji Simo announced the project, highlighting its goal to match talent with opportunities efficiently.....

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

Model Providers

Submit Your Model

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

GEO Services

Google's DeepMind Introduces New Technology: Realistic Motion Videos Can Be Generated Without 3D Models

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Tencent Hunyuan Game Visual Generation Platform Officially Launches Version 2.0

AI Daily: AI Photo Integration with Nano Banana; Tencent Zhiying Suspends Service; JD's Self-Developed Jingdianjing AI Copy Launches

Join the 6-Day Free Creation Celebration with AI Access to Google Nano Banana

DingTalk and OpenDataLab jointly launch the document parsing tool DLU

Warner Bros. Launches Counterattack: Sues AI Image Generation Company Midjourney

Moonshot AI Releases Kimi K2-0905: High-speed API Supporting 60-100 Tokens/s Now Fully Opened

OpenAI Launches AI Recruitment Platform, Aiming to Compete with LinkedIn

Starbucks Fully Introduces AI Inventory System: Covers Over 11,000 Stores in North America by End of September

Uber India Launches New Driver Data Classification Task to Support AI Model Development

AI Company Flock Safety Aims to Eliminate Crime in the US with Smart Cameras

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

Model Providers

Submit Your Model

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

GEO Services​

Google's DeepMind Introduces New Technology: Realistic Motion Videos Can Be Generated Without 3D Models

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Tencent Hunyuan Game Visual Generation Platform Officially Launches Version 2.0

AI Daily: AI Photo Integration with Nano Banana; Tencent Zhiying Suspends Service; JD's Self-Developed Jingdianjing AI Copy Launches

Join the 6-Day Free Creation Celebration with AI Access to Google Nano Banana

DingTalk and OpenDataLab jointly launch the document parsing tool DLU

Warner Bros. Launches Counterattack: Sues AI Image Generation Company Midjourney

Moonshot AI Releases Kimi K2-0905: High-speed API Supporting 60-100 Tokens/s Now Fully Opened

OpenAI Launches AI Recruitment Platform, Aiming to Compete with LinkedIn

Starbucks Fully Introduces AI Inventory System: Covers Over 11,000 Stores in North America by End of September

Uber India Launches New Driver Data Classification Task to Support AI Model Development

AI Company Flock Safety Aims to Eliminate Crime in the US with Smart Cameras

GEO Services