Say Goodbye to Blurriness! NVIDIA Launches ViPE Engine for High-Precision 3D Data in Spatial AI

AIbase基地

Published inAI News · 4 min read · Sep 16, 2025

Recently, NVIDIA, in collaboration with the University of Toronto, the Vector Institute, and the University of Texas at Austin, has released a groundbreaking technology called **ViPE (Video Pose Engine)**. ViPE aims to address key challenges in 3D geometric perception, specifically how to efficiently and accurately extract 3D information from complex natural videos.

Technology Core and Applications

3D geometric perception is essential for various modern technologies such as autonomous driving, virtual reality (VR), and augmented reality (AR). ViPE innovatively extracts intrinsic camera characteristics, motion information, and high-precision depth maps from raw videos quickly, providing a reliable data foundation for these spatial AI systems.

ViPE is highly adaptable and can handle various scenarios and camera types, including dynamic selfies, movie shots, dash cam footage, and pinhole, wide-angle, and 360° panoramic camera models.

Working Principle and Performance

The research team used a hybrid method with multiple constraints to ensure the high accuracy of ViPE:

Bundle Adjustment: Conduct dense bundle adjustment on key frames to estimate camera parameters, pose, and depth maps.
Dense Flow and Sparse Point Constraints: Introduce dense flow constraints from the DROID-SLAM network and sparse point constraints from the cuvslam library to ensure robustness and sub-pixel accuracy.
Depth Regularization: Utilize monocular metric depth networks to address scale ambiguity and consistency issues, generating high-resolution and temporally consistent depth information.

Test results show that ViPE outperforms existing technologies (such as MegaSAM, VGGT, and MASt3R-SLAM) in multiple benchmarks. It not only performs well in pose and intrinsic function accuracy but also runs stably at 3 to 5 frames per second on a single GPU and successfully generates scale-consistent trajectories.

To further advance research in the field of spatial AI, the team also released a large dataset containing approximately 96 million annotated frames, offering valuable resources for future technological exploration. The release of ViPE marks an important advancement in 3D geometric perception technology and lays a solid foundation for future spatial AI applications.

Address: https://research.nvidia.com/labs/toronto-ai/vipe/

New Breakthrough in Industrial Quality Inspection: Hikvision Launches AI Quality Inspector to Accurately Solve Packaging Error-Proofing Issues

Hangzhou Hikvision has launched an AI quality inspection system based on its self-developed "Guanlan" industrial large model, which uses intelligent visual inspection to solve issues of wrong or missing parts in the packaging stage of manufacturing. The system can accurately identify the type and quantity of components, and immediately sound an alarm when abnormalities are detected, significantly improving the efficiency and accuracy of quality inspection.

Google issues warning: Strengthening AI content regulation could lead to the collapse of its search engine business

Google warned that if regulatory authorities impose excessive restrictions on AI content scraping, its search engine business could face a devastating impact. This statement comes in response to proposed new regulations by the UK antitrust authority, which aim to give publishers more control over how Google's AI search functions use their content.

Yuchu Open Sources UnifoLM-VLA-0 Large Model: Injecting Physical Common Sense into General-Purpose Humanoid Robots

Yuchu open sources the UnifoLM-VLA-0 large model, specifically designed for general-purpose humanoid robots, achieving deep integration of vision, language, and action. The model breaks through the limitations of traditional vision-language models by pre-training on robot operation data, advancing the robot's brain from text and image understanding to embodied intelligence with physical common sense.

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

GEO Brand Visibility

AI Brand Monitoring Tool

AI Search Visibility Checker

GEO Promotion Link Detection

GEO Ranking Optimization System

GEO Services​

AI Model Compatibility Checker

AI Deployment Calculator

Say Goodbye to Blurriness! NVIDIA Launches ViPE Engine for High-Precision 3D Data in Spatial AI

AIbase基地

Technology Core and Applications

Working Principle and Performance

This article is from AIbase Daily

AI News Recommendations

Game Developers Vote Against: More Than Half of Industry Professionals Are Pessimistic About Generative AI

New Breakthrough in Industrial Quality Inspection: Hikvision Launches AI Quality Inspector to Accurately Solve Packaging Error-Proofing Issues

Google issues warning: Strengthening AI content regulation could lead to the collapse of its search engine business

Haier Smart Home Leads in Intelligent Manufacturing and Wins the IDC China AI Digital Factory Leader Award

Ant Group Launches LingBot-VLA: Dual-Arm Robot Control Enters the Era of Large Models

AI Screening Shows Remarkable Power: A Swedish Study of 100,000 People Confirms a 12% Reduction in Breast Cancer Missed Diagnoses

New Era of Deep Interaction: Samsung Announces Multi-Modal AI Smart Glasses Launch in 2026

Yuchu Open Sources UnifoLM-VLA-0 Large Model: Injecting Physical Common Sense into General-Purpose Humanoid Robots

Tech Giants Compete to Invest in OpenAI, Planning to Raise Up to $60 Billion

Claiming $3 Billion: Anthropic Sued by Music Giants for Allegedly Illegally Downloading 20,000 Songs

AI News Recommendations

Game Developers Vote Against: More Than Half of Industry Professionals Are Pessimistic About Generative AI

New Breakthrough in Industrial Quality Inspection: Hikvision Launches AI Quality Inspector to Accurately Solve Packaging Error-Proofing Issues

Google issues warning: Strengthening AI content regulation could lead to the collapse of its search engine business

Haier Smart Home Leads in Intelligent Manufacturing and Wins the IDC China AI Digital Factory Leader Award

Ant Group Launches LingBot-VLA: Dual-Arm Robot Control Enters the Era of Large Models

AI Screening Shows Remarkable Power: A Swedish Study of 100,000 People Confirms a 12% Reduction in Breast Cancer Missed Diagnoses

New Era of Deep Interaction: Samsung Announces Multi-Modal AI Smart Glasses Launch in 2026

Yuchu Open Sources UnifoLM-VLA-0 Large Model: Injecting Physical Common Sense into General-Purpose Humanoid Robots

Tech Giants Compete to Invest in OpenAI, Planning to Raise Up to $60 Billion

Claiming $3 Billion: Anthropic Sued by Music Giants for Allegedly Illegally Downloading 20,000 Songs

GEO Services