Tencent Open-Sources a Long-Range World Model with Native 3D Reconstruction Capabilities: HunyuanWorld-Voyager

AIbase基地

Published inAI News · 5 min read · Sep 2, 2025

Tencent has officially released HunyuanWorld-Voyager, an innovative video diffusion framework designed to generate 3D point clouds with world consistency based on a single input image, supporting users to immerse themselves in exploration along custom camera paths.

The official stated that this is the first global ultra-long-range world model with native 3D reconstruction capabilities, redefining AI-driven VR, gaming, and simulation space intelligence. This model not only can generate accurately aligned depth information and RGB videos, but it can also be directly used for high-quality 3D reconstruction without post-processing.

Direct 3D Output: Export point cloud videos as 3D formats without tools like COLMAP, enabling immediate 3D applications.
Innovative 3D Memory: Introduce a scalable world cache mechanism to ensure geometric consistency for any camera trajectory.
Top Performance: Ranked first in the Stanford WorldScore test, and performed well in video generation and 3D reconstruction benchmark tests.

The architecture of HunyuanWorld-Voyager includes two key components. The first is "World-consistent Video Diffusion," which proposes a unified architecture that can generate accurately aligned RGB video and depth video sequences based on existing world observations, ensuring global scene consistency. The second is "Long-distance World Exploration," which adopts an efficient world cache mechanism, combined with point cloud culling and autoregressive reasoning capabilities, supporting iterative scene expansion and achieving smooth video sampling through context-aware consistency technology.

To train the HunyuanWorld-Voyager model, the research team built a scalable data construction engine. This automated video reconstruction pipeline can automatically estimate camera poses and metric depths for any input video, eliminating the need for manual annotation, thus enabling the construction of large-scale and diverse training data. Based on this pipeline, HunyuanWorld-Voyager integrated real-world collected and Unreal Engine-rendered video resources, building a large-scale dataset containing over 100,000 video clips.

In experimental evaluations, HunyuanWorld-Voyager showed excellent performance in video generation quality. Compared with four open-source camera-controllable video generation methods, the results showed that this model outperformed other models in metrics such as PSNR, SSIM, and LPIPS, proving its superior video generation quality. At the same time, in terms of scene reconstruction, the generated videos from HunyuanWorld-Voyager also showed better effects in geometric consistency.

Additionally, HunyuanWorld-Voyager achieved the highest score in the WorldScore static benchmark test, demonstrating its superiority in camera motion control and spatial consistency. This achievement not only showcases the potential of the Hunyuan World model but also paves the way for future 3D scene generation technology.

Key Points:

🌍 HunyuanWorld-Voyager can generate 3D point clouds with world consistency based on a single input image, supporting users' immersive exploration.
🎥 The model simultaneously generates precisely aligned depth information and RGB videos, suitable for high-quality 3D reconstruction.
🏆 In multiple tests, HunyuanWorld-Voyager outperformed other models in both video generation quality and scene reconstruction effectiveness.

Perplexity Boldly Acquires Chrome! The AI Browser War is About to Begin!

In Silicon Valley, discussions about Google and AI browsers are heating up. Perplexity, an AI search firm, proposed a $34.5B bid for Chrome, sparking debates on AI's impact on browsers. Anthropic also entered the market with Claude for Chrome, intensifying competition in the evolving browser landscape.....

China's AI Content Regulation Makes a Strong Move: New Rules Take Effect on September 1st, DeepSeek, Tencent, and ByteDance Fully Respond to the Labeling Order

When AI-generated videos of 'grandsons' made countless elderly people cry, and when young people used AI anti-pressure-of-marriage videos to outwit their elders, this era full of magical realism finally saw a strong regulatory response. On September 1st, the "Regulations on the Identification of Artificial Intelligence-Generated Synthetic Content" came into effect, marking the beginning of a nationwide campaign for identifying the authenticity of AI content. The core requirement of this new regulation is extremely clear: all AI-generated content, whether text, images, audio, or video, must be clearly labeled. This is not a suggestion, but a mandatory requirement.

Salesforce Introduces AI Agents and Lays Off 4000 Employees to Improve Business Efficiency

Salesforce, a well-known customer relationship management (CRM) platform, recently announced that its customer support team has been reduced from 9000 to about 5000. This change is due to the company's introduction of new agent services and support products. Image source note: The image is AI-generated, and the image licensing provider is Midjourney. Salesforce's CEO Marc Benioff revealed in a recent podcast that the company refers to the tool as "Customer Zero".

Tencent Youtu Lab Opens Source Intelligent Agent Framework Youtu-Agent

Tencent Youtu Lab officially opens source the intelligent agent framework Youtu-Agent, aiming to help developers and AI enthusiasts quickly build their own intelligent agent applications. Youtu-Agent is designed with the concept of 'out-of-the-box', allowing users to build and run an intelligent agent through simple two-step operations. First, users need to pull the code from GitHub to their local machine, without needing to train or prepare models themselves. Through simple command line operations, users can quickly obtain the code and install it. Next, users need to write a YAML configuration file

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

Building and Deploying AI

AI Models Finder

LLM Leaderboard

Model Providers

Submit Your Model

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

GEO Services

Tencent Open-Sources a Long-Range World Model with Native 3D Reconstruction Capabilities: HunyuanWorld-Voyager

AIbase基地

This article is from AIbase Daily

AI News Recommendations

U.S.-India Venture Capital Firms Form Over $1 Billion Alliance to Invest in India's AI Unicorn Ecosystem

Perplexity Boldly Acquires Chrome! The AI Browser War is About to Begin!

China's AI Content Regulation Makes a Strong Move: New Rules Take Effect on September 1st, DeepSeek, Tencent, and ByteDance Fully Respond to the Labeling Order

Salesforce Introduces AI Agents and Lays Off 4000 Employees to Improve Business Efficiency

OpenAI Plans to Establish a Large Data Center in India to Promote AI Infrastructure Development

Tencent Hunyuan Translation Model Hunyuan-MT-7B Officially Open-Sourced, Wins 30 First Places in International Competitions

AI Daily: Tencent Opensources 3D World Model HunyuanWorld-Voyager; Ji Meng AI Series Models Open API; Tongyi's Intelligent Agent Development Framework AgentScope 1.0

Tencent Open-Source Masterpiece HunyuanWorld-Voyager: Generate 3D Worlds from a Single Image in Seconds, Outperforming Gen-3 and Ranking at the Top Globally!

Youtu-Agent Intelligent Agent Framework Officially Open Sourced, Leading the New Trend in AI Development

Tencent Youtu Lab Opens Source Intelligent Agent Framework Youtu-Agent

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

Building and Deploying AI

AI Models Finder

LLM Leaderboard

Model Providers

Submit Your Model

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

GEO Services​

Tencent Open-Sources a Long-Range World Model with Native 3D Reconstruction Capabilities: HunyuanWorld-Voyager

AIbase基地

This article is from AIbase Daily

AI News Recommendations

U.S.-India Venture Capital Firms Form Over $1 Billion Alliance to Invest in India's AI Unicorn Ecosystem

Perplexity Boldly Acquires Chrome! The AI Browser War is About to Begin!

China's AI Content Regulation Makes a Strong Move: New Rules Take Effect on September 1st, DeepSeek, Tencent, and ByteDance Fully Respond to the Labeling Order

Salesforce Introduces AI Agents and Lays Off 4000 Employees to Improve Business Efficiency

OpenAI Plans to Establish a Large Data Center in India to Promote AI Infrastructure Development

Tencent Hunyuan Translation Model Hunyuan-MT-7B Officially Open-Sourced, Wins 30 First Places in International Competitions

AI Daily: Tencent Opensources 3D World Model HunyuanWorld-Voyager; Ji Meng AI Series Models Open API; Tongyi's Intelligent Agent Development Framework AgentScope 1.0

Tencent Open-Source Masterpiece HunyuanWorld-Voyager: Generate 3D Worlds from a Single Image in Seconds, Outperforming Gen-3 and Ranking at the Top Globally!

Youtu-Agent Intelligent Agent Framework Officially Open Sourced, Leading the New Trend in AI Development

Tencent Youtu Lab Opens Source Intelligent Agent Framework Youtu-Agent

GEO Services