World Models Enter the Era of Fine-Tuning: Tencent Opensources the Reinforcement Learning Post-Training Framework WorldCompass

AIbase基地

Published inAI News · 4 min read · Mar 11, 2026

Tencent Hunyuan 3D team officially announced the open-source release of the world's first reinforcement learning (RL) post-training framework for world models - WorldCompass. As an official reinforcement learning expansion module for Hunyuan World Model 1.5, this framework aims to significantly improve the accuracy and user experience of world models during interactions.

Current mainstream world models mainly rely on large-scale pre-training, but when facing complex combination action instructions from users, they often encounter "understanding bias" or inaccurate execution issues. The emergence of WorldCompass provides a new "compass" to solve this pain point.

By introducing a reinforcement learning mechanism, this framework can deeply fine-tune pre-trained models, enabling them to more accurately interpret and execute complex action instructions, thus avoiding the embarrassment of "not understanding" commands. Evaluation data shows that after applying WorldCompass, the open-source SOTA model WorldPlay saw its interaction accuracy (Accaction) in the most difficult composite action scenarios rise from about 20% to over 55%, an increase of more than 35%.

In addition to enhancing action control, the framework also significantly improved the visual fidelity score (HPSv3), ensuring that the model maintains consistent visual performance during long-distance, long-time sequence virtual world exploration. The Tencent Hunyuan team stated that the release of WorldCompass marks the formal transition of world models from a purely "pre-training era" to a "reinforcement learning fine-tuning era."

Currently, the relevant technologies of WorldCompass have been validated in the Hunyuan WorldPlay model. Tencent has fully open-sourced the related code and technical reports, aiming to provide a technical path for global developers to build more intelligent and controllable "generative world simulators."

Key Points

🎯 Precision Control: Overcame the industry challenge of inaccurate execution by world models under complex action instructions, achieving a multiple-fold increase in accuracy.
🤖 RL Deep Empowerment: Demonstrated the significant tuning potential of reinforcement learning in long-term, interactive world models.
🌐 Full Stack Open Source: Fully opens up from code to model details, helping developers create more immersive virtual interaction environments.
🚀 Generation Crossing: Shifts the focus of world model technology from data stacking to meticulous refinement of interaction logic.

TencentHunyuan3D WorldCompass ReinforcementLearning WorldModel

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

New Trends in Smart Healthcare: China Unicom and Yu Yue Medical Join Forces to Equip Health Devices with an AI Brain

China Unicom and Yuyue Medical are deepening cooperation, extending from 5G production lines to full-chain co-creation of AI wearable devices. Leveraging Unicom's computing cloud platform, they are reshaping device perception and driving the evolution of medical devices, marking the shift of smart healthcare into a deeper phase.....

Jun 24, 2026

170

Capital Backs Again: Yinge Technology Completes a Several Billion Yuan Funding Round to Enter the 3D Generation Large Model Market

As generative AI advances, 3D content production gains value. Yinmo Technology, focused on 3D generation models, raised hundreds of millions of yuan led by Cathay Capital and Shanghai Guotou Pioneer, with existing investors participating, and Lighthouse Capital as FA. Its tech potential has long drawn wide investor attention.....

Jun 24, 2026

160

Yingmou Technology Secures Hundreds of Millions in New Round of Funding, Unveils Rodin Gen-2.5, a Million-Face-Level 3D Large Model

Yingmou Technology has completed a funding round worth hundreds of millions of yuan, led by Kaifu Fund and Shanghai GuoTou Daxiang. The funds will be used for 3D large model research and development and global commercialization, accelerating the implementation of game and e-commerce scenarios. The core product Hyper3D has been upgraded, with 80% of revenue coming from overseas, serving clients such as ByteDance and Unity.

Jun 23, 2026

180

AI Daily: Volcano Engine launches Doubao Seedance 2.5 and other models; Shengshu Vidu Q3 launched on Huawei Cloud; BaiChuan Intelligence releases M4 model

Welcome to the [AI Daily] section! This is your guide to exploring the world of artificial intelligence every day. Every day, we bring you the latest content in the AI field, focusing on developers, helping you understand technical trends and innovative AI product applications. Discover new AI products: https://app.aibase.com/zh1. Volcano Engine launches Doubao 2.1 Pro: Confirm free daily functions, will launch a professional office mode. The Doubao large model has undergone a major upgrade at the Volcano Engine FORCE Original Power conference, releasing several new versions.

Jun 23, 2026

1.9k

New Powerhouse for Film and Television Production: Shengshu Vidu Q3 Launched on Huawei Cloud, Creating a Video Generation Solution Designed for Dramas

Huawei Cloud partners with Shengshu Technology to launch the world's first video large model Vidu Q3 designed specifically for dramas. It focuses on integrated text-to-video and image-to-video generation. The model is tailored for enterprise marketing, film and television, and cultural and creative design. Its core advantage lies in strong narrative capabilities, enabling the efficient creation of coherent video content and accelerating the realization of creative ideas.

Jun 23, 2026

230

Apple and LM Studio Achieve a Breakthrough Collaboration: Four Mac Studios Successfully Run Trillion-Parameter Large Model

At WWDC, LM Studio and Apple ran Moonshot AI's Kimi K2.6 model on a cluster of four Mac Studios. The Mixture-of-Experts model has one trillion parameters, demonstrating Apple Silicon's potential for massive AI workloads.....

Jun 22, 2026

320

Snap Splits Its Generative AI Video Team to Establish New Company Dotmo, Alleviating High R&D Costs

Snap spun off its AI video team into an independent company, Dotmo, to cut high internal generative AI costs and boost operational agility. Focused on AI models for interactive gaming experiences, Dotmo's core team comes from Snap. Though independent, it maintains close capital and technical ties with Snap.....

Jun 22, 2026

160

Enterprise AI Transformation Gains a New Tool: Qingyun Technology's Computing Cloud Integrates MiniMax-M3 Model

Enterprises face challenges in efficiently and cost-effectively implementing AI. Qingyun Technology's Crest Computing platform has integrated the domestic open-source large model MiniMax-M3, offering new computing power support. MiniMax-M3 excels in three core technologies, including outstanding context processing capabilities, and relies on its self-developed architecture to help enterprises easily deploy AI business.

Jun 18, 2026

660

One-Year Leap to Unicorn: Manifold AI Achieves Several Billion Yuan in Funding, Accelerating the Realization of World Models

Manifold AI, a one-year-old world-model startup, has secured hundreds of millions of yuan in funding from new investors including China Reform Fund, InnoVen Capital, BAIC Capital, and Core Energy Venture Capital, while four existing shareholders oversubscribed. Its rapid growth in the AI race has earned strong capital recognition.....

Jun 18, 2026

720

JD.com Launches A2P2 Protocol: The First Smart Agent Autonomous Payment Standard, Dividing into Six Levels from L0 to L5

JD.com released the country's first smart agent autonomous payment protocol, A2P2, which for the first time categorizes AI payment capabilities into six levels from L0 to L5. The protocol focuses on the intermediate stages of L3 and L4, achieving a progressive transition from user confirmation to full autonomous decision-making by the smart agent, providing a framework for standardization of AI payments.

Jun 17, 2026

340

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

GEO Brand Visibility

AI Visibility Audit

AI Search Visibility Checker

GEO Ranking Monitor

AI Conversation Insight

GEO Promotion Link Detection

GEO Ranking Optimization System

GEO Ranking Optimization

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

LLM API Hub

AI Models Finder

Model Providers

LLM Leaderboard

LLM API Proxy Checker

Compare LLMs

LLM Cost Calculator

LLM Arena

AI Model Compatibility Checker

AI Deployment Calculator

World Models Enter the Era of Fine-Tuning: Tencent Opensources the Reinforcement Learning Post-Training Framework WorldCompass

AIbase基地

Key Points

This article is from AIbase Daily

AI News Recommendations

New Trends in Smart Healthcare: China Unicom and Yu Yue Medical Join Forces to Equip Health Devices with an AI Brain

Capital Backs Again: Yinge Technology Completes a Several Billion Yuan Funding Round to Enter the 3D Generation Large Model Market

Yingmou Technology Secures Hundreds of Millions in New Round of Funding, Unveils Rodin Gen-2.5, a Million-Face-Level 3D Large Model

AI Daily: Volcano Engine launches Doubao Seedance 2.5 and other models; Shengshu Vidu Q3 launched on Huawei Cloud; BaiChuan Intelligence releases M4 model

New Powerhouse for Film and Television Production: Shengshu Vidu Q3 Launched on Huawei Cloud, Creating a Video Generation Solution Designed for Dramas

Apple and LM Studio Achieve a Breakthrough Collaboration: Four Mac Studios Successfully Run Trillion-Parameter Large Model

Snap Splits Its Generative AI Video Team to Establish New Company Dotmo, Alleviating High R&D Costs

Enterprise AI Transformation Gains a New Tool: Qingyun Technology's Computing Cloud Integrates MiniMax-M3 Model

One-Year Leap to Unicorn: Manifold AI Achieves Several Billion Yuan in Funding, Accelerating the Realization of World Models

JD.com Launches A2P2 Protocol: The First Smart Agent Autonomous Payment Standard, Dividing into Six Levels from L0 to L5

AI News Recommendations

New Trends in Smart Healthcare: China Unicom and Yu Yue Medical Join Forces to Equip Health Devices with an AI Brain

Capital Backs Again: Yinge Technology Completes a Several Billion Yuan Funding Round to Enter the 3D Generation Large Model Market

Yingmou Technology Secures Hundreds of Millions in New Round of Funding, Unveils Rodin Gen-2.5, a Million-Face-Level 3D Large Model

AI Daily: Volcano Engine launches Doubao Seedance 2.5 and other models; Shengshu Vidu Q3 launched on Huawei Cloud; BaiChuan Intelligence releases M4 model

New Powerhouse for Film and Television Production: Shengshu Vidu Q3 Launched on Huawei Cloud, Creating a Video Generation Solution Designed for Dramas

Apple and LM Studio Achieve a Breakthrough Collaboration: Four Mac Studios Successfully Run Trillion-Parameter Large Model

Snap Splits Its Generative AI Video Team to Establish New Company Dotmo, Alleviating High R&D Costs

Enterprise AI Transformation Gains a New Tool: Qingyun Technology's Computing Cloud Integrates MiniMax-M3 Model

One-Year Leap to Unicorn: Manifold AI Achieves Several Billion Yuan in Funding, Accelerating the Realization of World Models

JD.com Launches A2P2 Protocol: The First Smart Agent Autonomous Payment Standard, Dividing into Six Levels from L0 to L5