World Models Enter the Era of Fine-Tuning: Tencent Opensources the Reinforcement Learning Post-Training Framework WorldCompass
Tencent's Hunyuan 3D team open-sourced WorldCompass, a reinforcement learning post-training framework designed to enhance world models' accuracy and user experience in interactions by addressing biases in handling complex instructions.....