Recently, a benchmarking platform named RoboChallenge was officially launched, aiming to provide the first large-scale, multi-task, and evaluation standard for robotic tasks performed by real robots in real physical environments.
RoboChallenge was jointly initiated by Dexmal PowerMind and Hugging Face. The core value of this testing platform lies in overcoming challenges in existing robot benchmark tests, such as performance validation in real environments, standardized testing conditions, and publicly accessible testing platforms.
This benchmark test will provide a more reliable and comparable evaluation standard for the practical application of Visual Language Action models (VLAs) in robots, thereby accelerating the deployment and verification process of VLA models from simulated environments to real physical worlds.