Unlocking 3D Vision for Robots: Yueli Lingji Introduces the GeoVLA Framework, Revolutionizing Traditional VLA Models!
The Yueli Lingji team addresses the issue of insufficient spatial perception in existing vision-language-action models in complex environments, which rely on 2D images. They propose a new solution aimed at enhancing robots' ability to judge depth and position in 3D space.