World Models Enter the Physical World: Antlingbot Opens Source LingBot-VA, Allowing Robots to Think Before Acting

AIbase基地

Published inAI News · 5 min read · Jan 30, 2026

On January 30, following the release of the spatial perception model, embodied large model, and world model "three consecutive launches," Ant Lingbo Technology announced the open-source release of the embodied world model LingBot-VA today. LingBot-VA introduces a novel autoregressive video-action world modeling framework, deeply integrating large-scale video generation models with robot control. The model generates the next world state while directly simulating and outputting corresponding action sequences, enabling robots to "simulate and act" like humans.

In real-robot evaluations, LingBot-VA demonstrated strong adaptability to complex physical interactions. Facing three categories of six challenging tasks - long-term tasks (making breakfast, picking up screws), high-precision tasks (inserting test tubes, opening packages), and manipulation of flexible and jointed objects (folding clothes, folding pants) - it only required 30~50 real-robot demonstration data samples to adapt, and the task success rate was on average 20% higher than the industry's strong baseline Pi0.5.

(Figure caption: In real-robot evaluations, LingBot-VA outperformed the industry benchmark Pi0.5 in multiple difficult operation tasks)

In simulation evaluations, LingBot-VA achieved over 90% success rate on the high-difficulty dual-arm collaborative operation benchmark RoboTwin2.0 for the first time, and reached an average success rate of 98.5% on the long-term lifelong learning benchmark LIBERO, both setting new industry records.

(Figure caption: LingBot-VA breaks the current SOTA in LIBERO and RoboTwin 2.0 simulation benchmark tests)

According to reports, LingBot-VA adopts the Mixture-of-Transformers (MoT) architecture, achieving cross-modal fusion between video processing and action control. Through a unique closed-loop simulation mechanism, the model incorporates real-world real-time feedback at each step of generation, ensuring that the generated images and actions remain consistent with physical reality, thus enabling robots to complete complex and difficult tasks.

To overcome the computational bottlenecks of large-scale video world models on robot edge devices, LingBot-VA designed an asynchronous inference pipeline, parallelizing action prediction and motor execution; meanwhile, it introduced a persistent mechanism based on memory cache and noise history enhancement strategy, allowing stable and precise action instructions to be output with fewer generation steps during inference. These optimizations enable LingBot-VA to possess both the deep understanding of large models and the low-latency response speed required for real-robot control.

Ant Lingbo stated that, following the previous open-source releases of LingBot-World (simulation environment), LingBot-VLA (intelligent base), and LingBot-Depth (spatial perception), LingBot-VA has explored a new path of "world model empowering embodied operations." Ant Group will continue to rely on the InclusionAI community to promote open source and collaboration with the industry to build foundational capabilities for embodied intelligence, accelerating the construction of an AGI ecosystem that is deeply integrated with open source and serves real industrial scenarios.

Currently, the model weights and inference code of LingBot-VA are fully open-sourced.

ChatGPT Agent User Churn 75% Ambiguous Positioning Becomes a Fatal Flaw

OpenAI's ChatGPT Agent, launched just six months ago, is facing discontinuation due to a sharp 75% drop in weekly active paid users, from 4 million to under 1 million. Despite initial interest from 11% of subscribers, growth stalled as users struggled to understand its purpose and encountered system issues.....

AI Daily: Unitree Opensources UnifoLM-VLA-0 Large Model; Tencent Yuanbao's Internal Test Screenshots Leaked; Clawd Rebrands to OpenClaw

Welcome to the [AI Daily] section! Here is your guide to exploring the world of artificial intelligence every day. Every day, we present you with the latest content in the AI field, focusing on developers, helping you understand technological trends and learn about innovative AI product applications. Click to learn more about new AI products: https://app.aibase.com/zh1. Ant Group launched LingBot-VLA: The dual-arm robot control has entered the era of large models. Ant Group released a visual-language-action foundation model called LingBot-VLA.

The Intelligent Evolution of the Construction Industry: AI Market Size to Reach $32 Billion by 2033

Construction is shifting from digital laggard to AI hotspot, moving from experience-driven to data-driven to tackle cost overruns, delays, and labor shortages. The global AI market in construction is projected to surge from $6.2B in 2026 to $32B by 2033, with a 26.4% CAGR, signaling explosive growth.....

AI Daily: MiniMax Music 2.5 Released; Ant Group Lingbo Opens World Model LingBot-World; Google Gemini 3.5 Leaked

Welcome to the [AI Daily] segment! This is your guide to exploring the world of artificial intelligence every day. Every day, we present you with the latest content in the AI field, focusing on developers to help you understand technical trends and innovative AI product applications. Discover new AI products: https://app.aibase.com/zh1. MiniMaxMusic 2.5 officially released: Overcoming the two major challenges of AI music control and authenticity. The release of MiniMaxMusic 2.5 marks a breakthrough in AI music creation in terms of controllability and

Ant LingBot Open Source LingBot-World: Building a Real-Time Interactive World Model with Embodied Intelligence

Ant's LingBot-World, an open-source interactive world model, offers a high-fidelity virtual training environment for embodied AI and autonomous driving. It simulates physics for low-cost digital training and transfers learned causal behaviors to real-world applications, addressing data scarcity and high training costs.....

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

GEO Brand Visibility

AI Brand Monitoring Tool

AI Search Visibility Checker

GEO Promotion Link Detection

GEO Ranking Optimization System

GEO Services​

AI Model Compatibility Checker

AI Deployment Calculator

World Models Enter the Physical World: Antlingbot Opens Source LingBot-VA, Allowing Robots to Think Before Acting

AIbase基地

This article is from AIbase Daily

AI News Recommendations

ChatGPT Agent User Churn 75% Ambiguous Positioning Becomes a Fatal Flaw

AI Daily: Unitree Opensources UnifoLM-VLA-0 Large Model; Tencent Yuanbao's Internal Test Screenshots Leaked; Clawd Rebrands to OpenClaw

Another Name Change! Clawd Renames to OpenClaw GitHub Star Count Exceeds 100,000 and Causes a Fuss in the Community

The Intelligent Evolution of the Construction Industry: AI Market Size to Reach $32 Billion by 2033

SenseNova-MARS by SenseTime Open Source: Agentic VLM Empowers AI with Independent Thinking and Action Capabilities

Ant Group Launches LingBot-VLA: Dual-Arm Robot Control Enters the Era of Large Models

Top 10 Technology Trends of 2025 Revealed: Smart Agents and Embodied Intelligence Lead the Frontier

AI Daily: MiniMax Music 2.5 Released; Ant Group Lingbo Opens World Model LingBot-World; Google Gemini 3.5 Leaked

Giant Collaboration: NVIDIA, Amazon, and Microsoft Plan to Invest $60 Billion in OpenAI

Ant LingBot Open Source LingBot-World: Building a Real-Time Interactive World Model with Embodied Intelligence

AI News Recommendations

ChatGPT Agent User Churn 75% Ambiguous Positioning Becomes a Fatal Flaw

AI Daily: Unitree Opensources UnifoLM-VLA-0 Large Model; Tencent Yuanbao's Internal Test Screenshots Leaked; Clawd Rebrands to OpenClaw

Another Name Change! Clawd Renames to OpenClaw GitHub Star Count Exceeds 100,000 and Causes a Fuss in the Community

The Intelligent Evolution of the Construction Industry: AI Market Size to Reach $32 Billion by 2033

SenseNova-MARS by SenseTime Open Source: Agentic VLM Empowers AI with Independent Thinking and Action Capabilities

Ant Group Launches LingBot-VLA: Dual-Arm Robot Control Enters the Era of Large Models

Top 10 Technology Trends of 2025 Revealed: Smart Agents and Embodied Intelligence Lead the Frontier

AI Daily: MiniMax Music 2.5 Released; Ant Group Lingbo Opens World Model LingBot-World; Google Gemini 3.5 Leaked

Giant Collaboration: NVIDIA, Amazon, and Microsoft Plan to Invest $60 Billion in OpenAI

Ant LingBot Open Source LingBot-World: Building a Real-Time Interactive World Model with Embodied Intelligence

GEO Services