Information

Latest AI News

Explore AI Frontiers, Master Industry Trends

AI Daily Brief

Your Daily AI Brief - Never Miss What's Next

Information

AI Product Finder

Smart Product Discovery - Comprehensive Market Intelligence

AI Product Rankings

AI Product Power Rankings - Performance, Buzz & Trends

AI Product Submit

Submit Your AI Product - Amplify Reach & Drive Growth

Tools

AI Tools Directory

Discover The Best AI Websites & Tools

Tools

GEO Brand Visibility

All-in-One GEO Brand Insights Platform

AI Visibility Audit

Quickly check how your brand is perceived and presented in AI-powered search results.

AI Search Visibility Checker

Detect brand's visibility on AI platforms

GEO Ranking Monitor

Batch queries & scheduled GEO ranking tracking

AI Conversation Insight

Discover trending questions users ask AI to guide content strategy

GEO Promotion Link Detection

Quickly evaluate the citation of promotion articles on AI platforms

Service

GEO Ranking Optimization System

Own your own GEO system and become a professional GEO optimization service provider.

GEO Ranking Optimization

Achieve Dominant Visibility in AI Search for Your Business or Brand with GEO Services

Information

MCP Servers

Discover Popular AI-MCP Services - Find Your Perfect Match Instantly

MCP Client

Easy MCP Client Integration - Access Powerful AI Capabilities

MCP Case Tutorials

Master MCP Usage - From Beginner to Expert

MCP Ranking

Top MCP Service Performance Rankings - Find Your Best Choice

MCP Service Submission

Publish & Promote Your MCP Services

Tools

MCP Playground

Test MCP Services Freely - Quick Online Experience

MCP Inspector

Quick MCP Service Testing - Fast Deployment

Information

LLM API Hub

One-stop integration for all major LLM APIs.

AI Models Finder

Comprehensive AI Models Collection for All Your Development & Research Needs

Model Providers

Discover Trusted AI Model Partners - Guaranteed Reliable Support

LLM Leaderboard

AI LLM Power Rankings - Performance, Buzz & Trends

Tools

LLM API Proxy Checker

Choose reliable LLM API proxies with our 5-dimension test

Compare LLMs

Multi-Dimensional Large Model Comparison - Find Your Perfect Match

LLM Cost Calculator

Calculate AI Model Costs Accurately - Optimize Your Budget

LLM Arena

Multi-Model Real-Time Evaluation & Quick Output Comparison

AI Model Compatibility Checker

Free PC Hardware Test for DeepSeek & Llama

AI Deployment Calculator

Enter Your Large Model Computing Requirements for Instant GPU, Memory & Server Configuration Recommendations

AI Marketplace

Generate Walkable 3D Worlds from a Single Image! NVIDIA Open Sources Lyra 2.0 to Solve Long Video Spatial Forgetfulness and Temporal Drift Issues

AIbase基地

Published inAI News · 6 min read · Apr 20, 2026

NVIDIA Research has recently officially released the Lyra2.0 framework on the Hugging Face platform, marking a new milestone in AI-generated 3D world-building technology. Starting from a single input image, Lyra2.0 can generate large-scale 3D scenes that are persistent, consistent, and freely exploratory, supporting real-time rendering, robot simulation, and immersive applications.

AIbase editors believe that this release not only improves the spatiotemporal consistency of video generation models but also provides practical asset pipelines for physical AI, game development, and virtual environment construction.

Core Challenges and Breakthroughs: Saying Goodbye to Spatial Forgetting and Temporal Drift

Traditional long-term video generation models often suffer from "spatial forgetting"—the model cannot remember details of previously generated areas, leading to inconsistent scenes—and "temporal drifting"—objects' positions and appearances gradually shift over time, severely affecting subsequent 3D reconstruction.

Lyra2.0 addresses these two major issues with innovative solutions:

Spatial Memory Mechanism: The system maintains 3D geometric information for each frame, but it is only used for information routing—retrieving relevant historical frames and establishing dense correspondences, while appearance synthesis still relies on powerful generative priors to avoid accumulation of geometric errors.
Self-Enhancing Training Strategy: During training, the model is exposed to its own degraded outputs, teaching it to actively correct drift rather than continue propagating it, thereby achieving longer 3D consistent video trajectories.

Through this two-stage design, Lyra2.0 can generate long video sequences starting from a single image and user-defined camera trajectories, reliably enhancing them into high-quality 3D Gaussian splatting or mesh models, supporting real-time rendering and further simulation.

Usage Process: From Image to Explorable 3D World

Input an image (optional with text prompts);
Define the camera movement trajectory through an interactive 3D browser;
The model regresses to generate long video clips controlled by the camera;
Upgrade the video sequence into an explicit 3D representation (point cloud, Gaussian, or mesh), and use it for continuous navigation;
Finally, export assets directly usable in environments like Unity, Unreal, and Isaac Sim.

Experiments show that Lyra2.0 outperforms multiple existing methods such as GEN3C, CaM, and Yume-1.5 in long video generation and 3D scene reconstruction metrics, especially in terms of scene scale and consistency. Generated scenes can reach tens of meters, allowing users to freely "go back," look around, and even deploy robots for real-time interaction.

Open Source and Application Value: Accelerating Physical AI and Virtual World Development

The model weights of Lyra2.0 are now open on Hugging Face (nvidia/Lyra-2.0), and the code repository is also available on GitHub (nv-tlabs/lyra), under the Apache 2.0 license, allowing commercial use. The underlying video backbone is based on powerful diffusion models such as Wan-14B, and the reconstruction phase integrates tools like Depth Anything V3, ensuring high-quality and practical output.

This framework is particularly suitable for:

Embodied AI and robot training: generating consistent simulation environments directly imported into Isaac Sim;
Games and Immersive Content: rapidly building exploratory virtual worlds;
3D Asset Generation Pipeline: completing from concept drawings to editable meshes in one go.

Compared to earlier versions, Lyra2.0 has made significant progress in scene persistence and scalability, paving the way for "world models" to move from demonstration to practical assets.

AIbase Editors' Comments: NVIDIA's latest open-source release not only demonstrates technical breakthroughs in spatiotemporal modeling with generative AI but also reflects the industry's ongoing commitment to open ecosystems. As tools like Lyra2.0 become more widespread, developers will be able to build large-scale, interactive 3D worlds more efficiently, accelerating the deployment of applications in robotics, autonomous driving, and the metaverse.

The project page, paper, and model are all publicly available. Interested developers can immediately visit Hugging Face and GitHub to experience them.

Paper URL: https://huggingface.co/papers/2604.13036

Model URL: https://huggingface.co/nvidia/Lyra-2.0

AI-Generated 3D NVIDIAResearch Lyra2.0 HuggingFace

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

Embracing a New Paradigm of AI Office: Tata Consultancy Equips 50,000 Employees with Claude

Tata Consultancy Services has reached a global strategic partnership with AI provider Anthropic to introduce the Claude model to accelerate the implementation of AI applications for enterprise customers. The agreement includes equipping 50,000 employees with Claude software, aiming to enhance internal office efficiency and build an AI service ecosystem, promoting the deep application of enterprise-level AI in customer business.

Jun 11, 2026

100

Breaking Through the Final Mile of AI Applications: Alibaba Cloud Meoo CLI Officially Open-Sourced

Alibaba Cloud open-sourced Meoo CLI, a command-line tool bridging local agents with cloud capabilities. It transforms AI-generated local code prototypes into online applications, automating data integration, environment setup, and deployment processes to address the industry pain point from code to deployment.....

Jun 11, 2026

140

Alibaba Cloud Releases Open-Source Tool Meoo CLI: Supports One-Click Deployment of Local AI Projects

Alibaba Cloud launched Meoo CLI on June 11, an open-source command-line tool to lower barriers for building AI-native apps. It integrates with local AI coding environments like Claude Code, Codex, and Cursor, enabling seamless cloud infrastructure calls for database access, user login, file storage, and project deployment. With simple commands, developers can directly deploy AI-generated local projects, streamlining the full code-to-launch proces....

Jun 11, 2026

120

Protecting Adolescents: Canada Proposes Legislation Banning Minors Under 16 from Using Social Media

The Canadian government submitted the Digital Safety Act to Parliament on June 10, proposing a comprehensive ban on minors under 16 from using social media, with exemptions only for platforms that meet strict security standards. This measure aims to create a safer online environment for young people and introduce significant penalty mechanisms, marking a stringent approach by Canada in protecting children's cybersecurity.

Jun 11, 2026

120

AI Daily: Xiaomi Opensources AI Coding Assistant MiMo Code; JD MALL's First Humanoid Robots Start Work; Google Releases DiffusionGemma

Welcome to the [AI Daily] column! This is your guide to exploring the world of artificial intelligence every day. Every day, we present you with the latest content in the AI field, focusing on developers, helping you understand technical trends and innovative AI product applications. Click to learn more about new AI products: https://app.aibase.com/zh1, Xiaomi opensources the terminal AI coding assistant MiMoCode, which includes a free top multi-modal model. Xiaomi has opened the source code for the terminal AI coding assistant MiMoCode V0.1.0, and this project is based on Op

Jun 11, 2026

210

Wanxiang Shengsheng Launches 'Automatic AI Multi-Channel Audiobook Creation' Cost Less Than 8 Yuan per Ten Thousand Characters

After two months of public beta, Wanxiang Yousheng has launched a fully automated AI multi-cast audiobook creation system. It integrates AI capabilities such as intelligent chapter splitting, character analysis, script generation, multi-role dubbing, and post-production synthesis, achieving unmanned end-to-end production through a task orchestration engine. Developed by the original Laziren Tingshu core team, the system aims to enhance audiobook ....

Jun 11, 2026

160

OceanBase Releases AI-Powered Integrated Data Solution for Core Systems of Key State-Owned Enterprises

On June 11, 2026, OceanBase's 'Provincial Government Cloud Integrated Database Platform Solution' was selected as a '2025 Information Technology Application Innovation Solution' at the Jinan conference, recognized as a 'Typical Solution' by MIIT's Cybersecurity Center. OceanBase also launched an 'AI Integrated Data Solution for Central State-Owned Enterprises,' enhancing its digital governance capabilities.....

Jun 11, 2026

140

Apple Developer Ecosystem Upgrade: Xcode 27 Native Integration of Gemini AI, Programming Camp Gains a Strong Ally

Apple integrates Google Gemini natively in Xcode 27 Beta, making it the third built-in AI coding agent after OpenAI Codex and Anthropic Claude Agent. This update offers developers more diverse smart coding options, boosting efficiency and interaction, marking a deep AI-driven transformation in Apple's development ecosystem.....

Jun 11, 2026

130

Crackdown on Rumors: Li Auto Sues a Media Company for Mass Defamation Using AI

Li Auto's legal department has taken legal action against a Jiangxi-based cultural media company for using AI to generate false content and maliciously defame the brand, filing a report with public security authorities. The involved entity has been investigated, penalized, and publicly apologized. Li Auto emphasizes distinguishing normal supervision from malicious rumor-mongering, resolutely protecting corporate legal rights.....

Jun 11, 2026

210

Beware of Structural Unemployment Caused by AI: Anthropic CEO Amodei Calls on Governments to Prepare in Advance

AI is reshaping productivity, but Anthropic CEO Amodei warns that massive job displacement is a structural inevitability, not a short-term pain. AI replicates and extends human cognition; if designed to take over more cognitive tasks, it will cause long-term labor market disruption.....

Jun 11, 2026

170