Meituan Launches LongCat-Video Video Generation Model, Opening a New Era of Long Video Creation

AIbase基地

Published inAI News · 4 min read · Oct 27, 2025

Today, the LongCat team of Meituan officially released its new video generation model - LongCat-Video. This model, with its ability to accurately reconstruct the operational state of the real world, marks a significant advancement in Meituan's exploration of the "world model" field. A world model is the core engine for the next generation of artificial intelligence, helping AI better understand, predict, and reconstruct the dynamics of the real world.

LongCat-Video is based on an advanced Diffusion Transformer (DiT) architecture, integrating core functions such as text-to-video, image-to-video, and video continuation. This innovative model effectively distinguishes tasks through the setting of "conditional frame count," ensuring excellent generation capabilities under different input conditions. LongCat-Video can output high-definition videos at 720p and 30fps in text-to-video generation, and it has leading semantic understanding and visual presentation capabilities in the open-source field. In addition, image-to-video can strictly preserve the attributes and style of the reference image during dynamic processes, showing natural and smooth motion performance.

The most impressive feature of LongCat-Video is its long video generation capability. Through pre-training on video continuation tasks, the model can stably output coherent long videos of up to 5 minutes, while avoiding common issues such as color drift, quality degradation, and action breakage. This technological breakthrough not only improves the quality of video generation but also provides a solid technical foundation for deep interaction scenarios such as autonomous driving and embodied intelligence.

In terms of efficient inference, LongCat-Video adopts a "two-stage coarse-to-fine generation" strategy, combined with block-sparse attention (BSA) and model distillation optimization, significantly improving the speed and quality of video generation. The inference speed of this model has been increased by 10.1 times, ensuring excellent generation quality even when processing long videos.

After rigorous internal and public benchmark testing, LongCat-Video has shown excellent performance in multiple dimensions such as text alignment, visual quality, and motion quality, achieving SOTA (State of the Art) levels in the current open-source field. The team stated that the release of LongCat-Video will greatly simplify the process of creating long videos, allowing creators to jump from 1 second of inspiration to a 5-minute finished product.

To allow more people to experience this advanced technology, Meituan has released related resources of LongCat-Video on GitHub and Hugging Face. This project not only provides powerful tools for individual creators but also injects new vitality into the entire video creation industry.

LongCat-Video WorldModel DiffusionTransformer Text-to-Video

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

Meituan Launches LongCat-Video Video Generation Model with Native Support for 5-Minute Continuous Output

Meituan's LongCat-Video, a DiT-based model, simulates physics for text-to-video tasks, advancing AI's real-world understanding and world model research.....

Oct 27, 2025

China University of Science and Technology and ByteDance Launch MoGA Long Video Generation Model: One-Click Generation of Minute-Level Multi-Shot Short Films

The University of Science and Technology of China and ByteDance jointly launched an end-to-end long video generation model that can directly generate high-quality videos with a duration of minutes, 480p resolution, and 24fps, supporting multi-shot switching. The core innovation is the underlying algorithm MoGA, a novel attention mechanism designed to tackle the challenges of long video generation, marking a key breakthrough in domestic video generation technology.

Oct 24, 2025

300

AI Daily: Google Skills Platform Opens to the Public for Free Access to Internal AI Knowledge; LiblibAI Secures $130 Million in Funding; Sora Updates Introduce Character Cameo Feature

Kunlun Wanwei's AI video product SkyReels will launch a new version in early November, leveraging AI technology to drive video innovation and support developers in exploring AI trends.....

Oct 23, 2025

200

ByteDance Seed Team Announces the Launch of 3D Generation Large Model Seed 3D 1.0

The ByteDance Seed team recently announced the launch of the 3D generation large model Seed3D1.0, which is capable of generating high-quality, realistic 3D models from a single image in an end-to-end manner, including detailed geometry, realistic textures, and physically based rendering (PBR) materials. This innovative achievement is expected to provide powerful world simulation support for the development of embodied intelligence, addressing bottlenecks in physical interaction capabilities and content diversity in current technologies. During the development process, the Seed team collected and processed a large amount of high-quality 3D data, building a complete three

Oct 23, 2025

550

Hailuo 2.3 is Coming Soon: The Next-Generation AI Video Model That Exceeds Veo, with Enhanced Realism

MiniMax's Hailuo2.3 video generation model achieves breakthroughs in realism, precision, and style diversity, enhancing motion capture to solidify its industry leadership after surpassing Google Veo3.....

Oct 23, 2025

1.3k

Vidu Q2 Reference AI Video Platform Fully Opens API Access

Recently, Shengshu Technology officially announced the full opening of the Vidu Q2 Reference AI Video Model API, marking a critical breakthrough in AI video generation technology from 'functional' to 'precision engineering'. Vidu Q2 demonstrates unique value in maintaining high consistency, especially in areas such as advertising and product display. It not only accurately restores product details but also infuses emotional expression into AI videos, thereby enhancing brand favorability and user conversion. The release of Vidu Q2 brings new opportunities to the interactive entertainment, animation, and advertising e-commerce industries.

Oct 23, 2025

290

Doubao Video Generation Model Seedance 1.0 Pro Launches First and Last Frame Capabilities

Volcano Engine has officially launched the first and last frame capabilities of Doubao-Seedance-1.0-pro, a video generation model from Doubao. This update marks an important step forward in controllability and consistency in AI video creation. With technical advantages such as subject consistency in complex scenes, physical plausibility of large movements, and intelligent video rhythm reasoning, Seedance 1.0 Pro will significantly enhance the main character tracking effect in generated videos, achieve precise narrative guidance, and produce more immersive and expressive video content.

Oct 23, 2025

250

AI Video Realizes Vertical Domains! Runway Opens Model Fine-tuning Permissions, Focusing on Robotics and Construction

Runway launches a video model fine-tuning tool for partners to customize AI models in verticals like robotics and education, enhancing performance with less data and computation.....

Oct 23, 2025

150

Kunlun AI Video Product SkyReels Will Launch a New Version in Early November

Kunlun AI Video Product SkyReels announced that the new version is expected to be officially launched in early November. Empowered by Kunlun's AI strategy, this product is expected to continue consolidating the company's leading position in the global AI video market and accelerate the realization of the industry vision that 'everyone can participate in professional video creation'. Currently, users around the world can log on to the SkyReels official website to apply for the test list and experience the model functions.

Oct 23, 2025

130

Meituan LongCat Team Launches VitaBench: A New Benchmark for Intelligent Agent Evaluation

The Meituan LongCat Team has launched the VitaBench intelligent agent evaluation benchmark, focusing on high-frequency life scenarios such as food delivery, restaurant dining, and travel. This benchmark constructs an interactive environment with 66 tools, covering complex operations from ticket purchasing to reservations, providing an important infrastructure for the development of intelligent agents in real-world scenarios.

Oct 21, 2025

180

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Submit Your Model

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

GEO Services

AI Search Visibility Checker

AI Model Compatibility Checker

AI Dataset Collection

Intelligent Document Recognition

Meituan Launches LongCat-Video Video Generation Model, Opening a New Era of Long Video Creation

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Meituan Launches LongCat-Video Video Generation Model with Native Support for 5-Minute Continuous Output

China University of Science and Technology and ByteDance Launch MoGA Long Video Generation Model: One-Click Generation of Minute-Level Multi-Shot Short Films

AI Daily: Google Skills Platform Opens to the Public for Free Access to Internal AI Knowledge; LiblibAI Secures $130 Million in Funding; Sora Updates Introduce Character Cameo Feature

ByteDance Seed Team Announces the Launch of 3D Generation Large Model Seed 3D 1.0

Hailuo 2.3 is Coming Soon: The Next-Generation AI Video Model That Exceeds Veo, with Enhanced Realism

Vidu Q2 Reference AI Video Platform Fully Opens API Access

Doubao Video Generation Model Seedance 1.0 Pro Launches First and Last Frame Capabilities

AI Video Realizes Vertical Domains! Runway Opens Model Fine-tuning Permissions, Focusing on Robotics and Construction

Kunlun AI Video Product SkyReels Will Launch a New Version in Early November

Meituan LongCat Team Launches VitaBench: A New Benchmark for Intelligent Agent Evaluation

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Submit Your Model

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

GEO Services​

AI Search Visibility Checker

AI Model Compatibility Checker

AI Dataset Collection

Intelligent Document Recognition

Meituan Launches LongCat-Video Video Generation Model, Opening a New Era of Long Video Creation

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Meituan Launches LongCat-Video Video Generation Model with Native Support for 5-Minute Continuous Output

China University of Science and Technology and ByteDance Launch MoGA Long Video Generation Model: One-Click Generation of Minute-Level Multi-Shot Short Films

AI Daily: Google Skills Platform Opens to the Public for Free Access to Internal AI Knowledge; LiblibAI Secures $130 Million in Funding; Sora Updates Introduce Character Cameo Feature

ByteDance Seed Team Announces the Launch of 3D Generation Large Model Seed 3D 1.0

Hailuo 2.3 is Coming Soon: The Next-Generation AI Video Model That Exceeds Veo, with Enhanced Realism

Vidu Q2 Reference AI Video Platform Fully Opens API Access

Doubao Video Generation Model Seedance 1.0 Pro Launches First and Last Frame Capabilities

AI Video Realizes Vertical Domains! Runway Opens Model Fine-tuning Permissions, Focusing on Robotics and Construction

Kunlun AI Video Product SkyReels Will Launch a New Version in Early November

Meituan LongCat Team Launches VitaBench: A New Benchmark for Intelligent Agent Evaluation

GEO Services