HKU Collaborates with Kuaishou Ke Ling to Break the Bottleneck of Long Video Consistency, Revolutionary Memory Retrieval Technology Launches

AIbase基地

Published inAI News · 4 min read · Aug 26, 2025

AIbase Report The University of Hong Kong and Kuaishou Clever team recently published a groundbreaking paper titled "Context as Memory: Scene-Consistent Interactive Long Video Generation with Memory Retrieval," proposing a revolutionary "Context-as-Memory" approach that successfully solves the core challenge of scene consistency control in long video generation.

Innovative Concept: Treating Historical Context as a "Memory" Carrier

The core innovation of this study lies in treating historically generated context as "memory," using context learning technology to learn contextual conditions, thereby achieving high-level consistency control over scenes in long videos. The research team found that video generation models can implicitly learn 3D priors in video data without explicit 3D modeling assistance, a concept that aligns with Google's Genie3.

Technical Breakthrough: FOV-Based Memory Retrieval Mechanism Significantly Improves Efficiency

To address the computational burden caused by theoretically infinitely long historical frame sequences, the research team proposed a memory retrieval mechanism based on the field of view (FOV) of the camera trajectory. This mechanism can intelligently select frames from all historical frames that are highly relevant to the current generated video as memory conditions, significantly improving computational efficiency and reducing training costs.

Through a dynamic retrieval strategy, the system determines the relevance between predicted frames and historical frames based on the overlap relationship of the camera trajectory's FOV, greatly reducing the number of contexts that need to be learned, achieving a qualitative leap in model training and inference efficiency.

Data Construction and Application Scenarios

The research team collected a diverse set of long video datasets with precise camera trajectory annotations using Unreal Engine 5, providing a solid foundation for technical validation. Users only need to provide an initial image to freely explore the generated virtual world along a set camera trajectory.

Performance Exceeds Existing Methods

Experimental results show that Context-as-Memory maintains excellent static scene memory at time scales of tens of seconds and demonstrates good generalization across different scenes. Compared to existing SOTA methods, this technology achieves significant performance improvements in scene memory for long video generation and effectively maintains memory continuity in unseen open-domain scenarios.

This breakthrough marks an important step forward for AI video generation technology toward longer time sequences and higher consistency, opening up new possibilities for application areas such as virtual world construction and film production.

ByteDance Makes a Big Move: Seed-OSS-36B Open Source Model Emerges Suddenly, Revolutionizing the AI Community with 512K Ultra-Long Context

The arms race in AI large models has escalated again, and this time it's ByteDance that has dropped a shocker. This tech giant, known for TikTok and Jinri Toutiao, has officially announced the open-sourcing of its latest masterpiece, the Seed-OSS-36B large language model. With an impressive configuration of 36 billion parameters and native 512K ultra-long context window, it has instantly become the focus of the open-source AI community, drawing widespread attention from the industry. In contrast to the common 128K context limits seen in mainstream open-source models, the 512K ultra-long context capability of Seed-OSS is truly astonishing.

The Infinite Memory AI Glasses Are Here! Two Post-95s Raised 1 Million Dollars to Create a Real-Life Cheating Device That Sparks Controversy

A startup called Halo, founded by two Harvard dropouts, has launched a revolutionary always-on AI glasses that claims to make the wearer instantly smart. The $249 device is now accepting pre-orders starting from Wednesday, but its secret recording function without an indicator light has raised serious privacy concerns. Anh Phu Nguyen, co-founder of Halo, said: Our goal is to create glasses that make you super smart the moment you put them on. The other co-founder, Caine

Altman Bets on GPT-6 Memory Revolution, OpenAI Strategic Shift to Agent System Development

After the setbacks with the release of GPT-5, OpenAI CEO Sam Altman is shifting his strategic focus to the development of GPT-6, hoping to turn the situation around through significant breakthroughs in memory functions. Unlike the two and a half years it took to develop from GPT-4 to GPT-5, OpenAI plans to release the next-generation model on a faster schedule. Altman revealed that the biggest innovation of GPT-6 will be focused on memory capabilities. The new model will be able to remember users' personal preferences, behavioral habits, ideological tendencies, and even...

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

Building and Deploying AI

AI Models Finder

LLM Leaderboard

Model Providers

Submit Your Model

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

HKU Collaborates with Kuaishou Ke Ling to Break the Bottleneck of Long Video Consistency, Revolutionary Memory Retrieval Technology Launches

AIbase基地

Innovative Concept: Treating Historical Context as a "Memory" Carrier

Technical Breakthrough: FOV-Based Memory Retrieval Mechanism Significantly Improves Efficiency

Data Construction and Application Scenarios

Performance Exceeds Existing Methods

This article is from AIbase Daily

AI News Recommendations

2025 China University Computer Competition AIGC Innovation Event Concludes in Dongguan, 6390 Students Participate in Competitions

Microsoft Opensources VibeVoice TTS Model: 90-Minute Ultra-Long Speech, Can Support 4-Person Dialogue, Chinese Performance is Stunning!

Apple Launches New AI Training Method, Significantly Improving Model Performance by Replacing Manual Scoring with Task Lists

Alibaba Tongyi Dianjin Launches with a Bang! 32B Model Leads the Financial AI Market, Who Will Share the $2 Trillion Pie?

Experts Question Google's Description That Each Prompt Only Consumes 5 Drops of Water

AI Daily: Zhipu AI releases AutoGLM 2.0; Tencent Yuanbao integrates with Tencent Video; ByteDance launches open-source large language model Seed-OSS

ByteDance Makes a Big Move: Seed-OSS-36B Open Source Model Emerges Suddenly, Revolutionizing the AI Community with 512K Ultra-Long Context

Intel Launches New Rack-Level AI Chip Jaguar Shores with HBM4 Memory

The Infinite Memory AI Glasses Are Here! Two Post-95s Raised 1 Million Dollars to Create a Real-Life Cheating Device That Sparks Controversy

Altman Bets on GPT-6 Memory Revolution, OpenAI Strategic Shift to Agent System Development