Apple Releases the SlowFast-LLaVA Model Adaptation: Long Video Understanding Performance Exceeds Large Models

AIbase基地

Published inAI News · 3 min read · Aug 25, 2025

According to foreign media reports, Apple's research team recently released the SlowFast-LLaVA model adaptation, which shows excellent performance in long video analysis tasks, even surpassing models with larger parameter scales. This breakthrough provides an efficient new solution for long video content analysis.

The core advantage of this model lies in its dual-stream architecture, which effectively solves issues of information redundancy and context window overflow in traditional frame-by-frame processing. The slow stream captures static details and background information at a low frame rate, while the fast stream tracks rapid changes in actions at a high frame rate. This collaborative working mode greatly optimizes video processing efficiency.

In long video benchmark tests, SlowFast-LLaVA demonstrated outstanding performance. Its 1 billion, 3 billion, and 7 billion parameter versions all achieved excellent results. For example, the model with only 1 billion parameters scored 56.6 on the General VideoQA task of LongVideoBench, while the 7 billion parameter version achieved a high score of 71.5 on the Long-Form Video Understanding task. In addition to video understanding, the model also performs well in image understanding tasks such as knowledge reasoning and OCR.

Although the model performs well, it still has certain limitations, such as a maximum input frame length of 128 frames, which may lead to the omission of key information. Apple's team stated that they will continue to explore memory optimization technologies to improve model performance.

SlowFast-LLaVA is trained on publicly available datasets and has been open-sourced, providing new ideas and efficient tools for the AI community in the field of long video understanding.

AI Revenue Exceeds 10 Billion! Baidu's Q2 2025 Financial Report is Impressive, Intelligent Search and Apollo Go Become the Dual Growth Engines

Today, Baidu released its second-quarter 2025 financial report, which showed that the company's total revenue reached 32.7 billion yuan, with Baidu Core revenue at 26.3 billion yuan. Notably, its AI new business revenue exceeded 10 billion yuan for the first time, with a year-on-year growth of as high as 34%, becoming the core engine driving the company's performance growth. The financial report data shows that Baidu's deep investment in the AI field has comprehensively empowered its core business. Baidu Search has undergone a comprehensive innovation from the search box to the result page. In July this year, the proportion of AI-generated content on mobile search result pages reached 64%.

Hinton's First Appearance in China: WAIC Warns About the Dual Nature of Artificial Intelligence, Calls for the Development of Benevolent AI

Hinton's first visit to China has sparked widespread discussion. At the 2025 World Artificial Intelligence Conference, this 77-year-old Turing Prize and Nobel laureate warned that AI may surpass human intelligence and called for caution against raising a tiger as a pet. He particularly emphasized that while pursuing AI technological advancement, it is essential to research how to keep AI benevolent. Although there is currently no relevant technology, this is a key area of research. Hinton's views have triggered extensive discussions in the industry.

The Struggle Between AI and Copyright Law: Dual Rulings by Meta and Anthropic Reveal Legal Dilemmas

CA courts ruled AI training as 'fair use' in 2 cases within 48 hrs, but with differing legal reasoning. Anthropic case likened AI to human learning, while Meta emphasized differences. Both acknowledged data's creative value but simplified market harm analysis. Narrow rulings may evolve with new evidence, highlighting copyright law's AI adaptation challenges.....

Xiaomi AI Glasses Officially Announced: To Be Unveiled Tomorrow Night, Positioned as the Next Generation of Personal Smart Devices

After a long period of anticipation and speculation, Xiaomi has finally officially announced its first AI glasses product. This highly anticipated device will be unveiled at tomorrow night's launch event. As another key product in Xiaomi's smart wearables field, the glasses are positioned as personal smart devices for the next generation, drawing widespread attention from the industry and consumers. It is reported that the rumors about Xiaomi AI glasses had already surfaced last year, but Xiaomi has remained silent and did not reveal any specific information. Now, with the release date approaching, the mystery surrounding this product is gradually being unveiled.

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

Building and Deploying AI

AI Models Finder

LLM Leaderboard

Model Providers

Submit Your Model

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

Apple Releases the SlowFast-LLaVA Model Adaptation: Long Video Understanding Performance Exceeds Large Models

AIbase基地

This article is from AIbase Daily

AI News Recommendations

AI Revenue Exceeds 10 Billion! Baidu's Q2 2025 Financial Report is Impressive, Intelligent Search and Apollo Go Become the Dual Growth Engines

Apple Research Team Breaks Through AI Programming Bottleneck: Let Open Source Models Learn SwiftUI Interface Development

Alibaba and Nankai University Collaborate to Launch a New Video Large Model Compression Technology LLaVA-Scissor

Hinton's First Appearance in China: WAIC Warns About the Dual Nature of Artificial Intelligence, Calls for the Development of Benevolent AI

The Struggle Between AI and Copyright Law: Dual Rulings by Meta and Anthropic Reveal Legal Dilemmas

Hugging Face releases the next generation of small parameter model SmolLM3: 128K context, dual-mode reasoning

Zhixuan Launches Naocha Robot Lingxi X2-N: Can Switch Between Wheel and Foot Dual Modes

Xiaomi AI Glasses Officially Announced: To Be Unveiled Tomorrow Night, Positioned as the Next Generation of Personal Smart Devices

Baidu Launches Dual Digital Human Interactive Live Streaming Studio, Powered by Wenxin Large Model 4.5T for New Breakthroughs in Multimodal Technology

AI Daily: ChatGPT opens free memory function; Huawei WATCH 5 smartwatch accesses dual large models; Claude Pro upgrades major functions