Information

Latest AI News

Explore AI Frontiers, Master Industry Trends

AI Daily Brief

Your Daily AI Brief - Never Miss What's Next

Information

AI Product Finder

Smart Product Discovery - Comprehensive Market Intelligence

AI Product Rankings

AI Product Power Rankings - Performance, Buzz & Trends

AI Product Submit

Submit Your AI Product - Amplify Reach & Drive Growth

Tools

AI Tools Directory

Discover The Best AI Websites & Tools

Information

AI Models Finder

Comprehensive AI Models Collection for All Your Development & Research Needs

LLM Leaderboard

AI LLM Power Rankings - Performance, Buzz & Trends

Model Providers

Discover Trusted AI Model Partners - Guaranteed Reliable Support

Submit Your Model

Submit Your Model Info & Services - Precision Marketing & User Targeting

Tools

Compare LLMs

Multi-Dimensional Large Model Comparison - Find Your Perfect Match

LLM Cost Calculator

Calculate AI Model Costs Accurately - Optimize Your Budget

LLM Arena

Multi-Model Real-Time Evaluation & Quick Output Comparison

Information

MCP Servers

Discover Popular AI-MCP Services - Find Your Perfect Match Instantly

MCP Client

Easy MCP Client Integration - Access Powerful AI Capabilities

MCP Case Tutorials

Master MCP Usage - From Beginner to Expert

MCP Ranking

Top MCP Service Performance Rankings - Find Your Best Choice

MCP Service Submission

Publish & Promote Your MCP Services

Tools

MCP Playground

Test MCP Services Freely - Quick Online Experience

MCP Inspector

Quick MCP Service Testing - Fast Deployment

GEO Services

Achieve Dominant Visibility in AI Search for Your Business or Brand with GEO Services

AI Search Visibility Checker

Detect brand's visibility on AI platforms

Tools

AI Model Compatibility Checker

Free PC Hardware Test for DeepSeek & Llama

Information

AI Dataset Collection

Large-scale datasets and benchmarks for training, evaluating, and testing models to measure

Tools

Intelligent Document Recognition

Comprehensive Text Extraction and Document Processing Solutions for Users

AI Tutorial

Alibaba and Nankai University Collaborate to Launch a New Video Large Model Compression Technology LLaVA-Scissor

AIbase基地

Published inAI News · 5 min read · Aug 5, 2025

Recently, the Tongyi Lab of Alibaba and the School of Computer Science at Nankai University jointly released an innovative video large model compression method - LLaVA-Scissor. The emergence of this technology aims to address a series of challenges in video model processing, especially the issues of inference speed and scalability caused by a high number of tokens when processing video frames with traditional methods.

Video models need to encode each frame individually, which leads to a sharp increase in the number of tokens. Although traditional token compression methods such as FastV, VisionZip, and PLLaVA have achieved certain results in the image field, they expose problems such as insufficient semantic coverage and temporal redundancy in video understanding. To address this, LLaVA-Scissor adopts a graph theory-based algorithm - the SCC method, which can effectively identify different semantic regions in the token set.

The SCC method calculates the similarity between tokens, constructs a similarity graph, and identifies connected components in the graph. Each token in a connected component can be replaced by a representative token, thereby significantly reducing the number of tokens. To improve processing efficiency, LLaVA-Scissor adopts a two-step spatiotemporal compression strategy, performing spatial compression and temporal compression separately. In spatial compression, semantic regions are identified for each frame, while temporal compression removes redundant information across frames, ensuring that the final generated tokens can efficiently represent the entire video.

In experimental validation, LLaVA-Scissor has shown outstanding performance in multiple video understanding benchmark tests, especially showing significant advantages under low token retention rates. For example, in the video question-answering benchmark test, LLaVA-Scissor is comparable to the original model at a 50% token retention rate, and outperforms other methods at 35% and 10% retention rates. In long video understanding tests, the method also demonstrated good performance, achieving an accuracy of 57.94% on the EgoSchema dataset at a 35% token retention rate.

This innovative compression technology not only improves the efficiency of video processing but also opens up new directions for the development of future video understanding and processing. The release of LLaVA-Scissor is undoubtedly expected to have a positive impact in the field of video artificial intelligence.

Key Points:
🌟 LLaVA-Scissor is an innovative video large model compression technology developed jointly by Alibaba and Nankai University, aimed at solving the problem of a sharp increase in token numbers in traditional methods.
🔍 The SCC method calculates token similarity, builds a graph, and identifies connected components, effectively reducing the number of tokens while preserving key semantic information.
🏆 LLaVA-Scissor has shown excellent performance in multiple video understanding benchmark tests, especially demonstrating significant performance advantages at low token retention rates.

AIGlossary VideoLargeModel LLaVA-Scissor GraphAlgorithm

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

Google Gemini 3.0 Pro Beta Leaks: Major Advancements in Programming Capabilities, Launching Next Week

Google Gemini 3.0 Pro is about to be released, with the beta version showing excellent performance in programming. The model comes in two versions: Pro and Flash. Developer test results have attracted attention, following OpenAI Sora 2, adding more excitement to the AI competition.

Oct 4, 2025

OpenAI Launches ChatGPT Parental Controls Feature, Sparking Intense Debate on Youth Protection and User Freedom

OpenAI introduces parental controls for ChatGPT, allowing parents to manage teen accounts with mute times and stricter filters. While safety advocates support it, some adults criticize the measures as insufficient. The launch coincides with related legal scrutiny.....

Oct 2, 2025

120

OpenAI's First-Half Financial Report Revealed: Sales Exceed $4.3 Billion

OpenAI's H1 2025 revenue hit $4.3B, up 16% YoY, but cash burn reached $2.5B, with full-year projection at $8.5B due to heavy AI R&D and ChatGPT costs.....

Sep 30, 2025

150

Germany Heidelberg Launches AI City Scanner, Marking a New Era for Parking Enforcement

Heidelberg deploys AI 'City Scanner' to detect illegal parking, reducing congestion with sensor tech, enhancing urban management.....

Sep 29, 2025

180

Google DeepMind Launches a New Robot AI Model That Can Sort Clothes

DeepMind introduces a new AI model for robots, enhancing laundry sorting with advanced visual recognition and deep learning, improving general reasoning for daily applications.....

Sep 26, 2025

330

Spotify Cracks Down on AI Music Chaos: Launches Industry Standard Certification System, Strictly Prohibits Unauthorized Voice Cloning

Spotify updates AI policy, mandates DDEX tagging for AI-generated music, bans unauthorized voice cloning. New rules include a music spam filter and require standardized AI disclosures from labels/partners to enhance transparency and reduce spam.....

Sep 26, 2025

120

Databricks Launches New Technology to Help Enterprises Reduce AI Costs by Up to 90 Times

Databricks' Agent Bricks tech, using GEPA, cuts AI model costs by 90x, boosting efficiency.....

Sep 26, 2025

200

Alibaba Lingyang Launches AgentOne Platform to Promote Enterprise Transformation into Super Companies

Alibaba's Lingyang launched AgentOne, an enterprise AI agent platform, at Yunqi Conference, enabling businesses to shift from passive to proactive intelligence with 20+ pre-built agents for data-driven AI transformation.....

Sep 25, 2025

180

OpenAI and SAP Collaborate to Promote the Use of Artificial Intelligence in Germany's Public Sector

OpenAI partners with SAP for 'OpenAI for Germany' to bring AI to public sectors, ensuring secure, efficient use with data privacy, powered by Azure.....

Sep 25, 2025

Microsoft Copilot Embraces Competition: Integrates Anthropic AI Models, Breaking OpenAI's Exclusive Position

Microsoft partners with Anthropic to integrate advanced AI into Copilot, diversifying beyond OpenAI, following recent Office 365 integration.....

Sep 25, 2025

120

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Submit Your Model

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

GEO Services​

AI Search Visibility Checker

AI Model Compatibility Checker

AI Dataset Collection

Intelligent Document Recognition

Alibaba and Nankai University Collaborate to Launch a New Video Large Model Compression Technology LLaVA-Scissor

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Google Gemini 3.0 Pro Beta Leaks: Major Advancements in Programming Capabilities, Launching Next Week

OpenAI Launches ChatGPT Parental Controls Feature, Sparking Intense Debate on Youth Protection and User Freedom

OpenAI's First-Half Financial Report Revealed: Sales Exceed $4.3 Billion

Germany Heidelberg Launches AI City Scanner, Marking a New Era for Parking Enforcement

Google DeepMind Launches a New Robot AI Model That Can Sort Clothes

Spotify Cracks Down on AI Music Chaos: Launches Industry Standard Certification System, Strictly Prohibits Unauthorized Voice Cloning

Databricks Launches New Technology to Help Enterprises Reduce AI Costs by Up to 90 Times

Alibaba Lingyang Launches AgentOne Platform to Promote Enterprise Transformation into Super Companies

OpenAI and SAP Collaborate to Promote the Use of Artificial Intelligence in Germany's Public Sector

Microsoft Copilot Embraces Competition: Integrates Anthropic AI Models, Breaking OpenAI's Exclusive Position

GEO Services