Tsinghua University and Kuaishou Unveil a New SVG Diffusion Model with a 6200% Increase in Training Efficiency

AIbase基地

Published inAI News · 4 min read · Oct 29, 2025

Recently, VAE (Variational Autoencoder) has been facing an embarrassing situation of being gradually phased out in the tech world. With the collaboration between Tsinghua University and Kuaishou's Ling team, a new generative model called SVG (VAE-free latent diffusion model) has been introduced. This innovation not only achieved an amazing 6200% improvement in training efficiency but also saw a 3500% leap in generation speed.

The decline of VAE in the field of image generation mainly stems from the "semantic entanglement" issue. In other words, when we try to change just one feature of an image (such as the color of a cat), other features (such as body size or expression) are often affected, resulting in inaccurate generated images. To solve this problem, the SVG model developed by Tsinghua University and Kuaishou took a different approach, actively building a feature space that integrates semantics and details.

In the design of the SVG model, the team first used the DINOv3 pre-trained model as a semantic extractor. This model, trained through large-scale self-supervised learning, can effectively identify and separate features of different categories, solving the semantic confusion in traditional VAE models. Additionally, to supplement details, the team specially designed a lightweight residual encoder to ensure that detail information does not conflict with semantic features. The key distribution alignment mechanism further enhances the fusion of these two types of features, ensuring the high quality of the generated images.

Experimental results show that the SVG model comprehensively surpasses traditional VAE approaches in terms of generation quality and multi-task generalizability. On the ImageNet dataset, the SVG model achieved a FID value (a metric measuring the similarity between generated and real images) of 6.57 after only 80 training cycles, far exceeding VAE models of similar scale; in terms of inference efficiency, the SVG model also demonstrated excellent performance, generating clear images with fewer sampling steps. Moreover, the feature space of the SVG model can be directly used for various visual tasks such as image classification and semantic segmentation without additional fine-tuning, greatly improving application flexibility.

The new technology developed by Tsinghua University and Kuaishou not only brings revolutionary changes to the field of image generation but also shows great potential in multimodal generation tasks.

Paper link: https://arxiv.org/pdf/2510.15301

China University of Science and Technology and ByteDance Launch MoGA Long Video Generation Model: One-Click Generation of Minute-Level Multi-Shot Short Films

The University of Science and Technology of China and ByteDance jointly launched an end-to-end long video generation model that can directly generate high-quality videos with a duration of minutes, 480p resolution, and 24fps, supporting multi-shot switching. The core innovation is the underlying algorithm MoGA, a novel attention mechanism designed to tackle the challenges of long video generation, marking a key breakthrough in domestic video generation technology.

Nobel Laureate Says AI Is Weakening the Competitiveness of Graduates from Top Universities

Michael Levitt, a Nobel Prize winner in Chemistry, stated at the 2025 Sustainable Global Leaders Conference that artificial intelligence is disrupting the education system, and the importance of academic degrees will decline. He pointed out that AI has democratized access to knowledge, and previously education was the gateway to knowledge, but now anyone can easily access knowledge through AI.

Tsinghua Changgeng Hospital Collaborates with Beijing Electronic Information and Intelligence to Develop China's First Pharmaceutical Large Model: Focused on Medication Safety Evaluation for Special Populations

Beijing Tsinghua Changgeng Hospital has collaborated with Beijing Electronic Information and Intelligence to develop China's first pharmaceutical-specific large model, using AI to optimize pharmaceutical processes, improve the efficiency and accuracy of medication safety evaluation for special populations such as the elderly, children, and pregnant women, and address the challenges of rapid updates in drug information and complex individual differences.

A Single Sentence Can Change AI's Creative Potential: Study Finds Simple Prompts Can Significantly Improve Output Diversity

A team from Stanford and other universities proposed the 'language sampling' method, which improves the creative diversity of generative AI by asking the model to generate five responses and their probabilities in the prompt. This method applies to both language and image models, and can stimulate richer creative outputs.

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Submit Your Model

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

GEO Services

AI Search Visibility Checker

AI Model Compatibility Checker

AI Dataset Collection

Intelligent Document Recognition

Tsinghua University and Kuaishou Unveil a New SVG Diffusion Model with a 6200% Increase in Training Efficiency

AIbase基地

This article is from AIbase Daily

AI News Recommendations

AI Image Editing Breakthrough! ByteDance and Hong Kong Chinese University Collaborate to Open Source DreamOmni2, Solving the Challenge of AI Understanding Abstract Concepts

Baidu Collaborates with Shanghai University of Sport to Launch Sports Big Model 2.0

China University of Science and Technology and ByteDance Launch MoGA Long Video Generation Model: One-Click Generation of Minute-Level Multi-Shot Short Films

Kuaishou Launches AI Programming Ecosystem KAT-Coder-Air Free for Public Use

Nobel Laureate Says AI Is Weakening the Competitiveness of Graduates from Top Universities

Breaking the Bottleneck! Shanghai Jiao Tong University and Shanghai AI Lab Collaborate to Enhance the Reflective Ability of Multimodal Large Models

Xiaomi AI Team Collaborates with Peking University to Publish New Paper, 'Talented Girl' Hired by Lei Jun Participates in Research

Tsinghua Changgeng Hospital Collaborates with Beijing Electronic Information and Intelligence to Develop China's First Pharmaceutical Large Model: Focused on Medication Safety Evaluation for Special Populations

A Single Sentence Can Change AI's Creative Potential: Study Finds Simple Prompts Can Significantly Improve Output Diversity

Google DeepMind and Yale University Collaborate to Develop AI Model C2S-Scale 27B for Cancer Treatment Pathways

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Submit Your Model

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

GEO Services​

AI Search Visibility Checker

AI Model Compatibility Checker

AI Dataset Collection

Intelligent Document Recognition

Tsinghua University and Kuaishou Unveil a New SVG Diffusion Model with a 6200% Increase in Training Efficiency

AIbase基地

This article is from AIbase Daily

AI News Recommendations

AI Image Editing Breakthrough! ByteDance and Hong Kong Chinese University Collaborate to Open Source DreamOmni2, Solving the Challenge of AI Understanding Abstract Concepts

Baidu Collaborates with Shanghai University of Sport to Launch Sports Big Model 2.0

China University of Science and Technology and ByteDance Launch MoGA Long Video Generation Model: One-Click Generation of Minute-Level Multi-Shot Short Films

Kuaishou Launches AI Programming Ecosystem KAT-Coder-Air Free for Public Use

Nobel Laureate Says AI Is Weakening the Competitiveness of Graduates from Top Universities

Breaking the Bottleneck! Shanghai Jiao Tong University and Shanghai AI Lab Collaborate to Enhance the Reflective Ability of Multimodal Large Models

Xiaomi AI Team Collaborates with Peking University to Publish New Paper, 'Talented Girl' Hired by Lei Jun Participates in Research

Tsinghua Changgeng Hospital Collaborates with Beijing Electronic Information and Intelligence to Develop China's First Pharmaceutical Large Model: Focused on Medication Safety Evaluation for Special Populations

A Single Sentence Can Change AI's Creative Potential: Study Finds Simple Prompts Can Significantly Improve Output Diversity

Google DeepMind and Yale University Collaborate to Develop AI Model C2S-Scale 27B for Cancer Treatment Pathways

GEO Services