Tencent Hunyuan New Technology Makes Large Models 'Less Oily' to Make AI-Generated Images More Realistic!

AIbase基地

Published inAI News · 4 min read · Sep 18, 2025

Recently, the Tencent Hunyuan team released their latest research findings on their official WeChat account - SRPO (Semantic Relative Preference Optimization), aimed at improving the realism of AI-generated images, especially addressing the "oily" issue in the skin texture of characters generated by the open-source text-to-image model Flux. This innovative technology is expected to bring revolutionary changes to the image generation field.

As digital art becomes increasingly popular, the quality of AI-generated images has become particularly important. Flux, as a popular base model in the open-source text-to-image community, often faces criticism for generating character skin that appears too smooth and unnatural. The joint research by the Tencent Hunyuan team with the Chinese University of Hong Kong (Shenzhen) and Tsinghua University proposed the SRPO solution, employing various methods such as online adjustment of reward preferences and optimization of the generation trajectory to enhance the realism of generated images.

The core of SRPO lies in introducing the concept of "semantic preference," adjusting the optimization objectives of the reward model by adding specific control prompts (such as "realism"). Experimental results show that this method significantly improves the realism of generated images. However, researchers also realized that a single semantic guidance might lead to reward cracking issues. Therefore, they innovatively introduced the "Semantic Relative Preference Optimization" strategy, using positive and negative words as guiding signals to balance the bias of the reward model.

Notably, traditional generation optimization methods often focus only on the latter half of the generation process, which can easily lead to overfitting on high-frequency information. By adopting the Direct-Align strategy, the Tencent Hunyuan team injects controllable noise into the input image and uses this noise as a reference anchor point for image reconstruction, significantly reducing reconstruction error and achieving more accurate reward signal transmission. This innovative approach supports the optimization of the first half of the generation trajectory, effectively solving the overfitting problem.

The SRPO technology has extremely high training efficiency, surpassing the existing DanceGRPO method in just 10 minutes. Research shows that SRPO's realism and aesthetic scores have increased by more than three times, and the training time has been reduced by 75 times compared to traditional methods. As this technology becomes widespread, the realism of AI-generated images in the future will be greatly improved, and it is expected to bring new possibilities to digital art creation.

Project Address: https://tencent.github.io/srpo-project-page/

AI Daily: Qwen3-VL Twin Stars Open Source; Tencent Tests a Popular Frog AI Interactive Story Mini Program; XPeng Unveils a Physical World Large Model

Welcome to the [AI Daily] section! Here is your guide to exploring the world of artificial intelligence every day. Every day, we present you with the latest content in the AI field, focusing on developers to help you understand technical trends and learn about innovative AI product applications. Click to learn more about new AI products: https://app.aibase.com/zh1. Tongyi Qianwen Achieves Another Victory: Qwen3-VL Twin Stars Open Source, Bringing a New Paradigm to Multimodal Retrieval. The Tongyi Lab of Alibaba open-sourced Qwen3-VL-Embedding and Qwen3

1 to 8! Alibaba Qwen's Download Volume Leads by a Large Margin, Beating the Total of Global Giants Like Meta and OpenAI in a Single Month

The Alibaba Tongyi Qianwen large model has shown outstanding performance in the global open-source AI community, with cumulative downloads exceeding 700 million times and becoming the most popular open-source model among developers. In December 2025, its monthly download volume even exceeded the total of other major models worldwide, demonstrating strong growth momentum.

Tsinghua University Develops AI Drug Screening Platform, Speed Improved by a Million Times, Global Open Database

Tsinghua University developed DrugCLIP, an AI platform for drug screening using deep contrast learning to enable genome-level high-throughput virtual screening. Published in Science, it aims to enhance drug target discovery efficiency, addressing the current limitation of targeting only about 10% of druggable targets.....

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

GEO Brand Visibility

AI Brand Monitoring Tool

AI Search Visibility Checker

GEO Services​

AI Model Compatibility Checker

AI Deployment Calculator

Tencent Hunyuan New Technology Makes Large Models 'Less Oily' to Make AI-Generated Images More Realistic!

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Grok Pauses Most Users' Image Generation Feature

Alibaba Cloud Collaborates with Hearing Bear to Launch Child AI Companion Mooni M1: More Than Just Chatting, It Understands Children's Emotions and Growth

AI Daily: Qwen3-VL Twin Stars Open Source; Tencent Tests a Popular Frog AI Interactive Story Mini Program; XPeng Unveils a Physical World Large Model

1 to 8! Alibaba Qwen's Download Volume Leads by a Large Margin, Beating the Total of Global Giants Like Meta and OpenAI in a Single Month

HaiLuo AI Celebrates MiniMax Listing: Login and Get a Gift of 500 Shells!

Two AI Breakthroughs of Ant Group Win First Prize for Excellent Research Achievements in 2025 Ministry of Education Scientific Research Awards

The AI Automation Storm Is Here: Claude Code Sparks a Content Explosion, Dual Crisis of Quality and Division

Dell GB10: Desktop Supercomputing Leads a New Era of Local AI

From Electronic Frame to Family Smart Hub: Skylight Calendar 2 Released, Redefining Family Collaboration with AI

Tsinghua University Develops AI Drug Screening Platform, Speed Improved by a Million Times, Global Open Database

AI News Recommendations

Grok Pauses Most Users' Image Generation Feature

Alibaba Cloud Collaborates with Hearing Bear to Launch Child AI Companion Mooni M1: More Than Just Chatting, It Understands Children's Emotions and Growth

AI Daily: Qwen3-VL Twin Stars Open Source; Tencent Tests a Popular Frog AI Interactive Story Mini Program; XPeng Unveils a Physical World Large Model

1 to 8! Alibaba Qwen's Download Volume Leads by a Large Margin, Beating the Total of Global Giants Like Meta and OpenAI in a Single Month

HaiLuo AI Celebrates MiniMax Listing: Login and Get a Gift of 500 Shells!

Two AI Breakthroughs of Ant Group Win First Prize for Excellent Research Achievements in 2025 Ministry of Education Scientific Research Awards

The AI Automation Storm Is Here: Claude Code Sparks a Content Explosion, Dual Crisis of Quality and Division

Dell GB10: Desktop Supercomputing Leads a New Era of Local AI

From Electronic Frame to Family Smart Hub: Skylight Calendar 2 Released, Redefining Family Collaboration with AI

Tsinghua University Develops AI Drug Screening Platform, Speed Improved by a Million Times, Global Open Database

GEO Services