New King in Chinese Image Editing! UniWorld-V2 Released: Select and Edit, Accurate Rendering of Chinese Fonts, Performance Surpasses GPT-Image and Gemini

AIbase基地

Published inAI News · 4 min read · Nov 7, 2025

In the field of image editing, a revolutionary technology is changing the game! TuZhan Intelligent and the UniWorld team from Peking University have introduced a new generation of image editing model — UniWorld-V2. This model not only surpasses Nano Banana in detail control for image processing, but also shows excellent performance in understanding Chinese instructions.

UniWorld-V2 is based on an innovative visual reinforcement learning framework — UniWorld-R1, which is the first to apply reinforcement learning strategy optimization to image editing, significantly improving the accuracy and flexibility of editing. Compared with traditional supervised fine-tuning methods, the design of UniWorld-R1 aims to solve the problems of data overfitting and poor generalization ability, allowing the model to better respond to diverse editing instructions.

For example, when users ask AI to change a girl's gesture to "OK", UniWorld-V2 can accurately understand and modify it. In contrast, Nano Banana failed to capture the user's intention accurately. More astonishingly, in the poster editing example, UniWorld-V2 can render complex Chinese artistic fonts, such as "Moon Full Mid-Autumn", ensuring clear effects and accurate semantics.

The model's refined control capabilities are also remarkable. Through simple box selection operations, users can specify the editing area and achieve high-level adjustments, such as moving specific objects outside the box. In addition, UniWorld-V2 can also show excellent performance in light and shadow processing, naturally integrating objects into the scene and enhancing the overall harmony.

In the testing benchmarks GEdit-Bench and ImgEdit, UniWorld-V2 leads other well-known models, such as OpenAI's GPT-Image-1 and Gemini2.0, with high scores of 7.83 and 4.49 respectively. These achievements are backed by the powerful versatility of the UniWorld-R1 framework, which not only enhances the performance of UniWorld-V2, but also brings significant improvements to other models.

The paper, code, and model of UniWorld-R1 are publicly available on GitHub and Hugging Face, laying the foundation for future research. The release of this technology not only promotes the development of the multimodal field, but also brings new possibilities to image editing technology.

Paper address:

https://arxiv.org/abs/2510.16888

GitHub link:

https://github.com/PKU-YuanGroup/UniWorld

AI Daily: Shanghai's First AI Prompt Copyright Case Judged; Kimi K2 Thinking Released; New King of Chinese Image Editing, UniWorld-V2 Released

Shanghai Huangpu District Court ruled in the first instance that AI prompts do not possess originality and do not constitute copyright infringement. This is the first copyright case involving AI prompts in Shanghai. The court held that prompts lack originality and therefore are not protected by copyright law.

Unveiling the Mystery of MiniMax M2: Why Choose Full Attention Mechanism?

The MiniMax M2 model uses a full attention mechanism, abandoning linear or sparse attention techniques. The development team believes that although the latter can save computing resources, full attention is more efficient in industrial applications and can improve model performance. This decision aims to optimize actual deployment results and promote the development of AI technology.

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Submit Your Model

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

AI Brand Monitoring Tool

GEO Services​

AI Search Visibility Checker

AI Model Compatibility Checker

AI Deployment Calculator

AI Dataset Collection

Intelligent Document Recognition

New King in Chinese Image Editing! UniWorld-V2 Released: Select and Edit, Accurate Rendering of Chinese Fonts, Performance Surpasses GPT-Image and Gemini

AIbase基地

This article is from AIbase Daily

AI News Recommendations

AI Daily: Shanghai's First AI Prompt Copyright Case Judged; Kimi K2 Thinking Released; New King of Chinese Image Editing, UniWorld-V2 Released

Dark Horse Emerges on Double 11! DingTalk AI Recorder DingTalk A1 Tops Sales Charts on Multiple Platforms

New Thinking Model - Moonlight Kimi K2 Thinking Released, the Boundaries of AI Are Pushed Again!

Unveiling the Mystery of MiniMax M2: Why Choose Full Attention Mechanism?

Apple Plans to Pay Google 1 Billion Dollars Annually to Upgrade Siri Voice Assistant

OpenAI CFO: No Plans for IPO for Now, Focused on Growth and R&D!

Apple Invests $1 Billion to Partner with Google! New Siri to Launch in Spring 2024, Powered by the Gemini Large Model for a Voice Assistant Revival

AI Daily: Sora Launches on Android; NetEase Music Introduces AI Equalization Master; Google to Launch Nano Banana2

Google Gemini Platform to Launch Nano Banana2 Image Generation Technology with Upgrades

NVIDIA and Deutsche Telekom Invest 1 Billion Euros to Build Data Centers, Boosting Germany's AI Computing Power by 50%

GEO Services