Kunlun AI Open Sources 'Skywork UniPic 2.0' Model

AIbase基地

Published inAI News · 3 min read · Aug 13, 2025

Kunlun Wanyi Group announced on the third day of the SkyWork AI Technology Release Week that it has officially open-sourced its latest developed "Skywork UniPic2.0" model. The release of this unified multimodal model marks another major breakthrough in the field of multimodal artificial intelligence. Skywork UniPic2.0 is an efficient training and inference framework for unified multimodal modeling. By making the generation and editing modules lightweight, and through joint training of the multimodal understanding model, it builds core capabilities for understanding, image generation, and editing, aiming to achieve a "high-efficiency, high-quality, unified" multimodal generation model.

WeChat Screenshot_20250813091518.png

Skywork UniPic2.0 consists of three core modules: image generation and editing, unified model capabilities, and post-training of image generation and editing. Based on the SD3.5-Medium architecture, this model has been improved from supporting only text input to accepting both text and image input, expanding its image generation capability to dual capabilities of image generation and editing. By freezing the image generation and editing module, the multimodal model Qwen2.5-VL-7B and Pre-Train connector are used to build an integrated capability for understanding, generation, and editing. Then, by jointly fine-tuning the connector and the image generation and editing module, the final integrated model for understanding, image generation, and editing is achieved.

Skywork UniPic2.0 not only provides developers and researchers with a comprehensive open-source platform, including model weights, inference code, and reinforcement strategies, but also its generation module is trained based on the 2B parameter SD3.5-Medium architecture, achieving image generation and editing metrics that surpass other models with larger parameter counts. Additionally, the model introduces reinforcement learning, using the pioneering progressive dual-task reinforcement strategy called Flow-GRPO, effectively enhancing the model's ability to understand complex instructions and consistency in image generation and editing.

WeChat Screenshot_20250813091544.png

Project Homepage:

https://unipic-v2.github.io/

Technical Report:

https://github.com/SkyworkAI/UniPic/blob/main/UniPic-2/assets/pdf/UNIPIC2.pdf

GitHub Address:

https://github.com/SkyworkAI/UniPic/tree/main/UniPic-2

HuggingFace Gradio:

https://huggingface.co/spaces/Skywork/UniPic2-Metaquery

HuggingFace Model:

https://huggingface.co/Skywork/UniPic2-SD3.5M-Kontext-2B; https://huggingface.co/Skywork/UniPic2-Metaquery-9B

SkyworkUniPic2.0 MultimodalArtificialIntelligence KunlunWanweiGroup AITerminology

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

New Landscape of AI Applications in 2025: Translation, Search, and Browsers Are Completely Evolved!

In 2025, the application scenarios of AI have undergone significant changes, with various fields actively integrating into daily life. From translation, search to browsers, they are profoundly changing work and lifestyle. The competition in AI translation is intense, with professional software demonstrating strong counterattack capabilities, such as Youdao launching an upgraded version of the Translation Large Model 2.0.

Dec 30, 2025

180

Kunan 2.0 Launches with Great Impact! The Chinese Non-Ferrous Metals Industry Accelerates Full-Chain AI Transformation, Over a Hundred Scenarios in Implementation Spark New Quality Productivity

The Chinese non-ferrous metals industry is undergoing a deep transformation driven by AI. On December 26th, the China Non-Ferrous Metals Industry Association and Chinalco Group jointly launched the industry's first large model, Kunan 2.0, marking the transition of digital and intelligent transformation in the mineral resources sector from pilot exploration to large-scale implementation. The model not only achieves technological upgrades but also promotes the intelligent reconstruction of the entire supply chain, from exploration to smelting, helping the industry shift from traditional experiential models to precise intelligent production.

Dec 29, 2025

160

Lima v2.0 Launches with Great Impact: Evolving from a Container Tool into an Invisible Shield for Secure AI Workflows

Lima 2.0 shifts focus to AI, introducing a 'sandbox' mechanism that isolates AI coding agents in virtual machines to prevent access to sensitive host files or risky operations, ensuring secure development.....

Dec 24, 2025

180

Beijing Humanoid Robot Launches the First VLA Large Model XR-1 in Accordance with National Standards

Beijing Humanoid Robot Innovation Center open-sourced XR-1, China's first VLA model meeting national embodied intelligence standards, alongside RoboMIND2.0 data base and ArtVIP high-fidelity digital asset dataset, to advance robotics and support developers.....

Dec 22, 2025

280

GPT-5.2-Codex Launches Shockingly: Reshaping Software Engineering AI Agent Breaks React Security Vulnerability for the First Time

GPT‑5.2-Codex is officially launched, marking a major breakthrough in intelligent coding. Optimized from the GPT‑5.2 architecture, it integrates terminal operation expertise from GPT-5.1-Codex-Max to tackle complex software engineering and cybersecurity challenges. Key highlights include enhanced long-range task execution and improved efficiency and accuracy through native context compression technology.....

Dec 19, 2025

340

SenseTime Launches the Industry's First Multi-Series Generative AI Agent Seko2.0, Domestic AI Chip Successfully Integrates the Full Multimodal AIGC Pipeline

SenseTime launches Seko2.0, the world's first AI agent for multi-scene video generation, enabling continuous narratives from single clips. It ensures high consistency in characters, scenes, and style, advancing plot coherence and visual uniformity, scalable for short videos, ads, and education, powered by its proprietary multimodal model.....

Dec 15, 2025

480

Ant Group Open Sources LLaDA2.0, the Industry's First 100B-Parameter Diffusion Language Model

Ant Technology Research Institute launched the LLaDA2.0 series, including 16B and 100B versions, among which the 100B version is the industry's first billion-parameter discrete diffusion large language model. The model breaks through the scalability bottleneck of diffusion models, significantly improves generation quality and inference speed, and provides a new direction for the development of the field.

Dec 12, 2025

460

AI Daily: AI Animation Tool Seko 2.0 Launches; Super Strong Voice Model Qwen3-TTS Released; 2025 Annual Word and Phrase Candidates Announced

Welcome to the [AI Daily] column! This is your guide to exploring the world of artificial intelligence every day. Every day, we present you with the latest content in the AI field, focusing on developers, helping you understand technological trends and learn about innovative AI product applications. Discover new AI products: https://app.aibase.com/zh1, Alibaba released the super strong voice synthesis model Qwen3-TTS, offering 49 voice styles to meet your voice needs! 8, ChatGPT ranks first in the Apple App Store download chart, becoming the most popular app for American users.

Dec 11, 2025

410

Sensetime Seko 2.0 Launch: Generate 100 Episodes of Coherent Animation with One Sentence, AI Animation Production Cost Reduced to 'a Cup of Milk Tea Price'

SenseTime launches AI video agent 'Seko2.0', enabling users to generate up to 100 episodes of coherent, character-consistent animated series from a single sentence, with minimal production costs.....

Dec 11, 2025

820

Figma Launches AI Image Editing: One-Click Object Removal with Lasso, Automatic Expansion of the Canvas, Toolbar Reorganization

Figma introduces AI image editing features: lasso tool for object removal/isolation, auto background expansion, and lighting/color adjustments without text prompts. Lasso 2.0 allows direct deletion or dragging of selected objects while preserving the background. These tools will debut in Figma Design and Draw, with full platform rollout next year.....

Dec 11, 2025

290

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

AI Brand Monitoring Tool

AI Search Visibility Checker

GEO Services​

AI Model Compatibility Checker

AI Deployment Calculator

Kunlun AI Open Sources 'Skywork UniPic 2.0' Model

AIbase基地

This article is from AIbase Daily

AI News Recommendations

New Landscape of AI Applications in 2025: Translation, Search, and Browsers Are Completely Evolved!

Kunan 2.0 Launches with Great Impact! The Chinese Non-Ferrous Metals Industry Accelerates Full-Chain AI Transformation, Over a Hundred Scenarios in Implementation Spark New Quality Productivity

Lima v2.0 Launches with Great Impact: Evolving from a Container Tool into an Invisible Shield for Secure AI Workflows

Beijing Humanoid Robot Launches the First VLA Large Model XR-1 in Accordance with National Standards

GPT-5.2-Codex Launches Shockingly: Reshaping Software Engineering AI Agent Breaks React Security Vulnerability for the First Time

SenseTime Launches the Industry's First Multi-Series Generative AI Agent Seko2.0, Domestic AI Chip Successfully Integrates the Full Multimodal AIGC Pipeline

Ant Group Open Sources LLaDA2.0, the Industry's First 100B-Parameter Diffusion Language Model

AI Daily: AI Animation Tool Seko 2.0 Launches; Super Strong Voice Model Qwen3-TTS Released; 2025 Annual Word and Phrase Candidates Announced

Sensetime Seko 2.0 Launch: Generate 100 Episodes of Coherent Animation with One Sentence, AI Animation Production Cost Reduced to 'a Cup of Milk Tea Price'

Figma Launches AI Image Editing: One-Click Object Removal with Lasso, Automatic Expansion of the Canvas, Toolbar Reorganization

AI News Recommendations

New Landscape of AI Applications in 2025: Translation, Search, and Browsers Are Completely Evolved!

Kunan 2.0 Launches with Great Impact! The Chinese Non-Ferrous Metals Industry Accelerates Full-Chain AI Transformation, Over a Hundred Scenarios in Implementation Spark New Quality Productivity

Lima v2.0 Launches with Great Impact: Evolving from a Container Tool into an Invisible Shield for Secure AI Workflows

Beijing Humanoid Robot Launches the First VLA Large Model XR-1 in Accordance with National Standards

GPT-5.2-Codex Launches Shockingly: Reshaping Software Engineering AI Agent Breaks React Security Vulnerability for the First Time

SenseTime Launches the Industry's First Multi-Series Generative AI Agent Seko2.0, Domestic AI Chip Successfully Integrates the Full Multimodal AIGC Pipeline

Ant Group Open Sources LLaDA2.0, the Industry's First 100B-Parameter Diffusion Language Model

AI Daily: AI Animation Tool Seko 2.0 Launches; Super Strong Voice Model Qwen3-TTS Released; 2025 Annual Word and Phrase Candidates Announced

Sensetime Seko 2.0 Launch: Generate 100 Episodes of Coherent Animation with One Sentence, AI Animation Production Cost Reduced to 'a Cup of Milk Tea Price'

Figma Launches AI Image Editing: One-Click Object Removal with Lasso, Automatic Expansion of the Canvas, Toolbar Reorganization

GEO Services