6B Parameters, 16G VRAM, 8 Steps to Generate Image: Alibaba Z-Image Leaves Billion-Parameter Models in the Dust

AIbase基地

Published inAI News · 4 min read · Nov 27, 2025

Last night, a 1024×1024 neon Hanfu image was rendered in just 2.3 seconds on an RTX 4090, with the VRAM pointer stably at 13GB — Z-Image-Turbo from Alibaba Tongyi Lab stunned the audience: with only 6B parameters, it matched and even slightly surpassed 20B+ closed-source flagship models.

Z-Image speaks with results, without any flashy slogans:

- It can deliver print-quality images with just 8 sampling steps, runs on consumer-grade GPUs like the 3060 6G, and is capped at 16GB VRAM;

- It can understand long nested Chinese prompts in one go, automatically correcting from "sunlight at night" to "a milk tea in the left hand and a mobile screen displaying today's news," making Chinese and English letters no longer look like scribbles;

- Online features include skin pores, glass reflections, rain fog backlighting, and cinematic depth of field. Z-Image-Turbo has been ranked among the top open-source models on the Elo human preference list.

The secret lies in the new S3-DiT architecture: text, visual semantics, and image tokens are connected as a single stream, reducing the parameter count to one-third of competitors while maximizing inference efficiency. The team also released Z-Image-Edit, allowing users to "change heads and scenes" of original images with natural language, making it immediately playable for community users.

Alibaba has not officially announced whether it will fully open source, but the model is already available on ModelScope and Hugging Face. Pull requests have been merged into the diffusers main branch, and it can be loaded with a single pip command. Once enterprise API pricing is released, Midjourney and Flux may need to start thinking about price cuts earlier.

Z-Image's emergence is like a starting gun: the image generation field has officially entered the "lightweight and high-quality" era, making compute democratization no longer just a slogan — who doesn't have a 16G GPU?

Project address: https://github.com/Tongyi-MAI/Z-Image

AI Daily: Alibaba Open Sources Z-Image Image Model; Quark AI Glasses Launch; Opera Neon Browser Upgraded

Alibaba opensources the Z-Image image model, which supports bilingual text rendering, achieves efficient image generation and editing with only 6B parameters, and has excellent visual quality. The model was developed by the Tongyi Lab, focusing on AI technology trends to help developers understand innovative applications.

Alibaba Open Sources Z-Image Image Model: Supports Bilingual Text Rendering

Alibaba open-sources the Z-Image image generation model, which achieves efficient generation and editing with only 6B parameters, reaching visual quality close to 20B commercial models. The model uses a single-stream DiT architecture, offering fast generation and low resource consumption, and has the potential to promote the popularization of AI image tools for consumer applications.

HP Announces Layoffs of 6,000 People to Fully Promote AI Strategy

HP plans to lay off 4,000 to 6,000 people by 2028 as part of its "2026 Fiscal Plan." The move aims to enhance productivity, accelerate product innovation, and optimize customer interaction through the introduction of artificial intelligence technology. HP believes AI will reshape the workforce structure, and the industry is closely watching this trend. The layoffs will affect multiple departments.

xLLM Community Unveils Open-Source Inference Engine on December 6th: Supports MoE, T2I, T2V Full Scenarios, Achieves Latency Below 20ms with Mooncake Caching Solution

xLLM Community's first offline Meetup on Dec 6 focuses on building open-source AI Infra ecosystem. Features xLLM-Core inference engine with sub-20ms P99 latency for MoE, text-to-image/video tasks, 42% lower latency and 2.1× throughput vs vLLM. Technical highlights include unified computation graph and Mooncake KV cache optimization.....

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Submit Your Model

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

AI Brand Monitoring Tool

GEO Services​

AI Search Visibility Checker

AI Model Compatibility Checker

AI Deployment Calculator

AI Dataset Collection

Intelligent Document Recognition

6B Parameters, 16G VRAM, 8 Steps to Generate Image: Alibaba Z-Image Leaves Billion-Parameter Models in the Dust

AIbase基地

This article is from AIbase Daily

AI News Recommendations

AI Daily: Alibaba Open Sources Z-Image Image Model; Quark AI Glasses Launch; Opera Neon Browser Upgraded

Alibaba Open Sources Z-Image Image Model: Supports Bilingual Text Rendering

HP Announces Layoffs of 6,000 People to Fully Promote AI Strategy

Singapore's National AI Plan Switches Chips: Ditching Meta Llama in Favor of Alibaba's Qwen3-32B Open-Source Model Sea-Lion v4 Tops Southeast Asia Language Ranking

Developer Version of Stable Diffusion! FLUX. 2 Open Source Release: 10 Image References + 4MP Editing

Musk: Grok 5 Has a 10% Chance of Reaching AGI 6 Trillion Parameters + Real-Time Video Are the Core

HP Plans to Lay Off 6,000 Employees, Will Focus More on AI Tool Adoption in the Future

xLLM Community Unveils Open-Source Inference Engine on December 6th: Supports MoE, T2I, T2V Full Scenarios, Achieves Latency Below 20ms with Mooncake Caching Solution

Kunlun Yuan AI Launches New All-Modal Fusion Model BaiZe-Omni-14b-a2b, Driving New Advances in AI Technology

Tencent Releases HunyuanOCR Open-Source Model, Achieving Multiple SOTA Performances with Only 1B Parameters

GEO Services