Breaking the Computing Power Monopoly: Zhipu Collaborates with Huawei to Launch the First Full-Process Domestic Multi-Modal Large Model GLM-Image

AIbase基地

Published inAI News · 4 min read · Jan 14, 2026

Zhipu officially announced the open-source of the next-generation image generation model GLM-Image. The biggest breakthrough of this model is that it is the first SOTA (state-of-the-art) multimodal model that completes the entire workflow from data preprocessing to large-scale training on a domestic chip computing platform .

GLM-Image adopts an innovative "autoregressive + diffusion decoder" hybrid architecture, successfully achieving deep integration between image generation and language models. This architecture design enables the model to perform exceptionally well in "knowledge-intensive" generation tasks, accurately understanding global instructions and depicting local details, effectively solving long-standing challenges in AI painting such as poster layout, PPT creation, and complex scientific illustration generation.

GLM-Image supports both text-to-image and image-to-image generation within a single model.

Text-to-image: Generate high-detail images based on text descriptions, performing especially well in information-dense scenarios.
Image-to-image: Supports various tasks including image editing, style transfer, multi-subject consistency, and identity-preserving generation of people and objects.

In terms of technical indicators, GLM-Image demonstrates strong Chinese understanding and rendering capabilities. It ranks first among open-source models in multiple complex visual text generation rankings, especially excelling in challenging Chinese character generation tasks. Additionally, the model natively supports arbitrary aspect ratio image generation from 1024 to 2048 sizes without additional training, adapting to various resolutions automatically.

Currently, GLM-Image has been fully open-sourced on platforms such as GitHub and Hugging Face. To lower the usage barrier, the API call price is as low as 0.1 yuan per image. Zhipu stated that they will also launch a new version optimized for speed in the future, further improving commercial cost-effectiveness.

GitHub:https://github.com/zai-org/GLM-Image
Hugging Face:https://huggingface.co/zai-org/GLM-Image

Key points:

🇨🇳 Domestic full-stack self-researched: Completed the full workflow training based on Huawei Ascend Atlas800T A2 devices and MindSpore framework, verifying the feasibility of training top-tier models on domestic computing power.
🎨 Breakthrough in text and image fusion: Adopting a hybrid architecture, it ranked first among open-source models in LongText-Bench and other long-text rendering rankings, significantly improving the accuracy of Chinese character and complex text-image generation.
💰 High-cost-effective open-source: The model supports adaptive image generation across various resolutions and is open to creators at extremely low API prices, aiming to promote the popularization of domestic cognitive generation technology.

GLM-Image Huawei Domestic Chips Image Generation Model

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

Google Gemini Embedding 2 Launches with Great Impact! The First Full-Multimodal Embedding Model Is Here

Google launches Gemini Embedding2, the first multimodal embedding model based on the Gemini architecture, now in preview on Gemini API and Vertex AI. It maps text, images, videos, audio, and documents into a unified embedding space for cross-modal retrieval and classification, supporting over 100 languages.....

Mar 11, 2026

Yann LeCun Enters the World Model: His AI Startup Completes $1.03 Billion in Funding

Yann LeCun's AMI secures $1.03B funding at a $3.5B pre-money valuation, aiming to commercialize AI with reasoning and planning capabilities, challenging current LLM paradigms.....

Mar 10, 2026

170

Alibaba Qwen Management Adjustment: Alibaba Cloud CTO Zhou Jingren to Temporarily Supervise the Qwen Model's Top Position

Alibaba's Tongyi Lab undergoes personnel changes: Zhou Jingren oversees the Qwen model team, with core member Liu Dayiheng also leading post-training and coding, reporting directly to Zhou.....

Mar 10, 2026

160

China's Large Model Weekly Usage Surpasses Global Leaders: MiniMax M2.5 Remains at the Top

China's artificial intelligence sector is growing rapidly, with large model usage exceeding that of the United States for two consecutive weeks. Data shows that Chinese companies occupy three spots in the global top five, with MiniMax's M2.5 model leading the world with 1.87 trillion token weekly usage.

Mar 10, 2026

140

Surpassing the U.S.! China's Large Model Has the Highest Weekly Query Volume Globally, MiniMax Holds the Top Position

China's large models have achieved a breakthrough in the competition for AI computing power. Recent data shows that in early March, the weekly query volume of China's large models reached 4.19 trillion Tokens, an increase of 34.9% compared to the previous period. It has surpassed the U.S. for two consecutive weeks (the U.S. query volume during the same period was 3.63 trillion Tokens, a decrease of 8.5% compared to the previous period). This contrast highlights China's rapid rise in the field of AI applications.

Mar 10, 2026

170

Zhipu Launches Localized Agent Tool AutoClaw: Integrated with Pony-Alpha-2 Model and Supports One-Click Deployment

Zhipu launches AutoClaw (AoLong), the first localized OpenClaw integration tool in China that supports one-click installation, aiming to lower the barriers for using AI Agents. It simplifies environment configuration and supports rapid deployment on macOS and Windows systems, promoting the popularization of intelligent agent technology to general users. The tool is deeply integrated with the Pony-Alpha-2 model, optimized for Agents, which enhances the stability of tool calls and long-task processing capabilities.

Mar 10, 2026

460

Tencent Launches the All-Scenario AI Agent WorkBuddy: Compatible with OpenClaw and Supports Multi-Model Switching

Tencent launched the all-scenario AI agent WorkBuddy in March 2026, aiming to lower the entry barrier for large model applications. The product is compatible with the open-source project "OpenClaw", featuring no-deployment and ready-to-use characteristics, promoting the evolution of AI agents from geek tools to general-purpose office tools. Its technical core lies in simplifying cloud configuration, allowing users to drive it through instructions after downloading, significantly improving office efficiency.

Mar 9, 2026

660

AI Daily: Tencent Testing QClaw One-Click Startup Package; Xiaohongshu's New AI Editing Model Open-Sourced; OpenClaw Officially Supports GPT-5.4

Welcome to the [AI Daily] column! This is your guide to exploring the world of artificial intelligence every day. Every day, we present you with the latest content in the AI field, focusing on developers, helping you understand technical trends and innovative AI product applications. 3. Xiaohongshu's new AI editing model FireRed-Image-Editv1.1 is open-sourced, solving ID consistency and complex fusion challenges. The FireRed-Image-Editv1.1 model released by Xiaohongshu's SuperIntelligence team, in terms of ID

Mar 9, 2026

360

Real Combat Power Showdown of AI Coding Agents! OpenClaw Shrimp Ranking List Released

Recently, the "OpenClaw AI Agent Shrimp Capability Ranking" has attracted attention in the AI community. This ranking focuses on real-world scenarios and tests the coding task success rate of mainstream large models under the OpenClaw framework through a unified task set, providing developers with reference. The evaluation combines automated code checking with LLM intelligent review to ensure objective, reproducible results with no human intervention.

Mar 9, 2026

600

Volc Engine Launches ArkClaw: The Cloud SaaS Version of OpenClaw is Officially Launched, Integrated with Doubao Large Model

Volcano Engine launches ArkClaw, a cloud-based SaaS version of OpenClaw, offering an out-of-the-box AI assistant service. It addresses issues like complex environment setup, high token consumption, and unstable session states, enabling 24/7 operation and advancing AI Agents from development to zero-barrier commercial applications.....

Mar 9, 2026

550

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

GEO Brand Visibility

AI Brand Monitoring Tool

AI Search Visibility Checker

GEO Promotion Link Detection

GEO Ranking Optimization System

GEO Services​

AI Model Compatibility Checker

AI Deployment Calculator

Breaking the Computing Power Monopoly: Zhipu Collaborates with Huawei to Launch the First Full-Process Domestic Multi-Modal Large Model GLM-Image

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Google Gemini Embedding 2 Launches with Great Impact! The First Full-Multimodal Embedding Model Is Here

Yann LeCun Enters the World Model: His AI Startup Completes $1.03 Billion in Funding

Alibaba Qwen Management Adjustment: Alibaba Cloud CTO Zhou Jingren to Temporarily Supervise the Qwen Model's Top Position

China's Large Model Weekly Usage Surpasses Global Leaders: MiniMax M2.5 Remains at the Top

Surpassing the U.S.! China's Large Model Has the Highest Weekly Query Volume Globally, MiniMax Holds the Top Position

Zhipu Launches Localized Agent Tool AutoClaw: Integrated with Pony-Alpha-2 Model and Supports One-Click Deployment

Tencent Launches the All-Scenario AI Agent WorkBuddy: Compatible with OpenClaw and Supports Multi-Model Switching

AI Daily: Tencent Testing QClaw One-Click Startup Package; Xiaohongshu's New AI Editing Model Open-Sourced; OpenClaw Officially Supports GPT-5.4

Real Combat Power Showdown of AI Coding Agents! OpenClaw Shrimp Ranking List Released

Volc Engine Launches ArkClaw: The Cloud SaaS Version of OpenClaw is Officially Launched, Integrated with Doubao Large Model

AI News Recommendations

Google Gemini Embedding 2 Launches with Great Impact! The First Full-Multimodal Embedding Model Is Here

Yann LeCun Enters the World Model: His AI Startup Completes $1.03 Billion in Funding

Alibaba Qwen Management Adjustment: Alibaba Cloud CTO Zhou Jingren to Temporarily Supervise the Qwen Model's Top Position

China's Large Model Weekly Usage Surpasses Global Leaders: MiniMax M2.5 Remains at the Top

Surpassing the U.S.! China's Large Model Has the Highest Weekly Query Volume Globally, MiniMax Holds the Top Position

Zhipu Launches Localized Agent Tool AutoClaw: Integrated with Pony-Alpha-2 Model and Supports One-Click Deployment

Tencent Launches the All-Scenario AI Agent WorkBuddy: Compatible with OpenClaw and Supports Multi-Model Switching

AI Daily: Tencent Testing QClaw One-Click Startup Package; Xiaohongshu's New AI Editing Model Open-Sourced; OpenClaw Officially Supports GPT-5.4

Real Combat Power Showdown of AI Coding Agents! OpenClaw Shrimp Ranking List Released

Volc Engine Launches ArkClaw: The Cloud SaaS Version of OpenClaw is Officially Launched, Integrated with Doubao Large Model

GEO Services