Huawei Launches New Technology to Optimize Large Model Inference: UCM Technology Alleviates HBM Dependence

AIbase基地

Published inAI News · 4 min read · Aug 12, 2025

On August 12, Huawei will release its breakthrough AI inference innovation technology, UCM (Inference Memory Data Manager), at the 2025 Financial AI Inference Application Implementation and Development Forum. This technology is expected to reduce China's reliance on HBM (High Bandwidth Memory) for AI inference and significantly improve the performance of large models in China.

UCM is centered around KV Cache, integrating multiple cache acceleration algorithm tools. By managing the memory data generated during inference in a hierarchical manner, it expands the context window, achieving high throughput, low latency inference experiences, and reducing the cost per Token. This solution can alleviate issues such as task stagnation and response delays caused by insufficient HBM resources.

Large Model Metaverse (2)

At this forum, Huawei will jointly announce the latest application achievements of AI inference with China UnionPay. Experts from institutions such as the China Academy of Information and Communications Technology, Tsinghua University, and iFlytek will also share their practical experiences in accelerating and optimizing large model inference. Fan Jie, Vice President of Huawei's Data Storage Product Line, stated that future AI breakthroughs will highly depend on the release of high-quality industry data. High-performance AI storage can shorten the data loading time from hours to minutes, increasing the efficiency of computing clusters from 30% to 60%.

Industry analysts believe that the release of UCM comes at a critical moment when the AI industry is shifting from "pursuing the limits of model capabilities" to "pursuing the optimization of inference experience." The inference experience has become an important standard for measuring the commercial value of AI. Great Wall Securities pointed out that as the capabilities of large models continue to improve and commercial scenarios expand, companies in the computing power and industrial chain are expected to seize new development opportunities.

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Submit Your Model

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

GEO Services

AI Search Visibility Checker

AI Model Compatibility Checker

AI Dataset Collection

Intelligent Document Recognition

Huawei Launches New Technology to Optimize Large Model Inference: UCM Technology Alleviates HBM Dependence

AIbase基地

This article is from AIbase Daily

AI News Recommendations

OpenAI's Valuation Surges to $500 Billion, Becomes the King of Global Unicorns

OpenAI and Musk Clash Again: Insist They Don't Need or Want Anybody's Trade Secrets

ChatGPT Launches Immediately Purchase Features Make Shopping More Convenient!

Musk Rages Again! Sixth Lawsuit Against OpenAI Accuses of Stealing Trade Secrets

7 Models from Alibaba Tongyi Dominate Hugging Face! All-modal Large Model Qwen3-Omni Ranks First Globally

OpenAI Secretly Switches ChatGPT Model to Handle Emotional Conversations

AI Daily: Tencent Unveils Huan Yuan Image 3.0; Kuaishou Launches KAT Series Agentic Coding Large Model; Apple Quietly Developing a ChatGPT-like Application

OpenAI Launches ChatGPT Pulse: Personalized Daily Briefing for Pro Users

ChatGPT Launches Personalized News Feature - Your Exclusive News Assistant is Here!

Zhou Hongyi: The Secret of Large Models Is Not Necessarily Better the Bigger!

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Submit Your Model

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

GEO Services​

AI Search Visibility Checker

AI Model Compatibility Checker

AI Dataset Collection

Intelligent Document Recognition

Huawei Launches New Technology to Optimize Large Model Inference: UCM Technology Alleviates HBM Dependence

AIbase基地

This article is from AIbase Daily

AI News Recommendations

OpenAI's Valuation Surges to $500 Billion, Becomes the King of Global Unicorns

OpenAI and Musk Clash Again: Insist They Don't Need or Want Anybody's Trade Secrets

ChatGPT Launches Immediately Purchase Features Make Shopping More Convenient!

Musk Rages Again! Sixth Lawsuit Against OpenAI Accuses of Stealing Trade Secrets

7 Models from Alibaba Tongyi Dominate Hugging Face! All-modal Large Model Qwen3-Omni Ranks First Globally

OpenAI Secretly Switches ChatGPT Model to Handle Emotional Conversations

AI Daily: Tencent Unveils Huan Yuan Image 3.0; Kuaishou Launches KAT Series Agentic Coding Large Model; Apple Quietly Developing a ChatGPT-like Application

OpenAI Launches ChatGPT Pulse: Personalized Daily Briefing for Pro Users

ChatGPT Launches Personalized News Feature - Your Exclusive News Assistant is Here!

Zhou Hongyi: The Secret of Large Models Is Not Necessarily Better the Bigger!

GEO Services