Cambricon Announces Full Series Model Day0 Compatibility and Open-Sourcing of Optimized Code for DeepSeek-V4

AIbase基地

Published inAI News · 3 min read · Apr 24, 2026

Cambricon has announced that it has completed the "Day0" adaptation of the entire DeepSeek's latest open-source model DeepSeek-V4 series based on the vLLM inference framework. This adaptation includes the Flash version with 285B parameters and the Pro version with 1.6T parameters, ensuring stable operation of the model on Cambricon hardware platforms on the day of release. The corresponding adaptation code has been officially open-sourced to the GitHub community.

For the unique sparse attention and compressed structure of DeepSeek-V4, Cambricon has specially accelerated core modules such as Compressor by using its self-developed vector fusion operator library Torch-MLU-Ops. Using the high-performance programming language BangC, the Cambricon team has written highly optimized kernels for hot operators such as sparse Attention and GroupGemm, and fully supports five-dimensional hybrid parallel strategies (TP/PP/SP/DP/EP), low-precision quantization, and PD separation deployment in the vLLM framework. These technical approaches significantly improve the token throughput of end-to-end inference while meeting latency constraints.

On the hardware side, Cambricon deeply leverages the memory access and sorting acceleration features of MLU to effectively address the complex indexing structure of DeepSeek-V4. With the advantages of high interconnect bandwidth and low-latency communication, this solution maximizes the reduction of communication loss in Prefill and Decode scenarios, improving inference utilization.

Industry analysis indicates that DeepSeek-V4, with its ultra-long context of one million words (1M) and top-tier logical reasoning performance, imposes strict requirements on the underlying computing architecture. Cambricon's agile adaptation on the day of the model's release not only demonstrates the capability of domestic computing platforms to support ultra-large-scale, complex-structured models, but also indicates that the domestic AI industry chain has entered a mature stage in software-hardware collaboration, providing an efficient computing foundation for the popularization of large model applications.

Cambricon vLLM DeepSeek-V4 Torch-MLU-Ops

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

Report: DeepSeek Completes A-Round Financing of 51 Billion Yuan, with Giants Like Tencent and JD.com Participating

DeepSeek completed an A-round financing of approximately 51 billion yuan, with its valuation surging to 400 billion yuan. Due to the highly promising market prospects, the financing competition was intense, and it has shifted from seeking investors to companies screening participation qualifications, with a top-tier investment team ultimately investing.

Jun 18, 2026

170

JD.com Launches A2P2 Protocol: The First Smart Agent Autonomous Payment Standard, Dividing into Six Levels from L0 to L5

JD.com released the country's first smart agent autonomous payment protocol, A2P2, which for the first time categorizes AI payment capabilities into six levels from L0 to L5. The protocol focuses on the intermediate stages of L3 and L4, achieving a progressive transition from user confirmation to full autonomous decision-making by the smart agent, providing a framework for standardization of AI payments.

Jun 17, 2026

250

Big Companies Can't Afford the Huge AI Bills! Microsoft's Intelligent Agent Considers Switching to DeepSeek's Phantombase

Due to the high costs of top-tier AI models, Microsoft is shifting its Copilot Cowork intelligent agent to a pay-per-use model and plans to introduce DeepSeek's V4 fine-tuned version from a Chinese company to reduce costs for enterprise customers and accelerate the adoption of AI tools in enterprises.

Jun 17, 2026

270

Microsoft Copilot Cowork to Shift to Pay-As-You-Go Pricing or Introduce Azure Hosted Version of DeepSeek V4

Microsoft plans to introduce a self-hosted DeepSeek V4 in Copilot Cowork, creating a more cost-effective model option, and will fully transition to a pay-as-you-go pricing model. Previously, the agent based on Anthropic Claude faced a surge in inference tasks, leading to significant token consumption. The original fixed-rate pricing became unsustainable under heavy usage, with users performing hundreds of tasks per week.

Jun 17, 2026

340

Xiaomi Launches MiMo Claw Final Version: Supports 1000 Consecutive Tool Calls, Free Duration Increased to 4 Hours

Xiaomi launches the final version of its cloud-based lightweight Agent product, MiMo Claw, based on the OpenClaw framework, equipped with the MiMo-V2.5-Pro flagship model. It has integrated with the Kingsoft Office ecosystem, adding document office and tool call capabilities, and upgrading free benefits and subscription system. The model supports the MCP protocol and long context, making it suitable for complex multi-step tasks.

Jun 17, 2026

1.4k

AI Daily: ByteDance Launches Seedance 2.0 Mini; Kimi 2.7 Code High-Speed Version Large Model Officially Launched; DeepSeek Completes Over $7 Billion First Round of Funding

Welcome to the 【AI Daily】 column! This is your guide to exploring the world of artificial intelligence every day. Every day, we present you with the latest content in the AI field, focusing on developers, helping you understand technical trends and innovative AI product applications. Click to learn more about new AI products: https://app.aibase.com/zh1. The cost per second is halved, and ByteDance has launched the Seedance 2.0 Mini video generation model. ByteDance's火山引擎 (Volcano Engine) has launched the Seedance 2.0 Mini video generation model.

Jun 16, 2026

1.8k

DeepSeek Completes a $7 Billion First Financing Round: Valuation Exceeds $50 Billion

DeepSeek completed its first funding round of over $7 billion through an innovative transaction structure, with a post-investment valuation exceeding $50 billion. Funds were not injected directly into the company but into a limited partnership managed by CEO Liang Wenfeng. Investors received economic rights subject to a five-year lock-up period.....

Jun 16, 2026

420

iFlytek Medical Officially Launches Spark Medical Large Model V3.5

The AI healthcare industry reaches a critical turning point as iFlytek Medical launches Spark Medical Large Model V3.5 on June 9, trained on domestic computing power. The model shifts focus from parameter scale to core scenarios of clinical diagnosis and resident health management, demonstrating practical clinical application through data from top-tier hospitals.....

Jun 12, 2026

500

Windows Users Have Been Waiting for a Fix: Claude Desktop Exposed to Consume 1.8GB of Memory Upon Launch

Windows users are strongly dissatisfied with Claude Desktop's memory usage, as it launches a 1.8GB Hyper-V virtual machine process after using Claude Cowork or agent mode, and resource consumption remains high even during basic conversations.....

Jun 11, 2026

240

ByteDance Spins Off Its AI Drug Development Business for Independent Financing, Integrates Team and Remains Controlled by ByteDance

ByteDance's AI drug discovery business unit has initiated a spin-off and independent fundraising. ByteDance will retain a controlling stake post-completion, with core team, algorithms, tech platform, and pipeline assets transferred to a new company led by the original team. The independent unit will continue to receive computing support from Volcano Engine. Established in 2021 under Liu Kai, the team of ~50 focuses on AI4S.....

Jun 10, 2026

360

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

GEO Brand Visibility

AI Visibility Audit

AI Search Visibility Checker

GEO Ranking Monitor

AI Conversation Insight

GEO Promotion Link Detection

GEO Ranking Optimization System

GEO Ranking Optimization

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

LLM API Hub

AI Models Finder

Model Providers

LLM Leaderboard

LLM API Proxy Checker

Compare LLMs

LLM Cost Calculator

LLM Arena

AI Model Compatibility Checker

AI Deployment Calculator

Cambricon Announces Full Series Model Day0 Compatibility and Open-Sourcing of Optimized Code for DeepSeek-V4

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Report: DeepSeek Completes A-Round Financing of 51 Billion Yuan, with Giants Like Tencent and JD.com Participating

JD.com Launches A2P2 Protocol: The First Smart Agent Autonomous Payment Standard, Dividing into Six Levels from L0 to L5

Big Companies Can't Afford the Huge AI Bills! Microsoft's Intelligent Agent Considers Switching to DeepSeek's Phantombase

Microsoft Copilot Cowork to Shift to Pay-As-You-Go Pricing or Introduce Azure Hosted Version of DeepSeek V4

Xiaomi Launches MiMo Claw Final Version: Supports 1000 Consecutive Tool Calls, Free Duration Increased to 4 Hours

AI Daily: ByteDance Launches Seedance 2.0 Mini; Kimi 2.7 Code High-Speed Version Large Model Officially Launched; DeepSeek Completes Over $7 Billion First Round of Funding

DeepSeek Completes a $7 Billion First Financing Round: Valuation Exceeds $50 Billion

iFlytek Medical Officially Launches Spark Medical Large Model V3.5

Windows Users Have Been Waiting for a Fix: Claude Desktop Exposed to Consume 1.8GB of Memory Upon Launch

ByteDance Spins Off Its AI Drug Development Business for Independent Financing, Integrates Team and Remains Controlled by ByteDance

AI News Recommendations

Report: DeepSeek Completes A-Round Financing of 51 Billion Yuan, with Giants Like Tencent and JD.com Participating

JD.com Launches A2P2 Protocol: The First Smart Agent Autonomous Payment Standard, Dividing into Six Levels from L0 to L5

Big Companies Can't Afford the Huge AI Bills! Microsoft's Intelligent Agent Considers Switching to DeepSeek's Phantombase

Microsoft Copilot Cowork to Shift to Pay-As-You-Go Pricing or Introduce Azure Hosted Version of DeepSeek V4

Xiaomi Launches MiMo Claw Final Version: Supports 1000 Consecutive Tool Calls, Free Duration Increased to 4 Hours

AI Daily: ByteDance Launches Seedance 2.0 Mini; Kimi 2.7 Code High-Speed Version Large Model Officially Launched; DeepSeek Completes Over $7 Billion First Round of Funding

DeepSeek Completes a $7 Billion First Financing Round: Valuation Exceeds $50 Billion

iFlytek Medical Officially Launches Spark Medical Large Model V3.5

Windows Users Have Been Waiting for a Fix: Claude Desktop Exposed to Consume 1.8GB of Memory Upon Launch

ByteDance Spins Off Its AI Drug Development Business for Independent Financing, Integrates Team and Remains Controlled by ByteDance