NVIDIA GB200 NVL72 Performance Amazing, 28 Times Better Than AMD MI355X

AIbase基地

Published inAI News · 4 min read · Jan 4, 2026

In the latest released SemiAnalysis InferenceMAX benchmark, Signal65 analyzed the inference performance of the Deepseek-R1 0528 mixture-of-experts (MoE) model, and the results showed that NVIDIA's GB200 NVL72 rack system significantly outperformed the AMD Instinct MI355X cluster of similar scale. The characteristic of a mixture-of-experts model is to activate the most suitable "expert" for each task type, which improves efficiency but may cause communication latency and bandwidth pressure between nodes when scaled up, becoming a computational bottleneck.

NVIDIA optimized the architecture of the GB200 NVL72 through its "extreme co-design" strategy. This system tightly interconnects 72 chips and is equipped with up to 30TB of shared memory, significantly improving data transfer efficiency and solving the latency issue. According to the test data, the GB200 NVL72 achieves a throughput of up to 75 tokens/second per GPU under similar configurations, with a performance 28 times that of the AMD MI355X.

For large-scale cloud computing companies, total cost of ownership (TCO) is a critical consideration. Signal65 pointed out, based on Oracle cloud pricing data, that the GB200 NVL72 not only has strong performance but also offers remarkable cost-effectiveness. Its relative cost per token is only one fifteenth of the AMD solution, and it provides a higher interaction rate.

Although NVIDIA dominates in the mixture-of-experts model field, AMD still has its competitive advantages. The report states that the AMD MI355X remains a competitive option in dense model environments due to its high-capacity HBM3e memory. Currently, AMD has not yet launched a new rack-level solution to counter the challenge of the GB200 NVL72. However, as the competition between the AMD Helios platform and NVIDIA's Vera Rubin platform intensifies, the competition in rack-level expansion solutions will become even fiercer.

Key Points:
🟢 NVIDIA's GB200 NVL72 performance is 28 times that of the AMD MI355X, showing significant leadership.
🟢 The GB200 NVL72 solves the data transmission latency issue through optimized architecture and high-speed shared memory.
🟢 Although NVIDIA has an advantage, AMD still has competitiveness in the dense model field, and future competition will be more intense.

SemiAnalysisInferenceMAX Deepseek-R10528 NVIDIAGB200NVL72 AMDInstinctMI355X

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

Core Talent is Accelerating Its Loss, SpaceXAI Faces R&D Challenges under Elon Musk's New Team

After the merger of SpaceX and xAI into SpaceXAI, over 50 top researchers and engineers have left since February this year, affecting core teams such as programming assistants, world models, and Grok voice interaction. The pre-training team has been severely poached by competitors, raising concerns about the sustainability of the technology.

May 15, 2026

170

AI Coding Startup Cursor Plans to Hire 200 People in Asia-Pacific, Previously Received Significant Investment from SpaceX

AI coding startup Cursor launches global expansion, plans to hire 200 employees in Asia-Pacific within six months, including marketing, on-site, and AI deployment engineers. The company has established an office in Singapore, led by Simon Green, with subsequent hiring covering Japan, Sydney, Melbourne, and India, accelerating technological globalization.

May 15, 2026

160

Baidu Establishes a Model Committee (BMC) to Coordinate BMU and AMU for Integrated R&D

On May 15, Baidu established the Baidu Model Committee (BMC) to oversee large model R&D and deployment, achieving a full-chain closure from technical exploration to business application. The committee integrates basic and applied model development departments, promoting 'technology-application integration,' marking a new efficient phase for Baidu's large model strategy.....

May 15, 2026

200

Tencent's Q1 Performance Exceeds Expectations, AI R&D Investment Reaches 22.54 Billion Yuan

Tencent's Q1 2025 financial report shows strong growth, with revenue of RMB 196.46 billion, up 9% YoY. AI breakthroughs are notable, with R&D investment of RMB 22.54 billion (up 19%) and capital expenditure of RMB 31.94 billion (up 16%), supporting technological innovation. Excluding new AI products, core business remains stable.....

May 14, 2026

220

Tencent Cloud: Announcement on the Upgrade and Switching Plan for Some DeepSeek Models

Tencent Cloud announced that it will cease support for the three older models, DeepSeek-V3-0324, DeepSeek-V3.1-Terminus, and DeepSeek-R1-0528, starting from May 22, 2026. Users are advised to switch to the new version promptly to ensure continuous and stable service.

May 13, 2026

280

The Race for AI Talent Heats Up: Yue Zhi Anmian and DeepSeek's Path of Anti-Global Tech Giants

On May 12, Zhang Yutong, VP of Moonshot AI, held a recruitment event at Peking University, engaging with Guanghua School of Management Dean Tian Xuan and offering on-site interviews. This underscores Moonshot AI's accelerated push in the AI talent war, as China's large model industry booms with fierce competition among leading firms like Moonshot AI and DeepSeek driving tech-focused hiring.....

May 13, 2026

150

Lee Kai-Fu: Open Source Models Are a Better Path to AI Sovereignty

Lee Kai-Fu introduced the concept of "AI Sovereignty," emphasizing that it involves technological control, data security, and the adaptation of models to local cultural and legal frameworks. He believes that countries do not need to blindly reinvent OpenAI, as closed-source self-development is costly and unrealistic. For countries and companies with limited resources, building a localized system based on open source models is a more feasible third option.

May 12, 2026

220

AMD Launches vLLM-ATOM Plugin to Deeply Optimize the Inference Performance of Domestic Large Models

AMD released the vLLM-ATOM plugin, aiming to fully tap into hardware potential without changing the existing workflow, significantly accelerating the inference of mainstream large language models such as DeepSeek-R1 and Kimi-K2. vLLM is an open-source framework optimized for throughput and GPU memory utilization in high-concurrency scenarios, focusing on request scheduling and cache management. The ATOM plugin further enhances this capability.

May 12, 2026

320

New Developments in Domestic Large Models: MiniMax Launches the '10x Team' Program to Reward Global Top Experts

MiniMax (Xiyu Technology) has launched the '10x Team' global talent collaboration program, aiming to gather top experts from various industries, combine industry expertise with cutting-edge AI technology, and promote the application of large models in vertical fields, extending productivity from general to specialized scenarios, achieving a tenfold increase in industry efficiency. It also opens up multimodal core resources to verify the value of industry insights.

May 12, 2026

220

Big Players Crossing Over to AI? CATL Establishes Two New Companies, Software R&D May Become the New Strategy

CATL is accelerating its layout in the northwest, recently establishing two Time Electric Service Technology Co., Ltd. in Lanzhou and Yinchuan. The Lanzhou company has a registered capital of 10 million yuan. Its business scope has expanded to AI software development and mechanical research, breaking through the traditional battery manufacturing field, indicating its diversified strategy and intention to deepen its presence in regional markets.

May 11, 2026

200

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

GEO Brand Visibility

AI Visibility Audit

AI Search Visibility Checker

AI Conversation Insight

GEO Promotion Link Detection

GEO Ranking Optimization System

GEO Ranking Optimization

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

LLM API Hub

AI Models Finder

Model Providers

LLM Leaderboard

Compare LLMs

LLM Cost Calculator

LLM Arena

AI Model Compatibility Checker

AI Deployment Calculator

NVIDIA GB200 NVL72 Performance Amazing, 28 Times Better Than AMD MI355X

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Core Talent is Accelerating Its Loss, SpaceXAI Faces R&D Challenges under Elon Musk's New Team

AI Coding Startup Cursor Plans to Hire 200 People in Asia-Pacific, Previously Received Significant Investment from SpaceX

Baidu Establishes a Model Committee (BMC) to Coordinate BMU and AMU for Integrated R&D

Tencent's Q1 Performance Exceeds Expectations, AI R&D Investment Reaches 22.54 Billion Yuan

Tencent Cloud: Announcement on the Upgrade and Switching Plan for Some DeepSeek Models

The Race for AI Talent Heats Up: Yue Zhi Anmian and DeepSeek's Path of Anti-Global Tech Giants

Lee Kai-Fu: Open Source Models Are a Better Path to AI Sovereignty

AMD Launches vLLM-ATOM Plugin to Deeply Optimize the Inference Performance of Domestic Large Models

New Developments in Domestic Large Models: MiniMax Launches the '10x Team' Program to Reward Global Top Experts

Big Players Crossing Over to AI? CATL Establishes Two New Companies, Software R&D May Become the New Strategy

AI News Recommendations

Core Talent is Accelerating Its Loss, SpaceXAI Faces R&D Challenges under Elon Musk's New Team

AI Coding Startup Cursor Plans to Hire 200 People in Asia-Pacific, Previously Received Significant Investment from SpaceX

Baidu Establishes a Model Committee (BMC) to Coordinate BMU and AMU for Integrated R&D

Tencent's Q1 Performance Exceeds Expectations, AI R&D Investment Reaches 22.54 Billion Yuan

Tencent Cloud: Announcement on the Upgrade and Switching Plan for Some DeepSeek Models

The Race for AI Talent Heats Up: Yue Zhi Anmian and DeepSeek's Path of Anti-Global Tech Giants

Lee Kai-Fu: Open Source Models Are a Better Path to AI Sovereignty

AMD Launches vLLM-ATOM Plugin to Deeply Optimize the Inference Performance of Domestic Large Models

New Developments in Domestic Large Models: MiniMax Launches the '10x Team' Program to Reward Global Top Experts

Big Players Crossing Over to AI? CATL Establishes Two New Companies, Software R&D May Become the New Strategy