Nvidia Launches New Rubin CPX GPU to Power Massive Context AI Applications

AIbase基地

Published inAI News · 4 min read · Sep 10, 2025

Nvidia recently announced that its new Vera Rubin micro-architecture is in development and is scheduled to be launched in 2026. The Rubin CPX variant of this architecture will focus on meeting the needs of artificial intelligence workloads that require processing massive context windows. At a press conference, Nvidia CEO Jensen Huang stated: "The Vera Rubin platform will mark a new leap in AI computing, introducing the next-generation Rubin GPU and a new class of processor called CPX."

Rubin CPX is particularly suitable for applications that require processing over one million tokens, such as complex software development and high-definition video generation. According to Nvidia's plan, the Vera Rubin NDL144CPX GPU will be available by the end of 2026. The CPX model is specifically designed for applications requiring long context windows, offering 8 exaflops of AI performance, 30 PF NVFP4 context computing capability, and three times the exponential computation performance compared to the Nvidia GB300NVL72 system. In addition, the CPX model is equipped with 128GB GDDR7 memory, 4 encoders, and 4 decoders, designed specifically for video generation, and provides 100TB of fast memory.

Nvidia executives said that the Vera Rubin NDL144CPX can be considered part of a large artificial intelligence factory. To support the construction of large-scale data centers, Nvidia also plans to launch terascale reference designs. This means that Nvidia will closely collaborate with infrastructure companies to redesign data centers from a computing perspective, providing reference designs covering aspects such as building, design, simulation, and operation.

Before this release, Nvidia also announced the latest MLPerf inference test results, where the Blackwell GPU set a new record, surpassing the baseline of the Llama3.1405B interactive model. This innovative technology is called "disaggregated service," which allows the same hardware to achieve improved performance, providing additional revenue opportunities for enterprises that have already deployed solutions.

Key Points:
🔍 **Nvidia releases Rubin CPX GPU, aimed at supporting large-context AI applications.**
🚀 **This GPU will be available by the end of 2026, featuring powerful AI performance and memory configuration.**
🏢 **Nvidia plans to launch terascale reference designs for data centers, helping to build AI factories.**

OpenAI to Secure Over $100 Billion in Funding, Post-Valuation Could Exceed $850 Billion

On February 19, Bloomberg reported that OpenAI is about to complete a funding round exceeding $100 billion. This record-breaking capital injection is expected to push its post-investment valuation above $850 billion (approximately 5.88 trillion Chinese yuan at the current exchange rate), instantly drawing widespread attention from the tech and investment sectors. In the first batch of funds from this round, strategic investors occupy a central position, with Amazon, SoftBank Group, NVIDIA, and Microsoft all listed among the core participants. Further information indicates that if all parties execute at the maximum discussed amount, the investment size

NVIDIA and Meta's Collaboration Tops the Trending List, with the Latter Planning to Deploy Millions of Blackwell GPUs

Recently, NVIDIA officially announced a multi-year, cross-generation strategic partnership with Meta. According to the agreement reached between the two parties, Meta plans to deploy millions of NVIDIA's Blackwell GPUs, as well as the next-generation Rubin architecture GPU designed specifically for agent AI inference, within its large-scale AI data centers to strengthen its AI computing infrastructure.

Getting Rid of NVIDIA Dependency! OpenAI Joins Forces with Cerebras to Launch GPT-5.3-Codex-Spark: The First Fruit of a $10 Billion Computing Power

OpenAI is accelerating its strategy to reduce reliance on NVIDIA, and on February 12, 2026, it launched its first AI model based on Cerebras chips, GPT-5.3-Codex-Spark. This model is designed specifically for software engineers, offering a more flexible interactive experience, supporting instant interruption and switching, allowing developers to pause lengthy computations at any time and quickly handle other urgent coding tasks.

ByteDance's Self-Developed Chip Exposed: 100,000 Units in Mass Production Imminent, Aiming to Break NVIDIA Dependence

ByteDance is accelerating its development of the self-designed AI chip SeedChip, planning to mass-produce at least 100,000 units this year, mainly for inference tasks to ensure AI computing power supply. Although the company stated that the relevant reports are inaccurate, its AI procurement budget this year has exceeded 160 billion yuan, with half still used to purchase NVIDIA chips, reflecting the high inference cost pressure faced when advancing large models.

Mianbi Intelligence Launches Songguo Board: AI-Native Edge Development Board Opens New Paradigms in Hardware Development

Mianbi Intelligence releases its first AI edge development board, the Songguo Board, based on the NVIDIA Jetson module, integrated with multi-modal interfaces such as microphones and cameras, and compatible with its self-developed MiniCPM series models, aiming to enable developers to conveniently build intelligent hardware.

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

GEO Brand Visibility

AI Brand Monitoring Tool

AI Search Visibility Checker

GEO Promotion Link Detection

GEO Ranking Optimization System

GEO Services​

AI Model Compatibility Checker

AI Deployment Calculator

Nvidia Launches New Rubin CPX GPU to Power Massive Context AI Applications

AIbase基地

This article is from AIbase Daily

AI News Recommendations

OpenAI to Secure Over $100 Billion in Funding, Post-Valuation Could Exceed $850 Billion

NVIDIA and Meta's Collaboration Tops the Trending List, with the Latter Planning to Deploy Millions of Blackwell GPUs

Getting Rid of NVIDIA Dependency! OpenAI Joins Forces with Cerebras to Launch GPT-5.3-Codex-Spark: The First Fruit of a $10 Billion Computing Power

ByteDance's Self-Developed Chip Exposed: 100,000 Units in Mass Production Imminent, Aiming to Break NVIDIA Dependence

Valuation Surges to $23 Billion! Cerebras Teams Up with OpenAI to Challenge NVIDIA's Computing Dominance

Code Output Surges 300%! NVIDIA 30,000 Engineers Fully Upgrade to Custom AI Tools

Mianbi Intelligence Launches Songguo Board: AI-Native Edge Development Board Opens New Paradigms in Hardware Development

Huang Renxun Refutes the 'Death of Software' Theory: AI is a Useful Screwdriver, Not a Replacement

Challenging NVIDIA! Intel CEO Andrew Jiang Announces Entry into GPU Production, Focusing on the AI Computing Market

The Only Giant AI Startup Representative! Yang Zhilin from Moonshot AI Invited to NVIDIA GTC 2026 Conference

AI News Recommendations

OpenAI to Secure Over $100 Billion in Funding, Post-Valuation Could Exceed $850 Billion

NVIDIA and Meta's Collaboration Tops the Trending List, with the Latter Planning to Deploy Millions of Blackwell GPUs

Getting Rid of NVIDIA Dependency! OpenAI Joins Forces with Cerebras to Launch GPT-5.3-Codex-Spark: The First Fruit of a $10 Billion Computing Power

ByteDance's Self-Developed Chip Exposed: 100,000 Units in Mass Production Imminent, Aiming to Break NVIDIA Dependence

Valuation Surges to $23 Billion! Cerebras Teams Up with OpenAI to Challenge NVIDIA's Computing Dominance

Code Output Surges 300%! NVIDIA 30,000 Engineers Fully Upgrade to Custom AI Tools

Mianbi Intelligence Launches Songguo Board: AI-Native Edge Development Board Opens New Paradigms in Hardware Development

Huang Renxun Refutes the 'Death of Software' Theory: AI is a Useful Screwdriver, Not a Replacement

Challenging NVIDIA! Intel CEO Andrew Jiang Announces Entry into GPU Production, Focusing on the AI Computing Market

The Only Giant AI Startup Representative! Yang Zhilin from Moonshot AI Invited to NVIDIA GTC 2026 Conference

GEO Services