Performance Improved by Over Two Times: NVIDIA Releases Nemotron-Labs-TwoTower Diffusion Language Model

AIbase基地

Published inAI News · 3 min read · Jul 1, 2026

On the path to improving the efficiency of large model generation, NVIDIA has recently introduced a new solution. On July 1st, NVIDIA officially open-sourced its latest Nemotron-Labs-TwoTower diffusion language model, aiming to break through the throughput bottleneck of traditional autoregressive (AR) models through architectural innovation.

Traditional autoregressive models process text generation by decoding one token sequentially, which proves inefficient when handling large-scale synthesis tasks. NVIDIA's "two-tower" architecture takes an alternative approach, breaking the task into two parts: one is the "context tower" that remains frozen and handles prompts while preserving existing language understanding capabilities; the other is the "denoiser tower," specifically trained to generate in parallel and optimize tokens.

The ingenuity of this architectural design lies in balancing "quality" and "speed." In a testing environment with 2×H100 GPUs, the model successfully retained 98.7% of the baseline model's generation quality under default settings, while its actual generation throughput increased significantly by 2.42 times. This means that for data teams needing to mass-produce synthetic text, this model is undoubtedly a powerful tool combining high performance and efficiency.

In terms of operation, the model offers high flexibility, supporting three decoding modes: diffusion mode, simulated AR, and standard AR. Developers can choose freely according to their task requirements. Currently, the model is released as an open-weight project, following the NVIDIA Nemotron Open Model License Agreement, and fully supports commercial use.

Although the model shows a slight performance drop in code generation and mathematical reasoning tasks compared to the original baseline, and requires certain GPU memory, it provides a highly promising technical direction for accelerating large model inference. As artificial intelligence applications penetrate more frequent and large-scale scenarios, this approach of trading generation speed for algorithmic architectural optimization is becoming a new trend in model development.

LargeModel NVIDIA Two-TowerArchitecture DiffusionLanguageModel

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

Get Started with a Production-Grade Speech AI Agent in Two Minutes: xAI Launches Voice Agent Builder Beta Version

xAI launches the Voice Agent Builder beta version, allowing enterprises to build high-level speech AI agents in just two minutes through a no-code platform and its self-developed Grok Voice model. Its core is a highly integrated end-to-end architecture that addresses the pain points of traditional solutions with fragmented processes such as speech-to-text conversion, significantly lowering development and operational barriers.

Jul 2, 2026

160

AI Cloud Platform Together AI Completes $800 Million Series C Funding, Valuation Reaches $8.3 Billion, Annual Bookings Exceed $1.1 Billion

Together AI raised $800M Series C at $8.3B valuation. Led by Saudi Aramco Ventures, with Nvidia, Vista Equity Partners. Founded 2022, provides AI infrastructure leasing like Nvidia GPU clusters.....

Jul 2, 2026

150

Intelligence Alternative to GPT-5? Qwen 3.6 27B Evaluation Shows Local Model Has Reached the Cutting-Edge Level

Qwen3.6 series overturns belief that local large models compromise. Tested on MacBook Max M5 128GB, Qwen3.6 27B with 8-bit GGUF quantization delivers incredible efficiency. It proves to be not only usable but a powerful general-intelligence tool without sacrifice, marking a new phase in local LLM deployment.....

Jul 1, 2026

200

Early Signs of Commercialization: Huang Zhenxin from Moonshot Explains Kimi's Differentiation Strategy

Large model industry enters deep water of deployment & cost battle. Moonshot AI's Kimi has clear commercialization. B-side head Huang Zhenxin says: insist on underlying architecture innovation, not mere engineering stacking. Kimi is high-performance model, will maintain this path despite high costs from global compute crunch.....

Jun 30, 2026

300

Huawei openPangu 2.0 Launches Two Versions: Accelerating the Dual Breakthrough of Computing Power and Ecosystem in the Agent Era

Huawei open-sources 92B-parameter Pangu model (openPangu-2.0-Flash) with weights, inference code, and training operators. It offers native training and inference reference for Ascend computing, accelerates AI business innovation for long-text and low-latency trends, builds an intelligent foundation for the Agent era, and enriches the Ascend developer ecosystem.....

Jun 30, 2026

360

Microsoft Invests Heavily in AI Computing Power: Azure Fully Integrates Anthropic Claude Models with NVIDIA GB300 Architecture

On June 29, Nvidia announced Microsoft Azure's full launch of Anthropic's Claude models, powered by the latest GB300 Blackwell Ultra superchip to deliver top-tier inference. This underscores the importance of computing architecture upgrades for AI deployment and marks a milestone in Microsoft's AI ecosystem.....

Jun 30, 2026

270

AI Agent Evolution Accelerates: Anthropic Claude Joins Forces with NVIDIA GB300 to Launch on Azure

Anthropic's Claude is now generally available on Microsoft Azure for enterprises, running on NVIDIA's latest Blackwell Ultra GB300 GPU platform with the GB300NVL72 system, ushering in a new era of high-performance computing for enterprise AI agents.....

Jun 30, 2026

190

OceanBase Launches Lake-Storage Integrated AI Database: Enabling Agents to Truly Understand Enterprises

AI breakthroughs contrast with unmet enterprise value, shifting focus from models to data. OceanBase launched a lake-house AI database, integrating massive storage, transactional analytics, and multimodal processing to build a strongly consistent data foundation, efficiently supporting AI Agents.....

Jun 29, 2026

290

Three-Year Delayed Long Article: Former OpenAI Security VP Wang Li Analyzes Scaling Laws: Your Model May Have Been Trained on the Wrong Data

Lilian Weng returns with a deep dive into scaling laws, arguing the industry consensus may be reversed: from Kaplan to Chinchilla, the mainstream data allocation might not be optimal. It examines compute, model size, and data quantity trade-offs, implying the billions-invested path requires reconsideration, prompting a re-evaluation of pretraining recipes.....

Jun 26, 2026

330

The Eve of AGI: DeepSeek Expands All Departments by Double, Intensifying the Competition for Top Talent in Large Models

DeepSeek announced plans to double team sizes across seven roles including full-stack development and AI core system R&D, highlighting its aggressive push towards AGI and strong capabilities. The high-profile AI innovator continues its global momentum.....

Jun 26, 2026

330

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

GEO Brand Visibility

AI Visibility Audit

AI Search Visibility Checker

GEO Ranking Monitor

AI Conversation Insight

GEO Promotion Link Detection

Website AI Friendliness Detection

GEO Ranking Optimization System

GEO Ranking Optimization

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

LLM API Hub

AI Models Finder

Model Providers

LLM Leaderboard

LLM API Proxy Checker

Compare LLMs

LLM Cost Calculator

LLM Arena

AI Model Compatibility Checker

AI Deployment Calculator

Performance Improved by Over Two Times: NVIDIA Releases Nemotron-Labs-TwoTower Diffusion Language Model

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Get Started with a Production-Grade Speech AI Agent in Two Minutes: xAI Launches Voice Agent Builder Beta Version

AI Cloud Platform Together AI Completes $800 Million Series C Funding, Valuation Reaches $8.3 Billion, Annual Bookings Exceed $1.1 Billion

Intelligence Alternative to GPT-5? Qwen 3.6 27B Evaluation Shows Local Model Has Reached the Cutting-Edge Level

Early Signs of Commercialization: Huang Zhenxin from Moonshot Explains Kimi's Differentiation Strategy

Huawei openPangu 2.0 Launches Two Versions: Accelerating the Dual Breakthrough of Computing Power and Ecosystem in the Agent Era

Microsoft Invests Heavily in AI Computing Power: Azure Fully Integrates Anthropic Claude Models with NVIDIA GB300 Architecture

AI Agent Evolution Accelerates: Anthropic Claude Joins Forces with NVIDIA GB300 to Launch on Azure

OceanBase Launches Lake-Storage Integrated AI Database: Enabling Agents to Truly Understand Enterprises

Three-Year Delayed Long Article: Former OpenAI Security VP Wang Li Analyzes Scaling Laws: Your Model May Have Been Trained on the Wrong Data

The Eve of AGI: DeepSeek Expands All Departments by Double, Intensifying the Competition for Top Talent in Large Models

AI News Recommendations

Get Started with a Production-Grade Speech AI Agent in Two Minutes: xAI Launches Voice Agent Builder Beta Version

AI Cloud Platform Together AI Completes $800 Million Series C Funding, Valuation Reaches $8.3 Billion, Annual Bookings Exceed $1.1 Billion

Intelligence Alternative to GPT-5? Qwen 3.6 27B Evaluation Shows Local Model Has Reached the Cutting-Edge Level

Early Signs of Commercialization: Huang Zhenxin from Moonshot Explains Kimi's Differentiation Strategy

Huawei openPangu 2.0 Launches Two Versions: Accelerating the Dual Breakthrough of Computing Power and Ecosystem in the Agent Era

Microsoft Invests Heavily in AI Computing Power: Azure Fully Integrates Anthropic Claude Models with NVIDIA GB300 Architecture

AI Agent Evolution Accelerates: Anthropic Claude Joins Forces with NVIDIA GB300 to Launch on Azure

OceanBase Launches Lake-Storage Integrated AI Database: Enabling Agents to Truly Understand Enterprises

Three-Year Delayed Long Article: Former OpenAI Security VP Wang Li Analyzes Scaling Laws: Your Model May Have Been Trained on the Wrong Data

The Eve of AGI: DeepSeek Expands All Departments by Double, Intensifying the Competition for Top Talent in Large Models