NVIDIA Releases Open-Source Dual-Tower AI Model, Text Generation Speed Increased by 2.42 Times, Image Quality Retained at 98.7%

AIbase基地

Published inAI News · 3 min read · Jul 3, 2026

NVIDIA launched the Nemotron-Labs-TwoTower discrete diffusion language model on July 2nd, aiming to address the issue of slow token-by-token generation speed in large models. The related weights have been open-sourced on Huggingface. The model is based on the existing Nemotron backbone network, reusing pre-trained weights without requiring a complete training from scratch, significantly reducing development costs.

60B Two-Tower Architecture, Parallel Processing to Improve Generation Efficiency

The model has a total parameter count of 60B, split into two independent 30B neural networks working collaboratively. Each tower activates 3B parameters and is equipped with 128 routable expert modules. The context tower is fixed and frozen, responsible for retaining the overall semantic information; the denoising tower is specifically trained, generating text in parallel using the diffusion mechanism, and the two towers exchange data through cross-attention.

Traditional models output tokens sequentially one by one, while the two-tower architecture can write text in parallel, greatly increasing the inference throughput, while maintaining speed and output quality. Benchmark test results show that the model's comprehensive capabilities retain 98.7% of the original level, and the text generation throughput is directly increased by 2.42 times, with only slight declines in code and math tasks.

Open Source Deployment, Suitable for Multi-Scenario Inference

The model is released under NVIDIA's exclusive open-source license, allowing developers to freely download and test, as well as commercial deployment. It requires pairing two H100 or A100 80GB GPUs, with a single card only supporting pure autoregressive mode. Full two-tower inference requires dual-card collaboration. Testing covers multiple tasks such as common sense, mathematics, code, and reading comprehension, with most indicators remaining comparable to the original version, balancing generation speed and content quality.

Nemotron-Labs-TwoTower Discrete Diffusion Language Model 60B Dual-Tower Architecture NVIDIA

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

AI Daily: Alibaba Internally Reverses Disable of Claude; Microsoft's Pure Web-Based Aion System Exposed; Claude Premium Model Launches On-Demand Payment Model

Welcome to the [AI Daily] section! Here is your guide to exploring the world of artificial intelligence every day. Every day, we present you with the latest content in the AI field, focusing on developers to help you grasp technical trends and understand innovative AI product applications. Click to learn more about new AI products: https://app.aibase.com/zh1. 'Alibaba Internally Reverses Disable': Alibaba has completely delisted the Claude series AI tools. Alibaba has completely prohibited employees from using the Claude series AI tools, triggering discussions about AI technology security and data privacy.

Jul 3, 2026

640

AI Research Enters the Autonomous Driving Era: Yang Zhilin Discusses the Third Stage of Large Model Training

AI research paradigm is undergoing profound transformation. At the 2026 Zhongguancun Forum, Yang Zhilin, founder of Moonshot AI, noted that AI R&D has entered the third stage of 'AI-led research.' Starting 2026, past reliance on human-crafted rules and fine-tuning will be overturned, as AI increasingly leads its own development.....

Jul 3, 2026

170

AI Video Market Landscape Reimagined: Google Gemini Omni Flash Tops Blind Test Rankings

Google DeepMind's text-to-video model Gemini Omni Flash climbed to the top of the authoritative blind test ranking Video Arena with 1404 Elo points, demonstrating Google's multimodal technology strength and confirming that the video generation field is rapidly evolving.

Jul 3, 2026

110

Advertising Governance Embarks on a Visual Evolution: ByteDance Engine Launches Mamoda 2.5 Version to Achieve Comprehensive Video Coverage

ByteDance Engine launched its self-developed advertising governance large model, Mamoda 2.5, achieving an upgrade in content safety risk control technology. Starting from version 1.0, which could only identify basic prohibited text, the model has continuously evolved, expanding its capabilities. It now provides stronger support for efficiently and accurately identifying and managing prohibited content in the digital advertising ecosystem.

Jul 3, 2026

110

End of Subscription Benefits? Anthropic Tightens Its Strongest AI Model, Claude Fable5, Will Be Charged by Usage

Anthropic will remove access to its top model Claude Fable5 from subscriptions on July 7, 2026, excluding core features from membership. Costs for developers and heavy users will likely surge. The move follows the US lifting export curbs on Fable series, with global redeployment underway.....

Jul 3, 2026

150

The Digital Business Card of the Earth: China Releases the World's First Stratigraphic AI Large Model

Chinese scientists unveiled the first stratigraphic AI model and intelligent global stratigraphic correlation system at the 5th International Congress on Stratigraphy. It aims to integrate global geological data, replace tedious manual correlation with AI, efficiently interpret Earth's 4.6-billion-year evolutionary history, and serve as an 'intelligent steward' of the Earth.....

Jul 3, 2026

170

Claude Prime Model Fable 5 Launches Pay-As-You-Go Pricing Model, Subscription Users Have Limited Benefits

Starting July 7, Anthropic will remove its top model Claude Fable5 from subscription tiers and switch to usage-based credits. Pro and Max users, who previously could use up to 50% of their weekly allowance on this model, will lose access. The move has sparked user backlash.....

Jul 3, 2026

180

Freelancer Crisis? Latest AI Model Copes with 16% Remote Projects, the Design Industry is Changing

AI Safety Center's Remote Labor Index reveals a breakthrough in remote work automation. Claude Fable5 achieved a record 16.1% automation rate on freelance tasks like 3D modeling, architectural design, graphic design, and video animation, based on industry standards measuring AI output quality matching or surpassing human professionals.....

Jul 3, 2026

160

Tencent Games Launches 2026 Summer Unprotected Minors Special Action, Upgrading AI Dual-Engine Addiction Prevention Model

Tencent launched the '2026 Summer Unprotected Minors Special Action', introducing an 'AI Dual-Engine Addiction Prevention' mode on top of national gaming restrictions. This approach uses AI technology to strengthen health protection from both inside and outside games. Although the industry has already established time limits and spending controls, new measures aim to address regulatory gaps by leveraging AI, reinforcing a firewall.

Jul 3, 2026

200

Say Goodbye to Code Refactoring Anxiety: Alibaba Open Sources Page Agent to Help Large Models Understand Web Page Logic

Alibaba Open Sources Page Agent, Changing the Approach to Browser Automation. It allows large models to directly parse web page structures, rather than relying on external screenshots or protocol-driven methods, thereby dynamically adapting to changes and solving the 'reinventing the wheel' dilemma.

Jul 3, 2026

220

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

GEO Brand Visibility

AI Visibility Audit

AI Search Visibility Checker

GEO Ranking Monitor

AI Conversation Insight

GEO Promotion Link Detection

Website AI Friendliness Detection

GEO Ranking Optimization System

GEO Ranking Optimization

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

LLM API Hub

AI Models Finder

Model Providers

LLM Leaderboard

LLM API Proxy Checker

Compare LLMs

LLM Cost Calculator

LLM Arena

AI Model Compatibility Checker

AI Deployment Calculator

NVIDIA Releases Open-Source Dual-Tower AI Model, Text Generation Speed Increased by 2.42 Times, Image Quality Retained at 98.7%

AIbase基地

60B Two-Tower Architecture, Parallel Processing to Improve Generation Efficiency

Open Source Deployment, Suitable for Multi-Scenario Inference

This article is from AIbase Daily

AI News Recommendations

AI Daily: Alibaba Internally Reverses Disable of Claude; Microsoft's Pure Web-Based Aion System Exposed; Claude Premium Model Launches On-Demand Payment Model

AI Research Enters the Autonomous Driving Era: Yang Zhilin Discusses the Third Stage of Large Model Training

AI Video Market Landscape Reimagined: Google Gemini Omni Flash Tops Blind Test Rankings

Advertising Governance Embarks on a Visual Evolution: ByteDance Engine Launches Mamoda 2.5 Version to Achieve Comprehensive Video Coverage

End of Subscription Benefits? Anthropic Tightens Its Strongest AI Model, Claude Fable5, Will Be Charged by Usage

The Digital Business Card of the Earth: China Releases the World's First Stratigraphic AI Large Model

Claude Prime Model Fable 5 Launches Pay-As-You-Go Pricing Model, Subscription Users Have Limited Benefits

Freelancer Crisis? Latest AI Model Copes with 16% Remote Projects, the Design Industry is Changing

Tencent Games Launches 2026 Summer Unprotected Minors Special Action, Upgrading AI Dual-Engine Addiction Prevention Model

Say Goodbye to Code Refactoring Anxiety: Alibaba Open Sources Page Agent to Help Large Models Understand Web Page Logic

AI News Recommendations

AI Daily: Alibaba Internally Reverses Disable of Claude; Microsoft's Pure Web-Based Aion System Exposed; Claude Premium Model Launches On-Demand Payment Model

AI Research Enters the Autonomous Driving Era: Yang Zhilin Discusses the Third Stage of Large Model Training

AI Video Market Landscape Reimagined: Google Gemini Omni Flash Tops Blind Test Rankings

Advertising Governance Embarks on a Visual Evolution: ByteDance Engine Launches Mamoda 2.5 Version to Achieve Comprehensive Video Coverage

End of Subscription Benefits? Anthropic Tightens Its Strongest AI Model, Claude Fable5, Will Be Charged by Usage

The Digital Business Card of the Earth: China Releases the World's First Stratigraphic AI Large Model

Claude Prime Model Fable 5 Launches Pay-As-You-Go Pricing Model, Subscription Users Have Limited Benefits

Freelancer Crisis? Latest AI Model Copes with 16% Remote Projects, the Design Industry is Changing

Tencent Games Launches 2026 Summer Unprotected Minors Special Action, Upgrading AI Dual-Engine Addiction Prevention Model

Say Goodbye to Code Refactoring Anxiety: Alibaba Open Sources Page Agent to Help Large Models Understand Web Page Logic