NVIDIA Releases New Small Model Nemotron-Nano-9B-V2: Free for Commercial Use and Excellent Performance

AIbase基地

Published inAI News · 5 min read · Aug 19, 2025

Small models are causing a storm, and NVIDIA is not lagging behind. After MIT and Google released small AI models that can run on smartwatches and smartphones, NVIDIA has launched its latest small language model (SLM)—Nemotron-Nano-9B-V2. The model performs well in multiple benchmark tests and achieved the highest level among similar products in specific tests.

Designed for Efficiency and Reasoning

The Nemotron-Nano-9B-V2 has 9 billion parameters. Although it is larger than some micro models with millions of parameters, it is significantly smaller than its previous 12 billion parameter version and is specifically optimized for a single NVIDIA A10 GPU. Oleksii Kuchiaev, Director of AI Model Post-Training at NVIDIA, explained that this adjustment was made to adapt to the popular A10 deployment GPU. In addition, the Nemotron-Nano-9B-V2 is a hybrid model, capable of processing larger batches and running six times faster than Transformer models of the same scale.

The model supports up to nine languages, including Chinese, English, German, French, Japanese, and Korean, and excels at handling instruction tracking and code generation tasks. Its pre-training dataset and the model itself are available on Hugging Face and NVIDIA's model catalog.

Combining Transformer and Mamba Architectures

The Nemotron-Nano-9B-V2 is based on the Nemotron-H series, which combines Mamba and Transformer architectures. Traditional Transformer models, although powerful, consume a lot of memory and computational resources when processing long sequences. The Mamba architecture introduces selective state space models (SSMs), which can process long information sequences with linear complexity, offering advantages in terms of memory and computational cost. The Nemotron-H series achieves a 2-3 times throughput improvement in long context processing by replacing most attention layers with linear state space layers, while maintaining high accuracy.

Unique Reasoning Control Features

A major innovation of this model is its built-in "reasoning" feature, which allows users to perform self-checks before the model provides the final answer. Users can enable or disable this feature using simple control tokens, such as /think or /no_think. The model also supports runtime "thinking budget" management, allowing developers to limit the number of tokens used for internal reasoning, thus achieving a balance between accuracy and latency. This is particularly crucial for application scenarios like customer support or autonomous agents that require fast response times.

Strict Open Licensing, Targeted at Enterprise Applications

NVIDIA released the Nemotron-Nano-9B-V2 under its Open Model License Agreement, which is enterprise-friendly and highly permissive. NVIDIA clearly states that enterprises can freely use the model for commercial purposes and do not have to pay any fees or royalties for using the model.

AI Star! A Photo Becomes a Movie Masterpiece. Gaga AI Revolutionizes Film Creation

Gaga AI launches the world's first film-level dialogue generation model, capable of generating 60-second videos from just a static photo and text prompts, supporting emotional performance, dual interaction, and multilingual support. This technology breaks traditional lip synchronization, enabling AI to transition from a tool to a creator, potentially revolutionizing the film industry's production barriers.

vivo Blue Heart Large Model Upgraded! Smart Assistant XIAO V Becomes a Thinking Expert, Free Interaction Without Wake-up Words!

At the 2025 vivo Developer Conference, the Blue Heart Language Large Model was upgraded, and the smart assistant XIAO V achieved significant progress. Core improvements include restructuring the intent control center, enhancing the accuracy of user intent understanding, enabling the decomposition of complex tasks and optimization of execution steps. A new deep thinking ability has been added, allowing XIAO V to provide more insightful and high-quality intelligent Q&A services.

vivo Blue Heart 3B On-Device Large Model Launches with Five Core Capabilities: Performance Exceeds All 8B Models

vivo launched the Blue Heart 3B On-Device Multimodal Reasoning Large Model at the 2025 Developer Conference. This 3 billion parameter model is the first One Model in the industry to integrate five core capabilities. After one year of training and optimization, it achieves a major breakthrough in deploying complex multimodal AI capabilities on mobile devices, establishing a leading position in the industry.

Seres Joins Forces with ByteDance's Varkid Engine to Promote the Industrial Upgrade of Embodied Intelligence

A subsidiary of Seres signed a framework agreement for business cooperation in embodied intelligence with Varkid Engine, a subsidiary of ByteDance. The two parties will conduct in-depth cooperation in the fields of intelligent robot decision-making, control, and human-machine enhancement based on multimodal cloud-edge collaboration. This marks the strategic layout of automotive manufacturers and internet AI giants in cutting-edge technology fields.

Time Magazine Unveils the 2025 Best Inventions of the Year: Chinese Innovations Such as Unitree, DeepSeek, Huawei, and BYD Make the List

Time Magazine has released the 2025 Best Inventions of the Year list, featuring 300 innovative products. Unitree Technology's humanoid bipedal robot R1 was selected for its breakthrough in traditional robot design, joining products from tech giants like Huawei, BYD, and Apple to showcase cutting-edge innovation.

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Submit Your Model

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

GEO Services​

AI Search Visibility Checker

AI Model Compatibility Checker

AI Dataset Collection

Intelligent Document Recognition

NVIDIA Releases New Small Model Nemotron-Nano-9B-V2: Free for Commercial Use and Excellent Performance

AIbase基地

Designed for Efficiency and Reasoning

Combining Transformer and Mamba Architectures

Unique Reasoning Control Features

Strict Open Licensing, Targeted at Enterprise Applications

This article is from AIbase Daily

AI News Recommendations

AI Star! A Photo Becomes a Movie Masterpiece. Gaga AI Revolutionizes Film Creation

Meitu RoboNeo Launches with Over a Million MAU in the First Month, Wu Xinhong Advocates AI Native

Microsoft Releases UserLM-8b: A Training Partner Model for Realistic Multi-turn Dialogues to Refine AI Assistants

vivo Blue Heart Large Model Upgraded! Smart Assistant XIAO V Becomes a Thinking Expert, Free Interaction Without Wake-up Words!

vivo Blue Heart 3B On-Device Large Model Launches with Five Core Capabilities: Performance Exceeds All 8B Models

Sora 2 Dominates App Store, CITIC Securities Continues to Support AI Industrial Chain

Seres Joins Forces with ByteDance's Varkid Engine to Promote the Industrial Upgrade of Embodied Intelligence

Time Magazine Unveils the 2025 Best Inventions of the Year: Chinese Innovations Such as Unitree, DeepSeek, Huawei, and BYD Make the List

Tsinghua Genius Yao Shunyu Resigns and Joins DeepMind to Forge a New Era!

Anthropic Opens Zero Slop Zone in New York to Resist Low-Quality AI Content

GEO Services