Nvidia Launches New Small Open Model Nemotron-Nano-9B-v2 with Smart Inference Switch

AIbase基地

Published inAI News · 4 min read · Aug 19, 2025

NVIDIA recently launched a new small language model, Nemotron-Nano-9B-v2, which shows excellent performance on multiple benchmarks and allows users to flexibly control the switch of its reasoning function. The parameter count of Nemotron-Nano-9B-v2 is 9 billion, significantly reduced from its predecessor's 12 billion parameters, aiming to meet deployment needs on a single NVIDIA A10 GPU.

Oleksii Kuchiaev, NVIDIA's Director of AI Model Post-Training, stated that this model is specifically optimized for the A10 GPU, achieving up to 6 times faster processing speed, suitable for various application scenarios. Nemotron-Nano-9B-v2 supports multiple languages, including English, German, Spanish, French, Italian, Japanese, as well as extended Korean, Portuguese, Russian, and Chinese, and is suitable for instruction following and code generation tasks.

The model is based on the Nemotron-H series, integrating Mamba and Transformer architectures, which can reduce memory and computational requirements when processing long sequences. Unlike traditional Transformer models, the Nemotron-H model uses selective state space models (SSM), ensuring accuracy while efficiently handling longer information sequences.

In terms of reasoning functions, Nemotron-Nano-9B-v2 can default to generating tracking records of the reasoning process. Users can also switch this feature using simple control instructions, such as /think or /no_think. In addition, the model introduces a runtime "thinking budget" management, allowing developers to set the maximum number of tokens used for reasoning, thus achieving a balance between accuracy and response speed.

In benchmark tests, Nemotron-Nano-9B-v2 demonstrated good accuracy. For example, under the "reasoning enabled" mode of the NeMo-Skills suite, the model performed well in multiple tests, showing advantages compared to other small open-source models.

NVIDIA released Nemotron-Nano-9B-v2 under an open model license, allowing commercial use, and developers can freely create and distribute derivative models. Notably, NVIDIA does not claim ownership of the output generated by the model, giving users full control over its usage.

The release of this model aims to provide developers with tools to balance reasoning capabilities and deployment efficiency in small-scale environments, marking NVIDIA's ongoing efforts to improve the efficiency and controllable reasoning capabilities of language models.

huggingface:https://huggingface.co/nvidia/NVIDIA-Nemotron-Nano-9B-v2

Key Points:
🌟 NVIDIA has launched a new small language model, Nemotron-Nano-9B-v2, which allows users to flexibly control the reasoning function.
⚙️ The model is based on an advanced hybrid architecture, enabling efficient processing of long sequence information, suitable for multilingual tasks.
📊 Nemotron-Nano-9B-v2 is released under an open model license, allowing developers to use it for commercial purposes and create and distribute derivative models.

AI Star! A Photo Becomes a Movie Masterpiece. Gaga AI Revolutionizes Film Creation

Gaga AI launches the world's first film-level dialogue generation model, capable of generating 60-second videos from just a static photo and text prompts, supporting emotional performance, dual interaction, and multilingual support. This technology breaks traditional lip synchronization, enabling AI to transition from a tool to a creator, potentially revolutionizing the film industry's production barriers.

vivo Blue Heart Large Model Upgraded! Smart Assistant XIAO V Becomes a Thinking Expert, Free Interaction Without Wake-up Words!

At the 2025 vivo Developer Conference, the Blue Heart Language Large Model was upgraded, and the smart assistant XIAO V achieved significant progress. Core improvements include restructuring the intent control center, enhancing the accuracy of user intent understanding, enabling the decomposition of complex tasks and optimization of execution steps. A new deep thinking ability has been added, allowing XIAO V to provide more insightful and high-quality intelligent Q&A services.

vivo Blue Heart 3B On-Device Large Model Launches with Five Core Capabilities: Performance Exceeds All 8B Models

vivo launched the Blue Heart 3B On-Device Multimodal Reasoning Large Model at the 2025 Developer Conference. This 3 billion parameter model is the first One Model in the industry to integrate five core capabilities. After one year of training and optimization, it achieves a major breakthrough in deploying complex multimodal AI capabilities on mobile devices, establishing a leading position in the industry.

Seres Joins Forces with ByteDance's Varkid Engine to Promote the Industrial Upgrade of Embodied Intelligence

A subsidiary of Seres signed a framework agreement for business cooperation in embodied intelligence with Varkid Engine, a subsidiary of ByteDance. The two parties will conduct in-depth cooperation in the fields of intelligent robot decision-making, control, and human-machine enhancement based on multimodal cloud-edge collaboration. This marks the strategic layout of automotive manufacturers and internet AI giants in cutting-edge technology fields.

Time Magazine Unveils the 2025 Best Inventions of the Year: Chinese Innovations Such as Unitree, DeepSeek, Huawei, and BYD Make the List

Time Magazine has released the 2025 Best Inventions of the Year list, featuring 300 innovative products. Unitree Technology's humanoid bipedal robot R1 was selected for its breakthrough in traditional robot design, joining products from tech giants like Huawei, BYD, and Apple to showcase cutting-edge innovation.

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Submit Your Model

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

GEO Services

AI Search Visibility Checker

AI Model Compatibility Checker

AI Dataset Collection

Intelligent Document Recognition

Nvidia Launches New Small Open Model Nemotron-Nano-9B-v2 with Smart Inference Switch

AIbase基地

This article is from AIbase Daily

AI News Recommendations

AI Star! A Photo Becomes a Movie Masterpiece. Gaga AI Revolutionizes Film Creation

Meitu RoboNeo Launches with Over a Million MAU in the First Month, Wu Xinhong Advocates AI Native

Microsoft Releases UserLM-8b: A Training Partner Model for Realistic Multi-turn Dialogues to Refine AI Assistants

vivo Blue Heart Large Model Upgraded! Smart Assistant XIAO V Becomes a Thinking Expert, Free Interaction Without Wake-up Words!

vivo Blue Heart 3B On-Device Large Model Launches with Five Core Capabilities: Performance Exceeds All 8B Models

Sora 2 Dominates App Store, CITIC Securities Continues to Support AI Industrial Chain

Seres Joins Forces with ByteDance's Varkid Engine to Promote the Industrial Upgrade of Embodied Intelligence

Time Magazine Unveils the 2025 Best Inventions of the Year: Chinese Innovations Such as Unitree, DeepSeek, Huawei, and BYD Make the List

Tsinghua Genius Yao Shunyu Resigns and Joins DeepMind to Forge a New Era!

Anthropic Opens Zero Slop Zone in New York to Resist Low-Quality AI Content

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Submit Your Model

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

GEO Services​

AI Search Visibility Checker

AI Model Compatibility Checker

AI Dataset Collection

Intelligent Document Recognition

Nvidia Launches New Small Open Model Nemotron-Nano-9B-v2 with Smart Inference Switch

AIbase基地

This article is from AIbase Daily

AI News Recommendations

AI Star! A Photo Becomes a Movie Masterpiece. Gaga AI Revolutionizes Film Creation

Meitu RoboNeo Launches with Over a Million MAU in the First Month, Wu Xinhong Advocates AI Native

Microsoft Releases UserLM-8b: A Training Partner Model for Realistic Multi-turn Dialogues to Refine AI Assistants

vivo Blue Heart Large Model Upgraded! Smart Assistant XIAO V Becomes a Thinking Expert, Free Interaction Without Wake-up Words!

vivo Blue Heart 3B On-Device Large Model Launches with Five Core Capabilities: Performance Exceeds All 8B Models

Sora 2 Dominates App Store, CITIC Securities Continues to Support AI Industrial Chain

Seres Joins Forces with ByteDance's Varkid Engine to Promote the Industrial Upgrade of Embodied Intelligence

Time Magazine Unveils the 2025 Best Inventions of the Year: Chinese Innovations Such as Unitree, DeepSeek, Huawei, and BYD Make the List

Tsinghua Genius Yao Shunyu Resigns and Joins DeepMind to Forge a New Era!

Anthropic Opens Zero Slop Zone in New York to Resist Low-Quality AI Content

GEO Services