Information

Latest AI News

Explore AI Frontiers, Master Industry Trends

AI Daily Brief

Your Daily AI Brief - Never Miss What's Next

Information

AI Product Finder

Smart Product Discovery - Comprehensive Market Intelligence

AI Product Rankings

AI Product Power Rankings - Performance, Buzz & Trends

AI Product Submit

Submit Your AI Product - Amplify Reach & Drive Growth

Tools

AI Tools Directory

Discover The Best AI Websites & Tools

Information

AI Models Finder

Comprehensive AI Models Collection for All Your Development & Research Needs

LLM Leaderboard

AI LLM Power Rankings - Performance, Buzz & Trends

Model Providers

Discover Trusted AI Model Partners - Guaranteed Reliable Support

Submit Your Model

Submit Your Model Info & Services - Precision Marketing & User Targeting

Tools

Compare LLMs

Multi-Dimensional Large Model Comparison - Find Your Perfect Match

LLM Cost Calculator

Calculate AI Model Costs Accurately - Optimize Your Budget

LLM Arena

Multi-Model Real-Time Evaluation & Quick Output Comparison

Information

MCP Servers

Discover Popular AI-MCP Services - Find Your Perfect Match Instantly

MCP Client

Easy MCP Client Integration - Access Powerful AI Capabilities

MCP Case Tutorials

Master MCP Usage - From Beginner to Expert

MCP Ranking

Top MCP Service Performance Rankings - Find Your Best Choice

MCP Service Submission

Publish & Promote Your MCP Services

Tools

MCP Playground

Test MCP Services Freely - Quick Online Experience

MCP Inspector

Quick MCP Service Testing - Fast Deployment

AI Brand Monitoring Tool

Analyze & Track How AI Models Cite Your Brand

GEO Services

Achieve Dominant Visibility in AI Search for Your Business or Brand with GEO Services

AI Search Visibility Checker

Detect brand's visibility on AI platforms

Tools

AI Model Compatibility Checker

Free PC Hardware Test for DeepSeek & Llama

AI Deployment Calculator

Enter Your Large Model Computing Requirements for Instant GPU, Memory & Server Configuration Recommendations

AI Tutorial

Information

AI Dataset Collection

Large-scale datasets and benchmarks for training, evaluating, and testing models to measure

Tools

Intelligent Document Recognition

Comprehensive Text Extraction and Document Processing Solutions for Users

Nvidia Launches Small Open Model Nemotron-Nano-9B-v2: Switchable Inference Features

AIbase基地

Published inAI News · 5 min read · Sep 1, 2025

Nvidia has recently officially launched a new small language model, Nemotron-Nano-9B-v2, marking a resurgence in the popularity of small models.

The model has 900 million parameters. Although it is larger in scale compared to other small models with millions of parameters, it has been significantly reduced from the original 1.2 billion parameters, aiming to run efficiently on a single Nvidia A10 GPU. Oleksii Kuchiaev, Director of AI Model Post-training at Nvidia, stated on social platforms that reducing the number of parameters is intended to better adapt to deployment needs. The model uses a hybrid architecture, which can be six times faster than similar-sized transformer models when processing larger batches.

Nemotron-Nano-9B-v2 supports multiple languages, including English, German, Spanish, French, Italian, Japanese, and others, and is suitable for tasks such as instruction following and code generation. The model also includes an innovative feature — users can switch the AI's "reasoning" process by using simple control tokens, i.e., self-checking before providing an answer. By default, the system generates reasoning traces, but users can control this process using commands like /think or /no_think. Additionally, the model introduces a "thinking budget" management mechanism, allowing developers to set the number of tokens used during the reasoning process, balancing accuracy and response speed.

According to test results, Nemotron-Nano-9B-v2 performed well in multiple benchmark tests. In "reasoning enabled" mode, the model achieved satisfactory results in tests such as AIME25, MATH500, GPQA, and LiveCodeBench. Moreover, it also showed excellent performance in instruction following and long context benchmarks, demonstrating higher accuracy compared to other open-source small models.

Nvidia has set an open licensing agreement for this model, allowing developers to freely use and distribute it commercially, and explicitly stating that it does not claim ownership of the generated output. This means companies can immediately put the model into production without additional negotiations, without worrying about usage barriers or fees.

Nvidia's Nemotron-Nano-9B-v2 model provides developers with a new tool to achieve reasoning capabilities and efficient deployment on a small scale. Its running budget control and reasoning switching features offer flexibility for system builders, aiming to improve accuracy and response speed, further advancing the development of small language models.

Key Points:
🌟 Nemotron-Nano-9B-v2 is a new small language model introduced by Nvidia, with 900 million parameters, specifically designed for efficient deployment.
🧠 The model supports multiple languages and has a reasoning switching feature, helping users adjust their responses according to their needs.
📈 An open licensing agreement allows developers to freely use and distribute the model without worrying about additional costs or licensing agreements.

Nemotron-Nano-9B-v2 Nvidia AIBuzzword SmallLanguageModel

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

AI Daily: Shanghai's First AI Prompt Copyright Case Judged; Kimi K2 Thinking Released; New King of Chinese Image Editing, UniWorld-V2 Released

Shanghai Huangpu District Court ruled in the first instance that AI prompts do not possess originality and do not constitute copyright infringement. This is the first copyright case involving AI prompts in Shanghai. The court held that prompts lack originality and therefore are not protected by copyright law.

Nov 7, 2025

New King in Chinese Image Editing! UniWorld-V2 Released: Select and Edit, Accurate Rendering of Chinese Fonts, Performance Surpasses GPT-Image and Gemini

Peking University and Rabbit Tech launch UniWorld-V2, an image editing model using the UniWorld-R1 RL framework. It excels in detail control, Chinese instruction understanding, and surpasses traditional supervised learning.....

Nov 7, 2025

100

OpenAI CFO Responds to Market Concerns: No Plan to Go Public, AI Bubble Theory is Exaggerated

OpenAI CFO Sarah Friar addresses AI bubble concerns, emphasizing society underestimates AI's long-term potential and underinvests, urging infrastructure support. She also clarified the company's funding structure.....

Nov 7, 2025

140

Xiaopeng CEO He Xiaopeng Tears Up as He Peels Back the Skin of the IRON Robot, Dispelling Doubts About a Human Inside

Xpeng CEO He Xiaoping emotionally refuted 'fake robot' claims at the X9 launch by cutting open the robot's leg to reveal its mechanical interior, citing a film analogy to prove authenticity.....

Nov 7, 2025

130

New Thinking Model - Moonlight Kimi K2 Thinking Released, the Boundaries of AI Are Pushed Again!

Moon's dark side releases open-source Kimi K2Thinking model, enhancing autonomous reasoning and tool usage for complex tasks without user intervention.....

Nov 7, 2025

110

Unveiling the Mystery of MiniMax M2: Why Choose Full Attention Mechanism?

The MiniMax M2 model uses a full attention mechanism, abandoning linear or sparse attention techniques. The development team believes that although the latter can save computing resources, full attention is more efficient in industrial applications and can improve model performance. This decision aims to optimize actual deployment results and promote the development of AI technology.

Nov 6, 2025

110

AI Daily: Sora Launches on Android; NetEase Music Introduces AI Equalization Master; Google to Launch Nano Banana2

OpenAI's Sora video app launches on Android via Google Play, expanding its global short video creation influence. It introduces a 'paid roles' feature for enhanced user personalization.....

Nov 5, 2025

190

Monetization Ideas Anyone Can Learn! B Station Uploader Uses AI to Create Character MVs from Journey to the West, All AI-Generated

Creator uses AI to generate songs, lyrics, and character images based on Journey to the West, producing MVs on Bilibili. Achieves high views, fan growth, and monetization. Ideal for AI enthusiasts and creators with basic AI and editing skills.....

Nov 5, 2025

130

Google Gemini Platform to Launch Nano Banana2 Image Generation Technology with Upgrades

Google to launch AI image model Nano Banana2 (GEMPIX2) by DeepMind, enhancing efficiency and precision in image generation to advance generative AI innovation.....

Nov 5, 2025

210

NVIDIA and Deutsche Telekom Invest 1 Billion Euros to Build Data Centers, Boosting Germany's AI Computing Power by 50%

Nvidia and Deutsche Telekom invest €1B in Germany's largest AI data center, operational by Q1 2026, boosting Europe's AI infrastructure and global competitiveness.....

Nov 5, 2025

150

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Submit Your Model

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

AI Brand Monitoring Tool

GEO Services​

AI Search Visibility Checker

AI Model Compatibility Checker

AI Deployment Calculator

AI Dataset Collection

Intelligent Document Recognition

Nvidia Launches Small Open Model Nemotron-Nano-9B-v2: Switchable Inference Features

AIbase基地

This article is from AIbase Daily

AI News Recommendations

AI Daily: Shanghai's First AI Prompt Copyright Case Judged; Kimi K2 Thinking Released; New King of Chinese Image Editing, UniWorld-V2 Released

New King in Chinese Image Editing! UniWorld-V2 Released: Select and Edit, Accurate Rendering of Chinese Fonts, Performance Surpasses GPT-Image and Gemini

OpenAI CFO Responds to Market Concerns: No Plan to Go Public, AI Bubble Theory is Exaggerated

Xiaopeng CEO He Xiaopeng Tears Up as He Peels Back the Skin of the IRON Robot, Dispelling Doubts About a Human Inside

New Thinking Model - Moonlight Kimi K2 Thinking Released, the Boundaries of AI Are Pushed Again!

Unveiling the Mystery of MiniMax M2: Why Choose Full Attention Mechanism?

AI Daily: Sora Launches on Android; NetEase Music Introduces AI Equalization Master; Google to Launch Nano Banana2

Monetization Ideas Anyone Can Learn! B Station Uploader Uses AI to Create Character MVs from Journey to the West, All AI-Generated

Google Gemini Platform to Launch Nano Banana2 Image Generation Technology with Upgrades

NVIDIA and Deutsche Telekom Invest 1 Billion Euros to Build Data Centers, Boosting Germany's AI Computing Power by 50%

GEO Services