Nemotron 3 Nano (30B A3B)

Nemotron 3 Nano is a 31.6B hybrid MoE model optimized for fast, long‑context agentic reasoning. It mixes Mamba‑2 and Transformer layers with a sparse MoE router (~3.6B active params per token) to deliver up to 4× higher throughput than Nemotron 2 and strong accuracy across math, coding, and tools. It supports a 1M‑token context window, offers Reasoning ON/OFF and a thinking‑budget to control costs, and ships with open weights, data, and RL tooling (NeMo Gym/RL). Released Dec 15, 2025 under the NVIDIA Open Model License, it’s built as the efficient backbone for multi‑agent systems at scale.

Benchmark results

Benchmark Score Tags Source
AIME 2025 99.2% self-reported llm-stats link →
Arena-Hard v2 67.7% self-reported llm-stats link →
GPQA 75.0% self-reported llm-stats link →
Humanity's Last Exam 15.5% self-reported llm-stats link →
LiveCodeBench v6 68.3% self-reported llm-stats link →
MMLU-Pro 78.3% self-reported llm-stats link →
MMLU-ProX 59.5% self-reported llm-stats link →
Multi-Challenge 38.5% self-reported llm-stats link →
SciCode 33.3% self-reported llm-stats link →
SWE-Bench Verified 38.8% self-reported llm-stats link →
Tau2 Airline 48.0% self-reported llm-stats link →
Tau2 Retail 56.9% self-reported llm-stats link →
Tau2 Telecom 42.2% self-reported llm-stats link →
Terminal-Bench 8.5% self-reported llm-stats link →
WMT24++ 86.2% self-reported llm-stats link →