Nemotron 3 Nano (30B A3B)
Nemotron 3 Nano is a 31.6B hybrid MoE model optimized for fast, long‑context agentic reasoning. It mixes Mamba‑2 and Transformer layers with a sparse MoE router (~3.6B active params per token) to deliver up to 4× higher throughput than Nemotron 2 and strong accuracy across math, coding, and tools. It supports a 1M‑token context window, offers Reasoning ON/OFF and a thinking‑budget to control costs, and ships with open weights, data, and RL tooling (NeMo Gym/RL). Released Dec 15, 2025 under the NVIDIA Open Model License, it’s built as the efficient backbone for multi‑agent systems at scale.
Benchmark results
| Benchmark | Score | Tags | Source |
|---|---|---|---|
| AIME 2025 | 99.2% | self-reported llm-stats | link → |
| Arena-Hard v2 | 67.7% | self-reported llm-stats | link → |
| GPQA | 75.0% | self-reported llm-stats | link → |
| Humanity's Last Exam | 15.5% | self-reported llm-stats | link → |
| LiveCodeBench v6 | 68.3% | self-reported llm-stats | link → |
| MMLU-Pro | 78.3% | self-reported llm-stats | link → |
| MMLU-ProX | 59.5% | self-reported llm-stats | link → |
| Multi-Challenge | 38.5% | self-reported llm-stats | link → |
| SciCode | 33.3% | self-reported llm-stats | link → |
| SWE-Bench Verified | 38.8% | self-reported llm-stats | link → |
| Tau2 Airline | 48.0% | self-reported llm-stats | link → |
| Tau2 Retail | 56.9% | self-reported llm-stats | link → |
| Tau2 Telecom | 42.2% | self-reported llm-stats | link → |
| Terminal-Bench | 8.5% | self-reported llm-stats | link → |
| WMT24++ | 86.2% | self-reported llm-stats | link → |