Llama 3.2 3B Instruct
Llama 3.2 3B Instruct is a large language model that supports a context length of 128K tokens and are state-of-the-art in their class for on-device use cases like summarization, instruction following, and rewriting tasks running locally at the edge.
Benchmark results
| Benchmark | Score | Tags | Source |
|---|---|---|---|
| ARC-C | 78.6% | self-reported llm-stats | link → |
| BFCL v2 | 67.0% | self-reported llm-stats | link → |
| GPQA | 32.8% | self-reported llm-stats | link → |
| GSM8k | 77.7% | self-reported llm-stats | link → |
| HellaSwag | 69.8% | self-reported llm-stats | link → |
| IFEval | 77.4% | self-reported llm-stats | link → |
| InfiniteBench/En.MC | 63.3% | self-reported llm-stats | link → |
| InfiniteBench/En.QA | 19.8% | self-reported llm-stats | link → |
| MATH | 48.0% | self-reported llm-stats | link → |
| MGSM | 58.2% | self-reported llm-stats | link → |
| MMLU | 63.4% | self-reported llm-stats | link → |
| Nexus | 34.3% | self-reported llm-stats | link → |
| NIH/Multi-needle | 84.7% | self-reported llm-stats | link → |
| Open-rewrite | 40.1% | self-reported llm-stats | link → |
| TLDR9+ (test) | 19.0% | self-reported llm-stats | link → |