Llama 3.2 3B Instruct

Llama 3.2 3B Instruct is a large language model that supports a context length of 128K tokens and are state-of-the-art in their class for on-device use cases like summarization, instruction following, and rewriting tasks running locally at the edge.

Benchmark results

Benchmark Score Tags Source
ARC-C 78.6% self-reported llm-stats link →
BFCL v2 67.0% self-reported llm-stats link →
GPQA 32.8% self-reported llm-stats link →
GSM8k 77.7% self-reported llm-stats link →
HellaSwag 69.8% self-reported llm-stats link →
IFEval 77.4% self-reported llm-stats link →
InfiniteBench/En.MC 63.3% self-reported llm-stats link →
InfiniteBench/En.QA 19.8% self-reported llm-stats link →
MATH 48.0% self-reported llm-stats link →
MGSM 58.2% self-reported llm-stats link →
MMLU 63.4% self-reported llm-stats link →
Nexus 34.3% self-reported llm-stats link →
NIH/Multi-needle 84.7% self-reported llm-stats link →
Open-rewrite 40.1% self-reported llm-stats link →
TLDR9+ (test) 19.0% self-reported llm-stats link →