IBM Granite 4.0 Tiny Preview

A preliminary version of the smallest model in the upcoming Granite 4.0 family, released May 2025. It utilizes a novel hybrid Mamba-2/Transformer, fine-grained mixture of experts (MoE) architecture (7B total parameters, 1B active at inference). This preview version is partially trained (2.5T tokens) but demonstrates significant memory efficiency and performance potential, validated for at least 128K context length without positional encoding.

Benchmark results

Benchmark Score Tags Source
AlpacaEval 2.0 35.2% self-reported llm-stats link →
Arena Hard 26.7% self-reported llm-stats link →
AttaQ 86.1% self-reported llm-stats link →
BIG-Bench Hard 55.7% self-reported llm-stats link →
DROP 46.2% self-reported llm-stats link →
GSM8k 70.1% self-reported llm-stats link →
HumanEval 82.4% self-reported llm-stats link →
HumanEval+ 78.3% self-reported llm-stats link →
IFEval 63.0% self-reported llm-stats link →
MMLU 60.4% self-reported llm-stats link →
PopQA 22.9% self-reported llm-stats link →
TruthfulQA 58.1% self-reported llm-stats link →