Granite 3.3 8B Instruct

Granite 3.3 models feature enhanced reasoning capabilities and support for Fill-in-the-Middle (FIM) code completion. They are built on a foundation of open-source instruction datasets with permissive licenses, alongside internally curated synthetic datasets tailored for long-context problem-solving.

HumanEval

89.7%

i
AttaQ

88.5%

i
HumanEval+

86.1%

i
AIME 2024

81.2%

i
GSM8k

80.9%

i
IFEval

74.8%

i
BIG-Bench Hard

69.1%

i
MATH-500

69.0%

i
TruthfulQA

66.9%

i
MMLU

65.5%

i
AlpacaEval 2.0

62.7%

i
DROP

59.4%

i
Arena Hard

57.6%

i
PopQA

26.2%

i