Magistral Medium

Trained solely with reinforcement learning on top of Mistral Medium 3, Magistral Medium is a reasoning model that achieves strong performance on complex math and code tasks without relying on distillation from existing reasoning models. The training uses an RLVR framework with modifications to GRPO, enabling improved reasoning ability and multilingual consistency.

AIME 2024

73.6%

i
GPQA

70.8%

i
AIME 2025

64.9%

i
LiveCodeBench

50.3%

i
Aider-Polyglot

47.1%

i
Humanity's Last Exam

9.0%

i