Magistral Small 2506

Building upon Mistral Small 3.1 (2503), with added reasoning capabilities, undergoing SFT from Magistral Medium traces and RL on top, it's a small, efficient reasoning model with 24B parameters. Magistral Small can be deployed locally, fitting within a single RTX 4090 or a 32GB RAM MacBook once quantized.

AIME 2024

70.7%

i
GPQA

68.2%

i
AIME 2025

62.8%

i
LiveCodeBench

51.3%

i