GPT-4.1 mini

GPT-4.1 mini provides a balance between intelligence, speed, and cost. It's a significant leap in small model performance, even beating GPT-4o in many benchmarks while reducing latency and cost.

CharXiv-D

88.4%

i
MMLU

87.5%

i
IFEval

84.1%

i
MMMLU

78.5%

i
MathVista

73.1%

i
MMMU

72.7%

i
Multi-IF

67.0%

i
GPQA

65.0%

i
Graphwalks BFS <128k

61.7%

i
Graphwalks parents <128k

60.5%

i
CharXiv-R

56.8%

i
TAU-bench Retail

55.8%

i
COLLIE

54.6%

i
AIME 2024

49.6%

i
ComplexFuncBench

49.3%

i
OpenAI-MRCR: 2 needle 128k

47.2%

i
Internal API instruction following (hard)

45.1%

i
MultiChallenge (o3-mini grader)

42.2%

i
AIME 2025

40.2%

i
TAU-bench Airline

36.0%

i
Multi-Challenge

35.8%

i
HMMT 2025

35.0%

i
Aider-Polyglot

34.7%

i
OpenAI-MRCR: 2 needle 1M

33.3%

i
Aider-Polyglot Edit

31.6%

i
SWE-Bench Verified

23.6%

i
Graphwalks BFS >128k

15.0%

i
Graphwalks parents >128k

11.0%

i
Humanity's Last Exam

3.7%

i

Pricing, uptime, and speed via OpenRouter — updated Jul 17, 2026, 04:19 AM.

Provider	Status	Input	Output	Limits	Uptime	Speed	Notes
OpenAI	available	$0.40/Mtok cache $0.10/Mtok	$1.60/Mtok	1.0M tokens context 33K tokens max output	99.6% 5m 99.8%	650 ms p50 TTFT 32 tok/s p50	$0.01/web search
Azure	available	$0.44/Mtok cache $0.11/Mtok	$1.76/Mtok	1.0M tokens context 33K tokens max output	—	827 ms p50 TTFT 56 tok/s p50	cache $0.01/web search