Gemini 2.5 Flash

A thinking model designed for a balance between price and performance. It builds upon Gemini 2.0 Flash with upgraded reasoning, hybrid thinking control, multimodal capabilities (text, image, video, audio input), and a 1M token input context window.

Global-MMLU-Lite

88.4%

i
AIME 2024

88.0%

i
FACTS Grounding

85.3%

i
GPQA

82.8%

i
MMMU

79.7%

i
AIME 2025

72.0%

i
Vibe-Eval

65.4%

i
LiveCodeBench v5

63.9%

i
Aider-Polyglot

61.9%

i
SWE-Bench Verified

60.4%

i
Aider-Polyglot Edit

56.7%

i
MRCR

32.0%

i
SimpleQA

26.9%

i
Humanity's Last Exam

11.0%

i

Pricing, uptime, and speed via OpenRouter — updated Jul 17, 2026, 04:19 AM.

Provider	Status	Input	Output	Limits	Uptime	Speed	Notes
Google AI Studio	available	$0.54/Mtok cache $0.05/Mtok	$4.50/Mtok	1.0M tokens context 66K tokens max output	100.0% 5m 100.0%	2,587 ms p50 TTFT 2.0 tok/s p50	$0.000001/image · $0.014/web search
Google	-5	$0.30/Mtok cache $0.03/Mtok	$2.50/Mtok	1.0M tokens context 66K tokens max output	58% 5m 65%	2,109 ms p50 TTFT 30 tok/s p50	$0/image · $0.014/web search