Gemini 2.5 Flash-Lite

Gemini 2.5 Flash-Lite is a model developed by Google DeepMind, designed to handle various tasks including reasoning, science, mathematics, code generation, and more. It features advanced capabilities in multilingual performance and long context understanding.

FACTS Grounding

84.1%

i
Global-MMLU-Lite

81.1%

i
MMMU

72.9%

i
GPQA

64.6%

i
Vibe-Eval

51.3%

i
AIME 2025

49.8%

i
LiveCodeBench

33.7%

i
SWE-Bench Verified

31.6%

i
Aider-Polyglot

26.7%

i
MRCR v2

16.6%

i
SimpleQA

10.7%

i
Humanity's Last Exam

5.1%

i
Arc

2.5%

i

Pricing, uptime, and speed via OpenRouter — updated Jul 17, 2026, 04:19 AM.

Provider	Status	Input	Output	Limits	Uptime	Speed	Notes
Google	available	$0.10/Mtok cache $0.01/Mtok	$0.40/Mtok	1.0M tokens context 66K tokens max output	99.9% 5m 100.0%	415 ms p50 TTFT 100 tok/s p50	$0/image · $0.014/web search
Google AI Studio	available	$0.18/Mtok cache $0.02/Mtok	$0.72/Mtok	1.0M tokens context 66K tokens max output	99.3% 5m 99.3%	—	$0/image · $0.014/web search