Gemma 3n E4B Instructed

Gemma 3n is a multimodal model designed to run locally on hardware, supporting image, text, audio, and video inputs. It features a language decoder, audio encoder, and vision encoder, and is available in two sizes: E2B and E4B.

HumanEval

75.0%

i
MGSM

67.0%

i
MMLU

64.9%

i
Global-MMLU-Lite

64.5%

i
MBPP

63.6%

i
Global-MMLU

60.3%

i
Include

57.2%

i
MMLU-Pro

50.6%

i
WMT24++

50.1%

i
HiddenMath

37.7%

i
OpenAI MMLU

35.6%

i
LiveCodeBench v5

25.7%

i
GPQA

23.7%

i
MMLU-ProX

19.9%

i
ECLeKTic

19.0%

i
Codegolf v2.2

16.8%

i
LiveCodeBench

13.2%

i
AIME 2025

11.6%

i

Pricing, uptime, and speed via OpenRouter — updated Jul 17, 2026, 04:19 AM.

Provider	Status	Input	Output	Limits	Uptime	Speed	Notes
Together	available	$0.06/Mtok	$0.12/Mtok	33K tokens context	100.0% 5m 100.0%	325 ms p50 TTFT 46 tok/s p50