Gemma 4 E4B

Gemma 4 E4B is Google DeepMind's compact multimodal model with 4.5 billion effective parameters (8B with embeddings) and a 128K context window. Supports image, text, and audio inputs. Features Per-Layer Embeddings for efficient on-device deployment while maintaining strong multimodal capabilities.

Benchmark results

Benchmark Score Tags Source
AIME 2026 42.5% self-reported llm-stats link →
BIG-Bench Extra Hard 33.1% self-reported llm-stats link →
GPQA 58.6% self-reported llm-stats link →
LiveCodeBench v6 52.0% self-reported llm-stats link →
MathVision 59.5% self-reported llm-stats link →
MedXpertQA 28.7% self-reported llm-stats link →
MMLU-Pro 69.4% self-reported llm-stats link →
MMMLU 76.6% self-reported llm-stats link →
MMMU-Pro 52.6% self-reported llm-stats link →
MRCR v2 25.4% self-reported llm-stats link →
t2-bench 57.5% self-reported llm-stats link →