Gemma 4 E4B
Gemma 4 E4B is Google DeepMind's compact multimodal model with 4.5 billion effective parameters (8B with embeddings) and a 128K context window. Supports image, text, and audio inputs. Features Per-Layer Embeddings for efficient on-device deployment while maintaining strong multimodal capabilities.
Benchmark results
| Benchmark | Score | Tags | Source |
|---|---|---|---|
| AIME 2026 | 42.5% | self-reported llm-stats | link → |
| BIG-Bench Extra Hard | 33.1% | self-reported llm-stats | link → |
| GPQA | 58.6% | self-reported llm-stats | link → |
| LiveCodeBench v6 | 52.0% | self-reported llm-stats | link → |
| MathVision | 59.5% | self-reported llm-stats | link → |
| MedXpertQA | 28.7% | self-reported llm-stats | link → |
| MMLU-Pro | 69.4% | self-reported llm-stats | link → |
| MMMLU | 76.6% | self-reported llm-stats | link → |
| MMMU-Pro | 52.6% | self-reported llm-stats | link → |
| MRCR v2 | 25.4% | self-reported llm-stats | link → |
| t2-bench | 57.5% | self-reported llm-stats | link → |