Gemini 1.5 Flash
Gemini 1.5 Flash is a fast and versatile multimodal model for scaling across diverse tasks. It supports audio, images, video, and text input, and produces text output. The model is optimized for generating code, extracting data, editing text, and more, making it ideal for narrow, high-frequency tasks.
Benchmark results
| Benchmark | Score | Tags | Source |
|---|---|---|---|
| AMC_2022_23 | 34.8% | self-reported llm-stats | link → |
| BIG-Bench Hard | 85.5% | self-reported llm-stats | link → |
| FLEURS | 9.6% | self-reported llm-stats | link → |
| FunctionalMATH | 53.6% | self-reported llm-stats | link → |
| GPQA | 51.0% | self-reported llm-stats | link → |
| GSM8k | 86.2% | self-reported llm-stats | link → |
| HellaSwag | 86.5% | self-reported llm-stats | link → |
| HiddenMath | 47.2% | self-reported llm-stats | link → |
| HumanEval | 74.3% | self-reported llm-stats | link → |
| MATH | 77.9% | self-reported llm-stats | link → |
| MathVista | 65.8% | self-reported llm-stats | link → |
| MGSM | 82.6% | self-reported llm-stats | link → |
| MMLU | 78.9% | self-reported llm-stats | link → |
| MMLU-Pro | 67.3% | self-reported llm-stats | link → |
| MMMU | 62.3% | self-reported llm-stats | link → |
| MRCR | 71.9% | self-reported llm-stats | link → |
| Natural2Code | 79.8% | self-reported llm-stats | link → |
| PhysicsFinals | 57.4% | self-reported llm-stats | link → |
| Vibe-Eval | 48.9% | self-reported llm-stats | link → |
| Video-MME | 76.1% | self-reported llm-stats | link → |
| WMT23 | 74.1% | self-reported llm-stats | link → |
| XSTest | 97.0% | self-reported llm-stats | link → |