Gemma 2 27B
Gemma 2 27B IT is an instruction-tuned version of Google's state-of-the-art open language model. Built from the same research and technology as Gemini, it's optimized for dialogue applications through supervised fine-tuning, distillation from larger models, and RLHF. The model excels at text generation tasks including question answering, summarization, and reasoning.
Benchmark results
| Benchmark | Score | Tags | Source |
|---|---|---|---|
| AGIEval | 55.1% | self-reported llm-stats | link → |
| ARC-C | 71.4% | self-reported llm-stats | link → |
| ARC-E | 88.6% | self-reported llm-stats | link → |
| BIG-Bench | 74.9% | self-reported llm-stats | link → |
| BoolQ | 84.8% | self-reported llm-stats | link → |
| GSM8k | 74.0% | self-reported llm-stats | link → |
| HellaSwag | 86.4% | self-reported llm-stats | link → |
| HumanEval | 51.8% | self-reported llm-stats | link → |
| MATH | 42.3% | self-reported llm-stats | link → |
| MBPP | 62.6% | self-reported llm-stats | link → |
| MMLU | 75.2% | self-reported llm-stats | link → |
| Natural Questions | 34.5% | self-reported llm-stats | link → |
| PIQA | 83.2% | self-reported llm-stats | link → |
| Social IQa | 53.7% | self-reported llm-stats | link → |
| TriviaQA | 83.7% | self-reported llm-stats | link → |
| Winogrande | 83.7% | self-reported llm-stats | link → |