Llama 3.3 70B Open-weight 70B model. Benchmark results Benchmark Score Tags Source HumanEval 88.4% MATH 77.0% MMLU 86.0%