lm-evaluation-harness, a tool to evaluate Llama models including quantized models focusing on quality. We also included a recipe that reproduces Meta 3.1 evaluation metrics Using lm-evaluation-harness and instructions that reproduce HuggingFace Open LLM Leaderboard v2 metrics.