|
|
1 éve | |
|---|---|---|
| .. | ||
| inference | 4be3eb0d17 Updates HF model_ids and readmes for 3.1 | 1 éve |
| llm_eval_harness | 8176c35731 small fix | 1 éve |
| README.md | 8176c35731 small fix | 1 éve |
lm-evaluation-harness, a tool to evaluate Llama models including quantized models focusing on quality. We also included a recipe that calculates Llama 3.1 evaluation metrics Using lm-evaluation-harness and instructions that calculate HuggingFace Open LLM Leaderboard v2 metrics.