|
11 mēneši atpakaļ | |
---|---|---|
.. | ||
inference | 1 gadu atpakaļ | |
llm_eval_harness | 11 mēneši atpakaļ | |
README.md | 11 mēneši atpakaļ |
lm-evaluation-harness
, a tool to evaluate Llama models including quantized models focusing on quality. We also included a recipe that reproduces Meta 3.1 evaluation metrics Using lm-evaluation-harness
and instructions that reproduce HuggingFace Open LLM Leaderboard v2 metrics.