Igor Kasianenko 4ecdbf9639 typo fixes преди 5 месеца
..
evals_synthetic_data 4ecdbf9639 typo fixes преди 5 месеца
inference b6ca01882b revert working link преди 8 месеца
llm_eval_harness 00691affbf add mmlu_instruct for 3.2 преди 9 месеца
README.md ae010af7d8 move and add Difflog преди 9 месеца

README.md

Benchmarks

  • inference - a folder contains benchmark scripts that apply a throughput analysis for Llama models inference on various backends including on-prem, cloud and on-device.
  • llm_eval_harness - a folder that introduces lm-evaluation-harness, a tool to evaluate Llama models including quantized models focusing on quality. We also included a recipe that calculates Llama 3.1 evaluation metrics Using lm-evaluation-harness and instructions that calculate HuggingFace Open LLM Leaderboard v2 metrics.