Kai Wu 9f0acebe02 first commit, local not working 1 рік тому
..
inference 4be3eb0d17 Updates HF model_ids and readmes for 3.1 1 рік тому
llm_eval_harness 4be3eb0d17 Updates HF model_ids and readmes for 3.1 1 рік тому
meta_eval_reproduce 9f0acebe02 first commit, local not working 1 рік тому
README.md fa02ded685 Create README.md 1 рік тому

README.md

Benchmarks

  • inference - a folder contains benchmark scripts that apply a throughput analysis for Llama models inference on various backends including on-prem, cloud and on-device.
  • llm_eval_harness - a folder contains a tool to evaluate fine-tuned Llama models including quantized models focusing on quality.