radu/LLamaRecipes @ subramen-patch-7

Igor Kasianenko 341539afee Fix/load model with torch dtype auto #663 after cookbook refactor (#871)		10 月之前
..
inference	b6ca01882b revert working link	10 月之前
llm_eval_harness	00691affbf add mmlu_instruct for 3.2	10 月之前
README.md	ae010af7d8 move and add Difflog	11 月之前

		
				README.md
			
				Benchmarks

inference - a folder contains benchmark scripts that apply a throughput analysis for Llama models inference on various backends including on-prem, cloud and on-device.
llm_eval_harness - a folder that introduces lm-evaluation-harness, a tool to evaluate Llama models including quantized models focusing on quality. We also included a recipe that calculates Llama 3.1 evaluation metrics Using lm-evaluation-harness and instructions that calculate HuggingFace Open LLM Leaderboard v2 metrics.