1 rok temu · 9dcbca563e
--- a/tools/benchmarks/llm_eval_harness/meta_eval_reproduce/README.md
+++ b/tools/benchmarks/llm_eval_harness/meta_eval_reproduce/README.md
@@ -6,7 +6,7 @@ As Meta Llama models gain popularity, evaluating these models has become increas
 
				 ## Important Notes
			
 
				 
			
 
				 1. **This tutorial is not the official implementation** of Meta Llama evaluation. It is based on public third-party libraries, and the implementation may differ slightly from our internal evaluation, leading to minor differences in the reproduced numbers.
			
 
				-2. **Model Compatibility**: This tutorial is specifically for Llama 3 based models, as our prompts include Meta Llama 3 special tokens (e.g., `<|start_header_id|>user<|end_header_id|`). It will not work with models that are not based on Llama 3.
			
 
				+2. **Model Compatibility**: This tutorial is specifically for Llama 3 based models, as our prompts include Meta Llama 3 special tokens, e.g. `<|start_header_id|>user<|end_header_id|`. It will not work with models that are not based on Llama 3.
			
 
				 
			
 
				 
			
 
				 ### Huggingface setups
			
@@ -65,7 +65,7 @@ test_split: latest
 
				 
			
 
				 **Note**: Remember to change the eval dataset name according to the model type and DO NOT use pretrained evals dataset on instruct models or vice versa.
			
 
				 
			
 
				-**2.Configure for preprocessing, prompts and ground truth**
			
 
				+**2.Configure preprocessing, prompts and ground truth**
			
 
				 
			
 
				 Here is the example yaml snippet in the MMLU-Pro that handles dataset preprocess, prompts and ground truth.