|  Justin Lee | dc406b4769
							
							setup meta-eval for benchmark, ray error | 8 mesi fa | 
				
					
						|  Justin Lee | 21e04c29bf
							
							update mmlu pro | 8 mesi fa | 
				
					
						|  Justin Lee | e19b9e9e34
							
							added fix split, gitignore and download mmlu script | 8 mesi fa | 
				
					
						|  Justin Lee | 8d3a0479e5
							
							updated env file | 9 mesi fa | 
				
					
						|  Justin Lee | 9ffb292272
							
							added inspect and modified harness | 9 mesi fa | 
				
					
						|  Justin Lee | eea96618cf
							
							batching and parallelization, ran on baseline and lite | 9 mesi fa | 
				
					
						|  Justin Lee | 4fd5f29414
							
							revert to previous changes | 9 mesi fa | 
				
					
						|  Justin Lee | a6f448f362
							
							<Replace this line with a title. Use 1 line only, 67 chars or less> | 9 mesi fa | 
				
					
						|  Justin Lee | becbe77ff3
							
							attempt to fix json output format in eval | 9 mesi fa | 
				
					
						|  Justin Lee | 03f2b8eddd
							
							change gpu parallel size docs | 9 mesi fa | 
				
					
						|  Justin Lee | 0bec41f86a
							
							updated readme | 9 mesi fa | 
				
					
						|  Justin Lee | 2776a35314
							
							harness runcode | 9 mesi fa | 
				
					
						|  Justin Lee | 314b6a874a
							
							added updated llama-mmlu-pro and added human-eva | 9 mesi fa | 
				
					
						|  Justin Lee | 5730a84b8a
							
							beef up readme | 9 mesi fa | 
				
					
						|  Justin Lee | 62b53676fb
							
							update harness notebook | 9 mesi fa | 
				
					
						|  Justin Lee | 1e4c6d22dd
							
							update harness notebook | 9 mesi fa | 
				
					
						|  Justin Lee | e52e1d1ab4
							
							updated prompt migration to use benchmark and also mipro, added meta implementation | 9 mesi fa | 
				
					
						|  Justin Lee | 4d75fe97b5
							
							update dir | 9 mesi fa | 
				
					
						|  Justin Lee | 90d16cd7de
							
							minor changes in eval, deleted formatter | 10 mesi fa | 
				
					
						|  Justin Lee | b85811d0b9
							
							change eval dataset, include more robust judging, improved main | 10 mesi fa | 
				
					
						|  Justin Lee | 43a2cbc220
							
							adding eval dataset | 10 mesi fa | 
				
					
						|  Justin Lee | 263b8b569d
							
							placeholder readme | 10 mesi fa | 
				
					
						|  Justin Lee | 096249bf33
							
							add .env settings and configure yml | 10 mesi fa | 
				
					
						|  Justin Lee | a3e96e4e46
							
							add engine and eval dataset | 10 mesi fa | 
				
					
						|  Justin Lee | 08e41d0d0a
							
							add usage guide and init | 10 mesi fa | 
				
					
						|  Justin Lee | 2570d1642a
							
							added evaluator and formatter and main | 10 mesi fa | 
				
					
						|  Igor Kasianenko | 4ad1c0f30c
							
							Update README.md (#843) | 9 mesi fa | 
				
					
						|  Naveen Reddy Gundlagutta | e951b567ba
							
							Update README.md | 9 mesi fa | 
				
					
						|  Sanyam Bhutani | 5311cde8ed
							
							Add FAQ (#842) | 9 mesi fa | 
				
					
						|  Sanyam Bhutani | cdf4a1ab46
							
							Update README.md | 9 mesi fa |