Justin Lee dc406b4769 setup meta-eval for benchmark, ray error 3 ماه پیش
..
__init__.py e52e1d1ab4 updated prompt migration to use benchmark and also mipro, added meta implementation 4 ماه پیش
datatypes.py e52e1d1ab4 updated prompt migration to use benchmark and also mipro, added meta implementation 4 ماه پیش
download_mmlu_pro.py e19b9e9e34 added fix split, gitignore and download mmlu script 3 ماه پیش
helpers.py e19b9e9e34 added fix split, gitignore and download mmlu script 3 ماه پیش
humaneval.py 314b6a874a added updated llama-mmlu-pro and added human-eva 4 ماه پیش
leaderboard_mmlu_pro.py e52e1d1ab4 updated prompt migration to use benchmark and also mipro, added meta implementation 4 ماه پیش
llama_mmlu_pro.py dc406b4769 setup meta-eval for benchmark, ray error 3 ماه پیش