Justin Lee e1d64ca2f4 update gitignore, added mmlu 0shot and ran a bunch of test il y a 7 mois
..
__init__.py e52e1d1ab4 updated prompt migration to use benchmark and also mipro, added meta implementation il y a 8 mois
datatypes.py e52e1d1ab4 updated prompt migration to use benchmark and also mipro, added meta implementation il y a 8 mois
download_mmlu_pro.py e1d64ca2f4 update gitignore, added mmlu 0shot and ran a bunch of test il y a 7 mois
helpers.py e1d64ca2f4 update gitignore, added mmlu 0shot and ran a bunch of test il y a 7 mois
humaneval.py 314b6a874a added updated llama-mmlu-pro and added human-eva il y a 8 mois
leaderboard_mmlu_pro.py e52e1d1ab4 updated prompt migration to use benchmark and also mipro, added meta implementation il y a 8 mois
llama_mmlu.py e1d64ca2f4 update gitignore, added mmlu 0shot and ran a bunch of test il y a 7 mois
llama_mmlu_pro.py dc406b4769 setup meta-eval for benchmark, ray error il y a 7 mois