Justin Lee e1d64ca2f4 update gitignore, added mmlu 0shot and ran a bunch of test hai 7 meses
..
bbh dc406b4769 setup meta-eval for benchmark, ray error hai 7 meses
gpqa dc406b4769 setup meta-eval for benchmark, ray error hai 7 meses
gpqa_cot dc406b4769 setup meta-eval for benchmark, ray error hai 7 meses
ifeval dc406b4769 setup meta-eval for benchmark, ray error hai 7 meses
math_hard dc406b4769 setup meta-eval for benchmark, ray error hai 7 meses
mmlu e1d64ca2f4 update gitignore, added mmlu 0shot and ran a bunch of test hai 7 meses
mmlu_pro e1d64ca2f4 update gitignore, added mmlu 0shot and ran a bunch of test hai 7 meses
meta_instruct.yaml 479b1fbbd7 updated mmlu meta-eval for prompt migration hai 7 meses
meta_pretrain.yaml dc406b4769 setup meta-eval for benchmark, ray error hai 7 meses