Justin Lee e1d64ca2f4 update gitignore, added mmlu 0shot and ran a bunch of test 7 months ago
..
bbh dc406b4769 setup meta-eval for benchmark, ray error 7 months ago
gpqa dc406b4769 setup meta-eval for benchmark, ray error 7 months ago
gpqa_cot dc406b4769 setup meta-eval for benchmark, ray error 7 months ago
ifeval dc406b4769 setup meta-eval for benchmark, ray error 7 months ago
math_hard dc406b4769 setup meta-eval for benchmark, ray error 7 months ago
mmlu e1d64ca2f4 update gitignore, added mmlu 0shot and ran a bunch of test 7 months ago
mmlu_pro e1d64ca2f4 update gitignore, added mmlu 0shot and ran a bunch of test 7 months ago
meta_instruct.yaml 479b1fbbd7 updated mmlu meta-eval for prompt migration 7 months ago
meta_pretrain.yaml dc406b4769 setup meta-eval for benchmark, ray error 7 months ago