Justin Lee 479b1fbbd7 updated mmlu meta-eval for prompt migration 7 月之前
..
bbh dc406b4769 setup meta-eval for benchmark, ray error 7 月之前
gpqa dc406b4769 setup meta-eval for benchmark, ray error 7 月之前
gpqa_cot dc406b4769 setup meta-eval for benchmark, ray error 7 月之前
ifeval dc406b4769 setup meta-eval for benchmark, ray error 7 月之前
math_hard dc406b4769 setup meta-eval for benchmark, ray error 7 月之前
mmlu 479b1fbbd7 updated mmlu meta-eval for prompt migration 7 月之前
mmlu_pro caeddccb8d update utils 7 月之前
meta_instruct.yaml 479b1fbbd7 updated mmlu meta-eval for prompt migration 7 月之前
meta_pretrain.yaml dc406b4769 setup meta-eval for benchmark, ray error 7 月之前