Aidan Do ab1b1450d7 Add gpqa and math evals for instruct models 6 月之前
..
bbh e1b7bc728c remove result section and change meta-llama 3.1 to llama 3.1 7 月之前
gpqa ab1b1450d7 Add gpqa and math evals for instruct models 6 月之前
gpqa_cot ab1b1450d7 Add gpqa and math evals for instruct models 6 月之前
ifeval 576e574e31 update readme 9 月之前
math_hard ab1b1450d7 Add gpqa and math evals for instruct models 6 月之前
mmlu 4f9050f748 Add mmlu eval for llama 3.2 pretrained models 6 月之前
mmlu_pro e1b7bc728c remove result section and change meta-llama 3.1 to llama 3.1 7 月之前
meta_instruct.yaml 576e574e31 update readme 9 月之前
meta_pretrain.yaml 576e574e31 update readme 9 月之前