| .. |
|
bbh
|
dc406b4769
setup meta-eval for benchmark, ray error
|
9 months ago |
|
gpqa
|
dc406b4769
setup meta-eval for benchmark, ray error
|
9 months ago |
|
gpqa_cot
|
dc406b4769
setup meta-eval for benchmark, ray error
|
9 months ago |
|
ifeval
|
dc406b4769
setup meta-eval for benchmark, ray error
|
9 months ago |
|
math_hard
|
dc406b4769
setup meta-eval for benchmark, ray error
|
9 months ago |
|
mmlu
|
423231e139
updated mmlu and harness
|
9 months ago |
|
mmlu_pro
|
423231e139
updated mmlu and harness
|
9 months ago |
|
meta_instruct.yaml
|
423231e139
updated mmlu and harness
|
9 months ago |
|
meta_pretrain.yaml
|
dc406b4769
setup meta-eval for benchmark, ray error
|
9 months ago |