.. |
bbh
|
dc406b4769
setup meta-eval for benchmark, ray error
|
7 月之前 |
gpqa
|
dc406b4769
setup meta-eval for benchmark, ray error
|
7 月之前 |
gpqa_cot
|
dc406b4769
setup meta-eval for benchmark, ray error
|
7 月之前 |
ifeval
|
dc406b4769
setup meta-eval for benchmark, ray error
|
7 月之前 |
math_hard
|
dc406b4769
setup meta-eval for benchmark, ray error
|
7 月之前 |
mmlu
|
423231e139
updated mmlu and harness
|
7 月之前 |
mmlu_pro
|
423231e139
updated mmlu and harness
|
7 月之前 |
meta_instruct.yaml
|
423231e139
updated mmlu and harness
|
7 月之前 |
meta_pretrain.yaml
|
dc406b4769
setup meta-eval for benchmark, ray error
|
7 月之前 |