.. |
bbh
|
dc406b4769
setup meta-eval for benchmark, ray error
|
hai 7 meses |
gpqa
|
dc406b4769
setup meta-eval for benchmark, ray error
|
hai 7 meses |
gpqa_cot
|
dc406b4769
setup meta-eval for benchmark, ray error
|
hai 7 meses |
ifeval
|
dc406b4769
setup meta-eval for benchmark, ray error
|
hai 7 meses |
math_hard
|
dc406b4769
setup meta-eval for benchmark, ray error
|
hai 7 meses |
mmlu
|
e1d64ca2f4
update gitignore, added mmlu 0shot and ran a bunch of test
|
hai 7 meses |
mmlu_pro
|
e1d64ca2f4
update gitignore, added mmlu 0shot and ran a bunch of test
|
hai 7 meses |
meta_instruct.yaml
|
479b1fbbd7
updated mmlu meta-eval for prompt migration
|
hai 7 meses |
meta_pretrain.yaml
|
dc406b4769
setup meta-eval for benchmark, ray error
|
hai 7 meses |