Justin Lee
|
dc406b4769
setup meta-eval for benchmark, ray error
|
2 月之前 |
Justin Lee
|
21e04c29bf
update mmlu pro
|
2 月之前 |
Justin Lee
|
e19b9e9e34
added fix split, gitignore and download mmlu script
|
2 月之前 |
Justin Lee
|
4fd5f29414
revert to previous changes
|
3 月之前 |
Justin Lee
|
becbe77ff3
attempt to fix json output format in eval
|
3 月之前 |
Justin Lee
|
2776a35314
harness runcode
|
3 月之前 |
Justin Lee
|
314b6a874a
added updated llama-mmlu-pro and added human-eva
|
3 月之前 |
Justin Lee
|
e52e1d1ab4
updated prompt migration to use benchmark and also mipro, added meta implementation
|
3 月之前 |