Justin Lee
|
dc406b4769
setup meta-eval for benchmark, ray error
|
8 месяцев назад |
Justin Lee
|
21e04c29bf
update mmlu pro
|
8 месяцев назад |
Justin Lee
|
e19b9e9e34
added fix split, gitignore and download mmlu script
|
8 месяцев назад |
Justin Lee
|
8d3a0479e5
updated env file
|
8 месяцев назад |
Justin Lee
|
9ffb292272
added inspect and modified harness
|
9 месяцев назад |
Justin Lee
|
eea96618cf
batching and parallelization, ran on baseline and lite
|
9 месяцев назад |
Justin Lee
|
4fd5f29414
revert to previous changes
|
9 месяцев назад |
Justin Lee
|
a6f448f362
<Replace this line with a title. Use 1 line only, 67 chars or less>
|
9 месяцев назад |
Justin Lee
|
becbe77ff3
attempt to fix json output format in eval
|
9 месяцев назад |
Justin Lee
|
03f2b8eddd
change gpu parallel size docs
|
9 месяцев назад |
Justin Lee
|
0bec41f86a
updated readme
|
9 месяцев назад |
Justin Lee
|
2776a35314
harness runcode
|
9 месяцев назад |
Justin Lee
|
314b6a874a
added updated llama-mmlu-pro and added human-eva
|
9 месяцев назад |
Justin Lee
|
5730a84b8a
beef up readme
|
9 месяцев назад |
Justin Lee
|
62b53676fb
update harness notebook
|
9 месяцев назад |
Justin Lee
|
1e4c6d22dd
update harness notebook
|
9 месяцев назад |
Justin Lee
|
e52e1d1ab4
updated prompt migration to use benchmark and also mipro, added meta implementation
|
9 месяцев назад |
Justin Lee
|
4d75fe97b5
update dir
|
9 месяцев назад |
Justin Lee
|
90d16cd7de
minor changes in eval, deleted formatter
|
10 месяцев назад |
Justin Lee
|
b85811d0b9
change eval dataset, include more robust judging, improved main
|
10 месяцев назад |
Justin Lee
|
43a2cbc220
adding eval dataset
|
10 месяцев назад |
Justin Lee
|
263b8b569d
placeholder readme
|
10 месяцев назад |
Justin Lee
|
096249bf33
add .env settings and configure yml
|
10 месяцев назад |
Justin Lee
|
a3e96e4e46
add engine and eval dataset
|
10 месяцев назад |
Justin Lee
|
08e41d0d0a
add usage guide and init
|
10 месяцев назад |
Justin Lee
|
2570d1642a
added evaluator and formatter and main
|
10 месяцев назад |
Igor Kasianenko
|
4ad1c0f30c
Update README.md (#843)
|
9 месяцев назад |
Naveen Reddy Gundlagutta
|
e951b567ba
Update README.md
|
9 месяцев назад |
Sanyam Bhutani
|
5311cde8ed
Add FAQ (#842)
|
9 месяцев назад |
Sanyam Bhutani
|
cdf4a1ab46
Update README.md
|
9 месяцев назад |