Justin Lee
|
4768a41a20
merge utils, added configs and sample notebook to run optimizer
|
1 month ago |
Justin Lee
|
3174e5bd2a
code handoff
|
2 months ago |
Justin Lee
|
423231e139
updated mmlu and harness
|
2 months ago |
Justin Lee
|
52c5a76165
made changes to utils
|
2 months ago |
Justin Lee
|
7a014b3e00
update readme
|
2 months ago |
Justin Lee
|
d4638ba575
updated gitignore
|
2 months ago |
Justin Lee
|
d214437e3e
Stop tracking files in eval_results/meta-llama__Llama-3.3-70B-Instruct
|
2 months ago |
Justin Lee
|
e1d64ca2f4
update gitignore, added mmlu 0shot and ran a bunch of test
|
2 months ago |
Justin Lee
|
479b1fbbd7
updated mmlu meta-eval for prompt migration
|
2 months ago |
Justin Lee
|
caeddccb8d
update utils
|
2 months ago |
Justin Lee
|
f8a6c7d79f
running mmlu pro with meta eval - fixed error
|
2 months ago |
Chester Hu
|
07b191b17e
Merge pull request #2 from pia-papanna/tools-refactory-chester
|
10 months ago |
Justin Lee
|
dc406b4769
setup meta-eval for benchmark, ray error
|
2 months ago |
Justin Lee
|
21e04c29bf
update mmlu pro
|
2 months ago |
Justin Lee
|
e19b9e9e34
added fix split, gitignore and download mmlu script
|
2 months ago |
Justin Lee
|
8d3a0479e5
updated env file
|
2 months ago |
Justin Lee
|
9ffb292272
added inspect and modified harness
|
2 months ago |
Justin Lee
|
eea96618cf
batching and parallelization, ran on baseline and lite
|
3 months ago |
Justin Lee
|
4fd5f29414
revert to previous changes
|
3 months ago |
Justin Lee
|
a6f448f362
<Replace this line with a title. Use 1 line only, 67 chars or less>
|
3 months ago |
Justin Lee
|
becbe77ff3
attempt to fix json output format in eval
|
3 months ago |
Justin Lee
|
03f2b8eddd
change gpu parallel size docs
|
3 months ago |
Justin Lee
|
0bec41f86a
updated readme
|
3 months ago |
Justin Lee
|
2776a35314
harness runcode
|
3 months ago |
Justin Lee
|
314b6a874a
added updated llama-mmlu-pro and added human-eva
|
3 months ago |
Justin Lee
|
5730a84b8a
beef up readme
|
3 months ago |
Justin Lee
|
62b53676fb
update harness notebook
|
3 months ago |
Justin Lee
|
1e4c6d22dd
update harness notebook
|
3 months ago |
Justin Lee
|
e52e1d1ab4
updated prompt migration to use benchmark and also mipro, added meta implementation
|
3 months ago |
Justin Lee
|
4d75fe97b5
update dir
|
3 months ago |