Aidan Do ab1b1450d7 Add gpqa and math evals for instruct models 9 kuukautta sitten
..
gpqa_0shot.yaml ab1b1450d7 Add gpqa and math evals for instruct models 9 kuukautta sitten
utils.py ab1b1450d7 Add gpqa and math evals for instruct models 9 kuukautta sitten