Commit History

Autor SHA1 Mensaxe Data
  Justin Lee dc406b4769 setup meta-eval for benchmark, ray error hai 7 meses
  Justin Lee 9ffb292272 added inspect and modified harness hai 7 meses
  Justin Lee eea96618cf batching and parallelization, ran on baseline and lite hai 7 meses
  Justin Lee becbe77ff3 attempt to fix json output format in eval hai 7 meses
  Justin Lee 2776a35314 harness runcode hai 8 meses
  Justin Lee 62b53676fb update harness notebook hai 8 meses
  Justin Lee e52e1d1ab4 updated prompt migration to use benchmark and also mipro, added meta implementation hai 8 meses