Commit History

Autor SHA1 Mensaxe Data
  Justin Lee 21e04c29bf update mmlu pro hai 7 meses
  Justin Lee e19b9e9e34 added fix split, gitignore and download mmlu script hai 7 meses
  Justin Lee 8d3a0479e5 updated env file hai 7 meses
  Justin Lee 9ffb292272 added inspect and modified harness hai 8 meses
  Justin Lee eea96618cf batching and parallelization, ran on baseline and lite hai 8 meses
  Justin Lee 4fd5f29414 revert to previous changes hai 8 meses
  Justin Lee a6f448f362 <Replace this line with a title. Use 1 line only, 67 chars or less> hai 8 meses
  Justin Lee becbe77ff3 attempt to fix json output format in eval hai 8 meses
  Justin Lee 03f2b8eddd change gpu parallel size docs hai 8 meses
  Justin Lee 0bec41f86a updated readme hai 8 meses
  Justin Lee 2776a35314 harness runcode hai 8 meses
  Justin Lee 314b6a874a added updated llama-mmlu-pro and added human-eva hai 8 meses
  Justin Lee 5730a84b8a beef up readme hai 8 meses
  Justin Lee 62b53676fb update harness notebook hai 8 meses
  Justin Lee 1e4c6d22dd update harness notebook hai 8 meses
  Justin Lee e52e1d1ab4 updated prompt migration to use benchmark and also mipro, added meta implementation hai 8 meses
  Justin Lee 4d75fe97b5 update dir hai 8 meses
  Justin Lee 90d16cd7de minor changes in eval, deleted formatter hai 9 meses
  Justin Lee b85811d0b9 change eval dataset, include more robust judging, improved main hai 9 meses
  Justin Lee 43a2cbc220 adding eval dataset hai 9 meses
  Justin Lee 263b8b569d placeholder readme hai 9 meses
  Justin Lee 096249bf33 add .env settings and configure yml hai 9 meses
  Justin Lee a3e96e4e46 add engine and eval dataset hai 9 meses
  Justin Lee 08e41d0d0a add usage guide and init hai 9 meses
  Justin Lee 2570d1642a added evaluator and formatter and main hai 9 meses
  Igor Kasianenko 4ad1c0f30c Update README.md (#843) hai 8 meses
  Naveen Reddy Gundlagutta e951b567ba Update README.md hai 8 meses
  Sanyam Bhutani 5311cde8ed Add FAQ (#842) hai 8 meses
  Sanyam Bhutani cdf4a1ab46 Update README.md hai 8 meses
  varunfb f4dbbf9261 Refactor Llama-Recipes (#832) hai 8 meses