Commit History

Autor SHA1 Mensaxe Data
  Hamid Shojanazeri 50e9d17045 add the default option for find the HF model_name/path from train_param.yaml %!s(int64=2) %!d(string=hai) anos
  Hamid Shojanazeri 41dd7ff1cb Merge branch 'main' into checkpoint_handler_path_fix %!s(int64=2) %!d(string=hai) anos
  Hamid Shojanazeri 31d6ce8bf6 adding expnadable sgement and dist debug flag info %!s(int64=2) %!d(string=hai) anos
  Hamid Shojanazeri a955ed1999 added checks for dist barrier and commented cuda exapnadable segements and dist_dbug %!s(int64=2) %!d(string=hai) anos
  Hamid Shojanazeri a2403c7c1a clean up %!s(int64=2) %!d(string=hai) anos
  Hamid Shojanazeri e9559d2669 fixing the train/eval_loss calcualtion %!s(int64=2) %!d(string=hai) anos
  Geeta Chauhan 2243b962fa Create spellcheck.yml (#50) %!s(int64=2) %!d(string=hai) anos
  Geeta Chauhan 3cc2b3787f Fix broken links in Dataset.md (#49) %!s(int64=2) %!d(string=hai) anos
  Geeta Chauhan 021ed8e312 adding active mem stat (#44) %!s(int64=2) %!d(string=hai) anos
  Geeta Chauhan 09db361d23 Templates updates (#67) %!s(int64=2) %!d(string=hai) anos
  Hamid Shojanazeri 4ba4400a75 adding dist barrier before and after checkpointing %!s(int64=2) %!d(string=hai) anos
  chauhang 95d59afcb8 Update PR template %!s(int64=2) %!d(string=hai) anos
  chauhang 857a3ade4e Add PR template %!s(int64=2) %!d(string=hai) anos
  chauhang 9f9532d34c comm %!s(int64=2) %!d(string=hai) anos
  Christian Miller 9b2f72e1f5 update README: python 3.8 rec + fix formatting %!s(int64=2) %!d(string=hai) anos
  Hamid Shojanazeri a49a2c2804 adding PT cuda allocation expand flag %!s(int64=2) %!d(string=hai) anos
  Geeta Chauhan 905f633dab adding issue tempalte (#57) %!s(int64=2) %!d(string=hai) anos
  Hamid Shojanazeri b814704b5f adding issue tempalte %!s(int64=2) %!d(string=hai) anos
  Hamid Shojanazeri 442c1ccf7c adding barrier to end of trainer loop %!s(int64=2) %!d(string=hai) anos
  Hamid Shojanazeri f74d57dc08 printing scores based on fsdp usage or single gpu %!s(int64=2) %!d(string=hai) anos
  Hamid Shojanazeri 3d887ea483 update with active memory and removing rank0 for eval score %!s(int64=2) %!d(string=hai) anos
  sekyonda 0d9c1a909f Update markdown_link_check_config.json %!s(int64=2) %!d(string=hai) anos
  Hamid Shojanazeri bedb96b78a fixing the full state path in checkpoint handler %!s(int64=2) %!d(string=hai) anos
  sekyondaMeta b625dceb9b Create spellcheck.yml %!s(int64=2) %!d(string=hai) anos
  Kaiser Pister b61c45d31d Fix broken links in Dataset.md %!s(int64=2) %!d(string=hai) anos
  Hamid Shojanazeri 569f8b7976 fixed arg names %!s(int64=2) %!d(string=hai) anos
  Hamid Shojanazeri 9e3b1b7f01 fixed arg names %!s(int64=2) %!d(string=hai) anos
  Hamid Shojanazeri 4b18e49f44 added steps for conversion of fsdp to Hf %!s(int64=2) %!d(string=hai) anos
  Geeta Chauhan 74bde65a62 Adding Supporting Files For link and Spell Check (#26) %!s(int64=2) %!d(string=hai) anos
  Hamid Shojanazeri a977145a9b change bf16 default to false %!s(int64=2) %!d(string=hai) anos