提交历史

作者 SHA1 备注 提交日期
  lchu 1cc9df19e6 remove unused import 2 年之前
  lchu c453b668fa add doc example about using low_cpu_fsdp 2 年之前
  lchu 80a4c36707 further fix #90 2 年之前
  lchu 0c51b47262 fix #90 2 年之前
  lchu c19c5c69aa fix fsdp construction on low_cpu_fsdp 2 年之前
  lchu e216c6f1f3 address #87 2 年之前
  lchu 895dfcea30 add nightly check for using low_cpu_fsdp mode 2 年之前
  lchu 1e64fc98d9 switch to simpler param_init_fn and meta device init 2 年之前
  lchu 101391f46a Revert "replace init_empty_weights with torch.device(meta)" 2 年之前
  lchu c8d4f38d23 replace init_empty_weights with torch.device(meta) 2 年之前
  lchu d8a81bb531 save cpu mem by leveraging FSDP rank0 broadcasting 2 年之前
  Geeta Chauhan 1387b76e11 fixing the full state path in checkpoint handler+loss report calculation (#51) 2 年之前
  Hamid Shojanazeri 88d3e1febc fix the save_train_param condition 2 年之前
  Hamid Shojanazeri b56028c98d fixing the word list/spell check 2 年之前
  Hamid Shojanazeri 62be60355a resolving conflicts 2 年之前
  Geeta Chauhan 174b856591 update README: python 3.9 rec + fix formatting (#63) 2 年之前
  Geeta Chauhan 0cd5694a14 Fsdp inference checkpoints (#39) 2 年之前
  Hamid Shojanazeri c4e96af6ee clean up 2 年之前
  Christian Miller 7c1884c690 recommend python 3.9 2 年之前
  Hamid Shojanazeri 7d2e06821e fixing the path to script 2 年之前
  Hamid Shojanazeri 5f97db8f0c fix spell check word list 2 年之前
  Hamid Shojanazeri 017cadd04b Merge branch 'checkpoint_handler_path_fix' of https://github.com/facebookresearch/llama-recipes into checkpoint_handler_path_fix 2 年之前
  Hamid Shojanazeri 4f70348b94 remove the redundant lr step 2 年之前
  Hamid Shojanazeri 9c95ed4bbe clean up 2 年之前
  Hamid Shojanazeri 311a5c1eec add notes for train_param.yaml 2 年之前
  Hamid Shojanazeri 5b916114eb merge main branch 2 年之前
  Hamid Shojanazeri 668c364f6b add rank to save_train_params 2 年之前
  Hamid Shojanazeri 231c9e7da9 adding train_param.yaml saving for fsdp checkpoint loading for inference 2 年之前
  Hamid Shojanazeri 475e67b4ec clean up 2 年之前
  Hamid Shojanazeri 50e9d17045 add the default option for find the HF model_name/path from train_param.yaml 2 年之前