Commit History

Autor SHA1 Mensaxe Data
  Kai Wu d0b7a20c89 finetuning readme updated %!s(int64=2) %!d(string=hai) anos
  Kai Wu 7b1a9413d2 fixed a typo %!s(int64=2) %!d(string=hai) anos
  Kai Wu 41434dc825 formatted and removed duplicated or unused function get_total_flops() and byte2mb() %!s(int64=2) %!d(string=hai) anos
  Kai Wu f2e80bae22 created a FlopMeasure class on top of FlopCounterMode instead of keep of copy of our own tflop_counter.py %!s(int64=2) %!d(string=hai) anos
  Kai Wu 69e46887b4 handling incorrect profiling early stop caused by max_train_steps and add profiler.step() for each train step %!s(int64=2) %!d(string=hai) anos
  Kai Wu 34e0bf4c6e second draft of this feature, seems to be working now %!s(int64=2) %!d(string=hai) anos
  Kai Wu a35519ee90 fixed typo and handling unexpected exit %!s(int64=2) %!d(string=hai) anos
  Kai Wu 2a5de9b448 first draft of flop counter feature %!s(int64=2) %!d(string=hai) anos
  Hamid Shojanazeri aaa9e2c863 Adding a feature that will stop the training/eval process after reaching some max_steps (#428) %!s(int64=2) %!d(string=hai) anos
  Kai Wu e6f69f84ad add max_steps_reached to reduce redundancy %!s(int64=2) %!d(string=hai) anos
  Kai Wu 362cda0fa6 fixing test_gradient_accumulation and test_save_to_json %!s(int64=2) %!d(string=hai) anos
  Kai Wu fa0a389f74 add max_step feature for training and eval %!s(int64=2) %!d(string=hai) anos
  Hamid Shojanazeri 37c8f72211 Update location and name of llm.py example notebook (#417) %!s(int64=2) %!d(string=hai) anos
  Thomas Robinson 79266217ef Update location and name of llm.py example notebook %!s(int64=2) %!d(string=hai) anos
  Hamid Shojanazeri f7aa02af9f only save training params on rank 0 (#415) %!s(int64=2) %!d(string=hai) anos
  jpgard 6954b16b3b only save training params on rank 0 %!s(int64=2) %!d(string=hai) anos
  Hamid Shojanazeri 64e189914f update due to peft new release (#407) %!s(int64=2) %!d(string=hai) anos
  Hamid Shojanazeri 11f51db28c adding the kbit prep in the code %!s(int64=2) %!d(string=hai) anos
  Hamid Shojanazeri f058ff6ccd update due to peft new release %!s(int64=2) %!d(string=hai) anos
  Hamid Shojanazeri 6a7478a6aa Reorg inference throughput folder structure (#404) %!s(int64=2) %!d(string=hai) anos
  Chester Hu 367e4869ac Reorg inference throughput folder structure %!s(int64=2) %!d(string=hai) anos
  Hamid Shojanazeri d6eb83f6c5 Add llm class so that externally-hosted models can be called (#398) %!s(int64=2) %!d(string=hai) anos
  Thomas Robinson 0346d0d5b8 Add documentation and examples %!s(int64=2) %!d(string=hai) anos
  Hamid Shojanazeri 43a1e5cdb0 Fix dead links after directory structure refactor (#397) %!s(int64=2) %!d(string=hai) anos
  Suraj Subramanian e2a35420c0 Remove octoai link that is 401-ing %!s(int64=2) %!d(string=hai) anos
  Suraj Subramanian 12602f32e2 Merge branch 'main' into subramen-patch-deadlinks %!s(int64=2) %!d(string=hai) anos
  Hamid Shojanazeri c8f4bdac41 Adding open in colab option for notebook (#395) %!s(int64=2) %!d(string=hai) anos
  Thomas Robinson 81984a9a44 Remove unnecessary spec format %!s(int64=2) %!d(string=hai) anos
  Suraj Subramanian f53f17138b fix dead links after refactor %!s(int64=2) %!d(string=hai) anos
  Thomas Robinson eee39a7463 Add llm.py class in order to call remotely hosted models %!s(int64=2) %!d(string=hai) anos