Commit History

Autor SHA1 Mensaxe Data
  Kai Wu bb990be967 not working, need create dataloader function hai 7 meses
  Kai Wu ee204ccb98 working now hai 8 meses
  Kai Wu b566582a86 finetune not working with fsdp hai 8 meses
  Matthias Reso 7a8c52cb38 Remove pkg_resources.packaging hai 11 meses
  simwiki 66e1867120 Fix save metric FileNotFoundError when finetuning hai 1 ano
  Kai Wu 26e877fd42 changed readme, unified the context interface and added get_flops_per_sec() hai 1 ano
  Kai Wu d9558c11ca changed context name and add more docs hai 1 ano
  Kai Wu 03f1ca7817 fixed some typo to pass spellcheck hai 1 ano
  Kai Wu 7b1a9413d2 fixed a typo hai 1 ano
  Kai Wu 41434dc825 formatted and removed duplicated or unused function get_total_flops() and byte2mb() hai 1 ano
  Kai Wu f2e80bae22 created a FlopMeasure class on top of FlopCounterMode instead of keep of copy of our own tflop_counter.py hai 1 ano
  Kai Wu 69e46887b4 handling incorrect profiling early stop caused by max_train_steps and add profiler.step() for each train step hai 1 ano
  Kai Wu 34e0bf4c6e second draft of this feature, seems to be working now hai 1 ano
  Kai Wu a35519ee90 fixed typo and handling unexpected exit hai 1 ano
  Kai Wu 2a5de9b448 first draft of flop counter feature hai 1 ano
  Kai Wu e6f69f84ad add max_steps_reached to reduce redundancy hai 1 ano
  Kai Wu fa0a389f74 add max_step feature for training and eval hai 1 ano
  jpgard 6954b16b3b only save training params on rank 0 hai 1 ano
  Hamid Shojanazeri 761b7e6e51 adding wandb_run ro eval hai 1 ano
  Hamid Shojanazeri ffdc93f00a Merge branch 'main' into wandb_logging hai 1 ano
  Matthias Reso c5a382e509 Make tests run on cpu only machines hai 1 ano
  Hamid Shojanazeri 162be4c045 Revert "Flop counter, profiling and GC (#357)" hai 1 ano
  Hamid Shojanazeri 1a09fb5d27 add logging for setting profiler hai 1 ano
  Hamid Shojanazeri 71d137c722 Merge branch 'main' into flop_counter_gc hai 1 ano
  Hamid Shojanazeri 8bf474b455 clean up hai 1 ano
  Hamid Shojanazeri 19089269d3 add gc hai 1 ano
  Hamid Shojanazeri dbfea484c6 Feature : Enable Intel GPU/XPU finetuning and inference (#116) hai 1 ano
  Beto 1f5b202c18 Adding tests for the save_metrics param in the train function hai 1 ano
  Beto 7474514fe0 Merging with main hai 1 ano
  gaopengzhi c7d410725b Merge branch 'main' into grad_clip hai 1 ano