Commit History

作者 SHA1 備註 提交日期
  celestinoalan 2a94bfff26 Append epoch rather than best val. loss to val_loss 11 月之前
  celestinoalan d6ae2031c3 Fix fine-tuning training loss accumulation (#725) 1 年之前
  Sanyam Bhutani 5fdeb55058 chore: update train_utils.py (#690) 1 年之前
  Lucas Ventura 2774065891 Improve model checkpoint saving logic (#691) 1 年之前
  Ikko Eltociear Ashimine 139824aafa chore: update train_utils.py 1 年之前
  Kai Wu c18a0d277f changed dataset to ocrvqa 1 年之前
  Kai Wu bd22f407d5 changed to aid2 dataset 1 年之前
  Kai Wu 79dbe05a94 batch fine-tuning lmm working 1 年之前
  Kai Wu 12da109823 Merge branch 'main' into lmm_finetune 1 年之前
  Kai Wu bb990be967 not working, need create dataloader function 1 年之前
  Matthias Reso 778e31e35c Fix checkpoint saving (#650) 1 年之前
  Kai Wu ee204ccb98 working now 1 年之前
  Kai Wu b566582a86 finetune not working with fsdp 1 年之前
  Matthias Reso eca526526c Use new get_model_state_dict api for save_pretrained peft model (#629) 1 年之前
  Matthias Reso 7a8c52cb38 Remove pkg_resources.packaging 1 年之前
  simwiki 66e1867120 Fix save metric FileNotFoundError when finetuning 1 年之前
  Kai Wu 26e877fd42 changed readme, unified the context interface and added get_flops_per_sec() 1 年之前
  Kai Wu d9558c11ca changed context name and add more docs 1 年之前
  Kai Wu 03f1ca7817 fixed some typo to pass spellcheck 1 年之前
  Kai Wu 7b1a9413d2 fixed a typo 1 年之前
  Kai Wu 41434dc825 formatted and removed duplicated or unused function get_total_flops() and byte2mb() 1 年之前
  Kai Wu f2e80bae22 created a FlopMeasure class on top of FlopCounterMode instead of keep of copy of our own tflop_counter.py 1 年之前
  Kai Wu 69e46887b4 handling incorrect profiling early stop caused by max_train_steps and add profiler.step() for each train step 1 年之前
  Kai Wu 34e0bf4c6e second draft of this feature, seems to be working now 1 年之前
  Kai Wu a35519ee90 fixed typo and handling unexpected exit 1 年之前
  Kai Wu 2a5de9b448 first draft of flop counter feature 1 年之前
  Kai Wu e6f69f84ad add max_steps_reached to reduce redundancy 1 年之前
  Kai Wu fa0a389f74 add max_step feature for training and eval 1 年之前
  jpgard 6954b16b3b only save training params on rank 0 1 年之前
  Hamid Shojanazeri 761b7e6e51 adding wandb_run ro eval 1 年之前