Commit History

Autor SHA1 Mensaxe Data
  Kai Wu c18a0d277f changed dataset to ocrvqa hai 10 meses
  Kai Wu bd22f407d5 changed to aid2 dataset hai 10 meses
  Kai Wu 79dbe05a94 batch fine-tuning lmm working hai 10 meses
  Kai Wu 12da109823 Merge branch 'main' into lmm_finetune hai 10 meses
  Kai Wu bb990be967 not working, need create dataloader function hai 10 meses
  Matthias Reso 778e31e35c Fix checkpoint saving (#650) hai 11 meses
  Kai Wu ee204ccb98 working now hai 11 meses
  Kai Wu b566582a86 finetune not working with fsdp hai 11 meses
  Matthias Reso eca526526c Use new get_model_state_dict api for save_pretrained peft model (#629) hai 11 meses
  Matthias Reso 7a8c52cb38 Remove pkg_resources.packaging hai 1 ano
  simwiki 66e1867120 Fix save metric FileNotFoundError when finetuning hai 1 ano
  Kai Wu 26e877fd42 changed readme, unified the context interface and added get_flops_per_sec() hai 1 ano
  Kai Wu d9558c11ca changed context name and add more docs hai 1 ano
  Kai Wu 03f1ca7817 fixed some typo to pass spellcheck hai 1 ano
  Kai Wu 7b1a9413d2 fixed a typo hai 1 ano
  Kai Wu 41434dc825 formatted and removed duplicated or unused function get_total_flops() and byte2mb() hai 1 ano
  Kai Wu f2e80bae22 created a FlopMeasure class on top of FlopCounterMode instead of keep of copy of our own tflop_counter.py hai 1 ano
  Kai Wu 69e46887b4 handling incorrect profiling early stop caused by max_train_steps and add profiler.step() for each train step hai 1 ano
  Kai Wu 34e0bf4c6e second draft of this feature, seems to be working now hai 1 ano
  Kai Wu a35519ee90 fixed typo and handling unexpected exit hai 1 ano
  Kai Wu 2a5de9b448 first draft of flop counter feature hai 1 ano
  Kai Wu e6f69f84ad add max_steps_reached to reduce redundancy hai 1 ano
  Kai Wu fa0a389f74 add max_step feature for training and eval hai 1 ano
  jpgard 6954b16b3b only save training params on rank 0 hai 1 ano
  Hamid Shojanazeri 761b7e6e51 adding wandb_run ro eval hai 1 ano
  Hamid Shojanazeri ffdc93f00a Merge branch 'main' into wandb_logging hai 1 ano
  Matthias Reso c5a382e509 Make tests run on cpu only machines hai 1 ano
  Hamid Shojanazeri 162be4c045 Revert "Flop counter, profiling and GC (#357)" hai 1 ano
  Hamid Shojanazeri 1a09fb5d27 add logging for setting profiler hai 1 ano
  Hamid Shojanazeri 71d137c722 Merge branch 'main' into flop_counter_gc hai 1 ano