Kai Wu
|
c18a0d277f
changed dataset to ocrvqa
|
7 bulan lalu |
Kai Wu
|
bd22f407d5
changed to aid2 dataset
|
7 bulan lalu |
Kai Wu
|
79dbe05a94
batch fine-tuning lmm working
|
7 bulan lalu |
Kai Wu
|
12da109823
Merge branch 'main' into lmm_finetune
|
7 bulan lalu |
Kai Wu
|
bb990be967
not working, need create dataloader function
|
7 bulan lalu |
Matthias Reso
|
778e31e35c
Fix checkpoint saving (#650)
|
7 bulan lalu |
Kai Wu
|
ee204ccb98
working now
|
7 bulan lalu |
Kai Wu
|
b566582a86
finetune not working with fsdp
|
7 bulan lalu |
Matthias Reso
|
eca526526c
Use new get_model_state_dict api for save_pretrained peft model (#629)
|
8 bulan lalu |
Matthias Reso
|
7a8c52cb38
Remove pkg_resources.packaging
|
11 bulan lalu |
simwiki
|
66e1867120
Fix save metric FileNotFoundError when finetuning
|
11 bulan lalu |
Kai Wu
|
26e877fd42
changed readme, unified the context interface and added get_flops_per_sec()
|
11 bulan lalu |
Kai Wu
|
d9558c11ca
changed context name and add more docs
|
1 tahun lalu |
Kai Wu
|
03f1ca7817
fixed some typo to pass spellcheck
|
1 tahun lalu |
Kai Wu
|
7b1a9413d2
fixed a typo
|
1 tahun lalu |
Kai Wu
|
41434dc825
formatted and removed duplicated or unused function get_total_flops() and byte2mb()
|
1 tahun lalu |
Kai Wu
|
f2e80bae22
created a FlopMeasure class on top of FlopCounterMode instead of keep of copy of our own tflop_counter.py
|
1 tahun lalu |
Kai Wu
|
69e46887b4
handling incorrect profiling early stop caused by max_train_steps and add profiler.step() for each train step
|
1 tahun lalu |
Kai Wu
|
34e0bf4c6e
second draft of this feature, seems to be working now
|
1 tahun lalu |
Kai Wu
|
a35519ee90
fixed typo and handling unexpected exit
|
1 tahun lalu |
Kai Wu
|
2a5de9b448
first draft of flop counter feature
|
1 tahun lalu |
Kai Wu
|
e6f69f84ad
add max_steps_reached to reduce redundancy
|
1 tahun lalu |
Kai Wu
|
fa0a389f74
add max_step feature for training and eval
|
1 tahun lalu |
jpgard
|
6954b16b3b
only save training params on rank 0
|
1 tahun lalu |
Hamid Shojanazeri
|
761b7e6e51
adding wandb_run ro eval
|
1 tahun lalu |
Hamid Shojanazeri
|
ffdc93f00a
Merge branch 'main' into wandb_logging
|
1 tahun lalu |
Matthias Reso
|
c5a382e509
Make tests run on cpu only machines
|
1 tahun lalu |
Hamid Shojanazeri
|
162be4c045
Revert "Flop counter, profiling and GC (#357)"
|
1 tahun lalu |
Hamid Shojanazeri
|
1a09fb5d27
add logging for setting profiler
|
1 tahun lalu |
Hamid Shojanazeri
|
71d137c722
Merge branch 'main' into flop_counter_gc
|
1 tahun lalu |