|  Kai Wu | c18a0d277f
							
							changed dataset to ocrvqa | 1 rok temu | 
				
					
						|  Kai Wu | bd22f407d5
							
							changed to aid2 dataset | 1 rok temu | 
				
					
						|  Kai Wu | 79dbe05a94
							
							batch fine-tuning lmm working | 1 rok temu | 
				
					
						|  Kai Wu | 12da109823
							
							Merge branch 'main' into lmm_finetune | 1 rok temu | 
				
					
						|  Kai Wu | bb990be967
							
							not working, need create dataloader function | 1 rok temu | 
				
					
						|  Matthias Reso | 778e31e35c
							
							Fix checkpoint saving  (#650) | 1 rok temu | 
				
					
						|  Kai Wu | ee204ccb98
							
							working now | 1 rok temu | 
				
					
						|  Kai Wu | b566582a86
							
							finetune not working with fsdp | 1 rok temu | 
				
					
						|  Matthias Reso | eca526526c
							
							Use new get_model_state_dict api for save_pretrained peft model (#629) | 1 rok temu | 
				
					
						|  Matthias Reso | 7a8c52cb38
							
							Remove pkg_resources.packaging | 1 rok temu | 
				
					
						|  simwiki | 66e1867120
							
							Fix save metric FileNotFoundError when finetuning | 1 rok temu | 
				
					
						|  Kai Wu | 26e877fd42
							
							changed readme, unified the context interface and added get_flops_per_sec() | 1 rok temu | 
				
					
						|  Kai Wu | d9558c11ca
							
							changed context name and add more docs | 1 rok temu | 
				
					
						|  Kai Wu | 03f1ca7817
							
							fixed some typo to pass spellcheck | 1 rok temu | 
				
					
						|  Kai Wu | 7b1a9413d2
							
							fixed a typo | 1 rok temu | 
				
					
						|  Kai Wu | 41434dc825
							
							formatted and removed duplicated or unused function get_total_flops() and byte2mb() | 1 rok temu | 
				
					
						|  Kai Wu | f2e80bae22
							
							created a FlopMeasure class on top of FlopCounterMode instead of keep of copy of our own tflop_counter.py | 1 rok temu | 
				
					
						|  Kai Wu | 69e46887b4
							
							handling incorrect profiling early stop caused by max_train_steps and add profiler.step() for each train step | 1 rok temu | 
				
					
						|  Kai Wu | 34e0bf4c6e
							
							second draft of this feature, seems to be working now | 1 rok temu | 
				
					
						|  Kai Wu | a35519ee90
							
							fixed typo and handling unexpected exit | 1 rok temu | 
				
					
						|  Kai Wu | 2a5de9b448
							
							first draft of flop counter feature | 1 rok temu | 
				
					
						|  Kai Wu | e6f69f84ad
							
							add max_steps_reached to reduce redundancy | 1 rok temu | 
				
					
						|  Kai Wu | fa0a389f74
							
							add max_step feature for training and eval | 1 rok temu | 
				
					
						|  jpgard | 6954b16b3b
							
							only save training params on rank 0 | 1 rok temu | 
				
					
						|  Hamid Shojanazeri | 761b7e6e51
							
							adding wandb_run ro eval | 1 rok temu | 
				
					
						|  Hamid Shojanazeri | ffdc93f00a
							
							Merge branch 'main' into wandb_logging | 1 rok temu | 
				
					
						|  Matthias Reso | c5a382e509
							
							Make tests run on cpu only machines | 1 rok temu | 
				
					
						|  Hamid Shojanazeri | 162be4c045
							
							Revert "Flop counter, profiling and GC (#357)" | 1 rok temu | 
				
					
						|  Hamid Shojanazeri | 1a09fb5d27
							
							add logging for setting profiler | 1 rok temu | 
				
					
						|  Hamid Shojanazeri | 71d137c722
							
							Merge branch 'main' into flop_counter_gc | 1 rok temu |