|  Kevin Slagle | 52a85e1564
							
							merge | hai 1 ano | 
				
					
						|  Hamid Shojanazeri | 808a3f7a0c
							
							Adding support for FSDP+Qlora. (#572) | hai 1 ano | 
				
					
						|  Kai Wu | 480c4f2b5e
							
							resume the finetuning given the path of the previous peft checkpoint folder | hai 1 ano | 
				
					
						|  Kevin Slagle | 2f7001ef73
							
							document less obvious training config parameters | hai 1 ano | 
				
					
						|  Matthias Reso | 091d58df17
							
							Disable prefix tuning as its currently not supported; Limit llama_adapter usage to non-FSDP only | hai 1 ano | 
				
					
						|  Kai Wu | 26e877fd42
							
							changed readme, unified the context interface and added get_flops_per_sec() | hai 1 ano | 
				
					
						|  Kai Wu | fe51935fa6
							
							Merge branch 'main' into feature/flop_counter | hai 1 ano | 
				
					
						|  Hamid Shojanazeri | df03fd4b12
							
							Recipe to add a new language to Llama2 (#429) | hai 1 ano | 
				
					
						|  Kai Wu | 03f1ca7817
							
							fixed some typo to pass spellcheck | hai 1 ano | 
				
					
						|  Kai Wu | a35519ee90
							
							fixed typo and handling unexpected exit | hai 1 ano | 
				
					
						|  Rahul A R | 2fa8e69b62
							
							add new argument: tokenizer_name | hai 1 ano | 
				
					
						|  Kai Wu | fa0a389f74
							
							add max_step feature for training and eval | hai 1 ano | 
				
					
						|  Hamid Shojanazeri | ffdc93f00a
							
							Merge branch 'main' into wandb_logging | hai 1 ano | 
				
					
						|  Hamid Shojanazeri | 162be4c045
							
							Revert "Flop counter, profiling and GC (#357)" | hai 1 ano | 
				
					
						|  Hamid Shojanazeri | 71d137c722
							
							Merge branch 'main' into flop_counter_gc | hai 1 ano | 
				
					
						|  Beto | 7474514fe0
							
							Merging with main | hai 1 ano | 
				
					
						|  kldarek | 989b6ee812
							
							wandb logging feedback | hai 1 ano | 
				
					
						|  gaopengzhi | e2797abe9b
							
							Add gradient_clipping and gradient_clipping_threshold parameters | hai 1 ano | 
				
					
						|  kldarek | cf373529f7
							
							basic wandb logging instrumentation | hai 1 ano | 
				
					
						|  Beto | 17d02c3b44
							
							Adding config to conditionally save stats | %!s(int64=2) %!d(string=hai) anos | 
				
					
						|  gaopengzhi | 04befdef69
							
							Add gradient clipping feature | %!s(int64=2) %!d(string=hai) anos | 
				
					
						|  Matthias Reso | a647955fc8
							
							Make packing/padding a training setting | %!s(int64=2) %!d(string=hai) anos | 
				
					
						|  Hamid Shojanazeri | 35b394e49f
							
							adding profiler and flop_counter | %!s(int64=2) %!d(string=hai) anos | 
				
					
						|  Hamid Shojanazeri | d56d5c469d
							
							adding flop counter | %!s(int64=2) %!d(string=hai) anos | 
				
					
						|  Matthias Reso | 72a9832571
							
							Merge branch 'main' into feature/package_distribution | %!s(int64=2) %!d(string=hai) anos | 
				
					
						|  Matthias Reso | cf678b9bf0
							
							Adjust imports to package structure + cleaned up imports | %!s(int64=2) %!d(string=hai) anos | 
				
					
						|  Matthias Reso | 4c9cc7d223
							
							Move modules into separate src folder | %!s(int64=2) %!d(string=hai) anos |