|  Matthias Reso | 4c225c65eb
							
							Fix order of concat vs sampler | 2 år sedan | 
				
					
						|  Matthias Reso | f9756ca79d
							
							Added packing test for samsum | 2 år sedan | 
				
					
						|  Matthias Reso | 5a359b7bf2
							
							Fix sampler vs batch_sampler | 2 år sedan | 
				
					
						|  Matthias Reso | fe8122daf1
							
							Adapt alpaca dataset to ConcatDataset | 2 år sedan | 
				
					
						|  Matthias Reso | 5da84b2913
							
							Fix usage of dataclass for train_config and fsdp_config | 2 år sedan | 
				
					
						|  Matthias Reso | aa5dee241a
							
							Fix unit test to reflect batch packing | 2 år sedan | 
				
					
						|  Matthias Reso | 8620ab8ac2
							
							Fix invalid labels for context in custom dataset/oasst1 | 2 år sedan | 
				
					
						|  Matthias Reso | 52c417b7d5
							
							Merge branch 'fix/invalidate_label_for_chat' into feature/length_based_batch_sampling | 2 år sedan | 
				
					
						|  Matthias Reso | 653a79e3dd
							
							Invalidate context in labels for samsum + grammar | 2 år sedan | 
				
					
						|  Matthias Reso | d3015b4c80
							
							Remove max_word from alpaca; lets deal tokenizer deal with truncation | 2 år sedan | 
				
					
						|  Matthias Reso | a647955fc8
							
							Make packing/padding a training setting | 2 år sedan | 
				
					
						|  Matthias Reso | eafea7b366
							
							Invalidate labels in dialog dataset to disable loss | 2 år sedan | 
				
					
						|  Matthias Reso | cc8cc0d3c3
							
							fix grammar dataset | 2 år sedan | 
				
					
						|  Matthias Reso | 2e4bd2a665
							
							Resize vocab size to fix idx error | 2 år sedan | 
				
					
						|  Matthias Reso | 10f9367e56
							
							fix missing labels in datasets | 2 år sedan | 
				
					
						|  Matthias Reso | f2d02a9362
							
							Add unit test for dis sampler | 2 år sedan | 
				
					
						|  Matthias Reso | be63d9ec39
							
							Remove padding in alpaca ds; remove concat in grammar | 2 år sedan | 
				
					
						|  Matthias Reso | ddf58d205d
							
							Added dist length based batch sampler | 2 år sedan | 
				
					
						|  Matthias Reso | ca41c1c697
							
							Adjust tests to len based batch sampling | 2 år sedan | 
				
					
						|  Matthias Reso | 97a7871f4b
							
							Fix seed in test | 2 år sedan | 
				
					
						|  Matthias Reso | 17209cdabd
							
							Add license to test file | 2 år sedan | 
				
					
						|  Matthias Reso | d5054ecae9
							
							Move sampler test | 2 år sedan | 
				
					
						|  Matthias Reso | 63ce4ce7f6
							
							Moved sampler to data submodule | 2 år sedan | 
				
					
						|  Matthias Reso | f620f3589d
							
							Adds length based batch sampler | 2 år sedan | 
				
					
						|  Matthias Reso | 8ac44ef3be
							
							Fix vocab size mismatch in inference due to added pad token | 2 år sedan | 
				
					
						|  Geeta Chauhan | 40b32ba559
							
							Fix tqdm bar not change length after terminal is resized (#201) | 2 år sedan | 
				
					
						|  hongbo.mo | 6217635e87
							
							Fix tqdm bar not change length after terminal is resized | 2 år sedan | 
				
					
						|  Matthias Reso | 0b2fa40dba
							
							Add unit test for weight decay | 2 år sedan | 
				
					
						|  Shijie Wu | 91e2573aa8
							
							pass weight_decay into optimizer | 2 år sedan | 
				
					
						|  Hamid Shojanazeri | c38bf5bdd3
							
							Add FSDP CPU offloading option (#122) | 2 år sedan |