Matthias Reso
							
						 | 
						
							
							
								8620ab8ac2
							
							Fix invalid labels for context in custom dataset/oasst1
						 | 
						2 lat temu | 
					
				
					
						
							
								   Matthias Reso
							
						 | 
						
							
							
								52c417b7d5
							
							Merge branch 'fix/invalidate_label_for_chat' into feature/length_based_batch_sampling
						 | 
						2 lat temu | 
					
				
					
						
							
								   Matthias Reso
							
						 | 
						
							
							
								653a79e3dd
							
							Invalidate context in labels for samsum + grammar
						 | 
						2 lat temu | 
					
				
					
						
							
								   Matthias Reso
							
						 | 
						
							
							
								d3015b4c80
							
							Remove max_word from alpaca; lets deal tokenizer deal with truncation
						 | 
						2 lat temu | 
					
				
					
						
							
								   Matthias Reso
							
						 | 
						
							
							
								a647955fc8
							
							Make packing/padding a training setting
						 | 
						2 lat temu | 
					
				
					
						
							
								   Matthias Reso
							
						 | 
						
							
							
								eafea7b366
							
							Invalidate labels in dialog dataset to disable loss
						 | 
						2 lat temu | 
					
				
					
						
							
								   Matthias Reso
							
						 | 
						
							
							
								cc8cc0d3c3
							
							fix grammar dataset
						 | 
						2 lat temu | 
					
				
					
						
							
								   Matthias Reso
							
						 | 
						
							
							
								2e4bd2a665
							
							Resize vocab size to fix idx error
						 | 
						2 lat temu | 
					
				
					
						
							
								   Matthias Reso
							
						 | 
						
							
							
								10f9367e56
							
							fix missing labels in datasets
						 | 
						2 lat temu | 
					
				
					
						
							
								   Matthias Reso
							
						 | 
						
							
							
								f2d02a9362
							
							Add unit test for dis sampler
						 | 
						2 lat temu | 
					
				
					
						
							
								   Matthias Reso
							
						 | 
						
							
							
								be63d9ec39
							
							Remove padding in alpaca ds; remove concat in grammar
						 | 
						2 lat temu | 
					
				
					
						
							
								   Matthias Reso
							
						 | 
						
							
							
								ddf58d205d
							
							Added dist length based batch sampler
						 | 
						2 lat temu | 
					
				
					
						
							
								   Matthias Reso
							
						 | 
						
							
							
								ca41c1c697
							
							Adjust tests to len based batch sampling
						 | 
						2 lat temu | 
					
				
					
						
							
								   Matthias Reso
							
						 | 
						
							
							
								97a7871f4b
							
							Fix seed in test
						 | 
						2 lat temu | 
					
				
					
						
							
								   Matthias Reso
							
						 | 
						
							
							
								17209cdabd
							
							Add license to test file
						 | 
						2 lat temu | 
					
				
					
						
							
								   Matthias Reso
							
						 | 
						
							
							
								d5054ecae9
							
							Move sampler test
						 | 
						2 lat temu | 
					
				
					
						
							
								   Matthias Reso
							
						 | 
						
							
							
								63ce4ce7f6
							
							Moved sampler to data submodule
						 | 
						2 lat temu | 
					
				
					
						
							
								   Matthias Reso
							
						 | 
						
							
							
								f620f3589d
							
							Adds length based batch sampler
						 | 
						2 lat temu | 
					
				
					
						
							
								   Matthias Reso
							
						 | 
						
							
							
								8ac44ef3be
							
							Fix vocab size mismatch in inference due to added pad token
						 | 
						2 lat temu | 
					
				
					
						
							
								   Geeta Chauhan
							
						 | 
						
							
							
								40b32ba559
							
							Fix tqdm bar not change length after terminal is resized (#201)
						 | 
						2 lat temu | 
					
				
					
						
							
								   hongbo.mo
							
						 | 
						
							
							
								6217635e87
							
							Fix tqdm bar not change length after terminal is resized
						 | 
						2 lat temu | 
					
				
					
						
							
								   Matthias Reso
							
						 | 
						
							
							
								0b2fa40dba
							
							Add unit test for weight decay
						 | 
						2 lat temu | 
					
				
					
						
							
								   Shijie Wu
							
						 | 
						
							
							
								91e2573aa8
							
							pass weight_decay into optimizer
						 | 
						2 lat temu | 
					
				
					
						
							
								   Hamid Shojanazeri
							
						 | 
						
							
							
								c38bf5bdd3
							
							Add FSDP CPU offloading option (#122)
						 | 
						2 lat temu | 
					
				
					
						
							
								   Howard Liberty
							
						 | 
						
							
							
								cc356b6017
							
							Add FSDP CPU offloading option
						 | 
						2 lat temu | 
					
				
					
						
							
								   Yuanhao
							
						 | 
						
							
							
								e554c1c8bf
							
							The tokenizer will not add eos_token by default
						 | 
						2 lat temu | 
					
				
					
						
							
								   tim-a-davis
							
						 | 
						
							
							
								3038020aa4
							
							Replaced ClassVar config param with field
						 | 
						2 lat temu | 
					
				
					
						
							
								   Hamid Shojanazeri
							
						 | 
						
							
							
								cfacee4302
							
							Update LLM_finetuning.md
						 | 
						2 lat temu | 
					
				
					
						
							
								   Hamid Shojanazeri
							
						 | 
						
							
							
								62dd2b3f4b
							
							Update docs/LLM_finetuning.md
						 | 
						2 lat temu | 
					
				
					
						
							
								   varunfb
							
						 | 
						
							
							
								6f2201c655
							
							Updated spell checker to resolve the issues in LLM_finetuning.md
						 | 
						2 lat temu |