Matthias Reso
							
						 | 
						
							
							
								739483f262
							
							Adjust test_grammar_datasets to stable sort
						 | 
						1 anno fa | 
					
				
					
						
							
								   Matthias Reso
							
						 | 
						
							
							
								b96e435cda
							
							Adjust test_samsum_dataset to second model
						 | 
						1 anno fa | 
					
				
					
						
							
								   Matthias Reso
							
						 | 
						
							
							
								fac41298b0
							
							Adapt test_custom_dataset to new model
						 | 
						1 anno fa | 
					
				
					
						
							
								   Matthias Reso
							
						 | 
						
							
							
								960014a3bb
							
							Fix test_custom_dataset by introducing a stable sort algorithm
						 | 
						1 anno fa | 
					
				
					
						
							
								   Matthias Reso
							
						 | 
						
							
							
								b5583b31d5
							
							Adapt test_grammar_dataset to new model
						 | 
						1 anno fa | 
					
				
					
						
							
								   Matthias Reso
							
						 | 
						
							
							
								17a6d16289
							
							Test batching for both llama versions
						 | 
						1 anno fa | 
					
				
					
						
							
								   Matthias Reso
							
						 | 
						
							
							
								a414ca6a57
							
							Update chat format for llama3
						 | 
						1 anno fa | 
					
				
					
						
							
								   Matthias Reso
							
						 | 
						
							
							
								113ea18bf1
							
							Replace LlamaTokenizer with AutoTokenizer
						 | 
						1 anno fa | 
					
				
					
						
							
								   Hamid Shojanazeri
							
						 | 
						
							
							
								aaa9e2c863
							
							Adding a feature that will stop the training/eval process after reaching some max_steps (#428)
						 | 
						1 anno fa | 
					
				
					
						
							
								   Kai Wu
							
						 | 
						
							
							
								e6f69f84ad
							
							add max_steps_reached to reduce redundancy
						 | 
						1 anno fa | 
					
				
					
						
							
								   Kai Wu
							
						 | 
						
							
							
								362cda0fa6
							
							fixing test_gradient_accumulation and test_save_to_json
						 | 
						1 anno fa | 
					
				
					
						
							
								   Kai Wu
							
						 | 
						
							
							
								fa0a389f74
							
							add max_step feature for training and eval
						 | 
						1 anno fa | 
					
				
					
						
							
								   Hamid Shojanazeri
							
						 | 
						
							
							
								37c8f72211
							
							Update location and name of llm.py example notebook (#417)
						 | 
						1 anno fa | 
					
				
					
						
							
								   Thomas Robinson
							
						 | 
						
							
							
								79266217ef
							
							Update location and name of llm.py example notebook
						 | 
						1 anno fa | 
					
				
					
						
							
								   Hamid Shojanazeri
							
						 | 
						
							
							
								f7aa02af9f
							
							only save training params on rank 0 (#415)
						 | 
						1 anno fa | 
					
				
					
						
							
								   jpgard
							
						 | 
						
							
							
								6954b16b3b
							
							only save training params on rank 0
						 | 
						1 anno fa | 
					
				
					
						
							
								   Hamid Shojanazeri
							
						 | 
						
							
							
								64e189914f
							
							update due to peft new release (#407)
						 | 
						1 anno fa | 
					
				
					
						
							
								   Hamid Shojanazeri
							
						 | 
						
							
							
								11f51db28c
							
							adding the kbit prep in the code
						 | 
						1 anno fa | 
					
				
					
						
							
								   Hamid Shojanazeri
							
						 | 
						
							
							
								f058ff6ccd
							
							update due to peft new release
						 | 
						1 anno fa | 
					
				
					
						
							
								   Hamid Shojanazeri
							
						 | 
						
							
							
								6a7478a6aa
							
							Reorg inference throughput folder structure (#404)
						 | 
						1 anno fa | 
					
				
					
						
							
								   Chester Hu
							
						 | 
						
							
							
								367e4869ac
							
							Reorg inference throughput folder structure
						 | 
						1 anno fa | 
					
				
					
						
							
								   Hamid Shojanazeri
							
						 | 
						
							
							
								d6eb83f6c5
							
							Add llm class so that externally-hosted models can be called (#398)
						 | 
						1 anno fa | 
					
				
					
						
							
								   Thomas Robinson
							
						 | 
						
							
							
								0346d0d5b8
							
							Add documentation and examples
						 | 
						1 anno fa | 
					
				
					
						
							
								   Hamid Shojanazeri
							
						 | 
						
							
							
								43a1e5cdb0
							
							Fix dead links after directory structure refactor (#397)
						 | 
						1 anno fa | 
					
				
					
						
							
								   Suraj Subramanian
							
						 | 
						
							
							
								e2a35420c0
							
							Remove octoai link that is 401-ing
						 | 
						1 anno fa | 
					
				
					
						
							
								   Suraj Subramanian
							
						 | 
						
							
							
								12602f32e2
							
							Merge branch 'main' into subramen-patch-deadlinks
						 | 
						1 anno fa | 
					
				
					
						
							
								   Hamid Shojanazeri
							
						 | 
						
							
							
								c8f4bdac41
							
							Adding open in colab option for notebook (#395)
						 | 
						1 anno fa | 
					
				
					
						
							
								   Thomas Robinson
							
						 | 
						
							
							
								81984a9a44
							
							Remove unnecessary spec format
						 | 
						1 anno fa | 
					
				
					
						
							
								   Suraj Subramanian
							
						 | 
						
							
							
								f53f17138b
							
							fix dead links after refactor
						 | 
						1 anno fa | 
					
				
					
						
							
								   Thomas Robinson
							
						 | 
						
							
							
								eee39a7463
							
							Add llm.py class in order to call remotely hosted models
						 | 
						1 anno fa |