|  Abhilash Majumder | 4a7bad83af
							
							Merge branch 'main' into ipex_feature | 2 lat temu | 
				
					
						|  Geeta Chauhan | cfba150311
							
							adding llama code inference (#144) | 2 lat temu | 
				
					
						|  Hamid Shojanazeri | 6105a3f886
							
							clarifying the infilling use-case | 2 lat temu | 
				
					
						|  Hamid Shojanazeri | 8b0008433c
							
							fix typos | 2 lat temu | 
				
					
						|  Hamid Shojanazeri | 564ef2f628
							
							remove padding logic | 2 lat temu | 
				
					
						|  Hamid Shojanazeri | 277a292fbc
							
							adding autotokenizer | 2 lat temu | 
				
					
						|  Hamid Shojanazeri | 3f2fb9167e
							
							adding notes to model not supporting infilling | 2 lat temu | 
				
					
						|  Hamid Shojanazeri | c62428b99c
							
							setting defaults of temp and top_p | 2 lat temu | 
				
					
						|  Hamid Shojanazeri | c014ae7cb8
							
							setting BT option to true | 2 lat temu | 
				
					
						|  Hamid Shojanazeri | 4fa44e16d9
							
							add note for python llama not suited for llama infilling | 2 lat temu | 
				
					
						|  Hamid Shojanazeri | b18a186385
							
							removing the option to take prompt from cli | 2 lat temu | 
				
					
						|  Hamid Shojanazeri | 75991d8795
							
							fix the extra line added and remove take prompt from cli | 2 lat temu | 
				
					
						|  Hamid Shojanazeri | d28fc9898a
							
							addressing doc comments | 2 lat temu | 
				
					
						|  Hamid Shojanazeri | a234d1fe0c
							
							fix typos | 2 lat temu | 
				
					
						|  Hamid Shojanazeri | 2d9f4796e8
							
							fixing the output format | 2 lat temu | 
				
					
						|  Hamid Shojanazeri | 1e8ea70b26
							
							adding llama code inference | 2 lat temu | 
				
					
						|  Geeta Chauhan | 82e05c46e0
							
							fix a bug in the config for use_fast_kernels (#121) | 2 lat temu | 
				
					
						|  Hamid Shojanazeri | 971c079aa6
							
							bugfix: remove duplicate load_peft_model (#124) | 2 lat temu | 
				
					
						|  hongbo.mo | fcc817e923
							
							bugfix: remove duplicate load_peft_model | 2 lat temu | 
				
					
						|  Brian Vaughan | 3faf005226
							
							fix a bug in the config for use_fast_kernels | 2 lat temu | 
				
					
						|  Abhilash Majumder | d5f39914e8
							
							Merge branch 'main' into ipex_feature | 2 lat temu | 
				
					
						|  abhilash1910 | 82d3ca6e06
							
							Fix bugs in data loading | 2 lat temu | 
				
					
						|  Geeta Chauhan | 03faba661f
							
							Update paddings  (#85) | 2 lat temu | 
				
					
						|  Geeta Chauhan | 205e5a4b81
							
							save cpu mem by leveraging FSDP rank0 broadcasting (#77) | 2 lat temu | 
				
					
						|  Hamid Shojanazeri | 85a4ed1b65
							
							Merge branch 'main' into update_paddings | 2 lat temu | 
				
					
						|  abhilash1910 | ed7ba999a9
							
							enable xpu finetuning and inference | 2 lat temu | 
				
					
						|  lchu | feaa344af3
							
							resolve conflicts | 2 lat temu | 
				
					
						|  Geeta Chauhan | 3f1fef7a00
							
							adding flash attention and xformer memory efficient through PT SDPA (#97) | 2 lat temu | 
				
					
						|  Hamid Shojanazeri | beab5726cc
							
							add notes for padding | 2 lat temu | 
				
					
						|  Hamid Shojanazeri | c3a11c4fbe
							
							update to main | 2 lat temu |