|  Matthias Reso | 00e0b0be6c
							
							Apply suggestions from code review | 1 年之前 | 
				
					
						|  Matthias Reso | 190d543b53
							
							Add fp8 references | 1 年之前 | 
				
					
						|  Matthias Reso | c167945448
							
							remove 405B ft doc | 1 年之前 | 
				
					
						|  Matthias Reso | b0b4e16aec
							
							Update docs/multi_gpu.md | 1 年之前 | 
				
					
						|  Matthias Reso | e2f77dbc21
							
							fix quant config | 1 年之前 | 
				
					
						|  Matthias Reso | 6ef9a78458
							
							Fix issues with quantization_config == None | 1 年之前 | 
				
					
						|  Matthias Reso | b319a9fb8c
							
							Fix lint issue | 1 年之前 | 
				
					
						|  Matthias Reso | a3fd369127
							
							Ref from infernce recipes to vllm for 405B | 1 年之前 | 
				
					
						|  Matthias Reso | a8f2267324
							
							Added multi node doc to multigpu_finetuning.md | 1 年之前 | 
				
					
						|  Matthias Reso | afb3b75892
							
							Add 405B + QLoRA + FSDP to multi_gpu.md doc | 1 年之前 | 
				
					
						|  Matthias Reso | 939c88fb04
							
							Add 405B + QLoRA ref to LLM finetung | 1 年之前 | 
				
					
						|  Matthias Reso | d2fd9c163a
							
							Added doc for multi-node vllm inference | 1 年之前 | 
				
					
						|  Matthias Reso | c9ae014459
							
							Enable pipeline parallelism through use of AsyncLLMEngine in vllm inferecen + enable use of lora adapter | 1 年之前 | 
				
					
						|  Matthias Reso | 0920b1a415
							
							Fix quantization for inference | 1 年之前 | 
				
					
						|  Matthias Reso | b36830fdf6
							
							Fix reading in stdin for chat_completion, remove padding as we're feeding single samples | 1 年之前 | 
				
					
						|  Matthias Reso | f0aa8e31ca
							
							Update url | 1 年之前 | 
				
					
						|  Matthias Reso | 9db61e5235
							
							Refactored infeence to allow multiple requests through gradio | 1 年之前 | 
				
					
						|  Thomas Robinson | fd9f52f710
							
							Modify prompt_format_utils with changes necessary for Llama Guard 3 (#1) | 1 年之前 | 
				
					
						|  Cyrus Nikolaidis | 0c57646481
							
							Prompt Guard Tutorial | 1 年之前 | 
				
					
						|  Hamid Shojanazeri | 808a3f7a0c
							
							Adding support for FSDP+Qlora. (#572) | 1 年之前 | 
				
					
						|  Jeff Tang | ba447971f0
							
							Port of DLAI LlamaIndex Agent short course lessons 2-4 to use Llama 3 (#594) | 1 年之前 | 
				
					
						|  Jeff Tang | 935ad46a0d
							
							wordlist update for DLAI LlamaIndex Agent short course | 1 年之前 | 
				
					
						|  Jeff Tang | af8838463e
							
							added lesson summary in each notebook and README | 1 年之前 | 
				
					
						|  Jeff Tang | aaeba04bd6
							
							README update | 1 年之前 | 
				
					
						|  Jeff Tang | 353ceaae74
							
							fix of cell order issue for L3 | 1 年之前 | 
				
					
						|  dongwang218 | ed3136f117
							
							Update hf weight conversion script to llama 3 (#551) | 1 年之前 | 
				
					
						|  Kai Wu | f6617fb86a
							
							changed --pure_bf16 to --fsdp_config.pure_bf16 and corrected "examples/" path (#587) | 1 年之前 | 
				
					
						|  Jeff Tang | 2e4ea5b728
							
							cell cleanup | 1 年之前 | 
				
					
						|  Jeff Tang | 0fef52e846
							
							README links fixed | 1 年之前 | 
				
					
						|  Jeff Tang | ebbf362576
							
							L4 - replace groq with fireworks to fix rate limit | 1 年之前 |