| .. | 
		
		
			
			
			
				
					| __init__.py | 207d2f80e9
					Make code-llama and hf-tgi inference runnable as module | %!s(int64=2) %!d(string=hai) anos | 
		
			
			
			
				
					| chat_utils.py | 6d9d48d619
					Use apply_chat_template instead of custom functions | hai 1 ano | 
		
			
			
			
				
					| checkpoint_converter_fsdp_hf.py | ce9501f22c
					remove relative imports | %!s(int64=2) %!d(string=hai) anos | 
		
			
			
			
				
					| llm.py | a404c9249c
					Notebook to demonstrate using llama and llama-guard together using OctoAI | hai 1 ano | 
		
			
			
			
				
					| model_utils.py | d51d2cce9c
					adding sdpa for flash attn | hai 1 ano | 
		
			
			
			
				
					| prompt_format_utils.py | bcdb5b31fe
					Fixing quantization config. Removing prints | hai 1 ano | 
		
			
			
			
				
					| safety_utils.py | f63ba19827
					Fixing tokenizer used for llama 3. Changing quantization configs on safety_utils. | hai 1 ano |