1 rok temu · 6a5c0f89de
--- a/recipes/responsible_ai/llama_guard/Llama
+++ b/recipes/responsible_ai/llama_guard/Llama
--- a/recipes/responsible_ai/llama_guard/README.md
+++ b/recipes/responsible_ai/llama_guard/README.md
@@ -2,62 +2,16 @@
 
																 <!-- markdown-link-check-disable -->
															
 
																 Meta Llama Guard is a language model that provides input and output guardrails for LLM inference. For more details and model cards, please visit the [PurpleLlama](https://github.com/meta-llama/PurpleLlama) repository.
															
 
																-This folder contains an example file to run inference with a locally hosted model, either using the Hugging Face Hub or a local path.
															
 
																+This notebook shows how to load the models with the transformers library and how to customize the categories.
															
 
																 ## Requirements
															
 
																-1. Access to Llama guard model weights on Hugging Face. To get access, follow the steps described [here](https://github.com/facebookresearch/PurpleLlama/tree/main/Llama-Guard#download)
															
 
																+1. Access to Llama guard model weights on Hugging Face. To get access, follow the steps described in the top of the model card in [Hugging Face](https://huggingface.co/meta-llama/Llama-Guard-3-1B)
															
 
																 2. Llama recipes package and it's dependencies [installed](https://github.com/meta-llama/llama-recipes?tab=readme-ov-file#installing)
															
 
																+3. Pillow package installed
															
 
																 ## Llama Guard inference script
															
 
																-For testing, you can add User or User/Agent interactions into the prompts list and the run the script to verify the results. When the conversation has one or more Agent responses, it's considered of type agent.
															
 
																-
															
 
																-
															
 
																-```
															
 
																-    prompts: List[Tuple[List[str], AgentType]] = [
															
 
																-        (["<Sample user prompt>"], AgentType.USER),
															
 
																-
															
 
																-        (["<Sample user prompt>",
															
 
																-        "<Sample agent response>"], AgentType.AGENT),
															
 
																-
															
 
																-        (["<Sample user prompt>",
															
 
																-        "<Sample agent response>",
															
 
																-        "<Sample user reply>",
															
 
																-        "<Sample agent response>",], AgentType.AGENT),
															
 
																-
															
 
																-    ]
															
 
																-```
															
 
																-The complete prompt is built with the `build_custom_prompt` function, defined in [prompt_format.py](../../../src/llama_recipes/inference/prompt_format_utils.py). The file contains the default Meta Llama Guard categories. These categories can adjusted and new ones can be added, as described in the [research paper](https://ai.meta.com/research/publications/llama-guard-llm-based-input-output-safeguard-for-human-ai-conversations/), on section 4.5 Studying the adaptability of the model.
															
 
																-<!-- markdown-link-check-enable -->
															
 
																-
															
 
																-To run the samples, with all the dependencies installed, execute this command:
															
 
																-
															
 
																-`python recipes/responsible_ai/llama_guard/inference.py`
															
 
																-
															
 
																-This is the output:
															
 
																-
															
 
																-```
															
 
																-['<Sample user prompt>']
															
 
																-> safe
															
 
																-
															
 
																-==================================
															
 
																-
															
 
																-['<Sample user prompt>', '<Sample agent response>']
															
 
																-> safe
															
 
																-
															
 
																-==================================
															
 
																-
															
 
																-['<Sample user prompt>', '<Sample agent response>', '<Sample user reply>', '<Sample agent response>']
															
 
																-> safe
															
 
																-
															
 
																-==================================
															
 
																-```
															
 
																-
															
 
																-To run it with a local model, you can use the `model_id` param in the inference script:
															
 
																-
															
 
																-`python recipes/responsible_ai/llama_guard/inference.py --model_id=/home/ubuntu/models/llama3/Llama-Guard-3-8B/ --llama_guard_version=LLAMA_GUARD_3`
															
 
																-
															
 
																-Note: Make sure to also add the llama_guard_version; by default it uses LLAMA_GUARD_3
															
 
																+To test the new models, follow this [notebook]().
															
 
																 ## Inference Safety Checker
															
 
																 When running the regular inference script with prompts, Meta Llama Guard will be used as a safety checker on the user prompt and the model output. If both are safe, the result will be shown, else a message with the error will be shown, with the word unsafe and a comma separated list of categories infringed. Meta Llama Guard is always loaded quantized using Hugging Face Transformers library with bitsandbytes.
															
@@ -66,7 +20,7 @@ In this case, the default categories are applied by the tokenizer, using the `ap
 
																 Use this command for testing with a quantized Llama model, modifying the values accordingly:
															
 
																-`python examples/inference.py --model_name <path_to_regular_llama_model> --prompt_file <path_to_prompt_file> --quantization 8bit --enable_llamaguard_content_safety`
															
 
																+`python examples/inference.py --model_name <path_to_regular_llama_model> --prompt_file <path_to_prompt_file> --quantization --enable_llamaguard_content_safety`
															
 
																 ## Llama Guard 3 Finetuning & Customization
															
 
																 The safety categories in Llama Guard 3 can be tuned for specific application needs. Existing categories can be removed and new categories can be added to the taxonomy. The [Llama Guard Customization](./llama_guard_customization_via_prompting_and_fine_tuning.ipynb) notebook walks through the process.