|
@@ -2,7 +2,7 @@
|
|
|
|
|
|
## Recipe Overview: Multi-Modal RAG using `Llama-3.2-11B` model:
|
|
|
|
|
|
-This is a complete workshop on how to label images using the new Llama 3.2-Vision Models and performing RAG using the image caption capablites of the model.
|
|
|
+This is a complete workshop on how to label images using the new Llama 3.2-Vision Models and performing RAG using the image caption capabilities of the model.
|
|
|
|
|
|
- **Data Labeling and Preparation:** We start by downloading 5000 images of clothing items and labeling them using `Llama-3.2-11B-Vision-Instruct` model
|
|
|
- **Cleaning Labels:** With the labels based on the notebook above, we will then clean the dataset and prepare it for RAG
|