Himanshu Shukla 6b1c0d582b added working in single file for 1. terminal inferencing, 2. gradio inferencing, 3. checkpoint inferencing 11 tháng trước cách đây
..
NotebookLlama 716c23f9d0 Update Step-1 PDF-Pre-Processing-Logic.ipynb (#756) 1 năm trước cách đây
RAG 017bee0356 Update hello_llama_cloud.ipynb (#754) 1 năm trước cách đây
Running_Llama3_Anywhere 0f632b3e3d Fix version number in Python example 1 năm trước cách đây
agents a6c7fe650b Fix 2 1 năm trước cách đây
finetuning 4377505e4f Moved the file code-merge-inference.py from fine-tuning firectory to local_inference 11 tháng trước cách đây
images 50fcb53165 removed unnecessary images and updated colab link 11 tháng trước cách đây
inference 6b1c0d582b added working in single file for 1. terminal inferencing, 2. gradio inferencing, 3. checkpoint inferencing 11 tháng trước cách đây
Getting_to_know_Llama.ipynb ee34e1be19 typo fix lama -> llama line 127 1 năm trước cách đây
Prompt_Engineering_with_Llama_3.ipynb cb05f6e01a Add files via upload 1 năm trước cách đây
README.md e814d7d672 Update README.md 11 tháng trước cách đây
build_with_Llama_3_2.ipynb 50fcb53165 removed unnecessary images and updated colab link 11 tháng trước cách đây

README.md

Llama-Recipes Quickstart

If you are new to developing with Meta Llama models, this is where you should start. This folder contains introductory-level notebooks across different techniques relating to Meta Llama.

  • The Build_with_Llama 3.2 notebook showcases a comprehensive walkthrough of the new capabilities of Llama 3.2 models, including multimodal use cases, function/tool calling, Llama Stack, and Llama on edge.
  • The Running_Llama_Anywhere notebooks demonstrate how to run Llama inference across Linux, Mac and Windows platforms using the appropriate tooling.
  • The Prompt_Engineering_with_Llama notebook showcases the various ways to elicit appropriate outputs from Llama. Take this notebook for a spin to get a feel for how Llama responds to different inputs and generation parameters.
  • The inference folder contains scripts to deploy Llama for inference on server and mobile. See also 3p_integrations/vllm and 3p_integrations/tgi for hosting Llama on open-source model servers.
  • The RAG folder contains a simple Retrieval-Augmented Generation application using Llama.
  • The finetuning folder contains resources to help you finetune Llama on your custom datasets, for both single- and multi-GPU setups. The scripts use the native llama-recipes finetuning code found in finetuning.py which supports these features: