Sanyam Bhutani 0fc9fca53f Update multi_modal_infer.py il y a 6 mois
..
code_llama 2f1cbfbbbf Merge remote-tracking branch 'upstream/main' into suraj-changes il y a 9 mois
local_inference 0fc9fca53f Update multi_modal_infer.py il y a 6 mois
mobile_inference 4487513793 Updating the folder name 3p_integrations il y a 9 mois
README.md d17e678659 Add Llama 3.1 example upgrade script (#5) il y a 9 mois
modelUpgradeExample.py d17e678659 Add Llama 3.1 example upgrade script (#5) il y a 9 mois

README.md

Quickstart > Inference

This folder contains scripts to get you started with inference on Meta Llama models.

  • Code Llama contains scripts for tasks relating to code generation using CodeLlama
  • Local Inference contains scripts to do memory efficient inference on servers and local machines
  • Mobile Inference has scripts using MLC to serve Llama on Android (h/t to OctoAI for the contribution!)
  • Model Update Example shows an example of replacing a Llama 3 model with a Llama 3.1 model.