|
|
2 anni fa | |
|---|---|---|
| .. | ||
| hf_text_generation_inference | 6d449a859b New folder structure (#1) | 2 anni fa |
| vllm | 6d449a859b New folder structure (#1) | 2 anni fa |
| README.md | 6d449a859b New folder structure (#1) | 2 anni fa |
| llama-on-prem.md | eb4a0bb644 Update deprecated demo app links to recipes | 2 anni fa |
This tutorial shows how to use Llama 2 with vLLM and Hugging Face TGI to build Llama 2 on-prem apps.
* To run a quantized Llama2 model on iOS and Android, you can use the open source MLC LLM or llama.cpp. You can even make a Linux OS that boots to Llama2 (repo).