Matthias Reso
|
d2fd9c163a
Added doc for multi-node vllm inference
|
1 سال پیش |
Thomas Robinson
|
1a183c0a5e
Introduce Llama guard customization notebook and associated dataset loader example
|
1 سال پیش |
Cyrus Nikolaidis
|
301e51a340
Merge branch 'main' of github.com:meta-llama/llama-recipes-alpha
|
1 سال پیش |
Cyrus Nikolaidis
|
883def17f0
Prompt Guard Inference for long strings
|
1 سال پیش |
Suraj
|
4be3eb0d17
Updates HF model_ids and readmes for 3.1
|
1 سال پیش |
Matthias Reso
|
c9ae014459
Enable pipeline parallelism through use of AsyncLLMEngine in vllm inferecen + enable use of lora adapter
|
1 سال پیش |
Suraj
|
d1d08f9b82
Update promptguard model-id
|
1 سال پیش |
Suraj
|
308026aad5
Adds tentative llamaguard HF model id, eos_token_id for model.generate
|
1 سال پیش |
Matthias Reso
|
0920b1a415
Fix quantization for inference
|
1 سال پیش |
Matthias Reso
|
b36830fdf6
Fix reading in stdin for chat_completion, remove padding as we're feeding single samples
|
1 سال پیش |
Matthias Reso
|
f0aa8e31ca
Update url
|
1 سال پیش |
Matthias Reso
|
9db61e5235
Refactored infeence to allow multiple requests through gradio
|
1 سال پیش |
Thomas Robinson
|
fd9f52f710
Modify prompt_format_utils with changes necessary for Llama Guard 3 (#1)
|
1 سال پیش |
Cyrus Nikolaidis
|
0c57646481
Prompt Guard Tutorial
|
1 سال پیش |
Hamid Shojanazeri
|
808a3f7a0c
Adding support for FSDP+Qlora. (#572)
|
1 سال پیش |
Jeff Tang
|
ba447971f0
Port of DLAI LlamaIndex Agent short course lessons 2-4 to use Llama 3 (#594)
|
1 سال پیش |
Jeff Tang
|
935ad46a0d
wordlist update for DLAI LlamaIndex Agent short course
|
1 سال پیش |
Jeff Tang
|
af8838463e
added lesson summary in each notebook and README
|
1 سال پیش |
Jeff Tang
|
aaeba04bd6
README update
|
1 سال پیش |
Jeff Tang
|
353ceaae74
fix of cell order issue for L3
|
1 سال پیش |
dongwang218
|
ed3136f117
Update hf weight conversion script to llama 3 (#551)
|
1 سال پیش |
Kai Wu
|
f6617fb86a
changed --pure_bf16 to --fsdp_config.pure_bf16 and corrected "examples/" path (#587)
|
1 سال پیش |
Jeff Tang
|
2e4ea5b728
cell cleanup
|
1 سال پیش |
Jeff Tang
|
0fef52e846
README links fixed
|
1 سال پیش |
Jeff Tang
|
ebbf362576
L4 - replace groq with fireworks to fix rate limit
|
1 سال پیش |
Jeff Tang
|
945175a2ea
l3 cleanup
|
1 سال پیش |
Jeff Tang
|
b585e1f211
L2 llm fix - use fireworks llama 3 to overcome the groq rate limit
|
1 سال پیش |
Jeff Tang
|
c87fb189f7
Building_Agentic_RAG_with_Llamaindex L2,3,4 and README
|
1 سال پیش |
Jeff Tang
|
7bb72efcc8
colab links fixed for dlai agents notebooks (#593)
|
1 سال پیش |
Jeff Tang
|
cc569ef52b
colab links fixed
|
1 سال پیش |