radu/LLamaRecipes

Auteur	SHA1 Message	Date
Matthias Reso	a3fd369127 Ref from infernce recipes to vllm for 405B	il y a 1 an
Matthias Reso	a8f2267324 Added multi node doc to multigpu_finetuning.md	il y a 1 an
Matthias Reso	afb3b75892 Add 405B + QLoRA + FSDP to multi_gpu.md doc	il y a 1 an
Matthias Reso	939c88fb04 Add 405B + QLoRA ref to LLM finetung	il y a 1 an
Matthias Reso	d2fd9c163a Added doc for multi-node vllm inference	il y a 1 an
Matthias Reso	c9ae014459 Enable pipeline parallelism through use of AsyncLLMEngine in vllm inferecen + enable use of lora adapter	il y a 1 an
Matthias Reso	0920b1a415 Fix quantization for inference	il y a 1 an
Matthias Reso	b36830fdf6 Fix reading in stdin for chat_completion, remove padding as we're feeding single samples	il y a 1 an
Matthias Reso	f0aa8e31ca Update url	il y a 1 an
Matthias Reso	9db61e5235 Refactored infeence to allow multiple requests through gradio	il y a 1 an
Thomas Robinson	fd9f52f710 Modify prompt_format_utils with changes necessary for Llama Guard 3 (#1)	il y a 1 an
Cyrus Nikolaidis	0c57646481 Prompt Guard Tutorial	il y a 1 an
Hamid Shojanazeri	808a3f7a0c Adding support for FSDP+Qlora. (#572)	il y a 1 an
Jeff Tang	ba447971f0 Port of DLAI LlamaIndex Agent short course lessons 2-4 to use Llama 3 (#594)	il y a 1 an
Jeff Tang	935ad46a0d wordlist update for DLAI LlamaIndex Agent short course	il y a 1 an
Jeff Tang	af8838463e added lesson summary in each notebook and README	il y a 1 an
Jeff Tang	aaeba04bd6 README update	il y a 1 an
Jeff Tang	353ceaae74 fix of cell order issue for L3	il y a 1 an
dongwang218	ed3136f117 Update hf weight conversion script to llama 3 (#551)	il y a 1 an
Kai Wu	f6617fb86a changed --pure_bf16 to --fsdp_config.pure_bf16 and corrected "examples/" path (#587)	il y a 1 an
Jeff Tang	2e4ea5b728 cell cleanup	il y a 1 an
Jeff Tang	0fef52e846 README links fixed	il y a 1 an
Jeff Tang	ebbf362576 L4 - replace groq with fireworks to fix rate limit	il y a 1 an
Jeff Tang	945175a2ea l3 cleanup	il y a 1 an
Jeff Tang	b585e1f211 L2 llm fix - use fireworks llama 3 to overcome the groq rate limit	il y a 1 an
Jeff Tang	c87fb189f7 Building_Agentic_RAG_with_Llamaindex L2,3,4 and README	il y a 1 an
Jeff Tang	7bb72efcc8 colab links fixed for dlai agents notebooks (#593)	il y a 1 an
Jeff Tang	cc569ef52b colab links fixed	il y a 1 an
Jeff Tang	89cb5d0a8f dlai_agentic_rag all lesson notebooks	il y a 1 an
Jeff Tang	43b7754b2c 4 notebooks ported from 4 DLAI agent short courses using Llama 3 (#560)	il y a 1 an

Récemment Précédemment

Historique des commits Trouver

Historique des commits