Historique des commits

Auteur SHA1 Message Date
  Matthias Reso a3fd369127 Ref from infernce recipes to vllm for 405B il y a 1 an
  Matthias Reso a8f2267324 Added multi node doc to multigpu_finetuning.md il y a 1 an
  Matthias Reso afb3b75892 Add 405B + QLoRA + FSDP to multi_gpu.md doc il y a 1 an
  Matthias Reso 939c88fb04 Add 405B + QLoRA ref to LLM finetung il y a 1 an
  Matthias Reso d2fd9c163a Added doc for multi-node vllm inference il y a 1 an
  Matthias Reso c9ae014459 Enable pipeline parallelism through use of AsyncLLMEngine in vllm inferecen + enable use of lora adapter il y a 1 an
  Matthias Reso 0920b1a415 Fix quantization for inference il y a 1 an
  Matthias Reso b36830fdf6 Fix reading in stdin for chat_completion, remove padding as we're feeding single samples il y a 1 an
  Matthias Reso f0aa8e31ca Update url il y a 1 an
  Matthias Reso 9db61e5235 Refactored infeence to allow multiple requests through gradio il y a 1 an
  Thomas Robinson fd9f52f710 Modify prompt_format_utils with changes necessary for Llama Guard 3 (#1) il y a 1 an
  Cyrus Nikolaidis 0c57646481 Prompt Guard Tutorial il y a 1 an
  Hamid Shojanazeri 808a3f7a0c Adding support for FSDP+Qlora. (#572) il y a 1 an
  Jeff Tang ba447971f0 Port of DLAI LlamaIndex Agent short course lessons 2-4 to use Llama 3 (#594) il y a 1 an
  Jeff Tang 935ad46a0d wordlist update for DLAI LlamaIndex Agent short course il y a 1 an
  Jeff Tang af8838463e added lesson summary in each notebook and README il y a 1 an
  Jeff Tang aaeba04bd6 README update il y a 1 an
  Jeff Tang 353ceaae74 fix of cell order issue for L3 il y a 1 an
  dongwang218 ed3136f117 Update hf weight conversion script to llama 3 (#551) il y a 1 an
  Kai Wu f6617fb86a changed --pure_bf16 to --fsdp_config.pure_bf16 and corrected "examples/" path (#587) il y a 1 an
  Jeff Tang 2e4ea5b728 cell cleanup il y a 1 an
  Jeff Tang 0fef52e846 README links fixed il y a 1 an
  Jeff Tang ebbf362576 L4 - replace groq with fireworks to fix rate limit il y a 1 an
  Jeff Tang 945175a2ea l3 cleanup il y a 1 an
  Jeff Tang b585e1f211 L2 llm fix - use fireworks llama 3 to overcome the groq rate limit il y a 1 an
  Jeff Tang c87fb189f7 Building_Agentic_RAG_with_Llamaindex L2,3,4 and README il y a 1 an
  Jeff Tang 7bb72efcc8 colab links fixed for dlai agents notebooks (#593) il y a 1 an
  Jeff Tang cc569ef52b colab links fixed il y a 1 an
  Jeff Tang 89cb5d0a8f dlai_agentic_rag all lesson notebooks il y a 1 an
  Jeff Tang 43b7754b2c 4 notebooks ported from 4 DLAI agent short courses using Llama 3 (#560) il y a 1 an