Commit Verlauf

Autor SHA1 Nachricht Datum
  rahul-sarvam 47556ce0a6 Update recipes/multilingual/README.md vor 1 Jahr
  Matthias Reso 739483f262 Adjust test_grammar_datasets to stable sort vor 1 Jahr
  Matthias Reso b96e435cda Adjust test_samsum_dataset to second model vor 1 Jahr
  Matthias Reso fac41298b0 Adapt test_custom_dataset to new model vor 1 Jahr
  Matthias Reso 960014a3bb Fix test_custom_dataset by introducing a stable sort algorithm vor 1 Jahr
  Matthias Reso b5583b31d5 Adapt test_grammar_dataset to new model vor 1 Jahr
  Matthias Reso 17a6d16289 Test batching for both llama versions vor 1 Jahr
  Kai Wu 7b1a9413d2 fixed a typo vor 1 Jahr
  Kai Wu 41434dc825 formatted and removed duplicated or unused function get_total_flops() and byte2mb() vor 1 Jahr
  Kai Wu f2e80bae22 created a FlopMeasure class on top of FlopCounterMode instead of keep of copy of our own tflop_counter.py vor 1 Jahr
  Matthias Reso a414ca6a57 Update chat format for llama3 vor 1 Jahr
  Kai Wu 69e46887b4 handling incorrect profiling early stop caused by max_train_steps and add profiler.step() for each train step vor 1 Jahr
  Matthias Reso 113ea18bf1 Replace LlamaTokenizer with AutoTokenizer vor 1 Jahr
  Beto 5979dbe996 Merging local with remote vor 1 Jahr
  Kai Wu 34e0bf4c6e second draft of this feature, seems to be working now vor 1 Jahr
  Beto d4cbfa1cc1 Merging upstream llama-recipes to current repo vor 1 Jahr
  Kai Wu a35519ee90 fixed typo and handling unexpected exit vor 1 Jahr
  Kai Wu 2a5de9b448 first draft of flop counter feature vor 1 Jahr
  Hamid Shojanazeri aaa9e2c863 Adding a feature that will stop the training/eval process after reaching some max_steps (#428) vor 1 Jahr
  Kai Wu e6f69f84ad add max_steps_reached to reduce redundancy vor 1 Jahr
  rahul-sarvam 0efb8bd31e Update README.md vor 1 Jahr
  rahul-sarvam 687c2dc5d8 Update README.md vor 1 Jahr
  Rahul A R 2fa8e69b62 add new argument: tokenizer_name vor 1 Jahr
  Rahul A R f8183b96fe use new tokenizer_name argument and resize embeddings if required vor 1 Jahr
  Rahul A R 1e4e3e00fc adding new multilingual recipe vor 1 Jahr
  Kai Wu 362cda0fa6 fixing test_gradient_accumulation and test_save_to_json vor 1 Jahr
  Kai Wu fa0a389f74 add max_step feature for training and eval vor 1 Jahr
  Suraj Subramanian 201daff2d1 Add note on CUDA version + remove 'test' from pytorch whl url vor 1 Jahr
  Hamid Shojanazeri 37c8f72211 Update location and name of llm.py example notebook (#417) vor 1 Jahr
  Thomas Robinson 79266217ef Update location and name of llm.py example notebook vor 1 Jahr