radu/LLamaRecipes

Autor	SHA1 Nachricht	Datum
rahul-sarvam	47556ce0a6 Update recipes/multilingual/README.md	vor 1 Jahr
Matthias Reso	739483f262 Adjust test_grammar_datasets to stable sort	vor 1 Jahr
Matthias Reso	b96e435cda Adjust test_samsum_dataset to second model	vor 1 Jahr
Matthias Reso	fac41298b0 Adapt test_custom_dataset to new model	vor 1 Jahr
Matthias Reso	960014a3bb Fix test_custom_dataset by introducing a stable sort algorithm	vor 1 Jahr
Matthias Reso	b5583b31d5 Adapt test_grammar_dataset to new model	vor 1 Jahr
Matthias Reso	17a6d16289 Test batching for both llama versions	vor 1 Jahr
Kai Wu	7b1a9413d2 fixed a typo	vor 1 Jahr
Kai Wu	41434dc825 formatted and removed duplicated or unused function get_total_flops() and byte2mb()	vor 1 Jahr
Kai Wu	f2e80bae22 created a FlopMeasure class on top of FlopCounterMode instead of keep of copy of our own tflop_counter.py	vor 1 Jahr
Matthias Reso	a414ca6a57 Update chat format for llama3	vor 1 Jahr
Kai Wu	69e46887b4 handling incorrect profiling early stop caused by max_train_steps and add profiler.step() for each train step	vor 1 Jahr
Matthias Reso	113ea18bf1 Replace LlamaTokenizer with AutoTokenizer	vor 1 Jahr
Beto	5979dbe996 Merging local with remote	vor 1 Jahr
Kai Wu	34e0bf4c6e second draft of this feature, seems to be working now	vor 1 Jahr
Beto	d4cbfa1cc1 Merging upstream llama-recipes to current repo	vor 1 Jahr
Kai Wu	a35519ee90 fixed typo and handling unexpected exit	vor 1 Jahr
Kai Wu	2a5de9b448 first draft of flop counter feature	vor 1 Jahr
Hamid Shojanazeri	aaa9e2c863 Adding a feature that will stop the training/eval process after reaching some max_steps (#428)	vor 1 Jahr
Kai Wu	e6f69f84ad add max_steps_reached to reduce redundancy	vor 1 Jahr
rahul-sarvam	0efb8bd31e Update README.md	vor 1 Jahr
rahul-sarvam	687c2dc5d8 Update README.md	vor 1 Jahr
Rahul A R	2fa8e69b62 add new argument: tokenizer_name	vor 1 Jahr
Rahul A R	f8183b96fe use new tokenizer_name argument and resize embeddings if required	vor 1 Jahr
Rahul A R	1e4e3e00fc adding new multilingual recipe	vor 1 Jahr
Kai Wu	362cda0fa6 fixing test_gradient_accumulation and test_save_to_json	vor 1 Jahr
Kai Wu	fa0a389f74 add max_step feature for training and eval	vor 1 Jahr
Suraj Subramanian	201daff2d1 Add note on CUDA version + remove 'test' from pytorch whl url	vor 1 Jahr
Hamid Shojanazeri	37c8f72211 Update location and name of llm.py example notebook (#417)	vor 1 Jahr
Thomas Robinson	79266217ef Update location and name of llm.py example notebook	vor 1 Jahr

Neuer Älter

Commit Verlauf Finden

Commit Verlauf