radu/LLamaRecipes

Autore	SHA1 Messaggio	Data
Matthias Reso	739483f262 Adjust test_grammar_datasets to stable sort	1 anno fa
Matthias Reso	b96e435cda Adjust test_samsum_dataset to second model	1 anno fa
Matthias Reso	fac41298b0 Adapt test_custom_dataset to new model	1 anno fa
Matthias Reso	960014a3bb Fix test_custom_dataset by introducing a stable sort algorithm	1 anno fa
Matthias Reso	b5583b31d5 Adapt test_grammar_dataset to new model	1 anno fa
Matthias Reso	17a6d16289 Test batching for both llama versions	1 anno fa
Matthias Reso	a414ca6a57 Update chat format for llama3	1 anno fa
Matthias Reso	113ea18bf1 Replace LlamaTokenizer with AutoTokenizer	1 anno fa
Hamid Shojanazeri	aaa9e2c863 Adding a feature that will stop the training/eval process after reaching some max_steps (#428)	1 anno fa
Kai Wu	e6f69f84ad add max_steps_reached to reduce redundancy	1 anno fa
Kai Wu	362cda0fa6 fixing test_gradient_accumulation and test_save_to_json	1 anno fa
Kai Wu	fa0a389f74 add max_step feature for training and eval	1 anno fa
Hamid Shojanazeri	37c8f72211 Update location and name of llm.py example notebook (#417)	1 anno fa
Thomas Robinson	79266217ef Update location and name of llm.py example notebook	1 anno fa
Hamid Shojanazeri	f7aa02af9f only save training params on rank 0 (#415)	1 anno fa
jpgard	6954b16b3b only save training params on rank 0	1 anno fa
Hamid Shojanazeri	64e189914f update due to peft new release (#407)	1 anno fa
Hamid Shojanazeri	11f51db28c adding the kbit prep in the code	1 anno fa
Hamid Shojanazeri	f058ff6ccd update due to peft new release	1 anno fa
Hamid Shojanazeri	6a7478a6aa Reorg inference throughput folder structure (#404)	1 anno fa
Chester Hu	367e4869ac Reorg inference throughput folder structure	1 anno fa
Hamid Shojanazeri	d6eb83f6c5 Add llm class so that externally-hosted models can be called (#398)	1 anno fa
Thomas Robinson	0346d0d5b8 Add documentation and examples	1 anno fa
Hamid Shojanazeri	43a1e5cdb0 Fix dead links after directory structure refactor (#397)	1 anno fa
Suraj Subramanian	e2a35420c0 Remove octoai link that is 401-ing	1 anno fa
Suraj Subramanian	12602f32e2 Merge branch 'main' into subramen-patch-deadlinks	1 anno fa
Hamid Shojanazeri	c8f4bdac41 Adding open in colab option for notebook (#395)	1 anno fa
Thomas Robinson	81984a9a44 Remove unnecessary spec format	1 anno fa
Suraj Subramanian	f53f17138b fix dead links after refactor	1 anno fa
Thomas Robinson	eee39a7463 Add llm.py class in order to call remotely hosted models	1 anno fa

Più recente Più vecchio

Cronologia Commit Cerca

Cronologia Commit