|  lchu | c19c5c69aa
							
							fix fsdp construction on low_cpu_fsdp | 2 vuotta sitten | 
				
					
						|  lchu | e216c6f1f3
							
							address #87 | 2 vuotta sitten | 
				
					
						|  lchu | 895dfcea30
							
							add nightly check for using low_cpu_fsdp mode | 2 vuotta sitten | 
				
					
						|  lchu | 1e64fc98d9
							
							switch to simpler param_init_fn and meta device init | 2 vuotta sitten | 
				
					
						|  lchu | 101391f46a
							
							Revert "replace init_empty_weights with torch.device(meta)" | 2 vuotta sitten | 
				
					
						|  lchu | c8d4f38d23
							
							replace init_empty_weights with torch.device(meta) | 2 vuotta sitten | 
				
					
						|  lchu | d8a81bb531
							
							save cpu mem by leveraging FSDP rank0 broadcasting | 2 vuotta sitten | 
				
					
						|  Geeta Chauhan | 1387b76e11
							
							fixing the full state path in checkpoint handler+loss report calculation (#51) | 2 vuotta sitten | 
				
					
						|  Hamid Shojanazeri | 88d3e1febc
							
							fix the save_train_param condition | 2 vuotta sitten | 
				
					
						|  Hamid Shojanazeri | b56028c98d
							
							fixing the word list/spell check | 2 vuotta sitten | 
				
					
						|  Hamid Shojanazeri | 62be60355a
							
							resolving conflicts | 2 vuotta sitten | 
				
					
						|  Geeta Chauhan | 174b856591
							
							update README: python 3.9 rec + fix formatting (#63) | 2 vuotta sitten | 
				
					
						|  Geeta Chauhan | 0cd5694a14
							
							Fsdp inference checkpoints (#39) | 2 vuotta sitten | 
				
					
						|  Hamid Shojanazeri | c4e96af6ee
							
							clean up | 2 vuotta sitten | 
				
					
						|  Christian Miller | 7c1884c690
							
							recommend python 3.9 | 2 vuotta sitten | 
				
					
						|  Hamid Shojanazeri | 7d2e06821e
							
							fixing the path to script | 2 vuotta sitten | 
				
					
						|  Hamid Shojanazeri | 5f97db8f0c
							
							fix spell check word list | 2 vuotta sitten | 
				
					
						|  Hamid Shojanazeri | 017cadd04b
							
							Merge branch 'checkpoint_handler_path_fix' of https://github.com/facebookresearch/llama-recipes into checkpoint_handler_path_fix | 2 vuotta sitten | 
				
					
						|  Hamid Shojanazeri | 4f70348b94
							
							remove the redundant lr step | 2 vuotta sitten | 
				
					
						|  Hamid Shojanazeri | 9c95ed4bbe
							
							clean up | 2 vuotta sitten | 
				
					
						|  Hamid Shojanazeri | 311a5c1eec
							
							add notes for train_param.yaml | 2 vuotta sitten | 
				
					
						|  Hamid Shojanazeri | 5b916114eb
							
							merge main branch | 2 vuotta sitten | 
				
					
						|  Hamid Shojanazeri | 668c364f6b
							
							add rank to save_train_params | 2 vuotta sitten | 
				
					
						|  Hamid Shojanazeri | 231c9e7da9
							
							adding train_param.yaml saving for fsdp checkpoint loading for inference | 2 vuotta sitten | 
				
					
						|  Hamid Shojanazeri | 475e67b4ec
							
							clean up | 2 vuotta sitten | 
				
					
						|  Hamid Shojanazeri | 50e9d17045
							
							add the default option for find the HF model_name/path from train_param.yaml | 2 vuotta sitten | 
				
					
						|  Hamid Shojanazeri | 41dd7ff1cb
							
							Merge branch 'main' into checkpoint_handler_path_fix | 2 vuotta sitten | 
				
					
						|  Hamid Shojanazeri | 31d6ce8bf6
							
							adding expnadable sgement and dist debug flag info | 2 vuotta sitten | 
				
					
						|  Hamid Shojanazeri | a955ed1999
							
							added checks for dist barrier and commented cuda exapnadable segements and dist_dbug | 2 vuotta sitten | 
				
					
						|  Hamid Shojanazeri | a2403c7c1a
							
							clean up | 2 vuotta sitten |