zenodia 1367c593aa implement feedbacks from reviewers 3 years ago
..
2ndrun.JPG a807fe3a90 Swedish GPTBootcamp tutorials 3 years ago
Alt_callout2terminals.JPG a807fe3a90 Swedish GPTBootcamp tutorials 3 years ago
ColumnParallel.JPG a807fe3a90 Swedish GPTBootcamp tutorials 3 years ago
ComputeEstimate.JPG a807fe3a90 Swedish GPTBootcamp tutorials 3 years ago
Compute_Datasize_Parameters.JPG a807fe3a90 Swedish GPTBootcamp tutorials 3 years ago
GPT3_all.png a807fe3a90 Swedish GPTBootcamp tutorials 3 years ago
GPUs_utils_naive.JPG d41c326e5f update SuperPOD discreption and README 3 years ago
MegatronGPTtimelines.JPG a807fe3a90 Swedish GPTBootcamp tutorials 3 years ago
NVprofilingToolchain.JPG a807fe3a90 Swedish GPTBootcamp tutorials 3 years ago
RowParallel.JPG a807fe3a90 Swedish GPTBootcamp tutorials 3 years ago
Scale_numOfTokens_asModelgetsLarger.JPG a807fe3a90 Swedish GPTBootcamp tutorials 3 years ago
TrainingTimeEstimate.JPG a807fe3a90 Swedish GPTBootcamp tutorials 3 years ago
achieved_teraflops_per_gpu.JPG a807fe3a90 Swedish GPTBootcamp tutorials 3 years ago
customize_preprocess_data_script.JPG a807fe3a90 Swedish GPTBootcamp tutorials 3 years ago
data_loss_model_size_compute.JPG 1367c593aa implement feedbacks from reviewers 3 years ago
gpus_utils_improved.JPG d41c326e5f update SuperPOD discreption and README 3 years ago
modifyLSH_setuppy.JPG 585e5c91bc separate solution vs challenge 3 years ago
multigpu_naive_run.jpg a807fe3a90 Swedish GPTBootcamp tutorials 3 years ago
naive_run.JPG a807fe3a90 Swedish GPTBootcamp tutorials 3 years ago
profiling_workflow.JPG a807fe3a90 Swedish GPTBootcamp tutorials 3 years ago