.. |
2ndrun.JPG
|
a807fe3a90
Swedish GPTBootcamp tutorials
|
3 years ago |
Alt_callout2terminals.JPG
|
a807fe3a90
Swedish GPTBootcamp tutorials
|
3 years ago |
ColumnParallel.JPG
|
a807fe3a90
Swedish GPTBootcamp tutorials
|
3 years ago |
ComputeEstimate.JPG
|
a807fe3a90
Swedish GPTBootcamp tutorials
|
3 years ago |
Compute_Datasize_Parameters.JPG
|
a807fe3a90
Swedish GPTBootcamp tutorials
|
3 years ago |
GPT3_all.png
|
a807fe3a90
Swedish GPTBootcamp tutorials
|
3 years ago |
GPUs_utils_naive.JPG
|
d41c326e5f
update SuperPOD discreption and README
|
3 years ago |
MegatronGPTtimelines.JPG
|
a807fe3a90
Swedish GPTBootcamp tutorials
|
3 years ago |
NVprofilingToolchain.JPG
|
a807fe3a90
Swedish GPTBootcamp tutorials
|
3 years ago |
RowParallel.JPG
|
a807fe3a90
Swedish GPTBootcamp tutorials
|
3 years ago |
Scale_numOfTokens_asModelgetsLarger.JPG
|
a807fe3a90
Swedish GPTBootcamp tutorials
|
3 years ago |
TrainingTimeEstimate.JPG
|
a807fe3a90
Swedish GPTBootcamp tutorials
|
3 years ago |
achieved_teraflops_per_gpu.JPG
|
a807fe3a90
Swedish GPTBootcamp tutorials
|
3 years ago |
customize_preprocess_data_script.JPG
|
a807fe3a90
Swedish GPTBootcamp tutorials
|
3 years ago |
data_loss_model_size_compute.JPG
|
1367c593aa
implement feedbacks from reviewers
|
3 years ago |
gpus_utils_improved.JPG
|
d41c326e5f
update SuperPOD discreption and README
|
3 years ago |
modifyLSH_setuppy.JPG
|
585e5c91bc
separate solution vs challenge
|
3 years ago |
multigpu_naive_run.jpg
|
a807fe3a90
Swedish GPTBootcamp tutorials
|
3 years ago |
naive_run.JPG
|
a807fe3a90
Swedish GPTBootcamp tutorials
|
3 years ago |
profiling_workflow.JPG
|
a807fe3a90
Swedish GPTBootcamp tutorials
|
3 years ago |