John Lockman bf2e13a042 Issue#259: adding new nodes using mapping file il y a 3 ans
..
PyTorch b80557f027 adding cpu example in pytorch to close #62 il y a 4 ans
TensorRT-InferenceServer 4355f72404 Update README.md il y a 4 ans
README.md 44fce1ea2b removed all instances of `master` from scripts and playbooks il y a 4 ans
host_inventory_file 28cdd7f8e4 adding new roles dir il y a 3 ans
host_inventory_file.ini 28cdd7f8e4 adding new roles dir il y a 3 ans
k8s-TensorFlow-resnet50-multinode-MPIOperator.yaml c599545fbf adding k8s and slurm submission examples il y a 4 ans
k8s-tensorflow-nvidia-ngc-resnet50-multinode-mpioperator.yaml 6e54ff6045 resolves issue #77 il y a 4 ans
mapping_file.csv bf2e13a042 Issue#259: adding new nodes using mapping file il y a 3 ans
slurm-TensorFlow-resnet50-multinode-MPI.batch c599545fbf adding k8s and slurm submission examples il y a 4 ans

README.md

Examples

The examples K8s Submit and SLURM submit are provide as examples for running the resnet50 benchmark with TensorFlow on 8 GPUs using 2 C4140s.

Submitting the example

K8s

kubectl create -f k8s-TensorFlow-resnet50-multinode-MPIOperator.yaml

Slurm

sbatch slurm-TensorFlow-resnet50-multinode-MPI.batch