|
@@ -1,30 +1,32 @@
|
|
|
-Dancing to the beat of a different drum.
|
|
|
+# Installing Omnia
|
|
|
|
|
|
-# Short Version:
|
|
|
+## TL;DR
|
|
|
|
|
|
+### Kubernetes
|
|
|
Install Kubernetes and all dependencies
|
|
|
```
|
|
|
-ansible-playbook -i host_inventory_file build-kubernetes-cluster.yml
|
|
|
+ansible-playbook -i host_inventory_file kubernetes/kubernetes.yml
|
|
|
```
|
|
|
|
|
|
-Initialize K8S cluster
|
|
|
+Initialize K8s cluster
|
|
|
```
|
|
|
-ansible-playbook -i host_inventory_file build-kubernetes-cluster.yml --tags "init"
|
|
|
+ansible-playbook -i host_inventory_file kubernetes/kubernetes.yml --tags "init"
|
|
|
```
|
|
|
-
|
|
|
-
|
|
|
-# What this does:
|
|
|
+### Slurm
|
|
|
+```
|
|
|
+ansible-playbook -i host_inventory_file slurm/slurm.yml
|
|
|
|
|
|
## Build/Install
|
|
|
+### Kubernetes
|
|
|
|
|
|
-### Add additional repositories:
|
|
|
+#### Add additional repositories:
|
|
|
|
|
|
- Kubernetes (Google)
|
|
|
- El Repo (nvidia drivers)
|
|
|
- Nvidia (nvidia-docker)
|
|
|
- EPEL (Extra Packages for Enterprise Linux)
|
|
|
|
|
|
-### Install common packages
|
|
|
+#### Install common packages
|
|
|
- gcc
|
|
|
- python-pip
|
|
|
- docker
|
|
@@ -37,19 +39,19 @@ ansible-playbook -i host_inventory_file build-kubernetes-cluster.yml --tags "ini
|
|
|
- nvidia-container-runtime
|
|
|
- ksonnet (CLI framework for K8S configs)
|
|
|
|
|
|
-### Enable GPU Device Plugins (nvidia-container-runtime-hook)
|
|
|
+#### Enable GPU Device Plugins (nvidia-container-runtime-hook)
|
|
|
|
|
|
-### Modify kubeadm config to allow GPUs as schedulable resource
|
|
|
+#### Modify kubeadm config to allow GPUs as schedulable resource
|
|
|
|
|
|
-### Start and enable services
|
|
|
+#### Start and enable services
|
|
|
- Docker
|
|
|
- Kubelet
|
|
|
|
|
|
-## Initialize Cluster
|
|
|
-### Head/master
|
|
|
+#### Initialize Cluster
|
|
|
+##### Head/master
|
|
|
- Start K8S pass startup token to compute/slaves
|
|
|
- Initialize networking (Currently using WeaveNet)
|
|
|
-Setup K8S Dashboard
|
|
|
- Create dynamic/persistent volumes
|
|
|
-### Compute/slaves
|
|
|
+##### Compute/slaves
|
|
|
- Join k8s cluster
|