In your HPC cluster, connect the Mellanox InfiniBand switches using the Fat-Tree topology. In the fat-tree topology, switches in layer 1 are connected through the switches in the upper layer, i.e., layer 2. And, all the compute nodes in the cluster, such as PowerEdge servers and PowerVault storage devices, are connected to switches in layer 1. With this topology in place, we ensure that a 1x1 communication path is established between the compute nodes. For more information on the fat-tree topology, see Designing an HPC cluster with Mellanox infiniband-solutions.
Omnia uses the server-based Subnet Manager (SM). SM runs in a Kubernetes namespace on the control plane. To enable the SM, Omnia configures the required parameters in the opensm.conf
file. Based on the requirement, the parameters can be edited.
Note: Install the InfiniBand hardware drivers by running the below command:
yum groupinstall "Infiniband Support" -y
(For Rocky)
Before running infiniband.yml
, ensure that HTTP and JSON Gateway are enabled on your switch. This can be verifed by running:
show web
(To check if HTTP is enabled)
show json-gw
(To check if JSON Gateway is enabled)
In case either service has been disabled, run:
web http enable
(To enable the HTTP gateway)
json-gw enable
(To enable the JSON gateway)
When connecting to a new or factory reset switch, the configuration wizard requests to execute an initial configuration:
Note: When initializing a factory reset switch, the user needs to ensure DHCP is enabled and an IPv6 address is not assigned. Omnia will assign an IP address to the Infiniband switch using DHCP with all other configurations.
Enter all relevant parameters for configuring your switches in the following files per the provided Input Parameter Guides.:
kubectl get svc -n awx
.kubectl get secret awx-admin-password -n awx -o jsonpath="{.data.password}" | base64 --decode
.http://<IP>:8052
, where IP is the awx-ui IP address and 8052 is the awx-ui port number. Log in to the AWX UI using the username as admin
and the retrieved password.