You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Support the heterogeneous (different) Intel GPU cards in the same OCP cluster.
Detail
In the Scenario, When in the same cluster, different Intel GPU cards like Max-1100, Flex-140, and Flex-170 are provisioned. A mechanism should be provided for the users to pick up the proper GPU card they want to run the workloads on.
To align with the taints/tolerance mechanism from Red Hat OpenShift AI accelerator Profile, We will use the same taints/tolerance mechanism for this feature.
To properly label(taint) the nodes in the cluster automatically, we will rely on the NFD node tainting feature.
The feature is for the heterogeneous (different) Intel GPU cards in the same OCP cluster.
The different Intel dGPU cards in the same node are not supported.
The text was updated successfully, but these errors were encountered:
uMartinXu
changed the title
upport the Heterogenous(different type of) Intel dGPU product
Support the Heterogenous(different type of) Intel dGPU products in the same cluster
Mar 4, 2024
uMartinXu
changed the title
Support the Heterogenous(different type of) Intel dGPU products in the same cluster
Support the Heterogenous(different type of) Intel GPU cards in the same OCP cluster
Mar 4, 2024
Summary
Support the heterogeneous (different) Intel GPU cards in the same OCP cluster.
Detail
In the Scenario, When in the same cluster, different Intel GPU cards like Max-1100, Flex-140, and Flex-170 are provisioned. A mechanism should be provided for the users to pick up the proper GPU card they want to run the workloads on.
To align with the taints/tolerance mechanism from Red Hat OpenShift AI accelerator Profile, We will use the same taints/tolerance mechanism for this feature.
To properly label(taint) the nodes in the cluster automatically, we will rely on the NFD node tainting feature.
So this feature rely on issue openshift/cluster-nfd-operator#356
Note
The feature is for the heterogeneous (different) Intel GPU cards in the same OCP cluster.
The different Intel dGPU cards in the same node are not supported.
The text was updated successfully, but these errors were encountered: