intel/intel-technology-enabling-for-openshift

NFD Feature Rules for GPU type aren't deployed

Closed this issue · 3 comments

Per https://github.com/intel/intel-device-plugins-for-kubernetes/blob/main/cmd/gpu_plugin/labels.md ,

Nodes can label the cards that are available using these rules: https://github.com/intel/intel-device-plugins-for-kubernetes/blob/main/deployments/nfd/overlays/node-feature-rules/platform-labeling-rules.yaml

This should probably be installed by default since this is via the OpenShift Operator deployment; otherwise admins have to create the rules themselves.

Here's a version that works, without the "extendedResources" tag

platform-labeling-rules.yaml.txt

@brgavino Thanks for your suggestion. :-)

To align with RH OpenShift AI Dashboard Accelerator Profile, the taints/tolerance mechanism will be used by us to support the Heterogenous(different type of) Intel dGPU product.

We are working on a related solution for the user.

And as cluster administrators, they can also pick the other way like the labelling you mentioned and use them with the nodeselect.

duplcated with #216