fix #3190 only use nvidia-container-runtime on gpu nodes #3192

johnhofman · 2018-06-06T14:36:38Z

What this PR does / why we need it:
Allows deployment of a cluster with cpu and gpu agent pools with the nvidia-device-plugin enabled. Without this the docker daemon on the cpu nodes fails to start.

Which issue this PR fixes *
This fixes #3190 by only making the changes to /etc/docker/daemon.json on gpu enabled nodes using the same template switch which enables/disables the installation of the GPU drivers.

acs-bot · 2018-06-06T14:36:40Z

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by:
To fully approve this pull request, please assign additional approvers.
We suggest the following additional approver: jackfrancis

Assign the PR to them by writing /assign @jackfrancis in a comment when ready.

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

OWNERS

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

sozercan · 2018-06-06T18:17:19Z

Thanks @johnhofman!

#3181 also addresses this issue and adds toleration for device plugin so it doesn't get scheduled on CPU nodes. I'll close this in favor of #3181 unless you see anything else missing?

fix Azure#3190 only use nvidia-container-runtime on gpu nodes

4a3df6f

acs-bot added the size/XS label Jun 6, 2018

sozercan closed this Jun 6, 2018

zzh8829 mentioned this pull request Jun 26, 2018

add GPU ExtendedResourceToleration admission controller support #3181

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix #3190 only use nvidia-container-runtime on gpu nodes #3192

fix #3190 only use nvidia-container-runtime on gpu nodes #3192

johnhofman commented Jun 6, 2018

acs-bot commented Jun 6, 2018

sozercan commented Jun 6, 2018

fix #3190 only use nvidia-container-runtime on gpu nodes #3192

fix #3190 only use nvidia-container-runtime on gpu nodes #3192

Conversation

johnhofman commented Jun 6, 2018

acs-bot commented Jun 6, 2018

sozercan commented Jun 6, 2018