Fixed a bug where hostname topology launched extra nodes #866

ellistarn · 2021-11-27T19:12:23Z

1. Issue, if available:

2. Description of changes:
This bug is caused by the provisioners controller comparing the state of the current provisioner to the persisted provisioner state and the mutation happening in hostname topology. The scheduling logic mutates the provisioners' requirements, which causes the comparison to inadvertently force refresh the provisioner while nodes are being launched. This causes the provisioner to be drained, which can happen after launching capacity but before binding the pods. This results in multiple scale out (and eventual self-healing / scale-in).

The bug is due to me violating my own principles about not mutating state: https://github.com/aws/karpenter/blob/5d5798b5fefc757ef353889204c56138d8042066/pkg/controllers/provisioning/scheduling/topology.go#L99. In the long term, this mutation will be removed as part of a scheduling refactor.

3. Does this change impact docs?

Yes, PR includes docs updates
Yes, issue opened: link to issue
No

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.

netlify · 2021-11-27T19:12:29Z

✔️ Deploy Preview for karpenter-docs-prod ready!

🔨 Explore the source changes: 709562c

🔍 Inspect the deploy log: https://app.netlify.com/sites/karpenter-docs-prod/deploys/61a28319b520000007b1b528

😎 Browse the preview: https://deploy-preview-866--karpenter-docs-prod.netlify.app

JacobGabrielson

/lgtm

Fixed a bug where hostname topology caused extra nodes to be launched

709562c

ellistarn changed the title ~~Fixed a bug where hostname topology caused extra nodes to be launched~~ Fixed a bug where hostname topology launched extra nodes Nov 27, 2021

JacobGabrielson approved these changes Nov 28, 2021

View reviewed changes

JacobGabrielson merged commit cbdaa40 into aws:main Nov 28, 2021

ellistarn deleted the hostnametopology branch November 28, 2021 21:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fixed a bug where hostname topology launched extra nodes #866

Fixed a bug where hostname topology launched extra nodes #866

ellistarn commented Nov 27, 2021

netlify bot commented Nov 27, 2021 •

edited

Loading

JacobGabrielson left a comment

Fixed a bug where hostname topology launched extra nodes #866

Fixed a bug where hostname topology launched extra nodes #866

Conversation

ellistarn commented Nov 27, 2021

netlify bot commented Nov 27, 2021 • edited Loading

JacobGabrielson left a comment

Choose a reason for hiding this comment

netlify bot commented Nov 27, 2021 •

edited

Loading