NodeInfo processor to refine template-based NodeInfos #3761

bpineau · 2020-12-14T10:35:13Z

Comparing synthetic NodeInfos obtained from nodegroup's TemplateInfo() to NodeInfos obtained from real-world nodes is bound to fail, even with kube reservations provided through nodegroups labels/annotations (for instance: kernel mem reservation is hard to predict, in particular on AWS instances).

This makes balance-similar-node-groups likely to misbehave when scale-up-from-zero is enabled (and a first nodegroup gets a real node), for instance.

Following Maciek suggestion (from discussions on a previous attempt at solving this), we can implement a NodeInfo Processor that would improve template-generated NodeInfos whenever a node was created off a similar nodegroup.

We're storing node's virtual origin through machineid, which works fine but is a bit ugly (suggestions welcome).

Tested this solves balance-similar-node-groups + scale-up-from-zero, with various instance types on AWS and GCP.

Previous attempts to solve that issue/discussions:

k8s-ci-robot · 2020-12-14T10:35:42Z

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by: bpineau
To complete the pull request process, please assign feiskyer after the PR has been reviewed.
You can assign the PR to them by writing /assign @feiskyer in a comment when ready.

The full list of commands accepted by this bot can be found here.

Needs approval from an approver in each of these files:

cluster-autoscaler/OWNERS

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

MaciekPytel · 2020-12-21T11:38:32Z

A general comment - this strongly assumes that two NodeGroups that are "similar" according to NodeGroupSet processor are identical for scheduling purposes (except for location related labels). I think this is likely to be true in most cases, but it's still a pretty significant assumption.
If the NodeGroups in question are not identical this assumption can lead to incorrect scale-up decision (since the template used in binpacking would differ from actual node). This makes me think that enabling the processor should probably be controlled via a flag. WDYT?

cluster-autoscaler/processors/nodeinfos/refine_node_infos_processor.go

umialpha

IIUC, this PR tries to find real or similar real nodeinfo as much as possible. Could we try the following routine.

If it is a real nodeinfo, update nodecache.
Else,
- If nodecache has cached real nodeinfo, use cached nodeinfo.
- Else, use the most similar real nodeinfo and update the cache.

cluster-autoscaler/processors/nodeinfos/refine_node_infos_processor.go

bpineau · 2020-12-23T17:32:59Z

@MaciekPytel agreed, added a (default false) flag.

umialpha

Sorry for misunderstanding. I list my understanding as follows. Please point out if anything wrong.

This PR tries to do,
If a NodeInfo is a template, find the most "similar" RealNode to replace it. But instead of comparing template node with real nodes, it compare template node with template nodes. Right?

IIUC, if this logic only influences "BalanceSimilarNodeGroups", would it be better to make it closer to "BalanceSimilarNodeGroups" logic, e.g. put this logic into processors.NodeGroupSetProcessor.FindSimilarNodeGroups(context, bestOption.NodeGroup, nodeInfos)

This logic takes place at the very beginning of RunOnce logic. As @MaciekPytel said,

this strongly assumes that two NodeGroups that are "similar" according to NodeGroupSet processor are identical for scheduling purposes

If the assumption is incorrect, it will lead to incorrect scale-up decision.

WDYT? Also @MaciekPytel .

bpineau · 2020-12-24T11:20:12Z

@umialpha Yes, that PR logic description is correct. We're following suggestions from previous attempts at solving this (in particular #2892 (comment) and #3608 (comment) ).

You're right about the assumption, hence the new flag to be enabled when using scale-up-from-zero (otherwise we already have real nodes and this change is no-op) and where nodeGroups having similar nodeInfos templates are similar nodeGroups.

Having more accurately evaluated nodeGroups capacities (the intent of this PR) is useful in other situations than just keeping nodegroups balanced with FindSimilarNodeGroups during upscale; for instance when evaluating options to re-pack undersubscribed nodes.

MaciekPytel · 2020-12-29T17:50:23Z

Agreed, this goes beyond just balancing. The fact that TemplateNodeInfo generally doesn't get resources exactly right is a problem that can have more impact that not matching NodeGroups for balancing purposes. Consider an example where TemplateNodeInfo overestimates memory by a little bit and a pod that won't really fit on the node would look like it fits - in this case CA would trigger an incorrect scale-up. NodeGroupSetProcessor doesn't really apply to this situation.

Using a template from a real node is generally more likely to be correct and avoid problems like this if you are sure the new node will really be the same (except for location-related stuff). As per my previous comment I don't think we should make this assumption for the users, but if you happen to know that it's true in your env what this PR does makes a lot of sense to me. As such I'm ok with adding this as an opt-in feature (note: I haven't done a detailed review, just talking about general approach).

bpineau · 2021-01-18T15:30:49Z

@mwielgus (and/or @MaciekPytel ) mind taking an other look? it's updated according to the comments

cluster-autoscaler/main.go

cluster-autoscaler/core/utils/utils.go

cluster-autoscaler/processors/nodeinfos/refine_node_infos_processor.go

MaciekPytel · 2021-02-17T14:16:25Z

Sorry for delay, I left a few minor comments. Overall, looks good, they're all pretty minor comments.

cluster-autoscaler/main.go

bpineau · 2021-02-26T09:37:50Z

Thanks for the review @MaciekPytel ! Updated the PR accordingly, please take an other look

Comparing synthetic NodeInfos obtained from nodegroup's TemplateInfo() to NodeInfos obtained from real-world nodes is bound to fail, even with kube reservations provided through nodegroups labels/annotations (for instance: kernel mem reservation is hard to predict). This makes `balance-similar-node-groups` likely to misbehave when `scale-up-from-zero` is enabled (and a first nodegroup gets a real node), for instance. Following [Maciek Pytel suggestion](kubernetes#3608 (comment)) (from discussions on a previous attempt at solving this), we can implement a NodeInfo Processor that would improve template-generated NodeInfos whenever a node was created off a similar nodegroup. We're storing node's virtual origin through machineid, which works fine but is a bit ugly (suggestions welcome). Tested this solves balance-similar-node-groups + scale-up-from-zero, with various instance types on AWS and GCP. Previous attempts to solve that issue/discussions: * kubernetes#2892 (comment) * kubernetes#3608 (comment)

MaciekPytel · 2021-08-16T11:55:40Z

@bpineau I suspect the motivation for #4191 is to replace this with a version based on a new interface. Is that correct? Or do you still want to include this one?

bpineau · 2021-08-16T12:27:12Z

@MaciekPytel you guessed right; I think we can close this one

k8s-ci-robot added size/L Denotes a PR that changes 100-499 lines, ignoring generated files. cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. labels Dec 14, 2020

bpineau changed the title ~~NodeInfo processor to Refine synthetic NodeInfos~~ NodeInfo processor to refine template-based NodeInfos Dec 14, 2020

k8s-ci-robot requested review from aleksandra-malinowska and Jeffwan December 14, 2020 10:35

bpineau mentioned this pull request Dec 14, 2020

Fix BalanceSimilarNodeGroups when scaling from zero #3608

Closed

bpineau force-pushed the refine-nodeinfo-processor branch 2 times, most recently from 068bcf8 to 060e78d Compare December 14, 2020 11:47

umialpha reviewed Dec 22, 2020

View reviewed changes

cluster-autoscaler/processors/nodeinfos/refine_node_infos_processor.go Outdated Show resolved Hide resolved

mwielgus suggested changes Dec 22, 2020

View reviewed changes

cluster-autoscaler/processors/nodeinfos/refine_node_infos_processor.go Outdated Show resolved Hide resolved

umialpha reviewed Dec 23, 2020

View reviewed changes

cluster-autoscaler/processors/nodeinfos/refine_node_infos_processor.go Show resolved Hide resolved

bpineau force-pushed the refine-nodeinfo-processor branch from 060e78d to 7b92a87 Compare December 23, 2020 13:30

umialpha reviewed Dec 24, 2020

View reviewed changes

k8s-ci-robot added the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Jan 14, 2021

bpineau force-pushed the refine-nodeinfo-processor branch from 7b92a87 to 449879e Compare January 18, 2021 15:10

k8s-ci-robot removed the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Jan 18, 2021

k8s-ci-robot added the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Jan 21, 2021

bpineau force-pushed the refine-nodeinfo-processor branch from 449879e to 32d07a0 Compare January 25, 2021 11:32

k8s-ci-robot removed the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Jan 25, 2021

MaciekPytel mentioned this pull request Jan 25, 2021

Flag for exclusive usage of template infos (with warnings) #3609

Closed

MaciekPytel reviewed Feb 17, 2021

View reviewed changes

cluster-autoscaler/main.go Outdated Show resolved Hide resolved

MaciekPytel reviewed Feb 17, 2021

View reviewed changes

cluster-autoscaler/core/utils/utils.go Outdated Show resolved Hide resolved

MaciekPytel reviewed Feb 17, 2021

View reviewed changes

cluster-autoscaler/processors/nodeinfos/refine_node_infos_processor.go Outdated Show resolved Hide resolved

MaciekPytel reviewed Feb 17, 2021

View reviewed changes

cluster-autoscaler/main.go Outdated Show resolved Hide resolved

bpineau force-pushed the refine-nodeinfo-processor branch from 32d07a0 to 5451133 Compare February 26, 2021 09:29

bpineau mentioned this pull request Apr 8, 2021

NodeInfo Processor for exclusive usage of template infos #4000

Closed

k8s-ci-robot added the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Apr 9, 2021

bpineau force-pushed the refine-nodeinfo-processor branch from 5451133 to a153405 Compare April 19, 2021 13:40

k8s-ci-robot removed the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Apr 19, 2021

jbartosik added the area/cluster-autoscaler label Apr 23, 2021

k8s-ci-robot added needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. and removed needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. labels Aug 2, 2021

mwielgus closed this Aug 16, 2021

bpineau mentioned this pull request Mar 7, 2023

scaling from 0 only scales one of the ASG's detected in spite of balance-similar-node-groups #5352

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

NodeInfo processor to refine template-based NodeInfos #3761

NodeInfo processor to refine template-based NodeInfos #3761

bpineau commented Dec 14, 2020 •

edited

Loading

k8s-ci-robot commented Dec 14, 2020

MaciekPytel commented Dec 21, 2020

umialpha left a comment

bpineau commented Dec 23, 2020

umialpha left a comment •

edited

Loading

bpineau commented Dec 24, 2020

MaciekPytel commented Dec 29, 2020

bpineau commented Jan 18, 2021

MaciekPytel commented Feb 17, 2021

bpineau commented Feb 26, 2021

MaciekPytel commented Aug 16, 2021

bpineau commented Aug 16, 2021

NodeInfo processor to refine template-based NodeInfos #3761

NodeInfo processor to refine template-based NodeInfos #3761

Conversation

bpineau commented Dec 14, 2020 • edited Loading

k8s-ci-robot commented Dec 14, 2020

MaciekPytel commented Dec 21, 2020

umialpha left a comment

Choose a reason for hiding this comment

bpineau commented Dec 23, 2020

umialpha left a comment • edited Loading

Choose a reason for hiding this comment

bpineau commented Dec 24, 2020

MaciekPytel commented Dec 29, 2020

bpineau commented Jan 18, 2021

MaciekPytel commented Feb 17, 2021

bpineau commented Feb 26, 2021

MaciekPytel commented Aug 16, 2021

bpineau commented Aug 16, 2021

bpineau commented Dec 14, 2020 •

edited

Loading

umialpha left a comment •

edited

Loading