Per nodegroup scale-down config #3789

MaciekPytel · 2020-12-30T18:23:17Z

This PR implements #3583 (comment). It allows NodeGroups to override ScaleDownUnneededTime, ScaleDownUnreadyTime and utilization thresholds for nodes in that NodeGroup.

A new GetOptions method is added to cloudprovider interface. Its implementation is optional - returning ErrNotImplemented will result in the current behavior of using global config for all NodeGroups. A stub implementation returning ErrNotImplemented was added to all providers.

Note that this PR is only a framework for implementing the actual feature - there is no implementation for how the user would specify the options. Any provider is free to implement GetOptions (and the way to configure it) as they like.

k8s-ci-robot · 2020-12-30T18:23:30Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: MaciekPytel

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~cluster-autoscaler/OWNERS~~ [MaciekPytel]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

MaciekPytel · 2020-12-30T18:24:17Z

/hold
I want to add some more unittests.

MaciekPytel · 2020-12-31T12:27:37Z

/hold cancel

cluster-autoscaler/processors/nodegroupconfig/node_group_config_processor_test.go

ricoleabricot · 2021-01-06T14:33:38Z

cluster-autoscaler/config/autoscaling_options.go

-	MaxEmptyBulkDelete int
+// NodeGroupAutoscalingOptions contain various options to customize how autoscaling of
+// a given NodeGroup works. Different options can be used for each NodeGroup.
+type NodeGroupAutoscalingOptions struct {


I may have a dumb question but:

Why only these 4 scale down parameters ?
ScaleDownDelayAfterAdd, ScaleDownDelayAfterDelete and so on, are not applicable for this new configuration method ?

Those 4 values were the low-hanging fruits - they required relatively small changes in the code and they were by far the most commonly requested values to control per nodegroup.

I think ultimately it makes sense to have ScaleDownDelayAfter* as part of NodeGroupAutoscalingOptions, but it is much more tricky to do (details below) and I didn't want to block this PR on it. I'd rather take an iterative approach of gradually moving more options to NodeGroupAutoscalingOptions in separate PRs (though I don't want to make any promises on when I will be able to actually implement it for ScaleDownDelay flags; I'm also happy to just review if someone else feels like taking a shot at it).

Details:

ScaleDownDelayAfter* are currently implemented as a global check that result in completely skipping scale-down logic] and not as a per-node check like the 4 values in the new config. Having them per-nodegroup would require completely re-implementing them and figuring out some non-obvious questions about semantics: how should per-nodegroup ScaleDownDelayAfterDelete work? It obviously blocks one nodegroup from scaling-down, but should it be triggered only when node from that nodegroup is deleted or when any node in cluster is deleted? Same goes for ScaleDownDelayAfterAdd. It seems to me that depending on use-case you may want either behavior (block on any node - if your pods can schedule on multiple nodegroups and you just want CA to pick the best one, only block on that nodegroup if you have a dedicated nodegroup per workload and each pod can generally only go to 1 nodegroup).

OK I see, it's rather clear to me that CA should be first updated with "simple" modification first.
As for ScaleDownDelayAfter*, I understand the impact and the amount of source code edit behind, and the semantics questions. I may have a "dumb" solution but : what about adding a ScaleDownAfterBehaviour, with global effect and local or something ?
Anyway, thanks for the hard work, I really appreciate the effort to add some per node-group config so quickly, it will really boost our autoscaling configuration for our managed solution 🚀

I don't think it's a dumb solution. I was considering a pretty much equivalent solution - keep the current scale-down-delay-after-* flags and add scale-down-delay-after-node-group-add/delete/etc (the name could use some work).
This gives maximum flexibility at the cost of adding even more complexity for the user. I guess it may not be a problem if you have a hosted solution, but configuring CA is already pretty hard partly due to a huge number of flags we have. Still it may be the best solution.

Whatever solution we decide on I'd rather do it in a separate PR though. I think this one is complex enough as is.

Whatever solution we decide on I'd rather do it in a separate PR though. I think this one is complex enough as is.

And I agree :)

cluster-autoscaler/config/autoscaling_options.go

cluster-autoscaler/core/scale_down.go

cluster-autoscaler/processors/nodegroupconfig/node_group_config_processor_test.go

cluster-autoscaler/core/scale_down_test.go

MaciekPytel · 2021-01-21T17:17:56Z

@towca Addressed your comment. I put the changes in a separate commit, since the fixes apply to different commits and amending them all would get tricky.

towca · 2021-01-25T09:37:43Z

Thanks for addressing the comments! LGTM, but could you take a look at the second part of the remaining comment? If it's ok feel free to remove the hold.

/lgtm
/hold

This is the first step of implementing kubernetes#3583 (comment). New method was added to cloudprovider interface. All existing providers were updated with a no-op stub implementation that will result in no behavior change. The config values specified per NodeGroup are not yet applied.

This is the implementation of kubernetes#3583 (comment).

towca · 2021-01-25T11:54:35Z

/lgtm
/unhold

towca · 2021-01-25T12:22:34Z

/lgtm

k8s-ci-robot added the cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. label Dec 30, 2020

k8s-ci-robot requested review from ellistarn and enxebre December 30, 2020 18:23

k8s-ci-robot added approved Indicates a PR has been approved by an approver from all required OWNERS files. size/XL Denotes a PR that changes 500-999 lines, ignoring generated files. labels Dec 30, 2020

k8s-ci-robot added the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Dec 30, 2020

MaciekPytel mentioned this pull request Dec 30, 2020

Nodegroup Interface for per-nodegroup configuration #3583

Closed

MaciekPytel force-pushed the per_nodegroup_config branch from b8257de to bca80ad Compare December 31, 2020 12:17

k8s-ci-robot removed the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Dec 31, 2020

BigDarkClown reviewed Jan 5, 2021

View reviewed changes

cluster-autoscaler/processors/nodegroupconfig/node_group_config_processor_test.go Outdated Show resolved Hide resolved

ricoleabricot reviewed Jan 6, 2021

View reviewed changes

towca reviewed Jan 13, 2021

View reviewed changes

MaciekPytel mentioned this pull request Jan 18, 2021

Feature Request: Allow individual values of scale-down-utilization-threshold for CPU and memory #1317

Closed

k8s-ci-robot added the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Jan 25, 2021

k8s-ci-robot assigned towca Jan 25, 2021

k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Jan 25, 2021

MaciekPytel added 3 commits January 25, 2021 11:00

Per NodeGroup config for scale-down options

3e42b26

This is the implementation of kubernetes#3583 (comment).

Add unittests for applying per NodeGroup config

a4b9747

MaciekPytel force-pushed the per_nodegroup_config branch from f05df85 to c8e527b Compare January 25, 2021 10:09

k8s-ci-robot removed the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Jan 25, 2021

k8s-ci-robot added lgtm "Looks good to me", indicates that a PR is ready to be merged. and removed do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. labels Jan 25, 2021

Rename default options to NodeGroupDefaults

65b3c8d

MaciekPytel force-pushed the per_nodegroup_config branch from c8e527b to 65b3c8d Compare January 25, 2021 12:21

k8s-ci-robot removed the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Jan 25, 2021

k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Jan 25, 2021

k8s-ci-robot merged commit a8ee048 into kubernetes:master Jan 25, 2021

MaciekPytel mentioned this pull request Jan 25, 2021

doc: proposal custom cloud provider over gRPC #3140

Merged

MaciekPytel mentioned this pull request Feb 5, 2021

Question/support or possible feature request: separate configs/settings for separate node groups. #2751

Closed

towca mentioned this pull request Apr 27, 2021

Remove vivekbagade, add towca as an approver in cluster-autoscaler/OWNERS #4040

Merged

qianlei90 mentioned this pull request Apr 11, 2023

Should --scale-down-delay-after-add be per-nodepool? #3071

Closed

himanshu-kun mentioned this pull request Jun 28, 2023

Allow setting options per worker group gardener/autoscaler#240

Closed

9 tasks

BigDarkClown mentioned this pull request Jun 29, 2023

Add BigDarkClown to Cluster Autoscaler approvers #5915

Merged

6 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Per nodegroup scale-down config #3789

Per nodegroup scale-down config #3789

MaciekPytel commented Dec 30, 2020

k8s-ci-robot commented Dec 30, 2020

MaciekPytel commented Dec 30, 2020

MaciekPytel commented Dec 31, 2020

ricoleabricot Jan 6, 2021

MaciekPytel Jan 7, 2021

ricoleabricot Jan 9, 2021 •

edited

Loading

MaciekPytel Jan 11, 2021

ricoleabricot Jan 11, 2021

MaciekPytel commented Jan 21, 2021

towca commented Jan 25, 2021

towca commented Jan 25, 2021

towca commented Jan 25, 2021

Per nodegroup scale-down config #3789

Per nodegroup scale-down config #3789

Conversation

MaciekPytel commented Dec 30, 2020

k8s-ci-robot commented Dec 30, 2020

MaciekPytel commented Dec 30, 2020

MaciekPytel commented Dec 31, 2020

ricoleabricot Jan 6, 2021

Choose a reason for hiding this comment

MaciekPytel Jan 7, 2021

Choose a reason for hiding this comment

ricoleabricot Jan 9, 2021 • edited Loading

Choose a reason for hiding this comment

MaciekPytel Jan 11, 2021

Choose a reason for hiding this comment

ricoleabricot Jan 11, 2021

Choose a reason for hiding this comment

MaciekPytel commented Jan 21, 2021

towca commented Jan 25, 2021

towca commented Jan 25, 2021

towca commented Jan 25, 2021

ricoleabricot Jan 9, 2021 •

edited

Loading