cluster-autoscaler : KubeSchedulerConfiguration plugin configuration PodTopologySpread #3879

Ramyak · 2021-02-10T19:28:32Z

Which component are you using?:

component: cluster-autoscaler

Is your feature request designed to solve a problem? If so describe the problem this feature should solve.:

Scheduler now supports PodTopologySpread - Cluster-level default constraints from kubernetes release 1.18 - commit.

1. PodTopologySpread - defaultConstraints at the cluster level: Cluster-autoscaler does not consider PodTopologySpread defaultConstraints at the cluster level.

apiVersion: kubescheduler.config.k8s.io/v1alpha2
kind: KubeSchedulerConfiguration
leaderElection:
  leaderElect: true
profiles:
  - pluginConfig:
      - name: PodTopologySpread
        args:
          defaultConstraints:
            - maxSkew: 1
              topologyKey: topology.kubernetes.io/zone
              whenUnsatisfiable: ScheduleAnyway
          defaultingType: List

Pods remain unscheduled. You get the error.
Note: Pod specs do not have topologySpreadConstraints in this case.

I0210 18:03:11.409934       1 filter_out_schedulable.go:118] Pod test-app-5b75d455c9-7gpf5 marked as unschedulable can be scheduled on node ip-172-21-145-192.ec2.internal (based on hinting). Ignoring in scale up.

2. PodTopologySpread when set at deployment: works since pod spec starts having topologySpreadConstraints.

  topologySpreadConstraints:
  - labelSelector:
      matchLabels:
        app: some-app
        release-unixtime: "1611668612"
    maxSkew: 1
    topologyKey: topology.kubernetes.io/zone
    whenUnsatisfiable: DoNotSchedule

Describe the solution you'd like.:

Cluster-autoscaler consider PodTopologySpread - Cluster-level default constraints during attempts to schedule pods

Describe any alternative solutions you've considered.:

Additional context.:

The text was updated successfully, but these errors were encountered:

fejta-bot · 2021-05-11T20:23:25Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Send feedback to sig-contributor-experience at kubernetes/community.
/lifecycle stale

Ramyak · 2021-05-11T20:35:54Z

/remove-lifecycle stale

k8s-triage-robot · 2021-08-09T20:44:00Z

The Kubernetes project currently lacks enough contributors to adequately respond to all issues and PRs.

This bot triages issues and PRs according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Mark this issue or PR as fresh with /remove-lifecycle stale
Mark this issue or PR as rotten with /lifecycle rotten
Close this issue or PR with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle stale

Ramyak · 2021-08-09T21:01:18Z

/remove-lifecycle stale

k8s-triage-robot · 2021-12-14T16:02:10Z

The Kubernetes project currently lacks enough contributors to adequately respond to all issues and PRs.

This bot triages issues and PRs according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Mark this issue or PR as fresh with /remove-lifecycle stale
Mark this issue or PR as rotten with /lifecycle rotten
Close this issue or PR with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle stale

der-eismann · 2021-12-14T16:13:59Z

/remove-lifecycle stale

k8s-triage-robot · 2022-03-14T17:10:26Z

The Kubernetes project currently lacks enough contributors to adequately respond to all issues and PRs.

This bot triages issues and PRs according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Mark this issue or PR as fresh with /remove-lifecycle stale
Mark this issue or PR as rotten with /lifecycle rotten
Close this issue or PR with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle stale

lawliet89 · 2022-03-15T01:32:00Z

/remove-lifecycle stale

k8s-triage-robot · 2022-06-13T01:57:47Z

The Kubernetes project currently lacks enough contributors to adequately respond to all issues and PRs.

This bot triages issues and PRs according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Mark this issue or PR as fresh with /remove-lifecycle stale
Mark this issue or PR as rotten with /lifecycle rotten
Close this issue or PR with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle stale

der-eismann · 2022-06-13T08:51:29Z

/remove-lifecycle stale

k8s-triage-robot · 2022-09-11T08:52:15Z

The Kubernetes project currently lacks enough contributors to adequately respond to all issues and PRs.

This bot triages issues and PRs according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Mark this issue or PR as fresh with /remove-lifecycle stale
Mark this issue or PR as rotten with /lifecycle rotten
Close this issue or PR with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle stale

k8s-triage-robot · 2022-12-10T14:04:57Z

The Kubernetes project currently lacks enough contributors to adequately respond to all issues and PRs.

This bot triages issues and PRs according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Mark this issue or PR as fresh with /remove-lifecycle stale
Mark this issue or PR as rotten with /lifecycle rotten
Close this issue or PR with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle stale

rohitagarwal003 · 2022-12-18T00:20:01Z

/remove-lifecycle stale
/lifecycle frozen

vadasambar · 2023-03-23T06:24:41Z

This is expected behavior. ScheduleAnyway constraint is processed during Scoring (Priority Function) part of the scheduling process in the scheduler. It is not processed during Filtering (Predicate) part of the scheduling process in the scheduler.

CA only uses Filtering part in the simulations (PreFilter and Filter extension points to be precise)

CheckPredicates function
- PreFilter:
  
  autoscaler/cluster-autoscaler/simulator/predicatechecker/schedulerbased.go
  
  Line 153 in 32c6bbc
  
  _, preFilterStatus := p.framework.RunPreFilterPlugins(context.TODO(), state, pod)
- Filter:
  
  autoscaler/cluster-autoscaler/simulator/predicatechecker/schedulerbased.go
  
  Line 163 in 32c6bbc
  
  filterStatus := p.framework.RunFilterPlugins(context.TODO(), state, pod, nodeInfo)
FitsAnyNodeMatching function
- PreFilter:
  
  autoscaler/cluster-autoscaler/simulator/predicatechecker/schedulerbased.go
  
  Line 109 in 32c6bbc
  
  preFilterResult, preFilterStatus := p.framework.RunPreFilterPlugins(context.TODO(), state, pod)
- Filter:
  
  autoscaler/cluster-autoscaler/simulator/predicatechecker/schedulerbased.go
  
  Line 129 in 32c6bbc
  
  filterStatus := p.framework.RunFilterPlugins(context.TODO(), state, pod, nodeInfo)

ScheduleAnyway is a part of scoring phase of the PodTopologySpread plugin

autoscaler/cluster-autoscaler/vendor/k8s.io/kubernetes/pkg/scheduler/framework/plugins/podtopologyspread/scoring.go

Line 71 in bd2ff82

s.Constraints, err = pl.buildDefaultConstraints(pod, v1.ScheduleAnyway)

While DoNotSchedule is a part of the filter/predicate phase of the PodTopologySpread plugin

autoscaler/cluster-autoscaler/vendor/k8s.io/kubernetes/pkg/scheduler/framework/plugins/podtopologyspread/filtering.go

Line 256 in bd2ff82

constraints, err = pl.buildDefaultConstraints(pod, v1.DoNotSchedule)

As long as DoNotSchedule is used, CA should respect the constraint. One problem I see with the current implementation in the CA is, we do not support a custom default constraint. We use the default one. If you specify a DoNotSchedule custom default constraint, CA might not respect it.

jdomag · 2023-04-24T16:56:14Z

@vadasambar
You said that CA respects default cluster constrain, but also that CA doesn't support ScheduleAnyway.
But the default cluster constrain is ScheduleAnyway according to the docs:

defaultConstraints:
  - maxSkew: 3
    topologyKey: "kubernetes.io/hostname"
    whenUnsatisfiable: ScheduleAnyway
  - maxSkew: 5
    topologyKey: "topology.kubernetes.io/zone"
    whenUnsatisfiable: ScheduleAnyway

Can you elaborate on this one, please?

vadasambar · 2023-04-24T17:03:22Z

@jdomag I recently wrote a blogpost on this (maybe this should be part of the docs) which might answer your question. Quoting the relevant part here:

CA imports the PreFilter and Filter part of the default scheduler code i.e., it doesn’t allow making any changes to the default behavior. Because of this CA’s simulation of the scheduler won’t accurately reflect the actual scheduler running in your cluster since your cluster/control plane scheduler’s behavior would be different than CA’s simulated scheduler. This would create problems because CA’s autoscaling won’t accurately match the needs of your cluster.
...

CA doesn’t consider preferredDuringSchedulingIgnoredDuringExecution because it is a part of Scoring phase of NodeAffinity scheduler plugin (comes in-built). Every scheduler plugin can act on multiple extension points. NodeAffinity acts on extension points in both Filtering and Scoring phases. The only problem is, it considers preferredDuringSchedulingIgnoredDuringExecution only in Scoring phase (PreSCore and Score extension points to be precise) and not in Filtering phase.
...

Similarly, ScheduleAnyway is a part of scoring phase of the PodTopologySpread plugin

https://vadasambar.com/post/kubernetes/would-ca-consider-my-soft-constraints/

jdomag · 2023-04-25T07:43:00Z

@vadasambar
thanks, this is a great article, I wish it was part of the official docs :)

vadasambar · 2023-04-26T04:24:08Z

thanks, this is a great article, I wish it was part of the official docs :)

I will try proposing adding it to the docs in the upcoming SIG (and thank you :))

jan-kantert · 2024-11-21T13:14:17Z

This hit us by surprise as well. In my opinion there should be a big red warning in the kubernetes docs: https://kubernetes.io/docs/concepts/scheduling-eviction/topology-spread-constraints/#cluster-level-default-constraints. Currently, this looks like a stable feature but it can cripple your application if you are unlucky. We added a PR to warn future users.

Ramyak added the kind/feature Categorizes issue or PR as related to a new feature. label Feb 10, 2021

lawliet89 mentioned this issue Mar 16, 2021

Add topologySpreadConstraints support for server pods hashicorp/consul-helm#863

Merged

2 tasks

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label May 11, 2021

k8s-ci-robot removed the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label May 11, 2021

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Aug 9, 2021

k8s-ci-robot removed the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Aug 9, 2021

jbartosik added the area/cluster-autoscaler label Sep 15, 2021

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Dec 14, 2021

k8s-ci-robot removed the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Dec 14, 2021

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Mar 14, 2022

k8s-ci-robot removed the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Mar 15, 2022

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Jun 13, 2022

k8s-ci-robot removed the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Jun 13, 2022

k8s-ci-robot added lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. and removed lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. labels Sep 11, 2022

lgmorand mentioned this issue Oct 10, 2022

Support for topology spread constraints with cluster autoscaler Azure/AKS#2849

Closed

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Dec 10, 2022

k8s-ci-robot added lifecycle/frozen Indicates that an issue or PR should not be auto-closed due to staleness. and removed lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. labels Dec 18, 2022

vadasambar mentioned this issue Mar 21, 2023

support overprovsioning without pending pods #5377

Closed

This was referenced Mar 23, 2023

Mar 2023 vadafoss/daily-updates#7

Closed

leave a buffer of underutilized nodes when scaling down #5611

Closed

vadasambar mentioned this issue Apr 25, 2023

Apr 2023 vadafoss/daily-updates#8

Closed

myaser mentioned this issue Apr 22, 2024

Question: how Karpenter play with the default PodTopologySpread part of KubeSchedulerConfiguration? kubernetes-sigs/karpenter#1197

Closed

jabdoa2 mentioned this issue Nov 21, 2024

Document limitations around default topology spread constraints kubernetes/website#48798

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cluster-autoscaler : KubeSchedulerConfiguration plugin configuration PodTopologySpread #3879

cluster-autoscaler : KubeSchedulerConfiguration plugin configuration PodTopologySpread #3879

Ramyak commented Feb 10, 2021

fejta-bot commented May 11, 2021

Ramyak commented May 11, 2021

k8s-triage-robot commented Aug 9, 2021

Ramyak commented Aug 9, 2021

k8s-triage-robot commented Dec 14, 2021

der-eismann commented Dec 14, 2021

k8s-triage-robot commented Mar 14, 2022

lawliet89 commented Mar 15, 2022

k8s-triage-robot commented Jun 13, 2022

der-eismann commented Jun 13, 2022

k8s-triage-robot commented Sep 11, 2022

k8s-triage-robot commented Dec 10, 2022

rohitagarwal003 commented Dec 18, 2022

vadasambar commented Mar 23, 2023 •

edited

Loading

jdomag commented Apr 24, 2023

vadasambar commented Apr 24, 2023 •

edited

Loading

jdomag commented Apr 25, 2023

vadasambar commented Apr 26, 2023

jan-kantert commented Nov 21, 2024 •

edited

Loading

cluster-autoscaler : KubeSchedulerConfiguration plugin configuration PodTopologySpread #3879

cluster-autoscaler : KubeSchedulerConfiguration plugin configuration PodTopologySpread #3879

Comments

Ramyak commented Feb 10, 2021

fejta-bot commented May 11, 2021

Ramyak commented May 11, 2021

k8s-triage-robot commented Aug 9, 2021

Ramyak commented Aug 9, 2021

k8s-triage-robot commented Dec 14, 2021

der-eismann commented Dec 14, 2021

k8s-triage-robot commented Mar 14, 2022

lawliet89 commented Mar 15, 2022

k8s-triage-robot commented Jun 13, 2022

der-eismann commented Jun 13, 2022

k8s-triage-robot commented Sep 11, 2022

k8s-triage-robot commented Dec 10, 2022

rohitagarwal003 commented Dec 18, 2022

vadasambar commented Mar 23, 2023 • edited Loading

jdomag commented Apr 24, 2023

vadasambar commented Apr 24, 2023 • edited Loading

jdomag commented Apr 25, 2023

vadasambar commented Apr 26, 2023

jan-kantert commented Nov 21, 2024 • edited Loading

vadasambar commented Mar 23, 2023 •

edited

Loading

vadasambar commented Apr 24, 2023 •

edited

Loading

jan-kantert commented Nov 21, 2024 •

edited

Loading