Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feature Request] make new-pod-scale-up-delay configurable in autoscaler profile #1716

Closed
zhiweiv opened this issue Jul 8, 2020 · 8 comments
Assignees
Labels

Comments

@zhiweiv
Copy link

zhiweiv commented Jul 8, 2020

Per https://github.com/kubernetes/autoscaler/blob/master/cluster-autoscaler/FAQ.md

By default, scale-up is considered up to 10 seconds after pod is marked as unschedulable, and scale-down 10 minutes after a node becomes unneeded. There are multiple flags which can be used to configure these thresholds. For example, in some environments, you may wish to give the k8s scheduler a bit more time to schedule a pod than the CA's scan-interval. One way to do this is by setting --new-pod-scale-up-delay, which causes the CA to ignore unschedulable pods until they are a certain "age", regardless of the scan-interval. If k8s has not scheduled them by the end of that delay, then they may be considered by the CA for a possible scale-up.

That is exactly what we need for job pod pools. It is good to be able configure new-pod-scale-up-delay in AKS autoscaler profile.

@zhiweiv
Copy link
Author

zhiweiv commented Jul 12, 2020

@marwanad
Does AKS plan to implement this in near future? Or just added it to backlog?

@marwanad
Copy link

@zhiweiv yup it will be added soon. I'll follow-up with an update.

@github-actions
Copy link

Action required from @Azure/aks-pm

@TomGeske TomGeske added Needs Attention 👋 Issues needs attention/assignee/owner and removed action-required labels Jul 24, 2020
@TomGeske
Copy link

Current ETA for deployment is end of September 2020.

@TomGeske TomGeske added feature-request Requested Features and removed Needs Attention 👋 Issues needs attention/assignee/owner labels Jul 24, 2020
@zhiweiv
Copy link
Author

zhiweiv commented Sep 16, 2020

Any update?

@marwanad
Copy link

@zhiweiv This should be available with the September API. CLI support + Docs slated for end of month. Note that the acceptable time unit is seconds so valid values are: 10s, 360s, etc.

@marwanad
Copy link

This should be now available in CLI 2.12.0, CLI-extensions 0.4.63 and via ARM templates.

Feel free to report back on this thread if you hit issues.

@zhiweiv
Copy link
Author

zhiweiv commented Sep 24, 2020

It should work now, I can see newPodScaleUpDelay in autoScalerProfile after update.

Will https://docs.microsoft.com/en-us/azure/aks/cluster-autoscaler#using-the-autoscaler-profile be updated to reflect this change?

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
Projects
Development

No branches or pull requests

3 participants