Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add Azure cluster-autoscaler e2e test #6989

Merged
merged 1 commit into from
Jun 28, 2024

Conversation

nojnhuh
Copy link
Contributor

@nojnhuh nojnhuh commented Jun 28, 2024

What type of PR is this?

/kind cleanup

What this PR does / why we need it:

This PR adds the first real functional test of the Azure provider for cluster-autoscaler. The test creates a bunch of Pods, waits for the cluster to scale up, deletes the pods, and waits for the cluster to scale back down.

Which issue(s) this PR fixes:

Fixes #

Special notes for your reviewer:

Does this PR introduce a user-facing change?

NONE

Additional documentation e.g., KEPs (Kubernetes Enhancement Proposals), usage docs, etc.:


@k8s-ci-robot k8s-ci-robot added kind/cleanup Categorizes issue or PR as related to cleaning up code, process, or technical debt. cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. size/XL Denotes a PR that changes 500-999 lines, ignoring generated files. labels Jun 28, 2024
@k8s-ci-robot k8s-ci-robot requested a review from nilo19 June 28, 2024 22:03
@k8s-ci-robot k8s-ci-robot requested a review from tallaxes June 28, 2024 22:03
@k8s-ci-robot k8s-ci-robot added the area/provider/azure Issues or PRs related to azure provider label Jun 28, 2024
@@ -46,8 +46,15 @@ install-e2e: $(HELM)
--set image.repository=$(IMAGE)-$(GOARCH) \
--set image.tag=$(TAG) \
--set image.pullPolicy=Always \
--set extraArgs.scale-down-delay-after-add=10s \
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Basically what I was aiming for with this new config was "react as fast as possible" so any feedback on how to best achieve that is most welcome.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

sgtm

)

BeforeEach(func() {
Eventually(allVMSSStable, "10m", "30s").Should(Succeed())
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

All of these timeouts are kind of arbitrary, so I'm totally open to tweaking these.

@@ -162,11 +162,6 @@ spec:
mode: System
owner:
name: ${CLUSTER_NAME}
tags:
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Should we try to isolate things like kube-system Pods from the side effects of these tests? That's what I was trying to achieve here by removing the autoscaler tags from this System pool but I can still see things like coredns sometimes scheduling onto User pools.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'm not sure about the connection here, but I think at this phase of the game this is good enough, and we're in good shape to raply make tweaks to cover these types of things that make testing cluster-autoscaler tricky.

@nojnhuh
Copy link
Contributor Author

nojnhuh commented Jun 28, 2024

/assign @jackfrancis @willie-yao

@jackfrancis
Copy link
Contributor

this is a great start!

/lgtm
/approve

@k8s-ci-robot k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Jun 28, 2024
@k8s-ci-robot
Copy link
Contributor

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: jackfrancis, nojnhuh

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@k8s-ci-robot k8s-ci-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Jun 28, 2024
@k8s-ci-robot k8s-ci-robot merged commit 069aab7 into kubernetes:master Jun 28, 2024
7 checks passed
@nojnhuh nojnhuh deleted the azure-e2e-test branch June 28, 2024 23:33
@willie-yao
Copy link
Contributor

Sorry I was late to this, lgtm!!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
approved Indicates a PR has been approved by an approver from all required OWNERS files. area/cluster-autoscaler area/provider/azure Issues or PRs related to azure provider cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. kind/cleanup Categorizes issue or PR as related to cleaning up code, process, or technical debt. lgtm "Looks good to me", indicates that a PR is ready to be merged. size/XL Denotes a PR that changes 500-999 lines, ignoring generated files.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants