-
Notifications
You must be signed in to change notification settings - Fork 430
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
fix managed cluster diff so aks added fields are omitted #1800
fix managed cluster diff so aks added fields are omitted #1800
Conversation
@devigned: GitHub didn't allow me to request PR reviews from the following users: richardchen331. Note that only kubernetes-sigs members and repo collaborators can review this PR, and authors cannot review their own PRs. In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
/test pull-cluster-api-provider-azure-e2e-exp |
345fdb6
to
980431f
Compare
/test pull-cluster-api-provider-azure-e2e-exp |
980431f
to
e205c14
Compare
/test pull-cluster-api-provider-azure-e2e-exp |
e205c14
to
66fa35c
Compare
/test pull-cluster-api-provider-azure-e2e-exp |
98fb4b6
to
bf10ce4
Compare
/test pull-cluster-api-provider-azure-e2e-exp |
9731287
to
b71c3e3
Compare
/hold Still investigating. |
/test pull-cluster-api-provider-azure-e2e-exp |
b71c3e3
to
429ec98
Compare
/test pull-cluster-api-provider-azure-e2e-exp |
3b8e8dd
to
fab59e5
Compare
fc72529
to
68e0cd0
Compare
/test pull-cluster-api-provider-azure-e2e-exp |
2 similar comments
/test pull-cluster-api-provider-azure-e2e-exp |
/test pull-cluster-api-provider-azure-e2e-exp |
Update: After debugging with @CecileRobertMichon and @jackfrancis we have determined the VMSSs created by the AKS cluster exist in the nodeResourceGroup, but the API is responding with an empty list. We are determining the root cause of this unexpected API response. /hold |
Is this only with all the type of nodepools ? or just user nodepools ? |
I don't think it has anything to do with the type of node pools. We have reached out to msft teams responsible for this API, so I should have a definitive answer soon. |
@devigned given that we have identified the source of test failures are we able to temporarily reduce regional coverage to those that are more likely to overcome these errors, and then move the changes in this PR forward? In other words, these changes are not exclusively intended to "fix" test failures, right? I think we should remove the various debug changes that were added to help debug, restore this PR to its state of intended change, and retest the "exp" job until we get a passing test. (Like, let's not block these changes on Azure platform errors that are largely out of our control.) |
34d783f
to
6189796
Compare
@devigned: The following test failed, say
Full PR test history. Your PR dashboard. Please help us cut down on flakes by linking to an open issue when you hit one in your PR. Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. I understand the commands that are listed here. |
/test pull-cluster-api-provider-azure-e2e-exp |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
lgtm
@devigned is this ready or did you want to run a couple more retests? |
/hold cancel |
actually /hold I'll let @devigned remove the hold when we're good from a testing standpoint |
/test pull-cluster-api-provider-azure-e2e-exp give it one more just for good measure. |
/lgtm |
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: CecileRobertMichon The full list of commands accepted by this bot can be found here. The pull request process is described here
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
What type of PR is this?
/kind bug
What this PR does / why we need it:
When an Azure Managed Cluster reconciles, will get into a never ending update loop trying to update the managed cluster due to the spec not matching the AKS computed fields for SKU and LoadBalancer related fields.
This PR does the following:
I don't particularly like this solution as it is brittle in the face of new computed defaults from the AKS side. I would much rather see a more robust solution where both the spec generated managed cluster and the fetched managed cluster are normalized, then diffed to ensure we are only comparing the properties that should trigger an update.
/cc @richardchen331 @alexeldeib
Please confirm that if this PR changes any image versions, then that's the sole change this PR makes.
TODOs:
Release note: