-
Notifications
You must be signed in to change notification settings - Fork 980
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Karpenter 1.0.2 fails to remove stale nodeclaims leftover from 0.37.3 upgrade #6981
Comments
Worked around this issue via #6847 (comment) |
Can you share the NodeClaim that existed prior to the conversion as well as the config for the CRD and the deployment? This looks like it should have been defaulted by the conversion webhook so I'm surprised that this converted over without the |
I have more explanation here #6847 (comment)
The CRDs were from version 1.0.2 of karpenter-crd chart installed into Relevant ec2nodeclass
|
I suspect that they didn't have the group because you changed versions without enabling the conversion webhooks. This is going to cause fields to get dropped and the apiserver doesn't know how to hydrate the data. Regardless, I suspect that this fix is going to solve your problem because we will no longer be relying on the spec to be configured correctly for the Patch to succeed. Allowing you to get unblocked from this issue. Regardless, I wouldn't recommend going through the path without the conversion webhooks because you may end up in an undefined state. |
Willing to try installing a snapshot to validate if this fixes the issue? Here's the command for installing the snapshot version of Karpenter with the patch fix
|
We plan to go from v0.37.3 to v1.0.2 with webhook enabled. Is this issue resolved? Or we should wait for a newer patch version in v1.0 |
I have the issue on |
This issue has been inactive for 14 days. StaleBot will close this stale issue after 14 more days of inactivity. |
Description
Observed Behavior:
After upgrade to 1.0.2 following upgrade path https://karpenter.sh/v1.0/upgrading/v1-migration/#upgrade-procedure
The webhook didn't properly convert the nodeclaims
as a result there are a number of claims that are failing:
These nodeclaims cannot be removed by force because karpenter 1.0.2 fails to delete them due to the aforementioned error.
These nodeclaims cannot be edited to remove the finalizer because they are immutable.
Expected Behavior:
Old nodeclaims left over from 0.37.3 properly cleaned up and replaced by new nodeclaims.
Reproduction Steps (Please include YAML):
Follow the upgrade path, see #6847 (comment)
Versions:
kubectl version
): 1.29+ eksThe text was updated successfully, but these errors were encountered: