-
Notifications
You must be signed in to change notification settings - Fork 430
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Upgrade from 1.20 to 1.21 is failing #1568
Comments
Initial investigation details: Nodes are staying NotReady because the CNI pods are failing to initialize. Kubelet logs show:
This is due to kubeadm defaulting to systemd cgroups starting in 1.21. The 1.21 image used by the new node was built by image builder and has containderd setup for systemd cgroups (kubernetes-sigs/image-builder#471) but because KCP still has version 1.20, the kubelet config is old and doesn't have https://github.com/kubernetes-sigs/cluster-api/pull/4236/files. Confirmed this only happens if worker nodes are upgraded before the control plane, which is not recommended. |
Also confirmed I cannot repro this for 1.19 -> 1.20, only 1.20 -> 1.21
|
kubernetes-sigs/cluster-api#4896 tracks changing the tests to not upgrade worker nodes before control plane |
/close this only affects 1.20 -> 1.21 upgrades where the worker nodes get upgraded before the control plane, which is against k8s upgrade recommendations (https://kubernetes.io/docs/tasks/administer-cluster/cluster-upgrade/) |
@CecileRobertMichon: Closing this issue. In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
/kind bug
What steps did you take and what happened:
[A clear and concise description of what the bug is.]
I was able to repro locally. New upgraded nodes are not becoming ready.
#1557 (comment)
CAPI e2e job has been broken https://testgrid.k8s.io/sig-cluster-lifecycle-cluster-api-provider-azure#capz-periodic-capi-e2e-main for the past week because it is trying to upgrade from 1.19 to 1.21 which is not allowed.
What did you expect to happen:
Anything else you would like to add:
[Miscellaneous information that will assist in solving the issue.]
Environment:
kubectl version
):/etc/os-release
):The text was updated successfully, but these errors were encountered: