OpenStack Cloud provider init failure on new clusters v2.24.0 #350

anders-elastisys · 2024-02-16T08:26:50Z

Describe the bug
There seems to be issues when creating new v2.24.0 clusters on openstack cloud providers where the openstack pods start and taints the nodes before coredns can start putting them in a pending state, and causing the openstack pods to crash as they fail to resolve the openstack endpoint:

Cloud provider could not be initialized: could not init cloud provider "openstack": Post "https://<openstack-endpoint>": dial tcp: lookup <openstack-endpoint> on 10.233.0.3:53: write udp ...->10.233.0.3:53: write: operation not permitted

Related upstream Kubespray issue: kubernetes-sigs/kubespray#10914

To Reproduce
Steps to reproduce the behavior:

On a openstack cloud, create a cluster with v2.24.0, Kubespray will finish without errors
Check kube-system namespace, see openstack pods crashing with logs similar to the output above

Expected behavior
Creating new clusters with kubespray should work fine on all cloud providers.

Version (add all relevant versions):

Compliant kubernetes kubespray v2.24.0-ck8s1

Additional context

A workaround for now is to add tolerations to the coredns pods. E.g. create a file tolerations.yaml:

# tolerations.yaml
spec:
  template:
    spec:
      tolerations:
      - effect: NoSchedule
        key: node.cloudprovider.kubernetes.io/uninitialized
        value: "true"
      - effect: NoSchedule
        key: node-role.kubernetes.io/control-plane

And patch coredns with the tolerations in the file:

kubectl patch deployment coredns -n kube-system --patch "$(cat tolerations.yaml)"

Once the openstack pods run without crashing you can remove the node.cloudprovider.kubernetes.io/uninitialized taint.

The text was updated successfully, but these errors were encountered:

anders-elastisys added the kind/bug Something isn't working label Feb 16, 2024

davidumea mentioned this issue Mar 7, 2024

config: add extra tolerations for coredns #356

Merged

26 tasks

davidumea self-assigned this Mar 7, 2024

davidumea closed this as completed in #356 Mar 7, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

OpenStack Cloud provider init failure on new clusters v2.24.0 #350

OpenStack Cloud provider init failure on new clusters v2.24.0 #350

anders-elastisys commented Feb 16, 2024 •

edited

Loading

OpenStack Cloud provider init failure on new clusters v2.24.0 #350

OpenStack Cloud provider init failure on new clusters v2.24.0 #350

Comments

anders-elastisys commented Feb 16, 2024 • edited Loading

anders-elastisys commented Feb 16, 2024 •

edited

Loading