Add retries for kubeadm join / UpdateStatus #2092

fabriziopandini · 2020-03-30T14:35:38Z

Is this a BUG REPORT or FEATURE REQUEST?

BUG REPORT

Versions

kubeadm version: v1.17.*

What happened?

While executing Cluster API tests, in some cases it was observed kubeadm join failures when updating the kubeadm-config config map

xref kubernetes-sigs/cluster-api#2769

What you expected to happen?

To make update status more resilient by adding a retry loop to this operation

How to reproduce it (as minimally and precisely as possible)?

This error happens only sometimes, most probably when there is a temporary blackout of the load balancer that sits in front of the API servers (HA proxy reloading his configuration).
Also, the error might happen when the new API server enters the load balancing pool but the underlying etcd member is not yet available due to slow network/slow I/O causing delays in etcd getting online or in some cases, also change fo the etcd leader.

Anything else we need to know?

Important: if possible the change should be kept as small and possible and backported

rosti · 2020-04-02T17:09:20Z

During the update status phase, we do the following 3 API calls:

Create or mutate the kubeadm-config config map. This is hooked to a 20 step exponential backoff that totals in ~10s of wait time.
Node role is created. No retries are performed.
Node role binding is created. Again, no retries are performed.

The first question here is, should we consider a timeout for the whole phase or per API call?
What timeouts do we envision here?

Having too big timeouts on per-operation basis might frustrate end users. Having too short timeout will cause failures of the nature seen by the Cluster API folks.

neolit123 · 2020-04-02T17:41:36Z

the ticket in CAPI, proposed that CAPI should be providing some metrics in terms of retires, yet this level of granularity will be hard to scope for them.

The first question here is, should we consider a timeout for the whole phase or per API call?

my vote goes for per-api call.

What timeouts do we envision here?

there is no sane answer for this.

a common timeout of exp back capping around 40 sec makes sense to me for general API calls.

BTW, at this point we seem to be applying a number of different backoffs, timeouts and different retry mechanics in different places which is increasing the tech dept in kubeadm.

fabriziopandini · 2020-04-03T11:59:32Z

I have no strong opinions about applying retries per call or per phase, by considering the need for backporting I'm +1 for the simplest solution during this iteration

FYI current approach in clusterctl is to have retry loops for small groups of API calls (not for a single call) and everything is standardized around three backoff configurations:

for group of operations with at least one write (~40s)
for group of operations with only reads (~15s)
specific for the connection to the API server (~15s)

https://github.com/kubernetes-sigs/cluster-api/blob/9fe8ad47e130758564d976dde7757d3edbba8c88/cmd/clusterctl/client/cluster/client.go#L206-L265

There are also special timeouts that apply to critical steps to the process (similar to wait for the API server or wait for TLS bootstrap in kubeadm)

xlgao-zju · 2020-04-28T09:49:36Z

I'd like to help with this.
/assign

neolit123 · 2020-06-02T21:49:54Z

@xlgao-zju hi, code freeze for 1.19 is June 25th.
would you be able to send a PR before that?

xlgao-zju · 2020-06-03T02:50:44Z

would you be able to send a PR before that?

will send the PR before June 15th, os that you reviewers will enough time to review the PR.

neolit123 added this to the v1.19 milestone Mar 30, 2020

neolit123 added kind/bug Categorizes issue or PR as related to a bug. kind/feature Categorizes issue or PR as related to a new feature. priority/critical-urgent Highest priority. Must be actively worked on as someone's top priority right now. labels Mar 30, 2020

neolit123 mentioned this issue Mar 30, 2020

improve kubeadm's preflight and cluster health assurance #2096

Closed

neolit123 mentioned this issue Apr 6, 2020

kubeadm join does not explicitly wait for etcd to have grown when joining secondary control plane #1353

Closed

k8s-ci-robot assigned xlgao-zju Apr 28, 2020

This was referenced Jun 5, 2020

Add retries for kubeadm join / UpdateStatus kubernetes/kubernetes#91815

Closed

kubeadm: Add retries for kubeadm join / UpdateStatus kubernetes/kubernetes#91952

Merged

k8s-ci-robot closed this as completed in kubernetes/kubernetes#91952 Jun 11, 2020

neolit123 mentioned this issue Jun 16, 2020

Insulate users from kubeadm API version changes kubernetes-sigs/cluster-api#2769

Closed

micahhausler mentioned this issue Apr 29, 2021

add API support for controlling various timeouts #2463

Closed

killianmuldoon mentioned this issue Jul 15, 2022

Deprecate experimentalRetryJoin in CABPK kubernetes-sigs/cluster-api#5597

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add retries for kubeadm join / UpdateStatus #2092

Add retries for kubeadm join / UpdateStatus #2092

fabriziopandini commented Mar 30, 2020

rosti commented Apr 2, 2020

neolit123 commented Apr 2, 2020 •

edited

Loading

fabriziopandini commented Apr 3, 2020

xlgao-zju commented Apr 28, 2020

neolit123 commented Jun 2, 2020

xlgao-zju commented Jun 3, 2020

Add retries for kubeadm join / UpdateStatus #2092

Add retries for kubeadm join / UpdateStatus #2092

Comments

fabriziopandini commented Mar 30, 2020

Is this a BUG REPORT or FEATURE REQUEST?

Versions

What happened?

What you expected to happen?

How to reproduce it (as minimally and precisely as possible)?

Anything else we need to know?

rosti commented Apr 2, 2020

neolit123 commented Apr 2, 2020 • edited Loading

fabriziopandini commented Apr 3, 2020

xlgao-zju commented Apr 28, 2020

neolit123 commented Jun 2, 2020

xlgao-zju commented Jun 3, 2020

neolit123 commented Apr 2, 2020 •

edited

Loading