locksmith: CoreOS autoupdate & Kubernetes node drain (klocksmith) #1274

skinny · 2016-05-10T13:16:28Z

When running certain multi-pod applications (Redis cluster in our case) sometimes during an CoreOS update run (installing & rebooting every node) the majority or all pods (3 in our example) end up one one physical machine. When that machine is rebooted, the Redis cluster is lost and requires (for now) manual intervention to get back up.

I learned that Kubernetes 1.2 introduced the Node-Drain functionality, this would be a great feature to use before rebooting a Kubernetes enabled CoreOS node.

Are there any plans on implenting this kind of behaviour (relocating all the pods before a reboot) or does anyone know another way of avoiding this kind of scenario.

Mark

philips · 2016-05-10T15:23:41Z

Hey Mark! I have been wanting to write a design doc on this. Here is a first draft: https://docs.google.com/document/d/1DHiB2UDBYRU6QSa2e9mCNla1qBivZDqYjBVn_DvzDWc/edit#

skinny · 2016-05-10T16:53:24Z

Hi, thanks for the quick response!

~~I read your draft and wondering about the need for the second "update-manager" pod. Wouldn't that introduce more issues, for example when that pod is scheduled on the node that needs rebooting?~~
_scrap that _

Also on Loop 1, step 2: you meant to tag no more than N nodes with the ok-to-reboot tag?

Mark

PS if you are still in Berlin, maybe we can have a quick chat?

philips · 2016-05-10T23:40:49Z

@skinny Happy to chat. I am in Berlin until Saturday.

philips · 2016-06-25T02:53:40Z

@skinny Still interested in working on this?

chrissnell · 2016-07-08T19:41:52Z

+1 for this!

yogurtnaturalny · 2016-09-11T01:19:20Z

Hi, this idea is very cool, like locksmith will update k8s to evacuate containers. Something like coreos to etcd "I wanna restart", etcd "ok, hold on i will inform k8s" etc to k8 "hey node 8 want to restart mark as not for deploy and rollupdate/restart containers", k8s "Yes my master, done ", etc ->coreos "restart". and coreos will report that he come back to cluster and etcd will unmark not schedule from node.

yogurtnaturalny · 2016-09-11T01:22:30Z

This will help not only coreos with locksmith but other distros to schedule updates on infrastructure. [VPS and bare metal] and decrease down time for service, etc...

philips · 2016-09-28T22:24:39Z

We should probably not try and use the existing locksmith codebase and instead call this "klocksmith" or something. The deployment method (containers), backend (kubernetes), etc are all completely different here.

snarlysodboxer · 2016-10-22T01:01:31Z

@philips I read through your Doc and it looks like a great idea. I'll throw this though out here just in case. Forgive me if I'm missing part of the picture.

What about just modifying locksmith itself to support preStop hooks? It could optionally run a command or httpGet a URL, blocking the reboot signal until that command or URL returns?

The command could obviously then be anybody's custom anything, and for the case of K8s, the command could be a simple bash script which runs kubectl drain <node name> and loops until Non-terminated Pods == 0, or something more sophisticated that hits the API directly.

so0k · 2016-11-03T05:59:30Z

Adding preStop hooks seems like a simple/quick solution to the problem at hand?

chrissnell · 2017-01-06T03:46:09Z

I would also like to be able to prevent a node from rebooting if a ReplicaSet or StatefulSet is not running the desired number of replicas. This is to prevent downtime for an application (like some data stores) that requires a minimum number of nodes to be running.

The scenario goes like this: Let's say that you're running something like ElasticSearch in a StatefulSet. let's say that one the SS's pods experiences a fatal event, like database or disk corruption, free space exhaustion, etc., and fails a liveness probe. It's broken and won't come up w/o manual intervention. The SS is now running with less than the desired number of replicas. If locksmith were to initiate a reboot on a node running a pod from this StatefulSet, this could compromise application/cluster availability.

We should be able to prevent node reboots when there is a compromised ReplicaSet or StatefulSet. Maybe there's a way to do this already? I don't know, but this seems an appropriate place to mention it. We're working around this very same situation w/ Cassandra running under Fleet (obviously less than ideal).

sander-su · 2017-01-11T12:59:16Z

+1 looks great, currently our cluster reboots entirely way too fast. No time for the applications to become available again. As this currently happens during nighttime this is not that big of a problem but could be better.

@chrissnell for compromised replica & statefullsets the PodDisruptionBudget would be the indicator.

Can someone comment on the current status?

crawford · 2017-01-11T18:48:58Z

We are working on a kubernetes-aware version of locksmith (lovingly called "klocksmith"). The plan is to deploy this component (consisting of a daemon set and controller) onto the cluster and allow that to manage the reboots. We don't have anything to announce just yet, but we are getting close.

crawford · 2017-05-10T23:27:06Z

For those following along, we released https://github.com/coreos/container-linux-update-operator which replaces Locksmith in Kubernetes clusters.

euank · 2017-08-15T22:54:06Z

The Container Linux Update Operator (the new name for "klocksmith") is now deployed by default on Tectonic clusters. It should also function just as well on regular Kubernetes clusters.

For specific enhancements related to it, please open additional issues here, against Tectonic, or against Kubernetes as appropriate.

@chrissnell
Enforcing a minimum health of a statefulset/deployment will best be accomplished by the pod disruption budget feature I think... which is still in development, but once it's available the update operator should use it.

dghubble · 2017-08-15T23:46:11Z

Yep, for example, plain-old Kubernetes clusters like the Matchbox bootkube-install example cluster (noo-tectonic) now use the Container Linux Update Operator too.

https://github.com/coreos/matchbox/blob/master/Documentation/cluster-addons.md

philips added component/locksmith kind/enhancement labels May 10, 2016

philips changed the title ~~CoreOS autoupdate & Kubernetes node drain~~ locksmith: CoreOS autoupdate & Kubernetes node drain May 10, 2016

crawford added the team/k8s label May 10, 2016

philips mentioned this issue May 14, 2016

kube-aws: Drain nodes before shutting them down coreos/coreos-kubernetes#465

Closed

philips mentioned this issue Sep 22, 2016

locksmith: integrate with kubelet #1112

Closed

philips changed the title ~~locksmith: CoreOS autoupdate & Kubernetes node drain~~ locksmith: CoreOS autoupdate & Kubernetes node drain (klocksmith) Sep 28, 2016

so0k mentioned this issue Nov 3, 2016

ES Client connection draining pires/kubernetes-elasticsearch-cluster#61

Open

cknowles mentioned this issue Jan 15, 2017

Allow CoreOS updates by adding nodes to etcd as proxy kubernetes-retired/kube-aws#240

Closed

euank self-assigned this Mar 16, 2017

gabrieladt mentioned this issue May 11, 2017

Some points to use in production kz8s/tack#174

Open

euank closed this as completed Aug 15, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

locksmith: CoreOS autoupdate & Kubernetes node drain (klocksmith) #1274

locksmith: CoreOS autoupdate & Kubernetes node drain (klocksmith) #1274

skinny commented May 10, 2016

philips commented May 10, 2016

skinny commented May 10, 2016 •

edited

Loading

philips commented May 10, 2016

philips commented Jun 25, 2016

chrissnell commented Jul 8, 2016

yogurtnaturalny commented Sep 11, 2016

yogurtnaturalny commented Sep 11, 2016 •

edited

Loading

philips commented Sep 28, 2016

snarlysodboxer commented Oct 22, 2016 •

edited

Loading

so0k commented Nov 3, 2016

chrissnell commented Jan 6, 2017

sander-su commented Jan 11, 2017

crawford commented Jan 11, 2017

crawford commented May 10, 2017

euank commented Aug 15, 2017

dghubble commented Aug 15, 2017

locksmith: CoreOS autoupdate & Kubernetes node drain (klocksmith) #1274

locksmith: CoreOS autoupdate & Kubernetes node drain (klocksmith) #1274

Comments

skinny commented May 10, 2016

philips commented May 10, 2016

skinny commented May 10, 2016 • edited Loading

philips commented May 10, 2016

philips commented Jun 25, 2016

chrissnell commented Jul 8, 2016

yogurtnaturalny commented Sep 11, 2016

yogurtnaturalny commented Sep 11, 2016 • edited Loading

philips commented Sep 28, 2016

snarlysodboxer commented Oct 22, 2016 • edited Loading

so0k commented Nov 3, 2016

chrissnell commented Jan 6, 2017

sander-su commented Jan 11, 2017

crawford commented Jan 11, 2017

crawford commented May 10, 2017

euank commented Aug 15, 2017

dghubble commented Aug 15, 2017

skinny commented May 10, 2016 •

edited

Loading

yogurtnaturalny commented Sep 11, 2016 •

edited

Loading

snarlysodboxer commented Oct 22, 2016 •

edited

Loading