Refactor node draining racing avoid condition #130

pliurh · 2021-05-08T06:43:23Z

Utilize k8s leader election mechanism to prevent annotating nodes
at the same time.

zshi-redhat · 2021-05-08T09:26:50Z

pkg/daemon/daemon.go

-	}, 3*time.Second, 3, true, ctx.Done())
+
+	done := make(chan bool)
+	go dn.getDrainLock(ctx, done)


Can we run dn.getDrainLock in foreground and not use done chan?
I assume leaderelection.RunOrDie is a loop w/o timeout, is it true?

The leaderelection.RunOrDie runs forever unless the context is canceled. I didn't run it in the foreground, because I want it keeps leading until the node finishes draining. So the rest nodes can only start the election after the prior one finishes draining if there is no reboot required.

adrianchiris · 2021-05-09T13:53:04Z

pkg/daemon/daemon.go

@@ -18,6 +18,14 @@ import (
 	"time"

 	"github.com/golang/glog"
+	sriovnetworkv1 "github.com/k8snetworkplumbingwg/sriov-network-operator/api/v1"


nit: could you separate local imports from external imports

This was updated by some plugin of my editor automatically. I'll change it back.

adrianchiris · 2021-05-09T14:19:32Z

pkg/daemon/daemon.go

+	}
+
+	// start the leader election
+	leaderelection.RunOrDie(ctx, leaderelection.LeaderElectionConfig{


looking at leaderelection docs:

Package leaderelection implements leader election of a set of endpoints. It uses an annotation in the endpoints object to store the record of the election state. This implementation does not guarantee that only one client is acting as a leader (a.k.a. fencing).

is it not an issue ? i think what we are doing here is considered fencing
how will the system behave when there is only one endpoint trying to take the lead on LeaseLock ?

Instead of using endpoint, I use Lease API for leader election here. I don't think that statement applies. As all the clients race for the same Lease object. So I don't think there could be more than one acting as leader.

OK thanks for explaining

adrianchiris · 2021-05-09T14:21:49Z

pkg/daemon/daemon.go

+					time.Sleep(1 * time.Second)
+					if dn.drainable {
+						glog.V(2).Info("drainNode(): no other node is draining")
+						err = dn.annotateNode(dn.name, annoDraining)


using this mechanism, do we still need to annotate the node with annotDraining ?

That is the trick. The leader election mechanism requires the leader to keep updating the Lease object. But in our case, the node may reboot itself, then lose leadership. So I use a 2 layers lock here. The node can only start draining with 2 conditions: 1) it becomes the leader 2) no other node is draining which is indicated by the annotation.

I see, thanks for clarifying

@pliurh mind mentioning this in the commit message ? so its clear in the commit message how this mechanism is used to control node draining

martinkennelly · 2021-05-11T14:27:36Z

pkg/daemon/daemon.go

+		RetryPeriod:     1 * time.Second,
+		Callbacks: leaderelection.LeaderCallbacks{
+			OnStartedLeading: func(ctx context.Context) {
+				glog.V(2).Info("drainNode(): started leading")


I see in your log messages you've drainNode() - should it not be getDrainLock()?

martinkennelly · 2021-05-11T14:32:07Z

pkg/daemon/daemon.go

+				glog.V(2).Info("drainNode(): started leading")
+				for {
+					time.Sleep(1 * time.Second)
+					if dn.drainable {


Outside of any concern for this PR because this pattern was here before this PR - but have you folks ever seen nodes getting stuck on this condition? It could happen if a node reboots and doesn't startup and daemonset is unable to update its draining status.
Then line 847 is blocked indefinitely.

That is intentional. We don't want a configuration mistake to break more nodes. If users encounter such a problem, they'd better do some troubleshooting to find out why the node cannot come back.

adrianchiris

I would prefer the package updates be performed in a separate commit in the PR (easier to review this way). but would not block on it.

@SchSeba i see below you were requested as reviewer. once you approve this can be merged IMO

Utilize k8s leader election mechanism to prevent annotating nodes at the same time. The leader election mechanism requires the leader to keep updating the Lease object. But in our case, the node may reboot itself, then lose leadership. So I use a 2 layers lock here. The node can only start draining with 2 conditions: 1) it becomes the leader 2) no other node is draining which is indicated by the annotation.

pliurh requested review from SchSeba, zshi-redhat and adrianchiris May 8, 2021 06:43

pliurh force-pushed the leader_election branch from 522505a to de961a9 Compare May 8, 2021 07:24

zshi-redhat reviewed May 8, 2021

View reviewed changes

adrianchiris reviewed May 9, 2021

View reviewed changes

martinkennelly reviewed May 11, 2021

View reviewed changes

pliurh force-pushed the leader_election branch 2 times, most recently from 55f619a to fb30f8c Compare May 17, 2021 03:12

martinkennelly approved these changes May 26, 2021

View reviewed changes

adrianchiris approved these changes May 31, 2021

View reviewed changes

pliurh force-pushed the leader_election branch from 3c93d38 to 9a73eb4 Compare June 22, 2021 14:02

pliurh merged commit 9bf123f into k8snetworkplumbingwg:master Jun 25, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor node draining racing avoid condition #130

Refactor node draining racing avoid condition #130

pliurh commented May 8, 2021

zshi-redhat May 8, 2021

pliurh May 10, 2021

adrianchiris May 9, 2021

pliurh May 10, 2021

pliurh May 14, 2021

adrianchiris May 9, 2021 •

edited

Loading

pliurh May 10, 2021

adrianchiris May 11, 2021

adrianchiris May 9, 2021

pliurh May 10, 2021

adrianchiris May 11, 2021

adrianchiris May 31, 2021

martinkennelly May 11, 2021

pliurh May 14, 2021

martinkennelly May 11, 2021

pliurh May 14, 2021 •

edited

Loading

adrianchiris left a comment

Refactor node draining racing avoid condition #130

Refactor node draining racing avoid condition #130

Conversation

pliurh commented May 8, 2021

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

adrianchiris May 9, 2021 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

pliurh May 14, 2021 • edited Loading

Choose a reason for hiding this comment

adrianchiris left a comment

Choose a reason for hiding this comment

adrianchiris May 9, 2021 •

edited

Loading

pliurh May 14, 2021 •

edited

Loading