Added Per-pod eviction backoff and retry logic #515

njtran · 2021-07-16T01:15:23Z

Issue, if available:
#452

Description of changes:

Pod evictions now have separate backoffs per pod
Drain in termination controller queues up pods to a continually running queue processor as a goroutine in the background
Accounts for different pod eviction errors

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.

ellistarn

Nice! Overall looks good. I'll take a deeper look tomorrow. We need some robust tests though.

pkg/controllers/termination/controller.go

pkg/controllers/termination/eviction.go

pkg/controllers/termination/suite_test.go

pkg/controllers/termination/controller.go

ellistarn · 2021-07-19T17:21:48Z

pkg/controllers/termination/eviction.go

+}
+
+// Evict adds a pod to the EvictionQueue
+func (e *EvictionQueue) Add(pods []*v1.Pod) {


A variadic might be a bit cleaner here, since you could imagine evicting a single pod. 😉

Since the only code that is adding a pod is our code, I thought it would make more sense just to put an array here, since that's how we form the list of pods anyways. I have no strong feelings either way though.

pkg/controllers/termination/eviction.go

pkg/controllers/termination/controller.go

pkg/controllers/termination/eviction.go

pkg/controllers/termination/suite_test.go

ellistarn · 2021-07-19T20:40:11Z

pkg/controllers/termination/terminate.go

 	"sigs.k8s.io/controller-runtime/pkg/client"
 )

+const (
+	evictionQueueMaxDelay  = 10 * time.Second


Thoughts on pushing this to 60 seconds? I'm a bit wary of too much eviction QPS. 1,000 pods at 10 seconds is 100qps.

I'm a little wary of pushing it to 60 seconds. Imagine I have 5 pods, with a PDB that allows me to evict one at a time. If we assume that our nodes take 50 seconds to be fully provisioned, we have enough time for the eviction delay to get to reach a minute for any one of these pods. At the worst, we could wait a whole minute after the PDB would allow an eviction to evict another pod. Let's say these are all on the same node, this node would then take much much longer to terminate and drain.

I realize this is a very specific case, but since our allocation logic tends to group pods that are created together (since scaling up deployments is the most basic happenstance of pending pods), our logic may end up running into something like this more than not.

I think we could fine tune these numbers when we get to performance and scale testing. What do you think?

ellistarn · 2021-07-21T21:10:22Z

pkg/test/expectations/expectations.go

@@ -137,3 +145,37 @@ func ExpectReconcileSucceeded(reconciler reconcile.Reconciler, key client.Object
 	_, err := reconciler.Reconcile(context.Background(), reconcile.Request{NamespacedName: key})
 	Expect(err).ToNot(HaveOccurred())
 }
+
+func ExpectPodsEvictingSucceeded(c client.Client, pods ...*v1.Pod) {


Thoughts on calling this
ExpectEvicted(c client.Client, pods ...*v1.Pod)
and then actually doing the deletion after we verify the deletion timestamp is set?

I would rather have this check just check it, then separately delete the pod, just in case there is a time where we don't immediately want to delete it, and check some other things first.

pkg/test/expectations/expectations.go

pkg/controllers/termination/suite_test.go

pkg/controllers/termination/controller.go

bwagner5 · 2021-07-21T20:00:00Z

pkg/controllers/termination/eviction.go

+	queue        workqueue.RateLimitingInterface
+	coreV1Client corev1.CoreV1Interface
+
+	enqueued set.Set


Should we add this functionality to the newly added utils/parallel/workqueue.go? That implementation is slightly different, but there is a lot of shared functionality. This could definitely be done later though.

Ellis and I talked about this, and we think there is a possibility to merge them here. We decided to punt on this merging effort later in favor of getting this feature in.

pkg/controllers/termination/suite_test.go

pkg/test/pods.go

pkg/controllers/termination/suite_test.go

…ming (aws#515)

njtran changed the title ~~Eviction~~ Added Per-pod eviction backoff and retry logic Jul 16, 2021

ellistarn reviewed Jul 16, 2021

View reviewed changes

bwagner5 reviewed Jul 19, 2021

View reviewed changes