Do not exec into pods during rolling update #308

ansd · 2020-09-02T16:31:46Z

This closes #304

Exec into pods only if StatefulSet is ready and up to date.

Before this commit, we observed in #304 that the controller tried to exec into pods at the same time as the pods got updated due to a StatefulSet restart resulting in connection errors.

The main change is to not set plugins if

sts.Status.ReadyReplicas < desiredReplicas || sts.Status.UpdatedReplicas < desiredReplicas

because during a StatefulSet restart, it happened multiple times in #304 that since sts.Status.ReadyReplicas == desiredReplicas, the exec commands got executed and the connection got interrupted because the pod got updated.

The main finding is that sts.Status.ReadyReplicas == desiredReplicas can be true although the StatefulSet rolling update is still ongoing:

$ while true; do  kubectl get statefulsets.apps definition-rabbitmq-server --template={{.status}}; echo "" ; sleep 1; done

map[collisionCount:0 currentReplicas:2 currentRevision:definition-rabbitmq-server-56b65fdc4f observedGeneration:5 readyReplicas:3 replicas:3 updateRevision:definition-rabbitmq-server-6f667f5f7d]
…
map[collisionCount:0 currentReplicas:2 currentRevision:definition-rabbitmq-server-56b65fdc4f observedGeneration:5 readyReplicas:2 replicas:3 updateRevision:definition-rabbitmq-server-6f667f5f7d]
…
map[collisionCount:0 currentReplicas:2 currentRevision:definition-rabbitmq-server-56b65fdc4f observedGeneration:5 readyReplicas:2 replicas:3 updateRevision:definition-rabbitmq-server-6f667f5f7d updatedReplicas:1]
…
map[collisionCount:0 currentReplicas:1 currentRevision:definition-rabbitmq-server-56b65fdc4f observedGeneration:5 readyReplicas:3 replicas:3 updateRevision:definition-rabbitmq-server-6f667f5f7d updatedReplicas:1]
…
map[collisionCount:0 currentReplicas:1 currentRevision:definition-rabbitmq-server-56b65fdc4f observedGeneration:5 readyReplicas:2 replicas:3 updateRevision:definition-rabbitmq-server-6f667f5f7d updatedReplicas:1]
…
map[collisionCount:0 currentReplicas:1 currentRevision:definition-rabbitmq-server-56b65fdc4f observedGeneration:5 readyReplicas:2 replicas:3 updateRevision:definition-rabbitmq-server-6f667f5f7d updatedReplicas:2]
…

Thank you @Gsantomaggio for helping troubleshooting 🙂

mkuratczyk · 2020-09-03T07:03:21Z

Why do we even try to run rabbitmq-plugins set if the cluster is restarting? Since #224 is done, we should only exec into the container to enable/disable plugins if the plugins ConfigMap is changed while the cluster is running. If the cluster configuration is changed (a different ConfigMap), we should restart the cluster and it should load plugins on startup based on the plugins ConfigMap - no need to run rabbitmq-plugins set at all. What am I missing? :)

Gsantomaggio · 2020-09-03T16:44:52Z

It doesn't crash anymore, but I think there is still some problem.

I tried to change the config map, by changing some value:

Changed:

queue_master_locator                                     = random
cluster_formation.node_cleanup.interval         = 50

Then if you check the config map, the values don't change

 ~ kubectl get configmaps definition-rabbitmq-server-conf -o yaml | grep queue_master_locator
queue_master_locator                            = min-masters

ansd · 2020-09-03T17:32:03Z

@mkuratczyk as we discussed this morning, you're right, we should only exec into the container to enable/disable plugins if the plugins ConfigMap is changed. We can and should still use the same guards as in this PR to only exec if all replicas are ready and not in the middle of a rolling update.

ansd · 2020-09-04T08:57:07Z

@Gsantomaggio that's desired behaviour.

You changed the configmap directly (e.g. via kubectl edit configmap definition-rabbitmq-server-conf).
Your configmap changed indeed, but your changes got shortly thereafter overwritten by the controller reconcile loop.

You should instead apply the RabbitMQ Custom Resource setting the additionalConfig like:

apiVersion: rabbitmq.com/v1beta1
kind: RabbitmqCluster
...
spec:
  rabbitmq:
    additionalConfig: |
      queue_master_locator = random
      cluster_formation.node_cleanup.interval = 50

Gsantomaggio · 2020-09-09T13:02:07Z

Ok, thank you @ansd

So why does the operator reboot the pod when the config map is changed?
I mean, if the original configuration is restored, the operator should not reboot the pods, right? Or I am missing something :)!

ansd · 2020-09-09T14:55:54Z

Hey @Gsantomaggio 🙂

Whenever the configuration is updated, the operator restarts the StatefulSet.

That's desired behaviour because https://www.rabbitmq.com/configure.html states that

rabbitmq.conf and advanced.config changes take effect after a node restart

This logic happens in

cluster-operator/controllers/rabbitmqcluster_controller.go

Line 318 in 585c609

    
           if builder.UpdateRequiresStsRestart() && operationResult == controllerutil.OperationResultUpdated {

and

cluster-operator/internal/resource/configmap.go

Line 53 in 585c609

return true

In your specific case, you're right that the operator unnecessarily restarts the StatefulSet. So that's a side effect of changing the configuration in the wrong way.

system_tests/system_tests.go

Fixes #304 Exec on pods only if StatefulSet is ready and up to date. Before this commit, we observed in #304 that the controller tried to exec into pods at the same time as the pods got updated due to a StatefulSet restart resulting in connection errors.

Relates to #304 Before this commit, the controller exec'ed into every RabbitMQ cluster pod in every reconcile loop to idempotently 'rabbitmq-plugins set'. Although simple and correct, these were unnecessary and expensive operations. After this commit, the controller only execs into the pods if the plugins config map got updated.

Because importing an internal library into the tests guarantees that the test will always pass; this weakens the test slightly, as we want this test to become red if we change the resource name because it would be a user facing change i.e. a change in the expectation of what our product creates.

Zerpet · 2020-09-15T14:41:37Z

I made the change in the system tests to not import the internal library and use a test constant. Given that we are 3 people at the moment and 2/3 have reviewed this, I will merge the PR when the checks pass.

ansd force-pushed the fix-error-after-config-map-change branch from 37200e7 to 5a28d29 Compare September 8, 2020 08:04

ferozjilla self-requested a review September 9, 2020 14:36

ansd force-pushed the fix-error-after-config-map-change branch from 5a28d29 to 6e8f8b1 Compare September 11, 2020 12:48

coro approved these changes Sep 14, 2020

View reviewed changes

Zerpet reviewed Sep 14, 2020

View reviewed changes

system_tests/system_tests.go Outdated Show resolved Hide resolved

Zerpet self-assigned this Sep 15, 2020

ansd and others added 3 commits September 15, 2020 14:32

Zerpet force-pushed the fix-error-after-config-map-change branch from 6e8f8b1 to f29f6bb Compare September 15, 2020 14:32

Zerpet merged commit 8b92838 into main Sep 15, 2020

Zerpet deleted the fix-error-after-config-map-change branch September 15, 2020 16:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Do not exec into pods during rolling update #308

Do not exec into pods during rolling update #308

ansd commented Sep 2, 2020

mkuratczyk commented Sep 3, 2020

Gsantomaggio commented Sep 3, 2020 •

edited

Loading

ansd commented Sep 3, 2020

ansd commented Sep 4, 2020 •

edited

Loading

Gsantomaggio commented Sep 9, 2020

ansd commented Sep 9, 2020

Zerpet commented Sep 15, 2020

Do not exec into pods during rolling update #308

Do not exec into pods during rolling update #308

Conversation

ansd commented Sep 2, 2020

mkuratczyk commented Sep 3, 2020

Gsantomaggio commented Sep 3, 2020 • edited Loading

ansd commented Sep 3, 2020

ansd commented Sep 4, 2020 • edited Loading

Gsantomaggio commented Sep 9, 2020

ansd commented Sep 9, 2020

Zerpet commented Sep 15, 2020

Gsantomaggio commented Sep 3, 2020 •

edited

Loading

ansd commented Sep 4, 2020 •

edited

Loading