Scale Down may not wait long enough before destroying PODs #104

maddisondavid · 2019-12-04T11:19:00Z

The Scale Down logic currently waits for the ConfigMap to be updated and then 6 reconcile loops before updating the Statefulset. The reconcile Loop wait I suspect is a way of waiting until the PODs have received the new ConfigMap environment variables.

The problem is that this sync be up to 1 minute (by default)

As a result, the total delay from the moment when the ConfigMap is updated to the moment when new keys are projected to the pod can be as long as kubelet sync period (1 minute by default) + ttl of ConfigMaps cache (1 minute by default) in kubelet.

https://kubernetes.io/docs/tasks/configure-pod-container/configure-pod-configmap/#mounted-configmaps-are-updated-automatically

Just waiting 6 reconcile loops does not seem a valid indicator of time as the reconcile loop can be called multiple times depending on what watches have fired.

Problem Location
https://github.com/pravega/zookeeper-operator/blob/master/pkg/controller/zookeepercluster/zookeepercluster_controller.go#L207-L210

pbelgundi · 2020-02-28T09:07:58Z

Fixed by PR: #120

pbelgundi mentioned this issue Dec 19, 2019

Changes for zk operator to be stateless #111

Closed

pbelgundi mentioned this issue Jan 28, 2020

Issue Changes for handling zk scale down #120

Merged

pbelgundi closed this as completed Feb 28, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Scale Down may not wait long enough before destroying PODs #104

Scale Down may not wait long enough before destroying PODs #104

maddisondavid commented Dec 4, 2019

pbelgundi commented Feb 28, 2020

Scale Down may not wait long enough before destroying PODs #104

Scale Down may not wait long enough before destroying PODs #104

Comments

maddisondavid commented Dec 4, 2019

pbelgundi commented Feb 28, 2020