Delete left over pilots #328

wallrj · 2018-04-11T16:35:24Z

Contrary to the original idea of creating pilots during scale-out and deleting them during scale-in,
I've instead extended the existing pilot sync method to delete pilots with an index higher than the Replicas value of the corresponding StatefulSet.

Fixes: #322

Release note:

NONE

jetstack-bot · 2018-04-11T16:35:29Z

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by:
To fully approve this pull request, please assign additional approvers.
We suggest the following additional approver: kragniz

Assign the PR to them by writing /assign @kragniz in a comment when ready.

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

OWNERS

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

wallrj · 2018-04-12T09:25:30Z

/retest

munnerz · 2018-04-17T14:00:58Z

I'm not sure we should be using currentReplicas here:

FIELD: currentReplicas <integer>

DESCRIPTION:
     currentReplicas is the number of Pods created by the StatefulSet controller
     from the StatefulSet version indicated by currentRevision.

During an update of the statefulset, this number will represent the total number of out of date replicas (i.e. it will gradually reduce from N to 0 during the upgrade, and then once the upgrade is complete will reset back to N).

FWIW, here are the various StatefulSet.status fields:

FIELDS:
   collisionCount	<integer>
     collisionCount is the count of hash collisions for the StatefulSet. The
     StatefulSet controller uses this field as a collision avoidance mechanism
     when it needs to create the name for the newest ControllerRevision.

   conditions	<[]Object>
     Represents the latest available observations of a statefulset's current
     state.

   currentReplicas	<integer>
     currentReplicas is the number of Pods created by the StatefulSet controller
     from the StatefulSet version indicated by currentRevision.

   currentRevision	<string>
     currentRevision, if not empty, indicates the version of the StatefulSet
     used to generate Pods in the sequence [0,currentReplicas).

   observedGeneration	<integer>
     observedGeneration is the most recent generation observed for this
     StatefulSet. It corresponds to the StatefulSet's generation, which is
     updated on mutation by the API Server.

   readyReplicas	<integer>
     readyReplicas is the number of Pods created by the StatefulSet controller
     that have a Ready Condition.

   replicas	<integer> -required-
     replicas is the number of Pods created by the StatefulSet controller.

   updateRevision	<string>
     updateRevision, if not empty, indicates the version of the StatefulSet used
     to generate Pods in the sequence [replicas-updatedReplicas,replicas)

   updatedReplicas	<integer>
     updatedReplicas is the number of Pods created by the StatefulSet controller
     from the StatefulSet version indicated by updateRevision.

I'd guess either replicas, or currentReplicas+updatedReplicas is what we need to use?

We could alternatively always base it on the statefulset.spec.replicas field - as once the StatefulSet has been scaled down, the cluster itself should already be in a state whereby it can scale down (i.e. all data evacuated from the node). Perhaps I've missed an edge case here though?

munnerz

Sorry - I'd left this review sitting around in my browser cache/github somewhere!

munnerz · 2018-04-17T14:03:41Z