[WIP] Promote KEP-1672 to GA #2938

andrewsykim · 2021-09-02T20:48:35Z

One-line PR description: Promote feature EndpointSliceTerminatingCondition to GA and ProxyTerminatingEndpoints to Beta.

Issue link: Proxy Terminating Endpoints #1669 & Tracking Terminating Endpoints #1672

Other comments:

andrewsykim · 2021-09-02T20:56:57Z

Marking this one WIP for now, I think some of the PRR questions need to be answered for KEP-1669

wojtek-t · 2021-09-03T06:07:56Z

/assign

thockin

Thanks!

/lgtm
/approve

k8s-ci-robot · 2021-09-03T17:24:51Z

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by: andrewsykim, thockin
To complete the pull request process, please ask for approval from wojtek-t after the PR has been reviewed.

The full list of commands accepted by this bot can be found here.

Needs approval from an approver in each of these files:

keps/prod-readiness/OWNERS
~~keps/sig-network/OWNERS~~ [thockin]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

wojtek-t · 2021-09-06T08:07:50Z

keps/sig-network/1672-tracking-terminating-endpoints/kep.yaml


 # The milestone at which this feature was, or is targeted to be, at each stage.
 milestone:
  alpha: "v1.20"
+  beta: "v1.22"


Did we went Beta in 1.22 without:
(a) having this tracked appropriately by RT?
(b) having PRR approved?

This sounds very wrong to me...

There was a last-minute decision during a SIG Network call to include this feature as beta in v1.22 -- I guess we slipped the KEP updates though. Sorry, that's my bad :(

I have no particular recollection of how this came to be, and certainly did not mean to bypass any process.

wojtek-t · 2021-09-06T08:08:52Z

keps/sig-network/1672-tracking-terminating-endpoints/kep.yaml

@@ -20,16 +20,18 @@ see-also:
 replaces: []

 # The target maturity stage in the current dev cycle for this KEP.
-stage: alpha
+stage: stable


Please fill in the PRR questionaire for this KEP.

hmm there's no diff for PRR cause all the questions were answered in the last release, let me know if there's a specific question I missed.

There should be a diff - there are couple questions that weren't answered back then, including:

Are there any tests for feature enablement/disablement?

Have the tests been added? If so, please link them. If not - we shouldn't even go to beta...

What specific metrics should inform a rollback?

We were discussing adding a metric - has that happened?

Were upgrade and rollback tested? Was the upgrade->downgrade->upgrade path tested?

Has that happened? Findings?

What are the SLIs (Service Level Indicators) an operator can use to determine the health of the service?

Has the metric mentioned there been added?

Have the tests been added? If so, please link them. If not - we shouldn't even go to beta...

Oh yes, of course we added tests for feature enablement, do we want the PRR to link to every test case we added? The answer to the question says:

Yes, there will be strategy API unit tests validating if the new API field is allowed based on the feature gate.

Is that enough or do you want more details on the specific test cases (there's a lot)?

Let me add more details on metrics, I think there was some back and forth on the viability of per endpoint metrics due to cardinality. But maybe total endpoints by condition is acceptable.

Is that enough or do you want more details on the specific test cases (there's a lot)?

Please link them - no need to describe.

Added all the PRs that adds unit, integration or e2e tests for this feature under Are there any tests for feature enablement/disablement?

wojtek-t · 2021-09-06T08:14:04Z