-
Notifications
You must be signed in to change notification settings - Fork 193
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Bug 1884334: UpdateError: enhance for ability to determine when upgrade failing #486
Conversation
@jottofar: This pull request references Bugzilla bug 1884334, which is valid. The bug has been moved to the POST state. The bug has been updated to refer to the pull request using the external bug tracker. 3 validation(s) were run on this bug
In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
/test unit |
ad4fd97
to
6c36937
Compare
/retitle Bug 1884334: UpdateError: add and use Progressing field |
@jottofar: This pull request references Bugzilla bug 1884334, which is valid. The bug has been moved to the POST state. 3 validation(s) were run on this bug
In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
6c36937
to
4b26289
Compare
/test ci/prow/e2e-agnostic-upgrade |
@jottofar: The specified target(s) for
Use
In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
/test e2e-agnostic-upgrade |
1 similar comment
/test e2e-agnostic-upgrade |
8193ef0
to
ead8788
Compare
/bugzilla refresh |
@jottofar: This pull request references Bugzilla bug 1884334, which is valid. 3 validation(s) were run on this bug
In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
8ced187
to
c9a55d8
Compare
/test e2e-agnostic-upgrade |
/test unit |
/retest |
Name: actual.Name, | ||
Nested: nestedMessage, | ||
UpdateEffect: payload.UpdateEffectNone, | ||
Reason: "ClusterOperatorNotAvailable", |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We might want a different reason for this, because we know by this point that it is available and not degraded.
But we can twiddle that in follow-up work.
type UpdateEffectType string | ||
|
||
const ( | ||
// "None" defines an error as having no affect on the update state. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
nit: We should add Go linter. I expect it would want these godocs to start with the exposed name, so:
// UpdateEffectNone defines an error as having no affect on the update state.
and similar for the other constants below.
/lgtm Holding in case you want to address either of the two new nits. I'm fine punting them both to follow up work as well, in which case, just pull the hold. |
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: jottofar, wking The full list of commands accepted by this bot can be found here. The pull request process is described here
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
/unhold Let's address them in a follow-up so this gets some soak time before code cutoff. |
/test unit |
@jottofar: All pull requests linked via external trackers have merged: Bugzilla bug 1884334 has been moved to the MODIFIED state. In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
Currently during an upgrade the cluster will be marked as failing when an operator is still updating but not necessarily failing. Add the ability to set whether a given cluster operator error should result in the cluster being set to failing.
The bool
UpgradeSucceeding
was added to theUpdateError
structure to indicate whether the error should result inFailing=True
. The name was chosen such that!UpgradeSucceeding== Failing
. In this way the default for a given error is that it results in the cluster being marked as failing and explicit action must be taken by a developer, for example when a newUpdateError
is added, to determine and set the error such that the cluster is not marked as failing. I chose to stick with a new field inUpdateError
as opposed to anUpdateError
sibling to avoid having to change method signatures especially since this first round is only addressing cluster operator errors.This PR also adds the ability to track each cluster operator's sync'ing time such that those marked as
ClusterOperatorNotAvailable
will not result in clusterFailing=True
unless sync'ing time has exceeded 40 minutes.