-
Notifications
You must be signed in to change notification settings - Fork 400
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Implement release blocking job criteria #347
Comments
Are we consolidating all non-blocking dashboards into -informing? I'd be in favor of that. |
/cc |
/milestone v1.14 I would like for us to implement this for the v1.14 release |
Issues go stale after 90d of inactivity. If this issue is safe to close now please do so with Send feedback to sig-testing, kubernetes/test-infra and/or fejta. |
/remove-lifecycle stale |
/help |
/assign
this is done
I would recommend someone look at https://github.com/kubernetes/test-infra/tree/master/metrics for this |
/assign @jberkus |
I actually bet I do if people don't ping me directly. I just opened a list of those where I was reviewer and I didn't see majority of those. If something is critical and really requires my attention, I should be at the very least assigned as approver (those I'm trying to follow, but I'm also missing some percentage of those).
I disagree with that. We shouldn't create a policy that we know want work as soon as we have soak tests. Those are important enough that I really believe we will prioritize them soon.
There was a 1-hour-long discussion that happened during the zoom meeting. Where I think we roughly converged. @jberkus is actually documenting the outcome of it in his PR. |
This is a separate discussion but ... this kinda defeats the purpose of having assigned reviewers. Please consider dropping a note to contribex about why / how this doesn't work so we can fix this. :/
There has been no indication that we will have soak tests. What we call soak tests now are easily among the longest failing tests we have and likely to be removed in the near future... These have been relatively unmaintained for on the order of year(s). Who is prioritizing them? I've heard zero discussion of this in SIG-Testing or SIG-Release ...
Excellent. 👍 |
I've got an initial attempt at a dashboard that displays metrics relevant to release-blocking criteria: kubernetes/test-infra#13879 (comment) http://velodrome.k8s.io/dashboard/db/job-health-release-blocking |
I caught that the bazel jobs were postsubmits that didn't meet the "scheduled at least 3 hours" criteria so swapped them with periodics that did kubernetes/test-infra#13907 |
The serial job takes way too long, and is failing due to timeout. I think we should kick out egregious offenders, and encourage a pattern of adding a feature-specific job if the feature is truly necessary to be release-blocking. Opened issues to start this for
|
/assign @tpepper |
/assign @guineveresaenger |
@msau42 is looking to split out serial storage tests into another release-blocking job as well kubernetes/test-infra#13936 I feel like splitting tests into more parallel blocking jobs is a sound approach for now. But, it's only going to get us but so far before we run into new limits:
At some point it's worth questioning why these tests need to be release-blocking, and if there is some sort of bar they should be held to. We presumably do this for Conformance tests, though IMO it's not as rigorously measured as it could be, and relies on extensive human review. |
Not completed despite the merger of #752. Mostly because there's still some issues unanswered:
|
will start with #775 |
/remove-help |
This is an umbrella issue for followup work to #346
That PR describes aspirational release blocking job criteria. This issue is intended to track the followup work, including:
assignment of owners to all jobsdescriptions added to all release-master-blocking jobspropose the creation of [email protected] or reuse of sig-foo-test-failures for all sigs that need to be responsive to test failuresthe creation of the release-informing dashboard, and moving jobs out of release-master-blocking to that dashboard/sig release
this is sig-release policy
/sig testing
this will be assisted by sig-testing tooling
EDIT 2019-07-23: AFAIK metrics is the only thing that remains to close this out
The text was updated successfully, but these errors were encountered: