USHIFT-2348: microshift y-2 upgrades #1562

dhellmann · 2024-02-08T17:16:33Z

/assign @DanielFroehlich @pmtk @jerpeter1

openshift-ci-robot · 2024-02-08T17:16:37Z

@dhellmann: This pull request references USHIFT-2348 which is a valid jira issue.

Warning: The referenced jira issue has an invalid target version for the target branch this PR targets: expected the story to target the "4.16.0" version, but no target version was set.

In response to this:

/assign @DanielFroehlich @pmtk @jerpeter1

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository.

DanielFroehlich

Please add some discussion in concerns of the underlying os, e.g. RHEL versions. We currently have increased test efforts with 4.14 being supported on 9.2 and 9.3. What would be the impact on Y+2 upgrades? Allowing both Y+2 and RHEL+1 sounds like a lot of combinations we might need to test.

There is an enhancement for the RHEL+1 support. How does that relate to this one? Would it also need to be updated?

enhancements/microshift/y-2-upgrades.md

dhellmann · 2024-02-08T19:09:25Z

The latest draft addresses all of @DanielFroehlich's comments.

dhellmann · 2024-02-08T19:27:08Z

/assign @jogeo

enhancements/microshift/y-minus-2-upgrades.md

dhellmann · 2024-02-13T16:27:55Z

#1555 is changing the enhancement template in a way that will cause the header check in the linter job to fail for existing PRs. If this PR is merged within the development period for 4.16 you may override the linter if the only failures are caused by issues with the headers (please make sure the markdown formatting is correct). If this PR is not merged before 4.16 development closes, please update the enhancement to conform to the new template.

jogeo

A few small changes
/lgtm

enhancements/microshift/y-minus-2-upgrades.md

dhellmann · 2024-02-16T13:51:04Z

@jogeo I committed your suggestions, which removed the lgtm you applied. Could you re-apply it, please?

dhellmann · 2024-02-16T14:02:21Z

The linter is failing because of the template change referenced in #1562 (comment)

jerpeter1 · 2024-02-16T18:03:40Z

enhancements/microshift/y-minus-2-upgrades.md

+schema, and content).
+
+MicroShift does not automatically create `StorageVersionMigration` CRs
+to trigger data migration. The core kubernetes APIs are safe because


While this is true right now, if we were to incorporate something that did require data migrations in the future (e.g. a monitoring stack transitioning from using elasticsearch for log storage to loki), we might need to carry the same version of the monitoring stack through a couple of microshift releases in order to ensure that data migrations take place.

Do you mean the migration stack? I don't think monitoring is involved in the storage migration, right? At least not the way we're using it.

ggiguash · 2024-02-18T11:20:44Z

enhancements/microshift/y-minus-2-upgrades.md

+
+The main drawback to implementing this enhancement is the increased
+test matrix for upgrades. We can automate those tests to minimize the
+impact.


Should we consider minimizing the testing effort by declaring y-2 upgrade support only between EUS releases?

As an example, in 4.15 / 4.16 releases we would have to set up the following tests:

4.14 -> 4.16 (y-2 released)

4.15 -> 4.17 (y-2 fake)

4.15 -> 4.16 (y-1 released)

4.15 -> 4.17 (y-2 fake)

If we only support EUS-to-EUS y-2 upgrades, we'd need tests 1 and 3 from the above list.

Note that we do not have to do anything "special" to limit our support to EUS releases only in the y-2 context - it can be purely declarative. Potentially, we can add a simple check in the code to only allow even version upgrade for y-2 scenarios.

We do not need the fake test scenarios. Those use the same set of code and only ensure that the version number prevents an upgrade. We have unit tests for testing that restriction.

After we have a 4.N+2 release, we could go back to the 4.N branch and add a new test that ensures that no changes go into 4.N that would prevent an upgrade to 4.N+2. However, given the fact that we do not make architectural or file format changes in stable branches, the risk of introducing such a breaking change is very low and adding such a test would add complexity to our test matrix for little benefit. Any such issue would be caught by the periodic job on the 4.N+2 branch.

If at some point there is an issue with upgrading from the code in version 4.N to 4.N+2 caused by introducing a breaking change into 4.N, we must fix the problem in 4.N+2 because we cannot ensure that users update to a new 4.N.z before trying to update to 4.N+2.

ggiguash · 2024-02-18T11:22:04Z

enhancements/microshift/y-minus-2-upgrades.md

+and also moving from the EUS version to non-EUS version. The aspects
+of testing the OS support during upgrades are orthogonal to the work
+for this enhancement, however, and should not require additional
+expansion of the test matrix, either in CI or by QE.


If we only support EUS-to-EUS upgrades for MicroShift, it would be in sync with EUS-to-EUS upgrades of the OS.

I expect in practice most users will stick to EUS versions and update 2 at a time. The risk of only actually testing EUS upgrades is that we have to remember to do different tests for every other version of MicroShift. If we always test that upgrading 2 versions at a time work, we will always maintain support for it.

ggiguash · 2024-02-18T11:23:29Z

enhancements/microshift/y-minus-2-upgrades.md

+     1. RHEL 9.2 / 4.14.latest → RHEL 9.4 / 4.16.Z
+     1. RHEL 9.3 / 4.14.latest → RHEL 9.4 / 4.16.Z
+     1. RHEL 9.2 / 4.15.latest → RHEL 9.4 / 4.16.Z
+     1. RHEL 9.3 / 4.15.latest → RHEL 9.4 / 4.16.Z


We also need to add a test for 4.15 -> 4.17 (fake), unless we support EUS-to-EUS only.

The QE team should not be testing using fake packages like what we build in CI.

ggiguash · 2024-02-18T11:26:25Z

enhancements/microshift/y-minus-2-upgrades.md

+(4.14 to 4.16 would be OK, but 4.15 to 4.17 would not). This would
+make the version checking logic more complicated and would introduce
+opportunities for that skip-level upgrade process to be broken in a
+non-EUS version so that it has to be fixed before the next EUS


I think we might want to make a difference between supported and tested upgrade version skews.
If there is a concern for breaking features between non-EUS upgrades, we can still test those in CI, but not declare suppport for those.

enhancements/microshift/y-minus-2-upgrades.md

pmtk · 2024-02-20T15:33:44Z

looks good to me

pmtk · 2024-02-20T16:36:11Z

+1

openshift-ci · 2024-02-21T17:17:29Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: pacevedom

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~OWNERS~~ [pacevedom]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

pmtk · 2024-02-22T09:42:52Z

/lgtm

openshift-ci · 2024-02-22T09:49:59Z

@dhellmann: all tests passed!

Full PR test history. Your PR dashboard.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. I understand the commands that are listed here.

ShaunaDiaz · 2024-03-04T18:12:08Z

enhancements/microshift/y-minus-2-upgrades.md

+   installed.
+2. Software runs, time passes.
+3. Edge device administrator updates the host to run MicroShift 4.Y.
+  * For ostree-based systems, the host is automatically rebooted as


Where would it roll back to if the upgrade fails (for some other reason)? Does ostree have any validation of version skew?

The rollback is always to the previous image that was installed on the host, regardless of what the new image contains or what causes the rollback.

ShaunaDiaz · 2024-03-04T18:16:20Z

enhancements/microshift/y-minus-2-upgrades.md

+not make a distinction between types of versions and requires stepping
+through one release at a time. MicroShift upgrades are significantly
+simpler because they are all single-node (so disruption is expected)
+and there are no operators for managing the host or cluster


Can add-ons that are used through OLM be a problem here?

Yes, just as with OCP it is possible for an OLM-installed operator to be an issue by not having compatibility with the version skew of MicroShift. It's less of a concern for MicroShift, because there are fewer core APIs and less likelihood of that incompatibility, but it's still there.

ShaunaDiaz · 2024-03-04T18:26:15Z

enhancements/microshift/y-minus-2-upgrades.md

+   1. Initial cluster bring-up will be a mix of deployments from ISO
+      installer and rpm-ostree upgrades from a bare RHEL host
+   1. The following upgrade paths will be covered:
+      1. RHEL 9.2 / 4.14.latest → RHEL 9.4 / 4.16.Z


Is "4.14.latest" going to start with the 4.14. z-stream version when the 4.16.0 version is released? So can users update from a z-stream prior, down to 4.14.0? Or does the Y-version check in reality ignore the z-stream skew?

In CI we test by starting from the latest from a z-stream. That ensures there is at least some path to update, and since it is extremely unlikely that we would introduce a change into the z-stream that would break updates the benefit of testing every combination of versions from the z-stream to the new release does not make the effort involved worth it.

@jogeo, I assume QE is taking a similar stance?

ShaunaDiaz · 2024-03-04T18:30:46Z

enhancements/microshift/y-minus-2-upgrades.md

+
+N/A
+
+### Tech Preview -> GA


What is the status of this feature in 4.16? TP or GA?

I would love get get this GAed directly - as it makes sense for an EUS release to use directly on production systems. @dhellmann , any concerns?

It will be GA directly.

openshift-ci-robot added the jira/valid-reference Indicates that this PR references a valid Jira ticket of any type. label Feb 8, 2024

openshift-ci bot assigned DanielFroehlich, jerpeter1 and pmtk Feb 8, 2024

dhellmann mentioned this pull request Feb 8, 2024

USHIFT-2170: support upgrading 2 Y versions at a time openshift/microshift#2952

Merged

openshift-ci bot requested review from coreydaley and dmage February 8, 2024 17:23

DanielFroehlich reviewed Feb 8, 2024

View reviewed changes

dhellmann force-pushed the USHIFT-2348-microshift-y-2-upgrades branch from 762ad26 to 45a9931 Compare February 8, 2024 19:09

dhellmann force-pushed the USHIFT-2348-microshift-y-2-upgrades branch from 45a9931 to ee74a10 Compare February 8, 2024 19:26

openshift-ci bot assigned jogeo Feb 8, 2024

pmtk reviewed Feb 9, 2024

View reviewed changes

enhancements/microshift/y-minus-2-upgrades.md Outdated Show resolved Hide resolved

pmtk reviewed Feb 9, 2024

View reviewed changes

enhancements/microshift/y-minus-2-upgrades.md Show resolved Hide resolved

dhellmann force-pushed the USHIFT-2348-microshift-y-2-upgrades branch 3 times, most recently from 7a9b0bb to 2d495da Compare February 12, 2024 20:27

jogeo reviewed Feb 16, 2024

View reviewed changes

openshift-ci bot added lgtm Indicates that a PR is ready to be merged. and removed lgtm Indicates that a PR is ready to be merged. labels Feb 16, 2024

dhellmann added the tide/merge-method-squash Denotes a PR that should be squashed by tide when it merges. label Feb 16, 2024

jerpeter1 reviewed Feb 16, 2024

View reviewed changes

USHIFT-2348: microshift y-2 upgrades

9ae5f40

dhellmann force-pushed the USHIFT-2348-microshift-y-2-upgrades branch from 5c78f32 to dd9aca3 Compare February 16, 2024 18:52

ggiguash reviewed Feb 18, 2024

View reviewed changes

pmtk reviewed Feb 20, 2024

View reviewed changes

enhancements/microshift/y-minus-2-upgrades.md Outdated Show resolved Hide resolved

dhellmann added 2 commits February 20, 2024 10:49

add QE test plan

391bf98

feedback from architecture call

f080d37

dhellmann force-pushed the USHIFT-2348-microshift-y-2-upgrades branch from dd9aca3 to f080d37 Compare February 20, 2024 16:03

pacevedom approved these changes Feb 21, 2024

View reviewed changes

openshift-ci bot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Feb 21, 2024

openshift-ci bot added the lgtm Indicates that a PR is ready to be merged. label Feb 22, 2024

openshift-merge-bot bot merged commit 44489cd into openshift:master Feb 22, 2024
2 checks passed

ShaunaDiaz reviewed Mar 4, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

USHIFT-2348: microshift y-2 upgrades #1562

USHIFT-2348: microshift y-2 upgrades #1562

dhellmann commented Feb 8, 2024

openshift-ci-robot commented Feb 8, 2024 •

edited by openshift-ci bot

Loading

DanielFroehlich left a comment

dhellmann commented Feb 8, 2024

dhellmann commented Feb 8, 2024

dhellmann commented Feb 13, 2024

jogeo left a comment

dhellmann commented Feb 16, 2024

dhellmann commented Feb 16, 2024

jerpeter1 Feb 16, 2024

dhellmann Feb 16, 2024

ggiguash Feb 18, 2024

dhellmann Feb 18, 2024

ggiguash Feb 18, 2024

dhellmann Feb 18, 2024

ggiguash Feb 18, 2024

dhellmann Feb 18, 2024

ggiguash Feb 18, 2024

pmtk commented Feb 20, 2024

pmtk commented Feb 20, 2024

openshift-ci bot commented Feb 21, 2024

pmtk commented Feb 22, 2024

openshift-ci bot commented Feb 22, 2024

ShaunaDiaz Mar 4, 2024

dhellmann Mar 6, 2024

ShaunaDiaz Mar 4, 2024

dhellmann Mar 6, 2024

ShaunaDiaz Mar 4, 2024

dhellmann Mar 6, 2024

ShaunaDiaz Mar 4, 2024

DanielFroehlich Mar 5, 2024

dhellmann Mar 6, 2024


		N/A

		### Tech Preview -> GA

USHIFT-2348: microshift y-2 upgrades #1562

USHIFT-2348: microshift y-2 upgrades #1562

Conversation

dhellmann commented Feb 8, 2024

openshift-ci-robot commented Feb 8, 2024 • edited by openshift-ci bot Loading

DanielFroehlich left a comment

Choose a reason for hiding this comment

dhellmann commented Feb 8, 2024

dhellmann commented Feb 8, 2024

dhellmann commented Feb 13, 2024

jogeo left a comment

Choose a reason for hiding this comment

dhellmann commented Feb 16, 2024

dhellmann commented Feb 16, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

pmtk commented Feb 20, 2024

pmtk commented Feb 20, 2024

openshift-ci bot commented Feb 21, 2024

pmtk commented Feb 22, 2024

openshift-ci bot commented Feb 22, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

openshift-ci-robot commented Feb 8, 2024 •

edited by openshift-ci bot

Loading