Switch to rhel-coreos (9) #3596

cgwalters · 2023-03-08T17:05:27Z

Forking from #3485

This version of the PR uses rhel-coreos, not rhel-coreos-9 per discussion.

ensures that RHCOS 9 SSH keys are in the right place

OKD release controller is out-of-date

ensures SSH keys get moved to the correct location

When we move from RHCOS 8 -> RHCOS 9, the SSH keys are not being written
to the new location because:

When the upgrade configs are written to the node, it is still running RHCOS 8, so the keys are not being written to the new location.
The node reboots into RHCOS 9 to complete the upgrade.
The "are we on the latest config" functions detect that we are indeed on the latest config and so it does not attempt to perform an update.

teaches TestIgn3Cfg about the new RHCOS 9 key path

checks perms for SSH key path dirs as well

Switch to rhel-coreos (9)

ref: https://issues.redhat.com/browse/COS-1983

We introduced a new rhel-coreos that is RHEL 9 to aid having a switch be
an atomic operation. After design discussion we realized it's easier
to have an "unversioned" image though, so this drops the -8.

daemon: Also override kernel-modules-core

Unfortunately rpm-ostree requires this right now; we have an issue
and code to provide a better API in coreos/rpm-ostree#2542
But using that will require shipping the updated rpm-ostree in RHEL 8.6.z
or at least OCP 4.12.z, which is problematic.

Because we know the new MCD will always be upgrading to RHEL9,
for now let's update this hardcoded list. In the future we can
detect when the running host has --remove-installed-kernel and
use it instead.

openshift-azure-routes: Avoid synchronizing too quickly

Rapid file changes triggering the path unit can start the
service here frequently, and then this can cause the start
limit to be hit, and then systemd will refuse further
activations (unless we bumped the limit).

I don't think we need to synchronize the iptables
rules more than once every 3 seconds.

When we move from RHCOS 8 -> RHCOS 9, the SSH keys are not being written to the new location because: 1. When the upgrade configs are written to the node, it is still running RHCOS 8, so the keys are not being written to the new location. 2. The node reboots into RHCOS 9 to complete the upgrade. 3. The "are we on the latest config" functions detect that we are indeed on the latest config and so it does not attempt to perform an update.

ref: https://issues.redhat.com/browse/COS-1983 We introduced a new `rhel-coreos` that is RHEL 9 to aid having a switch be an atomic operation. After design discussion we realized it's easier to have an "unversioned" image though, so this drops the `-8`.

Unfortunately rpm-ostree requires this right now; we have an issue and code to provide a better API in coreos/rpm-ostree#2542 But using that will require shipping the updated rpm-ostree in RHEL 8.6.z or at least OCP 4.12.z, which is problematic. Because we know the new MCD will always be upgrading to RHEL9, for now let's update this hardcoded list. In the future we can detect when the running host has `--remove-installed-kernel` and use it instead.

Rapid file changes triggering the path unit can start the service here frequently, and then this can cause the start limit to be hit, and then systemd will refuse further activations (unless we bumped the limit). I don't think we need to synchronize the iptables rules more than once every 3 seconds.

sdodson · 2023-03-08T18:17:56Z

/lgtm

sdodson · 2023-03-08T18:19:15Z

@jupierce lets do this

openshift-ci · 2023-03-08T18:20:41Z

@cgwalters: The following tests failed, say /retest to rerun all failed tests or /retest-required to rerun all mandatory failed tests:

Test name	Commit	Details	Required	Rerun command
ci/prow/okd-images	`b7a5887`	link	false	`/test okd-images`
ci/prow/okd-scos-e2e-gcp-ovn-upgrade	`b7a5887`	link	false	`/test okd-scos-e2e-gcp-ovn-upgrade`
ci/prow/okd-scos-images	`b7a5887`	link	true	`/test okd-scos-images`
ci/prow/okd-scos-e2e-aws-ovn	`b7a5887`	link	false	`/test okd-scos-e2e-aws-ovn`

Full PR test history. Your PR dashboard.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. I understand the commands that are listed here.

openshift-ci · 2023-03-08T18:22:54Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: cgwalters, sdodson

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~OWNERS~~ [cgwalters]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

Since openshift/machine-config-operator#3596 merged.

cgwalters · 2023-03-08T18:29:59Z

Followup PRs:

xref openshift@1ad53a7 and the rename in openshift/machine-config-operator#3596

xref 1ad53a7 and the rename in openshift/machine-config-operator#3596

Since openshift/machine-config-operator#3596 merged.

cgwalters · 2023-03-10T01:18:29Z

/cherry-pick release-4.13

openshift-cherrypick-robot · 2023-03-10T01:19:09Z

@cgwalters: new pull request created: #3603

In response to this:

/cherry-pick release-4.13

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

cheesesashimi and others added 8 commits March 8, 2023 10:43

ensures that RHCOS 9 SSH keys are in the right place

4c212ef

OKD release controller is out-of-date

fc98d02

teaches TestIgn3Cfg about the new RHCOS 9 key path

9412f1d

checks perms for SSH key path dirs as well

14d30fe

Switch to rhel-coreos (9)

44f9ad5

ref: https://issues.redhat.com/browse/COS-1983 We introduced a new `rhel-coreos` that is RHEL 9 to aid having a switch be an atomic operation. After design discussion we realized it's easier to have an "unversioned" image though, so this drops the `-8`.

openshift-ci bot requested review from cheesesashimi and jkyros March 8, 2023 17:09

openshift-ci bot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Mar 8, 2023

jupierce merged commit 832de9e into openshift:master Mar 8, 2023

openshift-ci bot assigned sdodson Mar 8, 2023

openshift-ci bot added the lgtm Indicates that a PR is ready to be merged. label Mar 8, 2023

cgwalters mentioned this pull request Mar 8, 2023

mco: Update to use rhel-coreos in master+4.14 openshift/release#37082

Merged

cgwalters added a commit to cgwalters/release that referenced this pull request Mar 8, 2023

mco: Update to use rhel-coreos in master+4.14

40fdbbb

Since openshift/machine-config-operator#3596 merged.

cgwalters added a commit to cgwalters/release that referenced this pull request Mar 8, 2023

openshift/kubernetes: Use rhel-coreos

94b0d10

Since openshift/machine-config-operator#3596 merged.

cgwalters mentioned this pull request Mar 8, 2023

openshift/kubernetes: Use rhel-coreos openshift/release#37083

Merged

cgwalters mentioned this pull request Mar 8, 2023

Switch to rhel-coreos-9 #3485

Closed

This was referenced Mar 8, 2023

MCO-116: Ensures that SSH keys are in the right place on RHCOS 9 #3534

Closed

chore: use new rhel-coreos OS image tags openshift/release#37087

Closed

jkyros mentioned this pull request Mar 9, 2023

Make OKD/SCOS Dockerfile regexes match again after rhel-coreos image name change #3597

Merged

cgwalters mentioned this pull request Mar 9, 2023

infra-periodics: Also don't prune rhel-coreos openshift/release#37117

Merged

cgwalters added a commit to cgwalters/release that referenced this pull request Mar 9, 2023

infra-periodics: Also don't prune rhel-coreos

6bc8ecf

xref openshift@1ad53a7 and the rename in openshift/machine-config-operator#3596

openshift-merge-robot pushed a commit to openshift/release that referenced this pull request Mar 9, 2023

infra-periodics: Also don't prune rhel-coreos (#37117)

681444d

xref 1ad53a7 and the rename in openshift/machine-config-operator#3596

openshift-merge-robot pushed a commit to openshift/release that referenced this pull request Mar 9, 2023

openshift/kubernetes: Use rhel-coreos (#37083)

139b7d2

Since openshift/machine-config-operator#3596 merged.

openshift-merge-robot pushed a commit to openshift/release that referenced this pull request Mar 9, 2023

mco: Update to use rhel-coreos in master+4.14 (#37082)

57c55f0

Since openshift/machine-config-operator#3596 merged.

openshift-cherrypick-robot mentioned this pull request Mar 10, 2023

[release-4.13] Switch to rhel-coreos (9) #3603

Closed

cgwalters mentioned this pull request Apr 11, 2023

layering: Use rhel-coreos now openshift/openshift-docs#58486

Closed

mburke5678 mentioned this pull request Apr 11, 2023

GH58486: layering: Use rhel-coreos now openshift/openshift-docs#58561

Closed

1 task

sohankunkerkar mentioned this pull request Nov 19, 2024

cri-o: use rhel-coreos openshift/release#58994

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Switch to rhel-coreos (9) #3596

Switch to rhel-coreos (9) #3596

cgwalters commented Mar 8, 2023 •

edited

Loading

sdodson commented Mar 8, 2023

sdodson commented Mar 8, 2023

openshift-ci bot commented Mar 8, 2023

openshift-ci bot commented Mar 8, 2023

cgwalters commented Mar 8, 2023

cgwalters commented Mar 10, 2023

openshift-cherrypick-robot commented Mar 10, 2023

Switch to rhel-coreos (9) #3596

Switch to rhel-coreos (9) #3596

Conversation

cgwalters commented Mar 8, 2023 • edited Loading

sdodson commented Mar 8, 2023

sdodson commented Mar 8, 2023

openshift-ci bot commented Mar 8, 2023

openshift-ci bot commented Mar 8, 2023

cgwalters commented Mar 8, 2023

cgwalters commented Mar 10, 2023

openshift-cherrypick-robot commented Mar 10, 2023

cgwalters commented Mar 8, 2023 •

edited

Loading