preparations for e2e tests on baremetal SNP #730

Freax13 · 2024-07-16T09:33:55Z

This PR implements all the necessary changes to run CI on our baremetal machine in the office.

Unfortunately the launch measurement is vCPU type dependent (it doesn't have to be that way, but QEMU sets up different vCPU types differently, so thanks, QEMU?!). Assume Genoa for now.

This will allow us to test on platforms other than aks-clh-snp.

The generic default manifest (not the default for AKS) isn't valid because it's missing TCB values. We want to emit invalid manifests and it's up to the user to fill in the missing values. Instead of failing, we now tell the user to fix the reference values for the selected platform.

We use this kernel module with `dm-mod.create="dm-verit...` to protect the image file.

The calculation of the launch measurement has been adjusted accordingly.

The previous rc0 had a bug somewhere that influenced the launch measurement. This bug has been fixed in rc1.

msanft

Mostly LGTM, just some minor things.

msanft · 2024-08-14T09:01:32Z

.github/workflows/e2e_openssl_baremetal.yml

+jobs:
+  test:
+    runs-on:
+      labels: snp


I think this is a GHA shortcoming, but it'd be very nice if we could make this test work on both SNP and TDX, without another duplication. But afaict, you cannot have dynamic values (e.g. an input) in runs-on. Not saying this PR should or can do anything about that, but just keeping it here as a note.

If I understand this section in the docs correctly, this might work.

I think this would enable us to create tests that run on all platforms unconditionally, but still not one test that runs on one, selectable platform

You can also run steps conditionally.

Yeah, but this won't be a step. We could execute everything but the actual step on GH-hosted runners and then transfer files over, but I fear that this is going to have a higher total cost in the end.

cli/cmd/generate.go

e2e/internal/kubeclient/deploy.go

e2e/openssl/openssl_test.go

nodeinstaller/internal/constants/constants.go

packages/by-name/kata/snp-launch-digest/package.nix

When the node-installer restarts K3s, the watch call fails. Watch has a retry loop internally, but it only retries starting the request, once it has established a request and that request dies spuriously, watch, doesn't reconnect.

Freax13 added the no changelog PRs not listed in the release notes label Jul 16, 2024

Freax13 force-pushed the tom/snp-baremetal-ci branch 27 times, most recently from f98a8f9 to f740fbd Compare July 23, 2024 05:41

Freax13 changed the title ~~CI: run e2e test on baremetal SNP~~ preparations for e2e test son baremetal SNP Jul 23, 2024

Freax13 marked this pull request as ready for review July 23, 2024 05:48

Freax13 force-pushed the tom/snp-baremetal-ci branch 4 times, most recently from 2e4cce5 to a4546ef Compare August 13, 2024 14:27

Freax13 marked this pull request as ready for review August 13, 2024 14:48

Freax13 requested a review from katexochen August 14, 2024 06:00

Freax13 force-pushed the tom/snp-baremetal-ci branch from a4546ef to afa1b22 Compare August 14, 2024 08:29

Freax13 added 11 commits August 14, 2024 10:33

contrast: properly calculate launch digest

0add2e9

Unfortunately the launch measurement is vCPU type dependent (it doesn't have to be that way, but QEMU sets up different vCPU types differently, so thanks, QEMU?!). Assume Genoa for now.

just: support QEMU-SNP in node-installer target

b21786f

e2e: pass platform as CLI parameter

8dc0ae9

This will allow us to test on platforms other than aks-clh-snp.

e2e: add build tag to constrasttest

80b9a74

e2e/openssl: wait for coordinator after restarting

69b35a6

kata.kata-kernel-uvm: enable dm-init

4bfe50a

We use this kernel module with `dm-mod.create="dm-verit...` to protect the image file.

node-installer: add kernel_params for setting up dm-verity

41774fb

The calculation of the launch measurement has been adjusted accordingly.

CI: add workflow for openssl test on baremetal SNP

b6c64e3

qemu-static: bump to rc1

1fadfcd

The previous rc0 had a bug somewhere that influenced the launch measurement. This bug has been fixed in rc1.

justfile: pass platform explicitly to more targets

3a5bc19

Freax13 force-pushed the tom/snp-baremetal-ci branch from afa1b22 to 00dc2c6 Compare August 14, 2024 08:33

msanft reviewed Aug 14, 2024

View reviewed changes

Freax13 added 4 commits August 14, 2024 13:35

e2e: add retry loop for WaitFor

717e272

When the node-installer restarts K3s, the watch call fails. Watch has a retry loop internally, but it only retries starting the request, once it has established a request and that request dies spuriously, watch, doesn't reconnect.

e2e/openssl: wait a little longer after coordinator restart

e036d6e

kubeclient: log correct error

ed1c4f8

e2e/openssl: fill in missing manifest values for baremetal SNP

825b761

Freax13 force-pushed the tom/snp-baremetal-ci branch from 00dc2c6 to 825b761 Compare August 14, 2024 11:35

Freax13 requested a review from msanft August 14, 2024 11:35

msanft approved these changes Aug 14, 2024

View reviewed changes

katexochen approved these changes Aug 14, 2024

View reviewed changes

Freax13 merged commit f6e54af into main Aug 14, 2024
12 checks passed

Freax13 deleted the tom/snp-baremetal-ci branch August 14, 2024 12:33

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

preparations for e2e tests on baremetal SNP #730

preparations for e2e tests on baremetal SNP #730

Freax13 commented Jul 16, 2024 •

edited

Loading

msanft left a comment

msanft Aug 14, 2024

Freax13 Aug 14, 2024

msanft Aug 14, 2024

Freax13 Aug 14, 2024

msanft Aug 14, 2024

preparations for e2e tests on baremetal SNP #730

preparations for e2e tests on baremetal SNP #730

Conversation

Freax13 commented Jul 16, 2024 • edited Loading

msanft left a comment

Choose a reason for hiding this comment

msanft Aug 14, 2024

Choose a reason for hiding this comment

Freax13 Aug 14, 2024

Choose a reason for hiding this comment

msanft Aug 14, 2024

Choose a reason for hiding this comment

Freax13 Aug 14, 2024

Choose a reason for hiding this comment

msanft Aug 14, 2024

Choose a reason for hiding this comment

Freax13 commented Jul 16, 2024 •

edited

Loading