-
Notifications
You must be signed in to change notification settings - Fork 2.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
System tests: add test tags #19302
System tests: add test tags #19302
Conversation
This is a preliminary, probably embarrassingly incomplete list of tests. Before we start in on the optimum subset, let's follow along in #19299 to see if this approach is even possible. Oh, and I forgot to link to BATS tag documentation. |
Discussion on #19299 is positive; looks like this is doable on the OpenQA front. Okay, @containers/podman-maintainers, have at it. Please review my initial tags, then (the hard part) search for what I've missed. |
I can actually PoC this before merging, even. I'm doing a scratch build (on F38 because build on Rawhide appears to be failing); if that succeeds I can just monkeypatch the test on openQA staging and run it against the scratch build and you can see how it will look. |
uh, not yet maybe? We still have a bit of a problem with mismatched |
sure, i don't care about tests failing on staging. they do that a lot. :P |
Watch it live! https://openqa.stg.fedoraproject.org/tests/3003796#live (and probably get to laugh at me when it turns out I messed up the monkey edit somehow) |
OK, so it ran and failed, so you can kinda see how it would look: it shows that the 'podman' test step is the one that failed, and the red outlined 'Failed' step shows you where it failed - if you click on that it tells you "# Test died: command 'bats --filter-tags openqa /usr/share/podman/test/system' failed at /usr/lib/os-autoinst/autotest.pm line 387." (openQA was designed around screenshot matching; originally all failures were screenshot match failures, so it kinda assumes every failure will be a thumbnail. when 'advanced' support for running stuff at consoles and checking the exit code was added, it was kinda shoehorned in with a 'fake thumbnail' like this). The screenshots around that show you what was on the screen at the time of the failure. If we make it redirect the output of the test command to a file and upload that file, you would find it on the Logs & Assets tab. I can hack that in quick and run it again. One wider thing to note, though: update testing is kind of a "full service" situation. I hate failed update tests. I never want there to be any failed update tests. Any time an update test fails, I will notice it and investigate it, and usually turn it into something more actionable - a bug report with the relevant logs attached, or whatever. In general, you (the upstream/package maintainer) aren't required to be looking at the openQA raw results, though you are of course welcome to. |
Fun story: before the somewhat-clever console support was added in years ago, we used to just literally print the exit code (with an identifier before it) and screenshot match on that, if it wasn't a 0, the test failed! |
Test failure looks pretty much like what I was expecting, although, is there any way to capture previous lines? All I see is the end (final) summary lines. This might be consistent with what you say about screenshot-capture origins. So, yes, I think capturing text logs would be nice. I'm feeling hopeful about this. Thanks for all your efforts. |
yeah, to get previous lines we'd definitely just want to upload the text output. I'm re-running the test with that wired in right now. |
It seems insane not to: of course we want to have early warning of integration problems. And it seems cruel to ask you to diagnose and report problems with our code/tests. We, too strive for zero errors, but we don't have that and never will as long as networks are involved. How do we (or, say, me) subscribe to openqa results for a given component? Also, my work week is long since over, but I'll try to check in tomorrow morning. |
Well, I'm the QA guy, it's kinda my job :P But of course, you certainly can. There isn't a real convenient 'off the shelf' way to subscribe to results, but the way you'd basically want to do it is via fedora messaging. Usually, when an openQA test finishes, three messages get published: a 'native' openQA message, a message in the kinda-standardized ci-messages format, and a message from ResultsDB when the result of the test is submitted there. For these trial-run tests only the 'native' openQA message happens, because there isn't a ci-messages standard for tests run on a scratch build, and we don't submit results for tests on scratch builds to ResultsDB (as they're assumed to usually be this kinda 'experiemental' test). Here's the 'openQA native' message from the trial run earlier: https://apps.stg.fedoraproject.org/datagrepper/v2/id?id=da3f88c8-57de-4278-9778-e31ec1d4d4fe&is_raw=true&size=extra-large . For a test run on an update (as is the usual case), the Here's how a ci-messages message and a resultsdb result message for a completed update test generally look. What I guess you'd want to do is just write a consumer for one of those types of message and have it ping you somehow if it sees a failure of the 'podman' test. Any of the messages should provide enough info for the ping to tell you what update the test failed on and generate a URL to the test (the messages have the test ID in them, the URL format is just https://openqa[.stg].fedoraproject.org/tests/(testid) ). We do also run the test on composes, which produces similar messages but with somewhat different information about the 'thing under test' (it'll be a compose ID, not a Bodhi update alias). I dunno if you've dealt with fedora messaging before, but if not, the docs are at https://fedora-messaging.readthedocs.io/en/stable/ . Writing a simple consumer is pretty easy, there's a sample at https://fedora-messaging.readthedocs.io/en/stable/user-guide/consuming.html (https://fedora-messaging.readthedocs.io/en/stable/user-guide/configuration.html#consumer-options explains what to put in the TOML config file). |
Here's a new run of the test where I redirected the output to a text file and uploaded it: https://openqa.stg.fedoraproject.org/tests/3003804 You can find the output on the Logs & Assets tab as |
I guess it might be nice if the new FMN let you just configure yourself to get notified of openQA failures, but I dunno if I have enough time to look at implementing that. |
Ok I fully I understand that but the problem is with a selective list we will always miss certain things. |
That's exactly what I'm requesting, it just got lost in the conversation. |
Yeah, as this will gate other updates, please focus on critical features that would constitute a broken experience if the tests fail. |
Since no response: I revisited my list of tagged tests, added a few more (including network). @containers/podman-maintainers, in the interests of getting something going in terms of , can we merge this, then fine-tune the essential-test list later? I don't see any possible way that this PR introduces any risk to existing CI or gating tests (but please review with that in mind). |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: edsantiago, vrothberg The full list of commands accepted by this bot can be found here. The pull request process is described here
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
ef10d62
to
83a9f29
Compare
btw, I hate to bikeshed, but 'openqa' seems like a possibly over-targeted name. can anyone think of a name which means more 'tests that may be useful as integration tests for distros', or however we want to describe the real concept here? |
Naming is usually the hardest part, so this is the time to bikeshed.
|
|
BATS 1.8.0 introduces tags: metadata that can be applied to a single test or one entire file, then used for filtering in a test run. Issue containers#19299 introduces the possibility of using OpenQA for podman reverse dependency testing: continuous CI on all packages that can affect podman, so we don't go two months with no bodhi builds then get caught by surprise when systemd or kernel or crun change in ways that break us. This PR introduces one bats tag, "distro-integration". The intention is for OpenQA (or other) tests to install the podman-tests package and run: bats --filter-tags distro-integration /usr/share/podman/test/system Goal is to keep the test list short and sweet: we do not need to test command-line option parsing. We *DO* need to test interactions with systemd, kernel, nethack, and other critical components. Signed-off-by: Ed Santiago <[email protected]>
Going once, going twice, |
cool! once this makes it to official downstream (Fedora) builds, I can put it into openQA for reals. it would be best if this can land in F37, F38 and Rawhide so I can run the additional test unconditionally; if it can only go in Rawhide, I can just only run the test on Rawhide updates. |
Until the 4.7 release (probably somewhere in September), it will only land in Rawhide. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@containers/podman-maintainers PTAL
/lgtm |
cool. poke me when it lands in Rawhide and I will add the openQA part to run only on Rawhide updates for now. |
[ Clean cherry-pick of containers#19302. This is a low-risk change with potentially very high ROI: the opportunity to catch interaction problems with updates in other system components. ] BATS 1.8.0 introduces tags: metadata that can be applied to a single test or one entire file, then used for filtering in a test run. Issue containers#19299 introduces the possibility of using OpenQA for podman reverse dependency testing: continuous CI on all packages that can affect podman, so we don't go two months with no bodhi builds then get caught by surprise when systemd or kernel or crun change in ways that break us. This PR introduces one bats tag, "distro-integration". The intention is for OpenQA (or other) tests to install the podman-tests package and run: bats --filter-tags distro-integration /usr/share/podman/test/system Goal is to keep the test list short and sweet: we do not need to test command-line option parsing. We *DO* need to test interactions with systemd, kernel, nethack, and other critical components. Signed-off-by: Ed Santiago <[email protected]>
It looks like this landed in 4.6.1 which is landing in Fedora across all branches, so I'll look at extending the openQA tests in production. thanks! |
hmm, actually, 4.6.1 is failing the Fedora CI checks ATM, so maybe I'll wait for that to be sorted. |
OK, this is now done in openQA. |
BATS 1.8.0 introduces tags: metadata that can be applied to
a single test or one entire file, then used for filtering
in a test run.
Issue #19299 introduces the possibility of using OpenQA
for podman reverse dependency testing: continuous CI on
all packages that can affect podman, so we don't go two
months with no bodhi builds then get caught by surprise
when systemd or kernel or crun change in ways that break us.
This PR introduces one bats tag, "distro-integration". The intention
is for OpenQA (or other) tests to install the podman-tests package
and run:
Goal is to keep the test list short and sweet: we do not
need to test command-line option parsing. We DO need to
test interactions with systemd, kernel, nethack, and other
critical components.
Signed-off-by: Ed Santiago [email protected]