Improve handling of harness-level errors in wptrunner #10444

jgraham · 2018-04-12T13:13:11Z

Split the internal handling of errors during a test into two cases;
INTERNAL-ERROR that is produced if there's an exception in the harness
and ERROR that is for exceptions reported by the test. Both are still
reported as ERROR to the user, but in the INTERNAL-ERROR case the
runner is always restarted, just like EXTERNAL-TIMEOUT, since we
assume that the internal state is compromised somehow.

This change is

jgraham · 2018-04-12T13:14:24Z

@jugglinmike I think you might be interested in this.

foolip

LGTM % docs, but I'd also like @jugglinmike to have a look.

Does any documentation of the different statuses exist? Having a description of "INTERNAL-ERROR" under docs/ (added in this PR) would make it easier to review, and benefits everyone trying to understand it in the future.

foolip · 2018-04-12T15:00:04Z

Also, do we need to run this through Gecko's CI to have confidence that this works?

jgraham · 2018-04-12T15:46:36Z

We don't currently have much documentation on the wptrunner internals, so there's nowhere obvious to add this. It isn't ever exposed to the user (it gets mapped down to ERROR before output because mozlog only supports a limited set of statuses, in order for consistency between users).

jgraham · 2018-04-12T15:48:04Z

https://treeherder.mozilla.org/#/jobs?repo=try&revision=890c9d39ed8c4239340954a1aed87fa3c02ba38e&selectedJob=173288584 is the try run started by our bot, although that doesn't include all tests by default. I can do another try run with everything if that looks OK.

jugglinmike · 2018-04-12T16:35:33Z

tools/wptrunner/wptrunner/testrunner.py

@@ -563,10 +563,12 @@ def test_ended(self, test, results):
        # TODO: consider changing result if there is a crash dump file

        # Write the result of the test harness
+        result_subns = {"INTERNAL-ERROR": "ERROR",
+                        "EXTERNAL-TIMEOUT": "TIMEOUT"}


The word "result" is being applied to the object that contains this status string. What do you think of status_subns or status_map?

jugglinmike · 2018-04-12T16:38:17Z

@foolip's request for documentation seems reasonable to me, but I also understand @jgraham's reluctance to bloat usage instructions with implementation details. A few ideas:

James writes, "It isn't ever exposed to the user (it gets mapped down to ERROR before output because mozlog only supports a limited set of statuses, in order for consistency between users)." If this detail was included as an in-line comment, we could address Philip's concern about "everyone trying to understand it in the future" without adding to the information we present to the average user

This patch overloads the string "ERROR" a bit. Depending on the context, it could mean "test error, not harness error", but it could also mean "test error or harness error". This could be confusing for others. Since the latter definition is unrestricted, it more closely matches the string. Would it be too onerous to introduce a new value dedicated to the "test error, not harness error" status? (This would also help future contributors get their bearings since the distinction James is making would be explicitly reflected in the code.)

Thanks for taking the lead on this, James!

jgraham · 2018-04-12T17:07:54Z

Adding a new error status to mozlog is hard, because it involves updating it, making a release, updating all formatters to ensure that they don't choke on the new value, and hoping you didn't miss any external ones.

It's about now you realise why it is that people like languages with enums and irrefutable pattern matching… mozlog would be great in Rust ;)

I'm happy to add a comment about what's going on in the source though.

Split the internal handling of errors during a test into two cases; INTERNAL-ERROR that is produced if there's an exception in the harness and ERROR that is for exceptions reported by the test. Both are still reported as ERROR to the user, but in the INTERNAL-ERROR case the runner is always restarted, just like EXTERNAL-TIMEOUT, since we assume that the internal state is compromised somehow.

jugglinmike · 2018-04-12T17:25:06Z

Adding a new error status to mozlog is hard, because it involves updating it,
making a release, updating all formatters to ensure that they don't choke on
the new value, and hoping you didn't miss any external ones.

I expected the path to updating mozlog would be a difficult one, but I also
don't think it's necessarily appropriate. The distinction we're discussion
doesn't necessarily make much sense in a general-purpose logging utility. What
I meant to suggest was the introduction of a new string value for internal use,
one to mirror "INTERNAL-ERROR" (maybe "TEST-ERROR") that would be likewise
translated to "ERROR" for integration with mozlog. My thinking was when reading
the source (and debugging runtime values), seeing that value would be more
helpful because you wouldn't have to wonder, "is this describing a test error?
Or has the mozlog translation already occurred, meaning it might be describing
an internal error?"

It's about now you realise why it is that people like languages with enums
and irrefutable pattern matching… mozlog would be great in Rust ;)

Still looking for an excuse to learn that language...

jgraham · 2018-04-12T17:43:12Z

Oh, I see. The "ERROR" string comes pretty much directly from testharness.js, so I'd prefer not to change that internally just to change it back again later.

jugglinmike · 2018-04-12T20:02:56Z

This looks good to me. I don't know how to interpret the Treeherder UI, though. Does it make sense to move forward with that complete test run, now?

gsnedders · 2018-04-17T01:13:11Z

We don't currently have much documentation on the wptrunner internals, so there's nowhere obvious to add this.

tools/wptrunner/docs?

jgraham requested a review from gsnedders April 12, 2018 13:13

wpt-pr-bot added the infra label Apr 12, 2018

jgraham force-pushed the wptrunner_internal_error branch from 0022300 to 2d1b290 Compare April 12, 2018 14:31

jgraham mentioned this pull request Apr 12, 2018

wptrunner: Always restart on test error #10186

Closed

foolip approved these changes Apr 12, 2018

View reviewed changes

jgraham force-pushed the wptrunner_internal_error branch from 2d1b290 to 2bd39e3 Compare April 12, 2018 16:00

jugglinmike reviewed Apr 12, 2018

View reviewed changes

jgraham force-pushed the wptrunner_internal_error branch from 2bd39e3 to 712be3d Compare April 12, 2018 17:17

jgraham merged commit 6736e3f into master Apr 15, 2018

gsnedders deleted the wptrunner_internal_error branch April 17, 2018 01:12

This was referenced Apr 17, 2018

Reject incomplete result sets web-platform-tests/results-collection#466

Open

Unexpected closure of Sauce Labs Connect proxy web-platform-tests/results-collection#544

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve handling of harness-level errors in wptrunner #10444

Improve handling of harness-level errors in wptrunner #10444

jgraham commented Apr 12, 2018 •

edited by wpt-reviewable-bot

Loading

jgraham commented Apr 12, 2018

foolip left a comment

foolip commented Apr 12, 2018

jgraham commented Apr 12, 2018

jgraham commented Apr 12, 2018

jugglinmike Apr 12, 2018

jugglinmike commented Apr 12, 2018

jgraham commented Apr 12, 2018 •

edited

Loading

jugglinmike commented Apr 12, 2018

jgraham commented Apr 12, 2018

jugglinmike commented Apr 12, 2018

gsnedders commented Apr 17, 2018

Improve handling of harness-level errors in wptrunner #10444

Improve handling of harness-level errors in wptrunner #10444

Conversation

jgraham commented Apr 12, 2018 • edited by wpt-reviewable-bot Loading

jgraham commented Apr 12, 2018

foolip left a comment

Choose a reason for hiding this comment

foolip commented Apr 12, 2018

jgraham commented Apr 12, 2018

jgraham commented Apr 12, 2018

jugglinmike Apr 12, 2018

Choose a reason for hiding this comment

jugglinmike commented Apr 12, 2018

jgraham commented Apr 12, 2018 • edited Loading

jugglinmike commented Apr 12, 2018

jgraham commented Apr 12, 2018

jugglinmike commented Apr 12, 2018

gsnedders commented Apr 17, 2018

jgraham commented Apr 12, 2018 •

edited by wpt-reviewable-bot

Loading

jgraham commented Apr 12, 2018 •

edited

Loading