Some Focus Area percentages on dashboard don't match the results percentage #4107

cookiecrook · 2024-11-06T08:18:31Z

Some Focus Area percentages on dashboard don't match the results percentage

For example, the dashboard lists the Accessibility Focus Area at 97.8% and 99.7% respectively for Firefox and Safari…

screen shot of dashboard listing scores described in the issue prose: 97.8% and 99.7%

But if you click through to the Focus Area results, the scores are 98% and 99.9%. (Bottom row right.)

screen shot of results listing scores described in the issue prose: 98 and 99.9%.

I haven't dug into which is correct, but it seems clear one of them is wrong, and the dashboard percentages should be consistent throughout.

KyleJu · 2024-11-06T18:43:06Z

Assign @DanielRyanSmith just so this issue has an owner. Feel free to discuss this in our meeting!

cookiecrook · 2024-12-03T02:35:57Z

Any movement on this @DanielRyanSmith? Thanks.

DanielRyanSmith · 2024-12-03T03:38:51Z

Edit: The below hypotheses of the problem were incorrect, but I'll leave this comment around if, for some reason, people are interested in the steps I took to investigate.

Sorry, I missed this until now - I'm not quite sure immediately what is happening here yet, since safari does not have any flaky tests that might cause this (that was my initial guess).

I see Safari with a 99.7% on the dashboard and a 99.9% on the results page.

If I had to guess again, it might be the fact that some tests/subtests under the label "interop-2024-accessibility" were added/removed at some point through the year, and the interop score aggregation script is using an incorrect number of tests to calculate the passing percentage.

Edit: Looking at older runs (back from March) onward, it does seem that the amount of subtests related to interop-2024-accessibility shrinks from 1133 down to 1095, then back up to 1112. I have to imagine that this is the problem with the dashboard's score, as it's probably using tests/subtests that have been removed from the category to aggregate score totals.

cookiecrook · 2024-12-03T23:31:57Z

Thanks for digging in... The total number has fluctuated a bit as invalid or disputed tests were removed or modified. There were quite a few that were erroneously added midyear too, and if any of those caused new failures in the engines we moved some of those to .tentative test files.

DanielRyanSmith · 2024-12-04T05:06:55Z

So I took a deeper look into this and it turns out that the interop dashboard score is definitely the more correct score, and the discrepancy here is due to how the rounding was handled on the test results page. I've made an extremely simple fix at #4142, which has an explanation of the problem.

cookiecrook · 2024-12-05T06:46:12Z

@DanielRyanSmith Original cases linked above still show a discrepancy. Albeit a smaller one. 99.7 to 99.8 instead of the original 99.7 to 99.9.

cookiecrook · 2024-12-05T06:47:14Z

I don’t have a way to reopen the issue. Would you prefer a new one?

DanielRyanSmith · 2024-12-05T17:08:36Z

Hmm, this is still the same issue, but my quick change did not match the approach for the interop score aggregation script, which does no rounding and is the intended result. The test results page is still rounding up, just a little less. I'll readjust things. Sorry about that!

The dashboard score is still correct here.

DanielRyanSmith · 2024-12-13T00:17:58Z

As a follow-up, this is now in production and the Accessibility scores on the results page look as expected. 🙂

Note that there is a possibility that score discrepancies might pop up at some point, and the likely causes have been documented in these issues:

Interop results page shows most recent aligned run results, which may not have been used to score the Interop Dashboard #4160
Search cache aggregation does not distinguish failed harness results from normal subtests #4159

However, the cause for the original situation described in this issue has been fixed. Thanks for the patience as I investigated here 😊

cookiecrook mentioned this issue Nov 6, 2024

Meeting: November 5, 2024, @ 9 AM PST web-platform-tests/interop-accessibility#146

Closed

KyleJu assigned DanielRyanSmith Nov 6, 2024

DanielRyanSmith added the interop Issues with the Interop dashboards label Dec 3, 2024

cookiecrook mentioned this issue Dec 3, 2024

Meeting: December 3, 2024, @ 9 AM PST web-platform-tests/interop-accessibility#154

Open

DanielRyanSmith mentioned this issue Dec 4, 2024

Update rounding logic for results page #4142

Merged

DanielRyanSmith closed this as completed in #4142 Dec 4, 2024

DanielRyanSmith reopened this Dec 5, 2024

DanielRyanSmith mentioned this issue Dec 7, 2024

Remove rounding on interop test results totals cells #4145

Merged

DanielRyanSmith closed this as completed in #4145 Dec 10, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Some Focus Area percentages on dashboard don't match the results percentage #4107

Some Focus Area percentages on dashboard don't match the results percentage #4107

cookiecrook commented Nov 6, 2024

KyleJu commented Nov 6, 2024

cookiecrook commented Dec 3, 2024

DanielRyanSmith commented Dec 3, 2024 •

edited

Loading

cookiecrook commented Dec 3, 2024 •

edited

Loading

DanielRyanSmith commented Dec 4, 2024

cookiecrook commented Dec 5, 2024

cookiecrook commented Dec 5, 2024

DanielRyanSmith commented Dec 5, 2024

DanielRyanSmith commented Dec 13, 2024

Some Focus Area percentages on dashboard don't match the results percentage #4107

Some Focus Area percentages on dashboard don't match the results percentage #4107

Comments

cookiecrook commented Nov 6, 2024

KyleJu commented Nov 6, 2024

cookiecrook commented Dec 3, 2024

DanielRyanSmith commented Dec 3, 2024 • edited Loading

Edit: The below hypotheses of the problem were incorrect, but I'll leave this comment around if, for some reason, people are interested in the steps I took to investigate.

cookiecrook commented Dec 3, 2024 • edited Loading

DanielRyanSmith commented Dec 4, 2024

cookiecrook commented Dec 5, 2024

cookiecrook commented Dec 5, 2024

DanielRyanSmith commented Dec 5, 2024

DanielRyanSmith commented Dec 13, 2024

DanielRyanSmith commented Dec 3, 2024 •

edited

Loading

cookiecrook commented Dec 3, 2024 •

edited

Loading