[cookie-store] Improve cleanup logic #14364

jugglinmike · 2018-12-05T01:34:46Z

While researching for a WPT infrastructure project, I found these tests to be behaving erratically. In a handful of cases, the cleanup invocation was not properly awaited. Since we've been meaning to take advantage of the new "async cleanup" functionality, I took the opportunity to use that API. I'm submitting this in two commits to make the bug fix easier to identify.

To validate this change set, I ran the tests in Chromium (70.0.3538.77) on master (3960fff) and on this branch via the command

 $ xvfb-run --auto-servernum ./wpt run --no-pause --log-mach - --log-mach-level DEBUG --channel dev --no-manifest chrome cookie-store/

The results were identical:

On master:

web-platform-test
~~~~~~~~~~~~~~~~~
Ran 338 checks (35 tests, 303 subtests)
Expected results: 328
Unexpected results: 10
  subtest: 10 (10 fail)

With patch applied:

web-platform-test
~~~~~~~~~~~~~~~~~
Ran 338 checks (35 tests, 303 subtests)
Expected results: 328
Unexpected results: 10
  subtest: 10 (10 fail)

Ensure that the test is not considered "complete" until after the cleanup logic has been executed.

Use the recently-implemented asynchronous cleanup functionality of testharness.js to ensure that cleanup logic executes regardless of the passing/failing status of the test. Rejected promises during cleanup will not prompt test failure, but they will cause the harness to report an error.

jugglinmike · 2018-12-05T02:52:23Z

It looks like these tests are still unstable even with my patch applied. I tried to reproduce the instability locally using the following command:

./wpt run --no-pause --log-mach - --channel dev --verify chrome cookie-store

...but that didn't catch anything on my local system (Chromium 70.0.3538.77) or in the Docker image used by Taskcluster (Google Chrome 72.0.3622.0 dev). Even the exact command reported in the Taskcluster logs (too lengthy to post here) didn't demonstrate the instability. Those CRASHes are suspicious, though. I don't think it's a memory limitation because the utilization is documented in the logs:

2018-12-05 01:37:06,210 - tc-run - INFO - Identifying tests affected in range 'HEAD^'...
mem avail: 29462 MiB (96 %), swap free: 0 MiB (0 %)
2018-12-05 01:37:34,339 - tc-run - INFO - Identified 17 affected tests

[...]

PID 1140 | Only local connections are allowed.
mem avail: 27550 MiB (89 %), swap free: 0 MiB (0 %)
PID 1452 | Starting ChromeDriver 2.44.609551 (5d576e9a44fe4c5b6a07e568f1ebc753f1214634) on port 4444

[...]

PID 10044 | Only local connections are allowed.
mem avail: 28787 MiB (93 %), swap free: 0 MiB (0 %)
PID 10320 | Starting ChromeDriver 2.44.609551 (5d576e9a44fe4c5b6a07e568f1ebc753f1214634) on port 4444

It's strange that the container has no swap memory, but I'm not sure that's a problem. It doesn't seem as though we'd need to do any paging with 28 gigabytes of free memory. Unfortunately, I can't replicate that condition via Docker running on my system due to kernel limitations.

@Hexcles do you have thoughts on any of this?

Hexcles · 2018-12-05T17:19:58Z

Right, our Docker containers don't have swap, which shouldn't have any negative affects. We should never use anything even close to the total available memory (guaranteed to be at least 8GB).

pwnall

LGTM.

Thank you very much for the cleanup! I don't know what's up with the bots, but this looks like a step forward for the tests.

jugglinmike · 2019-11-16T01:59:26Z

This was addressed in Chromium directly and merged to WPT via gh-17949. I've verified that patch addressed all of the instances of asynchronous cleanup that we identified here. It's good that the change eventually made it through stability checks!

jugglinmike added 2 commits December 4, 2018 20:14

[cookie-store] Delay test completion for cleanup

c623302

Ensure that the test is not considered "complete" until after the cleanup logic has been executed.

wpt-pr-bot added the cookie-store label Dec 5, 2018

wpt-pr-bot assigned inexorabletash Dec 5, 2018

wpt-pr-bot requested review from inexorabletash and pwnall December 5, 2018 01:34

pwnall approved these changes Dec 5, 2018

View reviewed changes

jugglinmike mentioned this pull request Dec 12, 2018

Taskcluster status checks not finishing and blocking PRs #14165

Closed

foolip mentioned this pull request Dec 20, 2018

Migrate from travis-ci.org to travis-ci.com #14499

Closed

jugglinmike closed this Nov 16, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[cookie-store] Improve cleanup logic #14364

[cookie-store] Improve cleanup logic #14364

jugglinmike commented Dec 5, 2018

jugglinmike commented Dec 5, 2018

Hexcles commented Dec 5, 2018

pwnall left a comment

jugglinmike commented Nov 16, 2019

[cookie-store] Improve cleanup logic #14364

[cookie-store] Improve cleanup logic #14364

Conversation

jugglinmike commented Dec 5, 2018

jugglinmike commented Dec 5, 2018

Hexcles commented Dec 5, 2018

pwnall left a comment

Choose a reason for hiding this comment

jugglinmike commented Nov 16, 2019