Bug 1656934 - Scan pending pings directories after dealing with upload status #1205

brizental · 2020-09-10T10:32:21Z

The bug that was causing that intermittent, was due to scanning the pings directories before running the on_upload_disabled function that deleted non-deletion request pings.

I fix that in this PR by moving the scanning to after function the on_upload_disabled function.

As I mentioned in the bug, I was able to reproduce the intermittent reliably in a specific branch of mine, with the fix I don't get the errors anymore on that branch either.

Dexterp37 · 2020-09-10T17:13:47Z

glean-core/src/lib.rs

+        // If upload is disabled, we delete all pending pings files
+        // and we need to do that **before** scanning the pending pings folder
+        // to ensure we don't enqueue pings before their files are deleted.
+        let _scanning_thread = glean.upload_manager.scan_pending_pings_directories();


Eugh. So this was an actual bug and not a test error. We should add a changelog entry. Do we have any way to estimate how frequently we sent pings out?

I'd suspect that's not super frequent, but it's important to better understand. We could also draft a PSA email, if we find the impact being big.

Do we have any way to estimate how frequently we sent pings out?

Hm, I need to think about this. When we have this bug it is usually followed by an error on deleting the ping file. Because we send the request to the caller -> it uploads succesfully (usually) -> and then it tries to delete an already deleted file, which logs an error. But that is all we have, from what I remember on the top of my head.

I'll think a bit further, if you have any ideas, let me know.

I have an idea to validate this: check if any pings are sent from a client_id after this client_id sent a deletion-request ping. Will come back to this thread with results when I have them.

https://sql.telemetry.mozilla.org/queries/74658/source

BONUS: add CHANGEOG.md entry

brizental · 2020-09-11T12:19:00Z

glean-core/src/lib.rs

        upload_manager.set_rate_limiter(
            /* seconds per interval */ 60, /* max tasks per interval */ 15,
        );

+        // We only scan the pending ping sdirectories when calling this from a subprocess,


Having this makes me question the name of this function. If you have better ideas I'd be up for changing it.

travis79 · 2020-09-11T12:56:17Z

glean-core/src/lib.rs

        upload_manager.set_rate_limiter(
            /* seconds per interval */ 60, /* max tasks per interval */ 15,
        );

+        // We only scan the pending ping sdirectories when calling this from a subprocess,


Oops: pending ping sdirectories

Scan pending pings directories after dealing with upload status

65d1622

auto-assign bot requested a review from Dexterp37 September 10, 2020 10:32

Dexterp37 reviewed Sep 10, 2020

View reviewed changes

Also scan pending pings folder in subprocess

65cc869

BONUS: add CHANGEOG.md entry

brizental requested a review from Dexterp37 September 11, 2020 12:12

brizental commented Sep 11, 2020

View reviewed changes

Dexterp37 approved these changes Sep 11, 2020

View reviewed changes

brizental merged commit 523a358 into mozilla:main Sep 11, 2020

brizental deleted the 1656934-non-deletion-intermittents branch September 11, 2020 12:33

travis79 reviewed Sep 11, 2020

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bug 1656934 - Scan pending pings directories after dealing with upload status #1205

Bug 1656934 - Scan pending pings directories after dealing with upload status #1205

brizental commented Sep 10, 2020

Dexterp37 Sep 10, 2020

brizental Sep 11, 2020

brizental Sep 11, 2020

brizental Sep 11, 2020

brizental Sep 11, 2020

travis79 Sep 11, 2020

Bug 1656934 - Scan pending pings directories after dealing with upload status #1205

Bug 1656934 - Scan pending pings directories after dealing with upload status #1205

Conversation

brizental commented Sep 10, 2020

Dexterp37 Sep 10, 2020

Choose a reason for hiding this comment

brizental Sep 11, 2020

Choose a reason for hiding this comment

brizental Sep 11, 2020

Choose a reason for hiding this comment

brizental Sep 11, 2020

Choose a reason for hiding this comment

brizental Sep 11, 2020

Choose a reason for hiding this comment

travis79 Sep 11, 2020

Choose a reason for hiding this comment