backfill: rate limit progress updating #67523

pbardea · 2021-07-13T02:05:51Z

Describe the problem

Recently we witnessed the following scenario: a create index job with a 1MB payload was being updated every 10s. This lead to the range in the jobs' table to start backpressuring writes and thus blocking writes. See #62627. (100 KB/s write write to the table pretty quickly fills up the range.)

The index backfiller should be less agressive with it's job progress updating. Rather than just updating at a fixed rate (today's one update every 10s), it should backpressure based on how fast it's writing to the jobs table. This is likely exacerbated since the index backfiller maintains a list of spans that it still needs to process. As completed work comes in from all of the 100 nodes it chops up the list of todo spans so the progress can grow quite large...

Instead of writing every 10s, we should at least throttle the write rate when updating the jobs record to some throughput rather than once every 10s regardless of how large the payload is. We may also want to consider limiting the size of the todoSpans at the cost of losing some progress to prevent these payloads from growing too large.

gz#9008

Epic CRDB-8816

gz#9032

gz#8907

Jira issue: CRDB-8588

The text was updated successfully, but these errors were encountered:

dankinder · 2021-07-30T03:45:22Z

Hey @pbardea would this issue have a bigger effect when creating a partial index?

For background, I originally filed #67487 where I hit "split failed while applying backpressure" on the jobs table during an index build, and @ajwerner pointed me to this ticket and gave me a workaround.

I'm building a simple partial index, like create index on mytable (the_column) where the_column is not null; where the_column is NULL for a majority of rows.

Here is what I'm setting currently to try to work around it:

-- Note: @ajwerner recommended 300 for this one but that failed so I lowered even further. Should be safe since I am not using incremental backups
ALTER TABLE system.jobs CONFIGURE ZONE USING gc.ttlseconds = 30;

ALTER TABLE system.jobs CONFIGURE ZONE USING range_max_bytes = 2<<30;
SET CLUSTER SETTING kv.range.backpressure_range_size_multiplier = 8;

This got it much further (failing at 70% instead of 5%), but the index build still fails with an error like this: job-update: split failed while applying backpressure to Put [/Table/15/1/677975849697181697/1/1,/Min), [txn: 3dbf0eeb] on range r812869:/Table/15/1/677975849{697181697-721102337} [(n4,s23):1, (n41,s245):2, (n37,s217):3, (n23,s137):4, (n44,s261):5, next=6, gen=384]: operation "split queue process replica 812869" timed out after 1m0s: could not find valid split key

What's odd is that, right around the time the index build stalls, I start getting timeouts trying to view the jobs table in the console. Presumably it's getting tons of write activity and has a lot of tombstones right then.

So I was theorizing that, because it's a partial index, maybe it is hitting big swaths that don't meet the condition (the_column is null), and progress is advancing too fast at that point, overwhelming the job table. Anyway, just a guess.

If you guys know of any other workaround to make this index build succeed I'd love to hear it. Thanks.

68215: backfill: reduce checkpoint interval r=adityamaru a=pbardea Informs #67523. This commit introduces a cluster setting as well as reduces the default checkpoint progress interval. Backfill progress is checkpointed by writing the set of spans that are left TODO. However, this set of spans could get quite large so each update to this job record can quickly fill a range on backfills of large tables. This change is a start to improving the situation by introducing a knob that can be tuned back for large backfills. Ideally, the schema change would rate limit itself based on the size of the progress updates. Release note (ops change): Introduce a cluster setting, bulkio.index_backfill.checkpoint_interval to control the rate at which backfills checkpoint their progress. Useful when needed to be dialed back on backfills of large tables. Co-authored-by: Paul Bardea <[email protected]>

pbardea · 2021-08-30T12:04:15Z

Hi @dankinder -- sorry for the delay! The workarounds that you provided are a good start. A cluster setting, bulkio.index_backfill.checkpoint_interval, was recently added (#68215) which can help further reduce the frequency of the checkpointing which may help out the backfills in this case. It's also been backported to the 21.1 release branch.

pbardea added C-bug Code not up to spec/doc, specs & docs deemed correct. Solution expected to change code/behavior. A-disaster-recovery labels Jul 13, 2021

blathers-crl bot added the T-disaster-recovery label Jul 13, 2021

pbardea changed the title ~~backfill: rate limit~~ backfill: rate limit progress updating Jul 13, 2021

shermanCRL mentioned this issue Jul 13, 2021

bulkio: investigate “long tail” index backfill performance #67535

Closed

ajwerner mentioned this issue Jul 13, 2021

error expiring job sessions with node down #67487

Closed

pbardea self-assigned this Jul 22, 2021

pbardea mentioned this issue Jul 29, 2021

backfill: reduce checkpoint interval #68215

Merged

blathers-crl bot mentioned this issue Jul 30, 2021

release-21.1: backfill: reduce checkpoint interval #68287

Merged

pbardea linked a pull request Jul 31, 2021 that will close this issue

backfill: reduce checkpoint interval #68215

Merged

pbardea removed their assignment Aug 30, 2021

dt closed this as completed Feb 13, 2024

github-project-automation bot added this to Disaster Recovery Backlog Aug 28, 2024

github-project-automation bot moved this to Done in Disaster Recovery Backlog Aug 28, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

backfill: rate limit progress updating #67523

backfill: rate limit progress updating #67523

pbardea commented Jul 13, 2021 •

edited by cockroach-jira-scripts

Loading

dankinder commented Jul 30, 2021

pbardea commented Aug 30, 2021

backfill: rate limit progress updating #67523

backfill: rate limit progress updating #67523

Comments

pbardea commented Jul 13, 2021 • edited by cockroach-jira-scripts Loading

dankinder commented Jul 30, 2021

pbardea commented Aug 30, 2021

pbardea commented Jul 13, 2021 •

edited by cockroach-jira-scripts

Loading