sql: large latency spikes when creating a large storing index while running tpcc #45888

rohany · 2020-03-09T15:14:38Z

Investigation into #44501 and #44504 have uncovered that the core of the problem seems to be creating the index that stores all the columns.

To reproduce:

roachprod create $CLUSTER -n 4 --clouds=aws  --aws-machine-type-ssd=c5d.4xlarge
roachprod stage $CLUSTER:1-3 cockroach
roachprod stage $CLUSTER:4 workload
roachprod start $CLUSTER:1-3
roachprod adminurl --open $CLUSTER:1
roachprod run $CLUSTER:1 -- "./cockroach workload fixtures import tpcc --warehouses=2500 --db=tpcc --checks=false"
roachprod run $CLUSTER:4 "./workload run tpcc --ramp=5m --warehouses=2500 --active-warehouses=2000 --split --scatter {pgurl:1-3}"

After the ramp period, run in another shell

roachprod sql $CLUSTER:3
> use tpcc;
> create unique index on customer (c_w_id, c_d_id, c_id) storing (c_first, c_middle, c_last, c_street_1, c_street_2, c_city, c_state, c_zip, c_phone, c_since, c_credit, c_credit_lim, c_discount, c_balance, c_ytd_payment, c_payment_cnt, c_delivery_cnt, c_data);

After some time, large p99 latency spikes can be witnessed, sometimes going up to multiple seconds.

Epic CRDB-8816

Jira issue: CRDB-5120

The text was updated successfully, but these errors were encountered:

rohany · 2020-03-09T20:45:40Z

Investigation and discussion with @ajwerner seems to have led to some understanding of this. The sql latency spikes observed can be directly correlated with spikes transaction restarts. Intuition for this is the following: when we create a second primary key, we introduce a large amount of new artificial contention, as all txns that update rows in the primary key also have to go and update the row in the index being built.

Due to this, it seems that running tpcc with 2k warehouses and a large storing index will overload a 3 node cluster. I see fewer latency spikes at 1k warehouses, and only 1 spike with 500 warehouses.

However, it is unclear what users should do if they have a workload running and need to change their primary key.

ajwerner · 2020-03-10T23:26:19Z

This feels very much the same as schema: latency spike with new index backfill on tpc-c 10k (with partitioning) #36925

Interestingly this relates to some other discussions about the inner workings of schema changes which perform backfills to new indexes. Here's some words that relate to several recent conversations. The thrust of this is (a) adding a new

Today we have two forms of index backfill, the "index backfill" and the "column backfill". When we perform that column backfill we do that work in a normal transaction because we're writing to a live index. For "index backfill" we're writing to an index that hasn't been written to (sort of). Rather, we put the new index backfill in DELETE_AND_WRITE_ONLY mode where foreground writes go to the new index in parallel with the AddSSTable. This is safe because the MVCC timestamps in the AddSSTable will precede all foreground writes.

Unfortunately there's no access to this fanciness for "column backfill". The upshot of column backfill is that it doesn't create a second index which foreground traffic needs to write to. This means that for transactions which used to hit the 1PC optimization, they still do. Furthermore, for transactions which are contended, the contention footprint is not doubled.

As we can see in this specific issue here, that contention increase can be a big deal.

For upcoming work on #9851 (changing data types), we're got what seems like two bad options.

We could do it all with column backfills. That would require two passes, one to add the new column type and then another to remove the old one. Furthermore, column backfills are quite a bit less efficient.
We could do it with an index backfill. Create a new index with the new column type then swap out the old one. This is bad because in the meantime, clients will need to write to both. When you rely on the 1PC optimization, it's a huge perf win and can make or break a workload

The concrete proposal here is to buy into #36850 and give ourselves a third option.

dt · 2020-03-12T11:06:14Z

Intuition for this is the following: when we create a second primary key, we introduce a large amount of new artificial contention, as all txns that update rows in the primary key also have to go and update the row in the index being built.

...

This feels very much the same as #36925

I'm having trouble squaring these in my understanding: if it were just the fact that we have a secondary index, then wouldn't the latency linger post backfill? My understanding from #36925 was that it was only _during the backfill that we see issues?

dt · 2020-03-12T11:27:00Z

I mentioned this in person last week but I think this investigation would be well served by a trace of a query which sees the latency spike, run before, during and after the backfill that shows exactly where we are spending the extra time.

ajwerner · 2020-03-12T13:33:31Z

I mentioned this in person last week but I think this investigation would be well served by a trace of a query which sees the latency spike, run before, during and after the backfill that shows exactly where we are spending the extra time.

That's reasonable. You're right that if adding the index is going to cause problems, keeping the backfill out of the way is not going to solve anything. When I entered the conversation my mindset had been on the primary key changes where we're going to remove the old index after the backfill.

rohany · 2020-03-12T14:16:12Z

I mentioned this in person last week but I think this investigation would be well served by a trace of a query which sees the latency spike, run before, during and after the backfill that shows exactly where we are spending the extra time.

I was having trouble collecting traces with jaeger due to memory issues, but traces that I collected from the /debug/ endpoint were just evident of a large number of retries (seen on the admin UI as well). The traces were sql.txn, and showed the txns got pushed for a long time -- some of the largest offenders were up to 60 seconds.

At least for primary key changes the old primary index is dropped so that after the backfill everything calms down. I'll try again with keeping the storing index around.

awoods187 · 2020-03-17T02:21:02Z

Whats our thinking now that we've completed additional investigations here?

github-actions · 2023-09-14T11:09:03Z

We have marked this issue as stale because it has been inactive for
18 months. If this issue is still relevant, removing the stale label
or adding a comment will keep it active. Otherwise, we'll close it in
10 days to keep the issue queue tidy. Thank you for your contribution
to CockroachDB!

This was referenced Mar 9, 2020

sql: A 2m long 2s latency spike during online primary key changes with TPCC #44501

Closed

sql: error result is ambiguous in tpcc after online primary key change #44504

Closed

rohany self-assigned this Mar 9, 2020

awoods187 added C-bug Code not up to spec/doc, specs & docs deemed correct. Solution expected to change code/behavior. S-3-ux-surprise Issue leaves users wondering whether CRDB is behaving properly. Likely to hurt reputation/adoption. labels Mar 9, 2020

dt assigned miretskiy Mar 11, 2020

dt mentioned this issue Apr 8, 2020

bulk, *: latency impact of background jobs on foreground traffic #47215

Closed

thoszhang mentioned this issue Apr 15, 2020

sql: investigate ways of cleaning up failed schema change rollbacks with mutations #47456

Closed

thoszhang mentioned this issue Apr 23, 2020

sql: column backfills should build a new index instead of mutating the existing one #47989

Closed

darinpp mentioned this issue Jul 7, 2020

roachtest: kv95/weekly/enc=false/nodes=32 failed #50686

Closed

miretskiy removed their assignment Aug 17, 2020

kenliu added the T-disaster-recovery label Dec 5, 2020

jlinder unassigned rohany Jul 1, 2021

github-actions bot added the no-issue-activity label Sep 14, 2023

github-actions bot added the X-stale label Sep 26, 2023

github-actions bot closed this as not planned Won't fix, can't repro, duplicate, stale Sep 26, 2023

exalate-issue-sync bot closed this as completed Sep 26, 2023

github-project-automation bot added this to Disaster Recovery Backlog Aug 28, 2024

github-project-automation bot moved this to Done in Disaster Recovery Backlog Aug 28, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

sql: large latency spikes when creating a large storing index while running tpcc #45888

sql: large latency spikes when creating a large storing index while running tpcc #45888

rohany commented Mar 9, 2020 •

edited by cockroach-jira-scripts

Loading

rohany commented Mar 9, 2020

ajwerner commented Mar 10, 2020

dt commented Mar 12, 2020

dt commented Mar 12, 2020

ajwerner commented Mar 12, 2020

rohany commented Mar 12, 2020

awoods187 commented Mar 17, 2020

github-actions bot commented Sep 14, 2023

sql: large latency spikes when creating a large storing index while running tpcc #45888

sql: large latency spikes when creating a large storing index while running tpcc #45888

Comments

rohany commented Mar 9, 2020 • edited by cockroach-jira-scripts Loading

rohany commented Mar 9, 2020

ajwerner commented Mar 10, 2020

dt commented Mar 12, 2020

dt commented Mar 12, 2020

ajwerner commented Mar 12, 2020

rohany commented Mar 12, 2020

awoods187 commented Mar 17, 2020

github-actions bot commented Sep 14, 2023

rohany commented Mar 9, 2020 •

edited by cockroach-jira-scripts

Loading