Using index for NULL values is slower than full table scan (not using index) #62963

kjlubick · 2021-04-01T14:59:33Z

Describe the problem

I have a table (TiledTraceDigests) with ~20 million rows. I realized the table needed another column (grouping_id) and an index on the new column, so I added it with alter table and then create index.

I then needed to fill in this column with data by looking it up in another table (Traces), so I ran several updates like:

UPDATE TiledTraceDigests SET (grouping_id) = (
  SELECT grouping_id FROM Traces WHERE Traces.trace_id = TiledTraceDigests.trace_id)
WHERE grouping_id is NULL limit 1000000;

The Limit of 1 million was to prevent the updates from taking too long or having to be retried if new data came in.
This was going fine for the first 16 updates or so, taking 60s or so per update. Suddenly, an update stalled out, taking over 15 minutes before I killed it from the UI.

I tried running the same update command with limit 5 and instead of happening in tens or hundreds of milliseconds, it took over 10 seconds (to update 5 rows).

I used EXPLAIN ANALYZE to see where the time was taking. It was blocked on getting rows from TiledTraceDigests.

Here's some interesting queries:

select * From TiledTraceDigests@grouping_digest_idx where grouping_id is null limit 10; took 6.7 seconds
select * From TiledTraceDigests@primary where grouping_id is null limit 10; took 5.3 seconds
select * From TiledTraceDigests@grouping_digest_idx where grouping_id = x'a181394e13962c65455837cbdd3a8da8' limit 10; took 4 milliseconds (as I would expect).

It appears that querying the null portion of this index is very very slow.

I first noticed this on v20.2.3, but the problem appears to persist after updating to v20.2.7.

Expected behavior
I expect querying a few rows from an index to be fast (milliseconds), not slower than avoiding use of the index.

Additional data / screenshots
SQL Schemas:

CREATE TABLE IF NOT EXISTS TiledTraceDigests (
  trace_id BYTES,
  tile_id INT4,
  digest BYTES NOT NULL,
  -- The following row was added with an alter table
  -- grouping_id BYTES NOT NULL,
  PRIMARY KEY (trace_id, tile_id, digest)
  -- The following index was added after the alter table
  -- INDEX grouping_digest_idx (grouping_id, digest)
);
CREATE TABLE IF NOT EXISTS Traces (
  trace_id BYTES PRIMARY KEY,
  corpus STRING AS (keys->>'source_type') STORED NOT NULL,
  grouping_id BYTES NOT NULL,
  keys JSONB NOT NULL,
  matches_any_ignore_rule BOOL,
  INDEX grouping_ignored_idx (grouping_id, matches_any_ignore_rule),
  INDEX ignored_grouping_idx (matches_any_ignore_rule, grouping_id),
  INVERTED INDEX keys_idx (keys)
);

I've attached the zip file taken from Statement Diagnostics in the UI.
stmt-bundle-646243008118882309.zip

Environment:

CockroachDB version 20.2.3 (where I first noticed this. After updating to 20.2.7, the problem persists)
Server OS: Debian/Kubernetes
Client app cockroach sql

Additional context
What was the impact?

My new column is only partially filled out, and I'm not sure how long it will take me to finish.

The text was updated successfully, but these errors were encountered:

kjlubick · 2021-04-01T15:21:05Z

FWIW: Deleting and recreating the index appears to have made the "WHERE grouping_id IS NULL" queries fast again. Not a great solution, but a solution.

RaduBerinde · 2021-04-06T18:43:05Z

select * From TiledTraceDigests@grouping_digest_idx where grouping_id is null limit 10; took 6.7 seconds
select * From TiledTraceDigests@primary where grouping_id is null limit 10; took 5.3 seconds

These queries should result in a contradiction when we generate index constraints (since the column is defined as NOT NULL). We may have a bug where the contradiction becomes a full table scan because there are no spans.

kjlubick · 2021-04-06T18:58:00Z

Oh, sorry I better clarify something. The grouping_id BYTES column was added, but w/o the NOT NULL constraint. I planned to add that constraint after I had filled out all the data.

mgartner · 2021-04-06T20:57:34Z

@kjlubick Are you running multiple UPDATEs concurrently? When you timed those three "interesting" queries, were there any ongoing UPDATEs?

kjlubick · 2021-04-07T11:18:01Z

Yes, there would have been multiple INSERTs to that table in parallel. I noticed if an INSERT happened during my query it would take approximately 2 or 3 times as long (retries, I presume). The data I provided was when there were not INSERTs being executed.

kjlubick · 2021-04-07T11:20:59Z

FWIW, those INSERTs were something like
INSERT INTO TiledTraceDigests (trace_id, tile_id, digest) VALUES ($1, $2, $3), ($4, $5, $6)... ON CONFLICT DO NOTHING using crdbpgx.ExecuteTx to retry retryable errors. Row batch size was up to 200 rows per insert.

mgartner · 2021-04-07T17:46:36Z

Contention may be the culprit - scanning for rows where grouping_id is null would contend with these inserts you describe. However, I don't see how dropping and recreating the index would have improve performance in this case. Has performance degraded at all since recreating the index?

kjlubick · 2021-04-07T17:57:53Z

Performance appears normal after recreating the index:

> select * From TiledTraceDigests@primary where grouping_id is null limit 10;
  trace_id | tile_id | digest | grouping_id
-----------+---------+--------+--------------
(0 rows)

Time: 21.636s total (execution 21.535s / network 0.101s)

> select * From TiledTraceDigests@grouping_digest_idx where grouping_id is null limit 10;
  trace_id | tile_id | digest | grouping_id
-----------+---------+--------+--------------
(0 rows)

Time: 103ms total (execution 102ms / network 1ms)

There are currently about 22 million rows in TiledTraceDigests as I write this.

mgartner · 2021-04-07T23:59:55Z

Another possibility is that you ran into #54029. If so, that first UDPATE of 1 million rows could have created general slowness with the table.

Even if it's unrelated to #54029, I'd suggest reducing the number of rows updated in each batch to around 10k. You should get more consistent performance by doing so.

mgartner · 2021-04-09T16:45:27Z

@kjlubick I'm going to close this issue for now because it's unlikely we'll get to the bottom of the this unless you encounter it again. Our best guess that it's related to #54029, and reducing the batch size of the updates to ~10k should mitigate that. Please leave a comment if you see it again, and we can try to diagnose it.

kjlubick added the C-bug Code not up to spec/doc, specs & docs deemed correct. Solution expected to change code/behavior. label Apr 1, 2021

This comment has been minimized.

Sign in to view

blathers-crl bot added O-community Originated from the community X-blathers-triaged blathers was able to find an owner labels Apr 1, 2021

mgartner self-assigned this Apr 6, 2021

mgartner closed this as completed Apr 9, 2021

mgartner added this to SQL Queries Jul 24, 2023

mgartner moved this to Done in SQL Queries Jul 24, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Using index for NULL values is slower than full table scan (not using index) #62963

Using index for NULL values is slower than full table scan (not using index) #62963

kjlubick commented Apr 1, 2021 •

edited

Loading

This comment has been minimized.

kjlubick commented Apr 1, 2021

RaduBerinde commented Apr 6, 2021

kjlubick commented Apr 6, 2021

mgartner commented Apr 6, 2021 •

edited

Loading

kjlubick commented Apr 7, 2021

kjlubick commented Apr 7, 2021

mgartner commented Apr 7, 2021

kjlubick commented Apr 7, 2021

mgartner commented Apr 7, 2021

mgartner commented Apr 9, 2021

Using index for NULL values is slower than full table scan (not using index) #62963

Using index for NULL values is slower than full table scan (not using index) #62963

Comments

kjlubick commented Apr 1, 2021 • edited Loading

This comment has been minimized.

kjlubick commented Apr 1, 2021

RaduBerinde commented Apr 6, 2021

kjlubick commented Apr 6, 2021

mgartner commented Apr 6, 2021 • edited Loading

kjlubick commented Apr 7, 2021

kjlubick commented Apr 7, 2021

mgartner commented Apr 7, 2021

kjlubick commented Apr 7, 2021

mgartner commented Apr 7, 2021

mgartner commented Apr 9, 2021

kjlubick commented Apr 1, 2021 •

edited

Loading

mgartner commented Apr 6, 2021 •

edited

Loading