Feature Request: Ignore all write_only vindexes when query planning #7336

jmoldow · 2021-01-21T05:20:28Z

Feature Description

Right now, the Vitess query planner / EXPLAIN information has no way to distinguish between a vindex that, for a given parametrized query:

Happens to be hitting all shards for the requested parameter values and for the current data in the lookup tables; vs.
Is always guaranteed to scatter to all shards, no matter the parameter values and the data in the lookup tables

This explains the behavior observed in #7328:

EXPLAIN FORMAT=vitess reported SelectEqualUnique because, as far as it knew, a unique vindex was being used on a single value.
vtexplain reported that the query would go to all shards, because Map for the write_only lookup vindex returned a list containing all the shards.

If instead the Vitess query planner had special support for understanding and handling write_only, then such vindexes could be ignored entirely during query planning. Then, rather than being unable to tell between the two cases above, the vindex would fall into a third category. And these problematic queries would fall through to all-shards scatter queries, which would be notated appropriately in EXPLAIN FORMAT=vitess, vtexplain, and /debug/scatter_stats.

As a bonus, the tools EXPLAIN FORMAT=vitess, vtexplain, and /debug/scatter_stats should let you know that a candidate vindex is available but still in write_only mode. That would make the situation even more clear.

Use Case(s)

Fix for #7328.

The text was updated successfully, but these errors were encountered:

jmoldow · 2021-01-21T16:59:58Z

One other interesting thing here.

Suppose there is a query with WHERE a=? AND b=?. Suppose this query currently utilizes a Lookup NonUnique vindex on column a.

If one runs CreateLookupVindex to create a Lookup Unique on column b, then the new vindex will have a lower cost https://vitess.io/docs/reference/features/vindexes/#cost , and will therefore be chosen.

But because the new vindex is write_only, the Map function will return all shards, leading to an all-shards scatter. Whereas if the original Lookup NonUnique vindex on column a was chosen, it wouldn't necessarily scatter to all-shards.

Whereas if write_only vindexes were ignored by the planner, the original Lookup NonUnique vindex will continue to be used, until ExternalizeVindex is run on the new vindex.

I haven't tried to repro this, but it makes sense, based on what I know, that this bug would manifest as described.

systay self-assigned this Jan 21, 2021

jmoldow mentioned this issue Jan 23, 2021

update reference/features/vindex vitessio/website#610

Merged

deepthi added the Component: Query Serving label Jan 25, 2021

askdba added the P3 label Feb 1, 2021

deepthi mentioned this issue Aug 28, 2021

Fixing error on deletes from owning table for a lookup being populated by CreateLookupVindex after a SwitchWrites #8701

Merged

3 tasks

ajm188 removed the P3 label Mar 9, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature Request: Ignore all write_only vindexes when query planning #7336

Feature Request: Ignore all write_only vindexes when query planning #7336

jmoldow commented Jan 21, 2021 •

edited

Loading

jmoldow commented Jan 21, 2021 •

edited

Loading

Feature Request: Ignore all write_only vindexes when query planning #7336

Feature Request: Ignore all write_only vindexes when query planning #7336

Comments

jmoldow commented Jan 21, 2021 • edited Loading

Feature Description

Use Case(s)

jmoldow commented Jan 21, 2021 • edited Loading

jmoldow commented Jan 21, 2021 •

edited

Loading

jmoldow commented Jan 21, 2021 •

edited

Loading