-
Notifications
You must be signed in to change notification settings - Fork 2.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[vtgate planner] Routing & Merging refactor #12197
Conversation
Review ChecklistHello reviewers! 👋 Please follow this checklist when reviewing this Pull Request. General
If a new flag is being introduced:
If a workflow is added or modified:
Bug fixes
Non-trivial changes
New/Existing features
Backward compatibility
|
752194e
to
b19d35e
Compare
3b61762
to
8f8d98f
Compare
5addfe1
to
118a776
Compare
@@ -1150,7 +1150,7 @@ | |||
"Original": "select Id from user where 1 in ('aa', 'bb')", | |||
"Instructions": { | |||
"OperatorType": "Route", | |||
"Variant": "Scatter", | |||
"Variant": "None", |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We are now using the evalengine
to check if we can at plan-time evaluate expression. If we can, and if the result is false
, we can use the None
opcode which is very cheap.
@@ -279,9 +279,9 @@ | |||
"Sharded": false | |||
}, | |||
"FieldQuery": "select RC.CONSTRAINT_NAME, ORDINAL_POSITION from INFORMATION_SCHEMA.KEY_COLUMN_USAGE as KCU, INFORMATION_SCHEMA.REFERENTIAL_CONSTRAINTS as RC where 1 != 1", | |||
"Query": "select RC.CONSTRAINT_NAME, ORDINAL_POSITION from INFORMATION_SCHEMA.KEY_COLUMN_USAGE as KCU, INFORMATION_SCHEMA.REFERENTIAL_CONSTRAINTS as RC where KCU.TABLE_SCHEMA = :__vtschemaname and KCU.TABLE_NAME = :KCU_TABLE_NAME and KCU.COLUMN_NAME = 'id' and KCU.REFERENCED_TABLE_SCHEMA = 'test' and KCU.CONSTRAINT_NAME = 'data_type_table_id_fkey' and KCU.CONSTRAINT_NAME = RC.CONSTRAINT_NAME order by KCU.CONSTRAINT_NAME asc, KCU.COLUMN_NAME asc", | |||
"Query": "select RC.CONSTRAINT_NAME, ORDINAL_POSITION from INFORMATION_SCHEMA.KEY_COLUMN_USAGE as KCU, INFORMATION_SCHEMA.REFERENTIAL_CONSTRAINTS as RC where KCU.TABLE_SCHEMA = :__vtschemaname and KCU.TABLE_NAME = :KCU_TABLE_NAME and KCU.COLUMN_NAME = 'id' and KCU.REFERENCED_TABLE_SCHEMA = :__vtschemaname and KCU.CONSTRAINT_NAME = 'data_type_table_id_fkey' and KCU.CONSTRAINT_NAME = RC.CONSTRAINT_NAME order by KCU.CONSTRAINT_NAME asc, KCU.COLUMN_NAME asc", |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The old query was wrong - we should never use the given schema name. Instead we have to replace the literal value with the argument :__vtschemaname
which is then filled in by the vttablet with the name of the underlying MySQL database.
"SysTableTableName": "[KCU_TABLE_NAME:VARCHAR(\"data_type_table\")]", | ||
"SysTableTableSchema": "[VARCHAR(\"test\")]", | ||
"SysTableTableSchema": "[VARCHAR(\"test\"), VARCHAR(\"test\")]", |
This comment was marked as resolved.
This comment was marked as resolved.
Sorry, something went wrong.
"Keyspace": { | ||
"Name": "main", | ||
"Sharded": false | ||
"OperatorType": "Join", |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The old plan was wrong. Given WHERE kcu.table_schema = ? AND rc.constraint_schema = ?
, we won't know until runtime if the user wants to look at information from the same or different keyspaces/schemas, and so merging these routes into a single one is invalid.
@@ -547,7 +571,7 @@ | |||
"FieldQuery": "select fk.referenced_table_name as to_table, fk.referenced_column_name as primary_key, fk.column_name as `column`, fk.constraint_name as `name`, rc.update_rule as on_update, rc.delete_rule as on_delete from information_schema.referential_constraints as rc, information_schema.key_column_usage as fk where 1 != 1", | |||
"Query": "select fk.referenced_table_name as to_table, fk.referenced_column_name as primary_key, fk.column_name as `column`, fk.constraint_name as `name`, rc.update_rule as on_update, rc.delete_rule as on_delete from information_schema.referential_constraints as rc, information_schema.key_column_usage as fk where rc.constraint_schema = :__vtschemaname and rc.table_name = :rc_table_name and fk.referenced_column_name is not null and fk.table_schema = :__vtschemaname and fk.table_name = :fk_table_name", | |||
"SysTableTableName": "[fk_table_name:VARCHAR(\"table_name\"), rc_table_name:VARCHAR(\"table_name\")]", | |||
"SysTableTableSchema": "[VARCHAR(\"table_schema\"), VARCHAR(\"table_schema\")]", | |||
"SysTableTableSchema": "[VARCHAR(\"table_schema\")]", |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Since we have merged the two routes, there is no need to specify the table_schema
schema twice.
"SysTableTableName": "[tc_table_name:VARCHAR(\"table_name\")]", | ||
"SysTableTableSchema": "[VARCHAR(\"constraint_schema\"), VARCHAR(\"table_schema\")]", | ||
"Table": "information_schema.check_constraints, information_schema.table_constraints" | ||
"OperatorType": "Join", |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Here we know that the two schemas being searched for are different - WHERE tc.table_schema = 'table_schema' AND ... cc.constraint_schema = 'constraint_schema'
. Merging these two was wrong.
@@ -22,7 +22,7 @@ | |||
"Original": "select * from ambiguous_ref_with_source", | |||
"Instructions": { | |||
"OperatorType": "Route", | |||
"Variant": "Reference", | |||
"Variant": "Unsharded", |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
ambiguous_ref_with_source
exists both as an unsharded table in the main
keyspace, and as a reference table in the user
keyspace. The latter is a copy of the unsharded table spread out to all shards so that all joins can be local.
During route planning we have decided that we want to send the query to the unsharded main
keyspace. The OpCode is more accurate if it's Unsharded
for this route.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This is great, it's an important change that makes things easier for all of us and that fills some gaps we had, thank you @systay 🙏🏻
I left a few questions, comments and nits after this first pass
Signed-off-by: Andres Taylor <[email protected]>
Signed-off-by: Andres Taylor <[email protected]>
Signed-off-by: Andres Taylor <[email protected]>
type InfoSchemaRouting struct { | ||
SysTableTableSchema []sqlparser.Expr | ||
SysTableTableName map[string]sqlparser.Expr | ||
Table *QueryTable | ||
} | ||
|
||
func (isr *InfoSchemaRouting) UpdateRoutingParams(_ *plancontext.PlanningContext, rp *engine.RoutingParameters) error { | ||
rp.SysTableTableSchema = nil | ||
for _, expr := range isr.SysTableTableSchema { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
now that we do not merge if SysTableTableSchema is different, should SysTableTableSchema be an Expr than slice of Expr? Similarly, Do we need SysTableTableName as a map?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
valid points. wdyt about doing these fixes in a separate PR? this one has grown enough for now :)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We should add a task for it
Signed-off-by: Andres Taylor <[email protected]>
Signed-off-by: Andres Taylor <[email protected]>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good to me!
Signed-off-by: Andres Taylor <[email protected]>
The rewriting on v16 didn't consider the case where we already had an extract subquery. In that case we don't extract again, to avoid infinite recursion. This does not affect v17 and later as this was fixed in the refactor in vitessio#12197. Signed-off-by: Dirkjan Bussink <[email protected]>
Description
This PR refactors how routing of queries is done during query planning.
Why?
The logic for which routes can be merged together is an important and complex part of the query planner.
Making the code easy to understand and talk about is critical to get this correct.
The old
Route
operator consisted of a set of fields:Of these, only
Source
,RouteOpCode
andMergedWith
are valid for all types of routes.All other fields only make sense for some OpCodes that the route represents.
The fields
VindexPreds
andSelected
only make sense for sharded tables, which are represented a lot of OpCodes, such asScatter
,EqualUnique
, etc.SysTableTableSchema
,SysTableTableName
are only used for information_schema tables (OpCode DBA).In a lot of places, we had to use a switch statement on the OpCode to handle things differently depending on the type of
Route
we were dealing with.The Change
To me, this screamed for an interface and multiple different implementation of this interface, depending on which type of route we have.
The new operator now contains:
The
Routing
interface is then used for picking the best plan per table in the query, and then the merging of multipleRoute
s into as few as possible.While doing this refactoring, I tried to keep the tests intact and only change the code behind. For the few exceptions to this rule, I have added comments in this PR explaining why the change was introduced.
Checklist