-
Notifications
You must be signed in to change notification settings - Fork 2.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[vtctld] Add v0 GetWorkflows rpc and workflow/vexec packages #7575
[vtctld] Add v0 GetWorkflows rpc and workflow/vexec packages #7575
Conversation
WaitForPositionDelays map[string]time.Duration | ||
// keyed by tablet alias. injects a sleep to the end of the function | ||
// regardless of parent context timeout or error result. | ||
WaitForPositionPostDelays map[string]time.Duration | ||
// WaitForPosition(tablet *topodatapb.Tablet, position string) error, so we | ||
// key by tablet alias and then by position. | ||
WaitForPositionResults map[string]map[string]error | ||
// keyed by tablet alias. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Just sorting these, I briefly forgot how the alphabet worked at some point.
// | ||
// It has the same signature as the vtctlservicepb.VtctldServer's GetWorkflows | ||
// rpc, and grpcvtctldserver delegates to this function. | ||
func (s *Server) GetWorkflows(ctx context.Context, req *vtctldatapb.GetWorkflowsRequest) (*vtctldatapb.GetWorkflowsResponse, error) { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'm deferring testing this because I want to start iterating with this in vtadmin. I promise to file issues and come to write tests for these.
@@ -41,6 +42,86 @@ message ExecuteVtctlCommandResponse { | |||
logutil.Event event = 1; | |||
} | |||
|
|||
// TableMaterializeSttings contains the settings for one table. | |||
message TableMaterializeSettings { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
proto/vtctldata.proto
Outdated
ReplicationLocation source = 2; | ||
ReplicationLocation target = 3; | ||
int64 max_v_replication_lag = 4; | ||
map<string, ShardReplicationStatus> shard_statuses = 5; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Suggestion from @rohit-nayak-ps to rename these from ReplicationStatus
to Stream
which I like much better ✨
|
||
// VReplicationQueryPlanner implements the QueryPlanner interface for queries on | ||
// the _vt.vreplication table. | ||
type VReplicationQueryPlanner struct { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
A lot of the stuff in vexec is a refactoring of the existing query planner code in go/vt/wrangler/{vexec,vexec_plan}.go
in a publicly exported and reusable form.
bool is_primary_serving = 3; | ||
} | ||
|
||
message Stream { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@rohit-nayak-ps let me know what you think of the new naming
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This is good!
Signed-off-by: Andrew Mason <[email protected]>
… everything Signed-off-by: Andrew Mason <[email protected]>
This will require a significant amount of polish, testing, and documentation. I'll also need to see if it even works at all with how wrangler currently does vexec queries. Signed-off-by: Andrew Mason <[email protected]>
Signed-off-by: Andrew Mason <[email protected]>
Signed-off-by: Andrew Mason <[email protected]>
…ents Signed-off-by: Andrew Mason <[email protected]>
This is to support individual workflow queries, instead of just bulk queries on multiple workflows (i.e. `GetWorkflow` vs `GetWorkflows`, or `workflow ks.workflowname show` in the old API) Signed-off-by: Andrew Mason <[email protected]>
Signed-off-by: Andrew Mason <[email protected]>
…s/logs Signed-off-by: Andrew Mason <[email protected]>
This matches the old implementation, based on the constant values that are always set for that version's planner.params() function. Note also that in the old implementation, `planner.params().insertTemplates == nil`, so `buildInsertPlan` would always return error, which is why we don't even bother to define a function for it here. Only schema migration workflows seem to support inserts. Signed-off-by: Andrew Mason <[email protected]>
…up them Signed-off-by: Andrew Mason <[email protected]>
Signed-off-by: Andrew Mason <[email protected]>
Signed-off-by: Andrew Mason <[email protected]>
Signed-off-by: Andrew Mason <[email protected]>
Signed-off-by: Andrew Mason <[email protected]>
Signed-off-by: Andrew Mason <[email protected]>
Signed-off-by: Andrew Mason <[email protected]>
…neric than "Manager" Signed-off-by: Andrew Mason <[email protected]>
Signed-off-by: Andrew Mason <[email protected]>
231d794
to
098ac4f
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
very nice
vexec / queryplanner APIs to fit with the QueryParams thing that wrangler | ||
uses, which _should_ work, but who knows?? Time will tell. | ||
*/ | ||
package workflow |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Not entirely sure what you mean by vexec workflows vs the others. If you mean there will be both wrangler and vexec based functionality, that is correct atm. We have "delegated" some APIs like Workflow to VExec since they are common to all vrep workflow types (MoveTables/Reshard/Materialize).
With V2 workflows some of the Workflow command functionality is provided under the umbrella MoveTables/Reshard commands. There is still a little bit of work to do, but we will be in a position to deprecate the Workflow command once completed. I am still a bit ambivalent about a deprecation since Workflow-functionalities work across all types of workflow and might be more user-friendly: the user doesn't need to know which type of workflow they are operating on. I am also going to work soon on the ability to update the BinlogSource attribute to make it easier to setup custom Materialize workflows instead of VReplicationExec based scripting.
That said, VExec is pretty powerful for arbitrary operations on workflows and we may always need it.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
nvm, I see from the code below what you mean by vexec workflows ... Leaving the ramblings above as an fyi ;-)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yeah, this is definitely not as clear as it could be; I'd prefer you didn't have to read all the code to understand what this is trying to say 😅
I can definitely try to clean this up in a later PR (probably when I add the GetWorkflow
), as I want to get this merged soon-ish to unblock us in vtadmin-land to start working with this API.
} | ||
|
||
return nil, assert.AnError | ||
} |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Nice
// method called something like "VReplicationExec(ctx, query, Options{DryRun: true})" | ||
// DryRun(ctx context.Context) error | ||
|
||
// PlanQuery contsructs and returns a QueryPlan for a given statement. The |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
typo: contsructs => constructs
Signed-off-by: Andrew Mason <[email protected]>
[vtctld] Add v0 GetWorkflows rpc and workflow/vexec packages Signed-off-by: Andrew Mason <[email protected]> Conflicts: go/vt/proto/vtctldata/vtctldata.pb.go go/vt/proto/vtctlservice/vtctlservice.pb.go go/vt/vtctl/grpcvtctldclient/client_gen.go go/vt/vtctl/grpcvtctldserver/server.go go/vt/wrangler/vexec.go proto/vtctldata.proto proto/vtctlservice.proto
[vtctld] Add v0 GetWorkflows rpc and workflow/vexec packages Signed-off-by: Andrew Mason <[email protected]> Conflicts: go/vt/proto/vtctldata/vtctldata.pb.go go/vt/proto/vtctlservice/vtctlservice.pb.go go/vt/vtctl/grpcvtctldclient/client_gen.go go/vt/vtctl/grpcvtctldserver/server.go go/vt/wrangler/vexec.go proto/vtctldata.proto proto/vtctlservice.proto
Note (@rafael): This one required a bunch of manual intervention [vtctld] Add v0 GetWorkflows rpc and workflow/vexec packages Signed-off-by: Rafael Chacon <[email protected]>
Description
This adds an initial attempt at a GetWorkflows rpc to VtctldServer, by way of introducing new
vtctl/workflow
andvtctl/workflow/vexec
packages to eventually share more code between the legacy and new vtctl(d) servers. I also made a more committed effort on that front by reimplementing wrangler's ListWorkflows function using the new vexec.package go/vt/vtctl/workflow
This provides the actual implementation of GetWorkflows. I'm debating of separating the GetWorkflows rpc into a separate "service" in the protobuf file, because it seems weird to me that
workflow.Server
provides the actual implementation. I think this would look something like:This would allow us to assert that
workflow.Server
correctly implements the entirety of the Workflow rpcs, without also needing to implement the rest of VtctldServer by separating out the concerns. But then registering is maybe weird, so idk. All perspectives welcome!!package go/vt/vtctl/workflow/vexec
This attempts to create a more ... encapsulated vexec API by more cleanly separating the query parsing/planning/rewriting phase from the execution phase. Eventually this should include all the functionality of wrangler's
runVexec
, but not yet, because that was a lot.Usage
I got a Reshard running by running up through 301_reshard.sh of the local example, and then:
New CLI / code
Old CLI / code
Related Issue(s)
Checklist
Follow-up Tasks
Deployment Notes
Impacted Areas in Vitess
Components that this PR will affect: