Optimize tsh db ls performance #14092

greedy52 · 2022-07-05T02:01:48Z

fix #14075

tsh db ls execution time depends on number of roles assigned to a user #14075

Changes:

Added new API GetCurrentUserRoles that streams all user roles
tsh db ls to reuse proxy client
tsh db ls --all to run fetch in parallel per profile

lib/auth/auth_with_roles.go

lib/auth/grpcserver.go

lib/services/role.go

tool/tsh/db.go

smallinsky

I wonder if we can cover this flow somehow in the tsh test.
For example, we can use our own dialer with custom timeout and inject this in tsh test to simulate low latency between the tsh client and teleport proxy.

espadolini

Have we considered the idea of having the allowed database users retrieved and returned as part of ListDatabases instead, to avoid the extra roundtrips to also fetch the rolesets?

sidenote: we don't seem to care about showing the list of database users when we're outputting databases in json format, perhaps we could skip the extra fetches in that case?

lib/services/role.go

lib/auth/grpcserver.go

lib/auth/auth_with_roles.go

tool/tsh/db.go

espadolini · 2022-07-06T11:12:53Z

additional question: do we absolutely need all the changes to parallelize ListDatabases and ConnectToProxy/FetchAllClusterRoles instead of opening one less ProxyClient and using the same one for databases and roles?

espadolini

Additional point raised by @rosstimothy: we should be careful about parallelizing connections - a cluster with a lot of leaf clusters can cause a massive amount of outbound connections when doing listDatabasesAllClusters.

smallinsky

Nice, I have left one comment but aport from that the fix looks good for me.

tool/tsh/db.go

rosstimothy

Looks good to me. We probably want to consider adding some tracing bits to the new stuff added here. It might also be worthwhile to capture traces before and after this change. That would really show the performance gains and also help identify any areas that could still be improved upon.

greedy52 · 2022-07-07T17:53:10Z

Looks good to me. We probably want to consider adding some tracing bits to the new stuff added here. It might also be worthwhile to capture traces before and after this change. That would really show the performance gains and also help identify any areas that could still be improved upon.

@rosstimothy traceability for new changes works out of the box. Old code services.FetchRoles does not take the context so I have to do some local changes to capture. I will leave instrumenting old code for a separate change.

Here is the comparison. Tested against my cluster in AWS.

Before this change

ConnectToProxy is the most expansive call and we were doing that 2 times.
GetRole n times for each role my user has. In the original issue, GetRole takes 100ms~200ms against TeleportCloud so they add up quick.

With this change

ConnectToProxy once
No longer calling GetSites to figure out current cluster name.
A single GetCurrentUserRoles

Really love the tracing!

…/14075_improve_tsh_db_ls_performance

…m:gravitational/teleport into STeve/14075_improve_tsh_db_ls_performance

rosstimothy · 2022-07-07T19:29:09Z

Looks good!

I think that listDatabasesAllClusters might potentially benefit from a span per profile, but that could be added later too I suppose.

…/14075_improve_tsh_db_ls_performance

…m:gravitational/teleport into STeve/14075_improve_tsh_db_ls_performance

github-actions · 2022-07-09T20:23:41Z

@greedy52 See the table below for backport results.

Branch	Result
branch/v10	Create PR

greedy52 added 2 commits July 4, 2022 21:59

Optimize tsh db ls performance

114c681

remove debug log

fe140ca

greedy52 added the WIP label Jul 5, 2022

greedy52 requested a review from smallinsky July 5, 2022 02:48

fix UT

3a21023

greedy52 force-pushed the STeve/14075_improve_tsh_db_ls_performance branch from 78315ad to 3a21023 Compare July 5, 2022 02:49

smallinsky reviewed Jul 5, 2022

View reviewed changes

lib/auth/auth_with_roles.go Outdated Show resolved Hide resolved

lib/auth/grpcserver.go Show resolved Hide resolved

lib/services/role.go Outdated Show resolved Hide resolved

tool/tsh/db.go Outdated Show resolved Hide resolved

smallinsky reviewed Jul 5, 2022

View reviewed changes

tool/tsh/db.go Outdated Show resolved Hide resolved

GavinFrazar mentioned this pull request Jul 5, 2022

Improve db connection errors #13824

Closed

rosstimothy requested a review from espadolini July 5, 2022 19:03

smallinsky reviewed Jul 6, 2022

View reviewed changes

espadolini reviewed Jul 6, 2022

View reviewed changes

russjones requested a review from rosstimothy July 6, 2022 17:19

greedy52 added 4 commits July 6, 2022 23:30

refactor

27f2e26

revert show databases changes

85c709f

run parrallel for tsh db ls --all

2d98293

add missing go.mod files

e90f6f4

greedy52 removed the WIP label Jul 7, 2022

greedy52 requested review from espadolini and smallinsky July 7, 2022 04:59

greedy52 marked this pull request as ready for review July 7, 2022 04:59

github-actions bot added the tsh tsh - Teleport's command line tool for logging into nodes running Teleport. label Jul 7, 2022

github-actions bot requested review from atburke and r0mant July 7, 2022 05:00

smallinsky reviewed Jul 7, 2022

View reviewed changes

tool/tsh/db.go Outdated Show resolved Hide resolved

espadolini requested changes Jul 7, 2022

View reviewed changes

tool/tsh/db.go Outdated Show resolved Hide resolved

tool/tsh/db.go Outdated Show resolved Hide resolved

rosstimothy reviewed Jul 7, 2022

View reviewed changes

tool/tsh/db.go Outdated Show resolved Hide resolved

reuse proxy client for tsh db ls --all

319911d

greedy52 requested review from espadolini, rosstimothy and smallinsky July 7, 2022 15:50

espadolini approved these changes Jul 7, 2022

View reviewed changes

greedy52 added the merge-for-v10 label Jul 7, 2022

rosstimothy approved these changes Jul 7, 2022

View reviewed changes

Merge branch 'master' into STeve/14075_improve_tsh_db_ls_performance

6a950e4

greedy52 enabled auto-merge (squash) July 7, 2022 17:57

greedy52 added the backport/branch/v10 label Jul 7, 2022

greedy52 added 4 commits July 7, 2022 14:24

fix ut

5e70e4e

Merge branch 'master' of github.com:gravitational/teleport into STeve…

639218d

…/14075_improve_tsh_db_ls_performance

Merge branch 'STeve/14075_improve_tsh_db_ls_performance' of github.co…

20dcdf6

…m:gravitational/teleport into STeve/14075_improve_tsh_db_ls_performance

Merge branch 'master' into STeve/14075_improve_tsh_db_ls_performance

c3e3cb8

greedy52 added 3 commits July 7, 2022 21:17

Merge branch 'master' of github.com:gravitational/teleport into STeve…

691e85f

…/14075_improve_tsh_db_ls_performance

Merge branch 'STeve/14075_improve_tsh_db_ls_performance' of github.co…

35308de

…m:gravitational/teleport into STeve/14075_improve_tsh_db_ls_performance

Merge branch 'master' into STeve/14075_improve_tsh_db_ls_performance

0f57afd

smallinsky approved these changes Jul 8, 2022

View reviewed changes

Merge branch 'master' into STeve/14075_improve_tsh_db_ls_performance

87b32e0

greedy52 merged commit 13abca6 into master Jul 9, 2022

This was referenced Jul 9, 2022

[v10] Optimize tsh db ls performance #14284

Merged

[v9] Optimize "tsh db ls" performance #14287

Merged

[v8] Optimize "tsh db ls" performance #14288

Merged

greedy52 deleted the STeve/14075_improve_tsh_db_ls_performance branch July 11, 2022 13:40

greedy52 mentioned this pull request Jul 11, 2022

Fix "tsh db ls --cluster cluster" to use the correct cluster name #14334

Merged

ravicious mentioned this pull request Aug 19, 2022

lib/services/role CurrentUserRoleGetter: Fix comment #13923

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimize tsh db ls performance #14092

Optimize tsh db ls performance #14092

greedy52 commented Jul 5, 2022 •

edited

Loading

smallinsky left a comment

espadolini left a comment

espadolini commented Jul 6, 2022

espadolini left a comment •

edited

Loading

smallinsky left a comment

rosstimothy left a comment

greedy52 commented Jul 7, 2022 •

edited

Loading

rosstimothy commented Jul 7, 2022

github-actions bot commented Jul 9, 2022

Optimize tsh db ls performance #14092

Optimize tsh db ls performance #14092

Conversation

greedy52 commented Jul 5, 2022 • edited Loading

smallinsky left a comment

Choose a reason for hiding this comment

espadolini left a comment

Choose a reason for hiding this comment

espadolini commented Jul 6, 2022

espadolini left a comment • edited Loading

Choose a reason for hiding this comment

smallinsky left a comment

Choose a reason for hiding this comment

rosstimothy left a comment

Choose a reason for hiding this comment

greedy52 commented Jul 7, 2022 • edited Loading

Before this change

With this change

rosstimothy commented Jul 7, 2022

github-actions bot commented Jul 9, 2022

greedy52 commented Jul 5, 2022 •

edited

Loading

espadolini left a comment •

edited

Loading

greedy52 commented Jul 7, 2022 •

edited

Loading