Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Batch polling of DescribeVolumesModifications #1965

Merged
merged 2 commits into from
Apr 2, 2024

Conversation

AndrewSirenko
Copy link
Contributor

@AndrewSirenko AndrewSirenko commented Mar 14, 2024

Is this a bug fix or adding new feature?
Feature

What is this PR about? / Why do we need it?
Coalesces EC2 DescribeVolumesModifications (DVM) polling across ControllerExpandVolume/ControllerModifyVolume/ModifyVolume RPCs by volumeID.

This decreases the likelihood of being exceeding one's Non-mutating API request token limit when modifying many volumes at once, which may impact all driver RPCs (ie being throttled for making too manyDescribe* calls).

Note: We do not batch any DVM call that we are unsure if a volume modification has been made for, due to undocumented behavior of a an error being returned if any volume queried for has never been modified.

What testing is done?
CI

Rested that VAC based modify volume batches DescribeVolumeModifications at scale. See Data/Impact section below.


Data/Impact

900 Modified PVCs over 60 seconds across 52 nodes (100Gi increase)

Note: All examples included 900 non-batchable DescribeVolumesModification calls.

Unbatched 300ms 500ms 1s
DVM Calls Made (excluding non-batchable 900) 5500 231 189 114
Rate-Limiting? Yes No No No

200 Modified PVCs over 60 seconds across 12 nodes (100Gi increase)

Note: Both examples include 200 non-batchable DescribeVolumesModification calls.

Unbatched 500ms
DVM Calls Made (excluding non-batchable 200) 624 81

@k8s-ci-robot k8s-ci-robot added do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. labels Mar 14, 2024
@k8s-ci-robot k8s-ci-robot added the size/L Denotes a PR that changes 100-499 lines, ignoring generated files. label Mar 14, 2024
Copy link

github-actions bot commented Mar 14, 2024

Code Coverage Diff

File Old Coverage New Coverage Delta
github.com/kubernetes-sigs/aws-ebs-csi-driver/pkg/cloud/cloud.go 84.4% 85.4% 1.0

@AndrewSirenko AndrewSirenko changed the title [WIP] Batch polling of DescribeVolumesModifications Batch polling of DescribeVolumesModifications Mar 14, 2024
@k8s-ci-robot k8s-ci-robot removed the do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. label Mar 14, 2024
@AndrewSirenko
Copy link
Contributor Author

/hold

Holding to avoid merge conflicts for #1963

@k8s-ci-robot k8s-ci-robot added the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Mar 14, 2024
@torredil
Copy link
Member

Reviewing after PR is rebased post SDK changes.

@AndrewSirenko
Copy link
Contributor Author

Reviewing after PR is rebased post SDK changes.

Response to my question about when I should expect a review on this PR.

@k8s-ci-robot k8s-ci-robot added size/XXL Denotes a PR that changes 1000+ lines, ignoring generated files. needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. and removed size/L Denotes a PR that changes 100-499 lines, ignoring generated files. labels Mar 29, 2024
@AndrewSirenko AndrewSirenko force-pushed the batch-modify branch 2 times, most recently from 2cfbcb3 to d167578 Compare April 1, 2024 20:28
@k8s-ci-robot k8s-ci-robot added size/L Denotes a PR that changes 100-499 lines, ignoring generated files. and removed needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. size/XXL Denotes a PR that changes 1000+ lines, ignoring generated files. labels Apr 1, 2024
pkg/cloud/cloud.go Outdated Show resolved Hide resolved
pkg/cloud/cloud.go Outdated Show resolved Hide resolved
pkg/cloud/cloud.go Outdated Show resolved Hide resolved
pkg/cloud/cloud.go Outdated Show resolved Hide resolved
Copy link
Contributor

@ConnorJC3 ConnorJC3 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

mostly lgtm, see below

pkg/cloud/cloud.go Outdated Show resolved Hide resolved
pkg/cloud/cloud.go Show resolved Hide resolved
pkg/cloud/cloud.go Outdated Show resolved Hide resolved
@k8s-ci-robot k8s-ci-robot added size/XXL Denotes a PR that changes 1000+ lines, ignoring generated files. and removed size/L Denotes a PR that changes 100-499 lines, ignoring generated files. labels Apr 2, 2024
@k8s-ci-robot k8s-ci-robot added size/L Denotes a PR that changes 100-499 lines, ignoring generated files. and removed size/XXL Denotes a PR that changes 1000+ lines, ignoring generated files. labels Apr 2, 2024
@AndrewSirenko
Copy link
Contributor Author

/unhold

@k8s-ci-robot k8s-ci-robot removed the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Apr 2, 2024
Copy link
Member

@torredil torredil left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

/lgtm

@k8s-ci-robot k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Apr 2, 2024
@ConnorJC3
Copy link
Contributor

/approve

@k8s-ci-robot
Copy link
Contributor

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: ConnorJC3

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@k8s-ci-robot k8s-ci-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Apr 2, 2024
@k8s-ci-robot k8s-ci-robot merged commit 137c30b into kubernetes-sigs:master Apr 2, 2024
19 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
approved Indicates a PR has been approved by an approver from all required OWNERS files. cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. lgtm "Looks good to me", indicates that a PR is ready to be merged. size/L Denotes a PR that changes 100-499 lines, ignoring generated files.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants