Allow overriding node attach limits as driver CLI parameter #347

gnufied · 2019-08-15T18:07:50Z

Given all the complexities around how limits will be calculated on the node, it is a good idea to just allow an admin to override the limits when starting the csi-ebs driver.

leakingtapan · 2019-08-15T20:17:56Z

Not sure how is this going to work if the worker nodes are of different instance type where each has a different attach limit. I think this should be a per node attributes.

gnufied · 2019-08-15T20:22:08Z

The CSINode object where limits are actually saved are per-node basis, so we should be able to support this as per-node attribute.

Are you referring to how will this be set during deploy time?

gnufied · 2019-08-15T20:31:16Z

The flow of this code is:

csi-ebs-driver-on-node --> store-limit-> CSINode --> scheduler uses CSINode object during counting volumes.

So it is perfectly possible to support this on per-node basis. It is another problem, how will tools like kops will surface this configuration parameter though..

leakingtapan · 2019-08-23T05:01:55Z

Probably I missed something. How could cluster operator set this new CLI parameter differently for drivers on different node? Since it is deployed as daemon set, I am assuming the pods in daemon set are identical

gnufied · 2019-08-27T15:11:32Z

Could this be solved by ensuring that daemonset that targets the same kind of nodes use one selector while other kind of nodes uses another selector? I do not think it should be too difficult to do for a kube admin who is running into these kind of issues...

fejta-bot · 2019-11-25T16:02:42Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/lifecycle stale

leakingtapan · 2019-11-25T18:55:25Z

/remove-lifecycle stale

otterley · 2019-12-15T00:10:19Z

On EC2 Nitro instances, the number of attachable EBS volumes is not a fixed value that can be calculated and supplied when the CSI driver is launched, regardless of instance type, that is correct throughout the lifetime of the node. Because the formula is MaxEbsAttachableVolumes = MaxAttachments - AttachedENIs, where MaxAttachments depends on the EC2 instance type, the value must be recalculated each time a new ENI is attached or detached from the instance.

leakingtapan · 2019-12-15T04:31:59Z

Agree that this proposal is very suboptimal

gnufied · 2019-12-15T12:51:44Z

Agree that this proposal is very suboptimal

Designing a generic mechanism to calculate number of attachments as shared resource is kind of a tricky problem. There are various ways overrides can be configured such as pre-allocating a number for ENIs or percentage of attachments(and subtracting it from max). It would be nice if AWS had a API for determining MaxEbsAttachableVolumes an instance supports, so as we can remove the hacks in this driver and in-tree.

fejta-bot · 2020-03-14T12:58:00Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/lifecycle stale

leakingtapan · 2020-04-13T06:35:39Z

/remove-lifecycle stale

rfranzke · 2020-06-03T12:54:46Z

Given the fact that prior CSI there was no solution that allowed overriding the volume limits per node but only globally in the whole cluster (https://v1-18.docs.kubernetes.io/docs/concepts/storage/storage-limits/#custom-limits), and given the above discussed complexities: Would it make sense to proceed with the suggested CLI flag for now until a more sophisticated solution can be implemented? This way, at least the existing clusters that were leveraging the KUBE_MAX_PD_VOLS method can continue to work as before when they migrate to CSI. It wouldn't make things worse. WDYT?

fejta-bot · 2020-09-01T14:40:46Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/lifecycle stale

fejta-bot · 2020-10-01T15:23:23Z

Stale issues rot after 30d of inactivity.
Mark the issue as fresh with /remove-lifecycle rotten.
Rotten issues close after an additional 30d of inactivity.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/lifecycle rotten

gnufied · 2020-10-01T15:27:43Z

This is fixed #522

gnufied mentioned this issue Aug 19, 2019

Improve Node-specific Volume Limit Calculation kubernetes/kubernetes#80967

Closed

leakingtapan mentioned this issue Aug 23, 2019

Add an optional flag to kubelet for vols outside of kublet control kubernetes/kubernetes#81805

Closed

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Nov 25, 2019

k8s-ci-robot removed the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Nov 25, 2019

gnufied mentioned this issue Dec 13, 2019

Max number of volumes calculation is incorrect #427

Closed

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Mar 14, 2020

k8s-ci-robot removed the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Apr 13, 2020

hardikdr mentioned this issue May 26, 2020

Allow setting env-variable for kube-scheduler gardener/gardener#2343

Closed

This was referenced Jun 4, 2020

☂️Integrate vSMP MemoryOne gardener/gardener#2354

Closed

Allow volume attach limit overwrite via command line parameter #522

Merged

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Sep 1, 2020

k8s-ci-robot added lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. and removed lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. labels Oct 1, 2020

gnufied closed this as completed Oct 1, 2020

torredil mentioned this issue Sep 12, 2022

"attachable-volumes-aws-ebs" not being set on nodes even when --volume-attach-limit is used... #1258

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow overriding node attach limits as driver CLI parameter #347

Allow overriding node attach limits as driver CLI parameter #347

gnufied commented Aug 15, 2019

leakingtapan commented Aug 15, 2019

gnufied commented Aug 15, 2019

gnufied commented Aug 15, 2019

leakingtapan commented Aug 23, 2019

gnufied commented Aug 27, 2019

fejta-bot commented Nov 25, 2019

leakingtapan commented Nov 25, 2019

otterley commented Dec 15, 2019

leakingtapan commented Dec 15, 2019

gnufied commented Dec 15, 2019

fejta-bot commented Mar 14, 2020

leakingtapan commented Apr 13, 2020

rfranzke commented Jun 3, 2020 •

edited

Loading

fejta-bot commented Sep 1, 2020

fejta-bot commented Oct 1, 2020

gnufied commented Oct 1, 2020

Allow overriding node attach limits as driver CLI parameter #347

Allow overriding node attach limits as driver CLI parameter #347

Comments

gnufied commented Aug 15, 2019

leakingtapan commented Aug 15, 2019

gnufied commented Aug 15, 2019

gnufied commented Aug 15, 2019

leakingtapan commented Aug 23, 2019

gnufied commented Aug 27, 2019

fejta-bot commented Nov 25, 2019

leakingtapan commented Nov 25, 2019

otterley commented Dec 15, 2019

leakingtapan commented Dec 15, 2019

gnufied commented Dec 15, 2019

fejta-bot commented Mar 14, 2020

leakingtapan commented Apr 13, 2020

rfranzke commented Jun 3, 2020 • edited Loading

fejta-bot commented Sep 1, 2020

fejta-bot commented Oct 1, 2020

gnufied commented Oct 1, 2020

rfranzke commented Jun 3, 2020 •

edited

Loading