csi: allow more than 1 writer claim for multi-writer mode #9040

tgross · 2020-10-07T12:50:51Z

Fixes a bug where CSI volumes with the MULTI_NODE_MULTI_WRITER access mode
were using the same logic as MULTI_NODE_SINGLE_WRITER to determine whether
the volume had writer claims available for scheduling.

vercel · 2020-10-07T12:50:54Z

This pull request is being automatically deployed with Vercel (learn more).
To see the status of your deployment, click below or on the icon next to each commit.

🔍 Inspect: https://vercel.com/hashicorp/nomad/ijjeq018w
✅ Preview: https://nomad-git-b-csi-multiwriter-count.hashicorp.vercel.app

Fixes a bug where CSI volumes with the `MULTI_NODE_MULTI_WRITER` access mode were using the same logic as `MULTI_NODE_SINGLE_WRITER` to determine whether the volume had writer claims available for scheduling.

notnoop · 2020-10-07T13:39:03Z

nomad/structs/csi.go

+		// the CSI spec doesn't allow for setting a max number of writers.
+		// we track node resource exhaustion through v.ResourceExhausted
+		// which is checked in WriteSchedulable
+		return true


That's a bit unfortunate - but it makes sense. Does it make sense to have an higher level/integration/e2e test to test the semantics?

The reason it wasn't originally E2E tested was because very few CSI plugins actually allow this mode (typically on-prem solutions), so we can't feasibly do an E2E test for it. But we could probably add this dimension to one of the existing integration-style tests in the nomad package. Let me take a look at that.

I dug into this a bit and that node resource exhaustion check isn't being made during the claim but during scheduling (in the feasibility checker), which in retrospect makes sense as we only validate arguments in an RPC and you can add resources to a cluster after the job submission has been made to make it feasible.

I've swapped one of the volumes in the existing feasibility checker tests to multiwriter and verified this code path is getting hit. I've also extended one of the existing RPC tests a bit to make sure the logic for ReadFreeClaims is being checked better while I'm in here. Our CSI E2E tests could use some work in general, but adding exhaustion checking would be a good idea for future work there.

…9040) Fixes a bug where CSI volumes with the `MULTI_NODE_MULTI_WRITER` access mode were using the same logic as `MULTI_NODE_SINGLE_WRITER` to determine whether the volume had writer claims available for scheduling. Extends CSI claim endpoint test to exercise multi-reader and make sure `WriteFreeClaims` is exercised for multi-writer in feasibility test.

github-actions · 2022-12-15T02:19:11Z

I'm going to lock this pull request because it has been closed for 120 days ⏳. This helps our maintainers find and focus on the active contributions.
If you have found a problem that seems related to this change, please open a new issue and complete the issue template so we can capture all the details necessary to investigate further.

tgross added this to the 0.13 milestone Oct 7, 2020

tgross requested review from notnoop and shoenig October 7, 2020 13:25

tgross force-pushed the b-csi-multiwriter-count branch from 617c0e7 to ba2eac6 Compare October 7, 2020 13:26

vercel bot deployed to Preview October 7, 2020 13:26 View deployment

tgross force-pushed the b-csi-multiwriter-count branch from ba2eac6 to 51007f5 Compare October 7, 2020 13:36

vercel bot temporarily deployed to Preview October 7, 2020 13:37 Inactive

csi: allow more than 1 writer claim for multi-writer mode

87f62b5

Fixes a bug where CSI volumes with the `MULTI_NODE_MULTI_WRITER` access mode were using the same logic as `MULTI_NODE_SINGLE_WRITER` to determine whether the volume had writer claims available for scheduling.

tgross force-pushed the b-csi-multiwriter-count branch from 51007f5 to 87f62b5 Compare October 7, 2020 13:38

vercel bot deployed to Preview October 7, 2020 13:38 View deployment

notnoop approved these changes Oct 7, 2020

View reviewed changes

extend CSI claim endpoint test to exercise multi-reader

b3342ad

vercel bot deployed to Preview October 7, 2020 14:09 View deployment

make sure WriteFreeClaims is exercised for multiwriter

8e8e6c7

vercel bot deployed to Preview October 7, 2020 14:23 View deployment

tgross merged commit 7cff2de into master Oct 7, 2020

tgross deleted the b-csi-multiwriter-count branch October 7, 2020 14:43

tgross mentioned this pull request Oct 7, 2020

multi-writer volume does not calculate free claims correctly #8968

Closed

github-actions bot locked as resolved and limited conversation to collaborators Dec 15, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

csi: allow more than 1 writer claim for multi-writer mode #9040

csi: allow more than 1 writer claim for multi-writer mode #9040

tgross commented Oct 7, 2020

vercel bot commented Oct 7, 2020 •

edited

Loading

notnoop Oct 7, 2020

tgross Oct 7, 2020

tgross Oct 7, 2020

github-actions bot commented Dec 15, 2022

csi: allow more than 1 writer claim for multi-writer mode #9040

csi: allow more than 1 writer claim for multi-writer mode #9040

Conversation

tgross commented Oct 7, 2020

vercel bot commented Oct 7, 2020 • edited Loading

notnoop Oct 7, 2020

Choose a reason for hiding this comment

tgross Oct 7, 2020

Choose a reason for hiding this comment

tgross Oct 7, 2020

Choose a reason for hiding this comment

github-actions bot commented Dec 15, 2022

vercel bot commented Oct 7, 2020 •

edited

Loading