-
Notifications
You must be signed in to change notification settings - Fork 2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
csi: unpublish workflow ID mismatches #7626
Conversation
e8a8fd1
to
4484a5f
Compare
I want to do some refactoring after the release with some of these names to disambiguate them to prevent this sort of error in the future, but for now this will do the job. |
} | ||
targetCSIInfo, ok := targetNode.CSINodePlugins[args.plug.ID] | ||
if !ok { | ||
return args.nodeClaims, fmt.Errorf("Failed to find NodeInfo for node: %s", targetNode.ID) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
It seems like this failure could be temporary, will we retry this again if the node plugin is in the process of coming back up? Also, is the node CSI id constant when the plugin comes back up or is it generated by the node plugin everytime it starts?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The GC job will get requeued (which works fine for us with #7632 where the Job.Deregister
will use the same GC path).
And as far as I can tell the CSI Node ID is implementation-specific. But in practice it has to be a fixed identifier of the underlying host. So for example, in the EBS case it's the EC2 instance ID, because if it wasn't the controller wouldn't be able to attach the external volume to it.
#7632 includes all these changes, so we can either merge that in to pick these all up or merge this separately and rebase it from master once that's done. |
Closing as these changes were rolled into #7632 |
I'm going to lock this pull request because it has been closed for 120 days ⏳. This helps our maintainers find and focus on the active contributions. |
Fixes for #7628