Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add option for AWS node-groups to run in a single subnet/AZ #1428

Merged
merged 9 commits into from
Jan 27, 2023

Conversation

iameskild
Copy link
Member

Fixes | Closes | Resolves #1388

Please remove anything marked as optional that you don't need to fill in.
Choose one of the keywords preceding to refer to the issue this PR solves, followed by the issue number (e.g Fixes # 666).
If there is no issue, remove the line. Remove this note after reading.

Changes introduced in this PR:

  • This adds a node-group option for it to be created using a single subnet (ie single AZ). At the moment, this will be the default behavior for the worker node-group but there might be interest in having this be the default for all node-groups.
    • Note: this only affects AWS.

Types of changes

What types of changes does your PR introduce?

Put an x in the boxes that apply

  • Bug fix (non-breaking change which fixes an issue)
  • New feature (non-breaking change which adds a feature)
  • Breaking change (fix or feature that would cause existing features to not work as expected)
  • Documentation Update
  • Code style update (formatting, renaming)
  • Refactoring (no functional changes, no API changes)
  • Build related changes
  • Other (please describe):

Testing

Requires testing

  • Yes
  • No

I have manually tested this but more testing might be need to ensure that we are not breaking other features.

In case you checked yes, did you write tests?

  • Yes
  • No

Documentation

Does your contribution include breaking changes or deprecations?
If so have you updated the documentation?

  • Yes, docstrings
  • Yes, main documentation
  • Yes, deprecation notices

Further comments (optional)

If this is a relatively large or complex change, kick off the discussion by explaining why you chose the solution you did and what alternatives you considered and more.

@brl0
Copy link
Contributor

brl0 commented Sep 30, 2022

@iameskild, just a quick heads-up that this change does have the effect of limiting available IP addresses, which limits the number of concurrent workers.

@iameskild
Copy link
Member Author

Thanks for the feedback @brl0! This was a concern that I had and I'm glad we were able to confirm one way or the other.

@iameskild
Copy link
Member Author

@brl0 we recently added an AWS EKS addon (aws-ebs-csi-driver) and I believe there is an addon that will allow us to reduce the number of IP addresses assigned to each of the pods (if I remember correctly its currently ~30 per pod).

I recall researching this before and indeed I did haha - #828 (comment)
That said, I will likely read through this a bit more but this is promising :)

Have a look here for more info: https://github.com/aws/amazon-vpc-cni-k8s

@iameskild iameskild added this to the Release 2022.12.1 milestone Nov 28, 2022
@iameskild iameskild requested review from costrouc and aktech November 28, 2022 11:23
@iameskild iameskild changed the base branch from main to release/2022.12.1 November 28, 2022 11:25
@iameskild iameskild added the needs: review 👀 This PR is complete and ready for reviewing label Nov 28, 2022
@brl0
Copy link
Contributor

brl0 commented Dec 11, 2022

@iameskild, I think this same configuration needs to also be applied to the Node Group for user instances as well, since there is significant data transfer between user instances and workers when computing large dataframes etc, because if this change is not applied to user instances additional fees could be incurred for transfers between those them and the workers.

Also, just curious, but do you know if the subnet mask size can be increased to avoid unnecessarily low worker count constraint this change otherwise implies.

@iameskild iameskild changed the base branch from release/2022.12.1 to develop January 10, 2023 00:36
@iameskild
Copy link
Member Author

@costrouc @viniciusdc this is the last PR that needs to be included in the January release 🎉

Copy link
Member

@costrouc costrouc left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

No issues with this PR. Will be great to finally run under one subnet as an option. This will remove a lot of the ways in which aws kubernetes behaves weird.

@costrouc costrouc merged commit b2446ec into develop Jan 27, 2023
@costrouc costrouc deleted the aws_single_subnet branch January 27, 2023 17:05
@iameskild iameskild mentioned this pull request Jan 28, 2023
11 tasks
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
needs: review 👀 This PR is complete and ready for reviewing provider: AWS type: enhancement 💅🏼 New feature or request
Projects
None yet
Development

Successfully merging this pull request may close these issues.

[ENH] - Add ability to spinup dask workers in a single AZ (AWS)
4 participants