Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Clarification on vSphere DRS and HA for EKS-A #3586

Closed
echel0n opened this issue Oct 6, 2022 · 7 comments
Closed

Clarification on vSphere DRS and HA for EKS-A #3586

echel0n opened this issue Oct 6, 2022 · 7 comments
Labels
external An issue, bug or feature request filed from outside the AWS org priority/p2 Backlog item stale triage/accepted
Milestone

Comments

@echel0n
Copy link

echel0n commented Oct 6, 2022

Does EKS-A work with a vSphere cluster running DRS and HA services enabled and is it recommended or preferred enabled/disabled ?

I am asking as I could not find anything in the EKS-A documentation about it nor could I find any other people talking on it that use EKS-A and just wanted to know if there is any more information surrounding it, I currently have both disabled right now.

Thanks!

@vincentni vincentni added the external An issue, bug or feature request filed from outside the AWS org label Oct 7, 2022
@vincentni
Copy link
Member

Thanks for reaching out to us! It seems requires vMotion which hasn't been tested yet. So the current recommendation is to disable both.

@echel0n
Copy link
Author

echel0n commented Oct 7, 2022

@vincentni thanks for the recommendation on that, I had run both HA and DRS and found pods getting stuck in creation due to PVC's already being in use, is it also recommend to shutdown the EKS VM before moving it over to a different vSphere node ?

@vincentni
Copy link
Member

We haven't tested EKS-A with a vSphere cluster running DRS and HA, so don't expect users to do so. Unfortunately, I can't give any recommendations on unsupported use cases, but what you plan to do sounds reasonable.

@echel0n
Copy link
Author

echel0n commented Oct 7, 2022

@vincentni I bring this up cause when performing a cluster update, it places the new VM in vSphere randomly it would seem, this can cause resource issues if to many of the VMs end up on the same vSphere node, is there a way to prevent this, like anti-affinity rules or ?

@abhinavmpandey08
Copy link
Member

Hi @echel0n! Thanks for reaching out!
EKS-A can utilize vSphere's DRS and HA services.
You can use a resourcePool that spans across multiple ESXI hosts to ensure the VMs are distributed evenly among your ESXI hosts.

@echel0n
Copy link
Author

echel0n commented Oct 10, 2022

Well I tried using DRS again and not sure if its related or not to #896 but I started having volume attachment issues on cluster upgrades and randomly when pods restarted after an upgrade, I've turned off DRS and manually moved VMs around for now.

@drewvanstone drewvanstone added this to the backlog milestone Nov 10, 2022
@github-actions
Copy link

There has been no activity on this issue for 60 days. Labeling as stale and closing in 7 days if no further activity.

@github-actions github-actions bot added the stale label Jan 10, 2023
@github-actions github-actions bot closed this as not planned Won't fix, can't repro, duplicate, stale Jan 18, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
external An issue, bug or feature request filed from outside the AWS org priority/p2 Backlog item stale triage/accepted
Projects
None yet
Development

No branches or pull requests

4 participants