Clarification on vSphere DRS and HA for EKS-A #3586

echel0n · 2022-10-06T02:04:25Z

Does EKS-A work with a vSphere cluster running DRS and HA services enabled and is it recommended or preferred enabled/disabled ?

I am asking as I could not find anything in the EKS-A documentation about it nor could I find any other people talking on it that use EKS-A and just wanted to know if there is any more information surrounding it, I currently have both disabled right now.

Thanks!

vincentni · 2022-10-07T22:30:21Z

Thanks for reaching out to us! It seems requires vMotion which hasn't been tested yet. So the current recommendation is to disable both.

echel0n · 2022-10-07T22:33:46Z

@vincentni thanks for the recommendation on that, I had run both HA and DRS and found pods getting stuck in creation due to PVC's already being in use, is it also recommend to shutdown the EKS VM before moving it over to a different vSphere node ?

vincentni · 2022-10-07T22:44:35Z

We haven't tested EKS-A with a vSphere cluster running DRS and HA, so don't expect users to do so. Unfortunately, I can't give any recommendations on unsupported use cases, but what you plan to do sounds reasonable.

echel0n · 2022-10-07T22:47:09Z

@vincentni I bring this up cause when performing a cluster update, it places the new VM in vSphere randomly it would seem, this can cause resource issues if to many of the VMs end up on the same vSphere node, is there a way to prevent this, like anti-affinity rules or ?

abhinavmpandey08 · 2022-10-07T23:55:58Z

Hi @echel0n! Thanks for reaching out!
EKS-A can utilize vSphere's DRS and HA services.
You can use a resourcePool that spans across multiple ESXI hosts to ensure the VMs are distributed evenly among your ESXI hosts.

echel0n · 2022-10-10T23:16:11Z

Well I tried using DRS again and not sure if its related or not to #896 but I started having volume attachment issues on cluster upgrades and randomly when pods restarted after an upgrade, I've turned off DRS and manually moved VMs around for now.

github-actions · 2023-01-10T01:32:53Z

There has been no activity on this issue for 60 days. Labeling as stale and closing in 7 days if no further activity.

vincentni added the external An issue, bug or feature request filed from outside the AWS org label Oct 7, 2022

drewvanstone added triage/accepted priority/p2 Backlog item labels Nov 10, 2022

drewvanstone added this to the backlog milestone Nov 10, 2022

github-actions bot added the stale label Jan 10, 2023

github-actions bot closed this as not planned Won't fix, can't repro, duplicate, stale Jan 18, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Clarification on vSphere DRS and HA for EKS-A #3586

Clarification on vSphere DRS and HA for EKS-A #3586

echel0n commented Oct 6, 2022 •

edited

Loading

vincentni commented Oct 7, 2022

echel0n commented Oct 7, 2022

vincentni commented Oct 7, 2022

echel0n commented Oct 7, 2022

abhinavmpandey08 commented Oct 7, 2022

echel0n commented Oct 10, 2022 •

edited

Loading

github-actions bot commented Jan 10, 2023

Clarification on vSphere DRS and HA for EKS-A #3586

Clarification on vSphere DRS and HA for EKS-A #3586

Comments

echel0n commented Oct 6, 2022 • edited Loading

vincentni commented Oct 7, 2022

echel0n commented Oct 7, 2022

vincentni commented Oct 7, 2022

echel0n commented Oct 7, 2022

abhinavmpandey08 commented Oct 7, 2022

echel0n commented Oct 10, 2022 • edited Loading

github-actions bot commented Jan 10, 2023

echel0n commented Oct 6, 2022 •

edited

Loading

echel0n commented Oct 10, 2022 •

edited

Loading