fix: Updating the vmsize for e2e cilium to avoid resource scarcity #2014

vipul-21 · 2023-06-12T22:44:58Z

Reason for Change:

Updating the vm size used for e2e test for cilium

Issue Fixed:

Improves the pipeline failure due to unavailability of resources(memory ) in case of cilium e2e

Requirements:

uses conventional commit messages
includes documentation
adds unit tests
relevant PR labels added

Notes:

vipul-21 · 2023-06-13T16:46:44Z

/azp run

azure-pipelines · 2023-06-13T16:46:56Z

Azure Pipelines successfully started running 2 pipeline(s).

rbtr · 2023-06-13T18:02:33Z

.pipelines/singletenancy/cilium-overlay/cilium-overlay-e2e-step-template.yaml

@@ -32,7 +32,7 @@ steps:
        mkdir -p ~/.kube/
        echo "Create AKS Overlay cluster"
        make -C ./hack/swift azcfg AZCLI=az REGION=$(REGION_OVERLAY_CLUSTER_TEST)
-        make -C ./hack/swift overlay-no-kube-proxy-up AZCLI=az REGION=$(REGION_OVERLAY_CLUSTER_TEST) SUB=$(SUB_AZURE_NETWORK_AGENT_TEST) CLUSTER=${{ parameters.clusterName }}-$(make revision)
+        make -C ./hack/swift overlay-no-kube-proxy-up AZCLI=az REGION=$(REGION_OVERLAY_CLUSTER_TEST) SUB=$(SUB_AZURE_NETWORK_AGENT_TEST) CLUSTER=${{ parameters.clusterName }}-$(make revision) VM_SIZE=Standard_DS4_v2


Standard_DS4_v2 is a HUGE jump up in VM size and cost (12x!). I originally picked B2s as the cheapest viable option - did you evaluate any other SKUs? Maybe we could start at Standard_B2ms which has double the mem at only double the cost?

I tried/tested with Standard_D4_v3 for the perf dashboard( which use goldpinger) that why choose a similar one for our e2e tests.
Will try to test with Standard_B2ms and see if that works.
Thanks !

The Standard_B2ms worked without the resource limit. @rbtr . Thanks for the help
Will try another run to be sure.

rbtr · 2023-06-13T18:04:07Z

test/integration/manifests/goldpinger/deployment.yaml

@@ -76,3 +76,10 @@ spec:
              port: 8080
            initialDelaySeconds: 5
            periodSeconds: 5
+          resources:


The resource block is omitted intentionally so that the goldpinger Pods will overprovision on the nodes. When you add this, only (node mem)/100Mi Goldpinger pods can be scheduled on the Node.

So if the node mem is 8gb( in case Standard_B2ms) the number of pods that can be schedule on that node would be 80. And we need to scale the pods till 100.
Do you recommend adding a smaller limit to accommodate approx ~110 pods OR remove the limit altogether so that we can overprovision on that node

vipul-21 · 2023-06-13T19:23:50Z

/azp run

azure-pipelines · 2023-06-13T19:24:09Z

Azure Pipelines successfully started running 2 pipeline(s).

…2014) CI: Testing the e2e test for cilium

* fix: assume invalid semver CNI has the required dump state command (#2078) * fix: Updating the vmsize for e2e cilium to avoid resource scarcity (#2014) CI: Testing the e2e test for cilium --------- Co-authored-by: Vipul Singh <[email protected]>

vipul-21 added ci Infra or tooling. do-not-merge work-in-progress labels Jun 12, 2023

vipul-21 force-pushed the singhvipul/ciliume2e branch 6 times, most recently from dbbab4b to 2550c63 Compare June 13, 2023 15:50

vipul-21 force-pushed the singhvipul/ciliume2e branch from 2550c63 to 81108b4 Compare June 13, 2023 17:36

vipul-21 added fix Fixes something. and removed do-not-merge work-in-progress labels Jun 13, 2023

vipul-21 marked this pull request as ready for review June 13, 2023 17:42

vipul-21 requested a review from a team as a code owner June 13, 2023 17:42

vipul-21 requested a review from isaac-dasan June 13, 2023 17:42

vipul-21 changed the title ~~Testing the e2e test for cilium~~ fix: Updating the vmsize and goldpinger resource limit for e2e cilium to avoid resource scarcity Jun 13, 2023

rbtr requested changes Jun 13, 2023

View reviewed changes

vipul-21 added the work-in-progress label Jun 13, 2023

vipul-21 force-pushed the singhvipul/ciliume2e branch 2 times, most recently from 7dccfc5 to 13e9c22 Compare June 13, 2023 18:23

vipul-21 changed the title ~~fix: Updating the vmsize and goldpinger resource limit for e2e cilium to avoid resource scarcity~~ fix: Updating the vmsize for e2e cilium to avoid resource scarcity Jun 13, 2023

vipul-21 removed the work-in-progress label Jun 13, 2023

vipul-21 requested a review from rbtr June 13, 2023 20:15

rbtr approved these changes Jun 13, 2023

View reviewed changes

vipul-21 enabled auto-merge (squash) June 13, 2023 21:43

CI: Testing the e2e test for cilium

4b495eb

vipul-21 force-pushed the singhvipul/ciliume2e branch from 13e9c22 to 4b495eb Compare June 13, 2023 22:59

vipul-21 merged commit 6b805e5 into master Jun 14, 2023

vipul-21 deleted the singhvipul/ciliume2e branch June 14, 2023 01:17

thatmattlong pushed a commit that referenced this pull request Jul 27, 2023

fix: Updating the vmsize for e2e cilium to avoid resource scarcity (#…

02bd47f

…2014) CI: Testing the e2e test for cilium

thatmattlong pushed a commit that referenced this pull request Jul 28, 2023

fix: Updating the vmsize for e2e cilium to avoid resource scarcity (#…

0c9666c

…2014) CI: Testing the e2e test for cilium

thatmattlong mentioned this pull request Jul 28, 2023

Release candidate v1.4.44.4 #2094

Merged

4 tasks

jpayne3506 mentioned this pull request Aug 8, 2023

ci: [CNI] [NPM] Add NPM|CNI integration test to load-test pipeline. #2105

Merged

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: Updating the vmsize for e2e cilium to avoid resource scarcity #2014

fix: Updating the vmsize for e2e cilium to avoid resource scarcity #2014

vipul-21 commented Jun 12, 2023 •

edited

Loading

vipul-21 commented Jun 13, 2023

azure-pipelines bot commented Jun 13, 2023

rbtr Jun 13, 2023

vipul-21 Jun 13, 2023

vipul-21 Jun 13, 2023

rbtr Jun 13, 2023

vipul-21 Jun 13, 2023

vipul-21 commented Jun 13, 2023

azure-pipelines bot commented Jun 13, 2023

fix: Updating the vmsize for e2e cilium to avoid resource scarcity #2014

fix: Updating the vmsize for e2e cilium to avoid resource scarcity #2014

Conversation

vipul-21 commented Jun 12, 2023 • edited Loading

vipul-21 commented Jun 13, 2023

azure-pipelines bot commented Jun 13, 2023

rbtr Jun 13, 2023

Choose a reason for hiding this comment

vipul-21 Jun 13, 2023

Choose a reason for hiding this comment

vipul-21 Jun 13, 2023

Choose a reason for hiding this comment

rbtr Jun 13, 2023

Choose a reason for hiding this comment

vipul-21 Jun 13, 2023

Choose a reason for hiding this comment

vipul-21 commented Jun 13, 2023

azure-pipelines bot commented Jun 13, 2023

vipul-21 commented Jun 12, 2023 •

edited

Loading