Increase default node cpu size from 1 to 4. #335

josephburnett · 2018-03-09T18:18:24Z

Previously we were requesting 2.5% of a CPU for Revisions which is not enough to run any meaningful load. And the scheduler will happily pack way too many Pods into each node and therefore the cluster autoscaler won't kick in when needed.

The Elafros autoscaler commit (#229) increases the CPU request per Pod to 1 full CPU. This was necessary to make autoscaling work correctly. However the default, tiny cluster of 3, 1-core nodes simply couldn't fit the Elafros control plane (Istio, Controllers, Webhooks) and two Revisions. Even 2-core machines are a little small.

So I'm increasing the default to 4 cores and autoscaling will bring the cluster down to 1 node when not used.

Each Revision needs most of a CPU. Conformance tests need two Revisions. And we have all the control plane stuff running. So we need bigger nodes.

josephburnett · 2018-03-09T18:48:00Z

#291 (comment) has some more context on why this change is necessary.

bobcatfish · 2018-03-09T20:26:43Z

What do you think about adding some docs at https://github.com/elafros/elafros#latest-release about minimum cluster requirements as well?

grantr · 2018-03-09T20:36:55Z

I created a new cluster with n1-standard-4 nodes, and conformance tests now consistently pass (on this branch). 🎉

grantr · 2018-03-09T23:42:26Z

docs/creating-a-kubernetes-cluster.md

@@ -29,6 +29,7 @@ To use a k8s cluster running in GKE:
      --cluster-version=1.9.2-gke.1 \
      --zone=us-east1-d \
      --scopes=cloud-platform \
+      --machine-type=n1-standard-4 \


Since this will be significantly more expensive for users, can we add a comment explaining why this machine type is necessary?

I added a comment below explaining why this is recommended.

bobcatfish · 2018-03-10T00:39:18Z

/retest

grantr · 2018-03-10T00:45:55Z

docs/creating-a-kubernetes-cluster.md

@@ -29,6 +29,7 @@ To use a k8s cluster running in GKE:
      --cluster-version=1.9.2-gke.1 \
      --zone=us-east1-d \
      --scopes=cloud-platform \
+      --machine-type=n1-standard-4 \


I added a comment below explaining why this is recommended.

bobcatfish · 2018-03-10T00:52:09Z

I'd like to get this merged now so we can be in a state where folks can follow the setup instructions and have passing conformance tests, but @josephburnett could you follow up with some docs or an issue to create docs about minimum system requirements for folks that want to use Elafros?

josephburnett · 2018-03-12T15:21:42Z

@bobcatfish. Ack. I'll add those docs.

…aster (knative#335)

josephburnett added 2 commits March 8, 2018 11:17

Increase node size to n1-standard-2.

f1764eb

Each Revision needs most of a CPU. Conformance tests need two Revisions. And we have all the control plane stuff running. So we need bigger nodes.

increase node cpu count to 4

74767f6

josephburnett requested a review from a team March 9, 2018 18:18

google-prow-robot added the size/XS Denotes a PR that changes 0-9 lines, ignoring generated files. label Mar 9, 2018

josephburnett mentioned this pull request Mar 9, 2018

Autoscaler changes broke deployment in non-default namespace #291

Closed

grantr reviewed Mar 9, 2018

View reviewed changes

Add a note about why this machine type

c3ed18d

grantr approved these changes Mar 10, 2018

View reviewed changes

bobcatfish merged commit 884ba3a into master Mar 10, 2018

bobcatfish mentioned this pull request Mar 10, 2018

Disable autoscaling in non-default namespaces. #325

Closed

yanweiguo mentioned this pull request Mar 12, 2018

Reduce nodes requirement of samples #299

Closed

mattmoor deleted the josephburnett-patch-1 branch March 20, 2018 13:12

markusthoemmes pushed a commit to markusthoemmes/knative-serving that referenced this pull request Nov 20, 2019

🤖 Triggering CI on branch 'release-next' after synching to upstream/m…

c20c4e8

…aster (knative#335)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Increase default node cpu size from 1 to 4. #335

Increase default node cpu size from 1 to 4. #335

josephburnett commented Mar 9, 2018

josephburnett commented Mar 9, 2018

bobcatfish commented Mar 9, 2018

grantr commented Mar 9, 2018

grantr Mar 9, 2018

grantr Mar 10, 2018

bobcatfish commented Mar 10, 2018

grantr Mar 10, 2018

bobcatfish commented Mar 10, 2018

josephburnett commented Mar 12, 2018

Increase default node cpu size from 1 to 4. #335

Increase default node cpu size from 1 to 4. #335

Conversation

josephburnett commented Mar 9, 2018

josephburnett commented Mar 9, 2018

bobcatfish commented Mar 9, 2018

grantr commented Mar 9, 2018

grantr Mar 9, 2018

Choose a reason for hiding this comment

grantr Mar 10, 2018

Choose a reason for hiding this comment

bobcatfish commented Mar 10, 2018

grantr Mar 10, 2018

Choose a reason for hiding this comment

bobcatfish commented Mar 10, 2018

josephburnett commented Mar 12, 2018