New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Blog Post: Out of the Clouds onto the Ground: How to Make Kubernetes Production Grade Anywhere #9716

Merged

k8s-ci-robot merged 4 commits into kubernetes:master from kbarnard10:blog

Aug 3, 2018

Contributor

kbarnard10 commented Aug 2, 2018

Adding blog post.

kbarnard10 added 2 commits

August 1, 2018 18:39


          adding blog post

eeb1132


          adding blog post

69e85fc

k8s-ci-robot added the cncf-cla: yes label

k8s-ci-robot requested review from natekartchner and SarahKConway

August 2, 2018 02:10

k8s-ci-robot added the size/L label


          Revert "adding blog post"

de68f5e

This reverts commit eeb1132.

Contributor Author

kbarnard10 commented Aug 2, 2018

/assign @zacharysarah
/assign @natekartchner

k8s-ci-robot assigned natekartchner and zacharysarah

Collaborator

k8sio-netlify-preview-bot commented Aug 2, 2018

Deploy preview for kubernetes-io-master-staging ready!

Built with commit 69e85fc

https://deploy-preview-9716--kubernetes-io-master-staging.netlify.com

Collaborator

k8sio-netlify-preview-bot commented Aug 2, 2018 •

edited

Loading

Deploy preview for kubernetes-io-master-staging ready!

Built with commit fef53fc

https://deploy-preview-9716--kubernetes-io-master-staging.netlify.com

neolit123 reviewed

View reviewed changes

Member

neolit123 left a comment

@kbarnard10 @cantbewong @embano1
thanks for the writeup! 👍

i've added some comments and found a couple of typos.

content/en/blog/_posts/2018-08-03-make-kubernetes-production-grade-anywhere.md Outdated


		Authors: Steven Wong (VMware), Michael Gasch (VMware)

		This blog offers some guidelines for running a production-grade Kubernetes cluster in an environment like an on-premise data center or edge location.

Member

neolit123 Aug 2, 2018

consistency for production grade

content/en/blog/_posts/2018-08-03-make-kubernetes-production-grade-anywhere.md Outdated


		This article is directed at on-premise Kubernetes deployments on a hypervisor or bare-metal platform, facing finite backing resources compared to the expansibility of the major public clouds. However, some of these recommendations may also be useful in a public cloud if budget constraints limit the resources you choose to consume.

		A single node bare-metal Minikube deployment may be cheap and easy, but is not production grade. Conversely, you’re not likely to achieve Google’s Borg experience in a retail store, branch office, or edge location -- nor are you likely to need it.

Member

neolit123 Aug 2, 2018

-- nor -> , nor

content/en/blog/_posts/2018-08-03-make-kubernetes-production-grade-anywhere.md Outdated


		A single node bare-metal Minikube deployment may be cheap and easy, but is not production grade. Conversely, you’re not likely to achieve Google’s Borg experience in a retail store, branch office, or edge location -- nor are you likely to need it.

		This blog offers some guidance on achieving a production-worthy Kubernetes deployment, even when dealing with some resource constraints.

Member

neolit123 Aug 2, 2018

possibly production-worthy -> production worthy

content/en/blog/_posts/2018-08-03-make-kubernetes-production-grade-anywhere.md Outdated


		![api server](/images/blog/2018-08-03-make-kubernetes-production-grade-anywhere/api-server.png)

		Typically the API server, Controller Manager and Scheduler components are co-located within multiple instances of control plane (aka Master) nodes. Master nodes usually include etcd too – although there are high availability and large cluster scenarios that call for running etcd on independent hosts. The components can be run as containers, and optionally be supervised by Kubernetes, i.e., running as statics pods. The latter requires the kubelet agent on the control plane nodes.

Member

neolit123 Aug 2, 2018

FYI there were recent discussion to move away from the term master in k8s, yet we do have this everywhere in the docs and in the code base.

etcd too – although -> etcd too, although

Member

neolit123 Aug 2, 2018

, i.e., running -> - i.e. running

Member

neolit123 Aug 2, 2018

The latter requires the kubelet agent on the control plane nodes.

i think the latter it's not very clear, also the kubelet runs on every node, not only CP nodes.
i would omit the last sentence.

content/en/blog/_posts/2018-08-03-make-kubernetes-production-grade-anywhere.md Outdated


		![kubernetes components HA](/images/blog/2018-08-03-make-kubernetes-production-grade-anywhere/kubernetes-components-ha.png)

		Risks to these components include hardware failures, software bugs, bad updates, human errors, network outages, and overloaded systems resulting in resource exhaustion. Redundancy can mitigate the impact of many of these hazards. In addition, the resource scheduling and high availability features of a hypervisor platform can be useful to surpass what can be achieved using the Linux operating system, Kubernetes, and container runtime alone.

Member

neolit123 Aug 2, 2018

, and container runtime -> and a container runtime

content/en/blog/_posts/2018-08-03-make-kubernetes-production-grade-anywhere.md Outdated


		## Security

		Every Kubernetes cluster has a cluster root Certificate Authority (CA). Master, Kubelet, and Kubectl certs need to be generated and installed. If you use an install tool or a distribution this may be handled for you. A manual process is described [here](https://github.com/kelseyhightower/kubernetes-the-hard-way/blob/master/docs/04-certificate-authority.md). You should be prepared to reinstall certificates in the event of node replacements or expansions.

Member

neolit123 Aug 2, 2018

Master, Kubelet, and Kubectl

this is more valid and also aligns with Kelsey's guide:

The Controller Manager, API Server, Scheduler, kubelet client, kube-proxy and administrator certificates

content/en/blog/_posts/2018-08-03-make-kubernetes-production-grade-anywhere.md Outdated

+              * Consider physical security, especially when deploying to edge or remote office locations that may be unattended. Include storage encryption to limit exposure from stolen devices and protection from attachment of malicious devices like USB keys.
+              * Protect Kubernetes plain-text cloud provider credentials (access keys, tokens, passwords, etc.)
+              Kubernetes [secret](https://kubernetes.io/docs/concepts/configuration/secret/) objects are appropriate for holding small amounts of sensitive data. These are retained within etcd. These can be readily used to hold credentials for the Kubernetes API but there are times when a workload or an extension of the cluster itself needs a more full-featured solution. The HashiCorp Vault project is is a popular solution if you need more than the built-in secret objects can provide.

Member

neolit123 Aug 2, 2018

is is a popular has double is.

content/en/blog/_posts/2018-08-03-make-kubernetes-production-grade-anywhere.md Outdated


		Backing up an etcd cluster can be accomplished with etcd’s [built-in](https://coreos.com/etcd/docs/latest/op-guide/recovery.html) snapshot mechanism, and copying the resulting file to storage in a different failure domain. The snapshot file contains all the Kubernetes states and critical information. In order to keep the sensitive Kubernetes data safe, encrypt the snapshot files.

		Using disk volume based snapshot recovery of etcd can have issues; see https://github.com/kubernetes/kubernetes/issues/40027. API-based backup solutions (e.g., [Ark](https://github.com/heptio/ark)) can offer more granular recovery than a etcd snapshot, but also can be slower. You could utilize both snapshot and API-based backups, but you should do one form of etcd backup as a minimum.

Member

neolit123 Aug 2, 2018

issues; see kubernetes/kubernetes#40027

to:

issues. See #40027.

website won't map the link same way as github does.

content/en/blog/_posts/2018-08-03-make-kubernetes-production-grade-anywhere.md Outdated

+              ## Considerations for your production workloads
+              Anti-affinity specifications can be used to split clustered services across backing hosts, but at this time the settings are used only when the pod is scheduled. This means that Kubernetes can restart a failed node of your clustered application, but does not have a native mechanism to rebalance after a fail back. This is a topic worthy of a separate blog, but supplemental logic might be useful to achieve optimal workload placements after host or worker node recoveries or expansions. The [Pod Priority and Preemption feature](https://kubernetes.io/docs/concepts/configuration/pod-priority-preemption/) can be used to specify a preferred triage in the event of resource shortages caused by failures or bursting workloads.
+              For stateful services, external attached volume mounts are the standard Kubernetes recommendation for a non-clustered service (e.g., a typical SQL database). At this time Kubernetes managed snapshots of these external volumes is in the category of a [roadmap feature request](https://docs.google.com/presentation/d/1dgxfnroRAu0aF67s-_bmeWpkM1h2LCxe6lB1l1oS0EQ/edit#slide=id.g3ca07c98c2_0_47), likely to align with the Container Storage Interface (CSI) integration. Thus performing backups of such a service would involve application specific, in-pod activity that is beyond the scope of this document. While awaiting better Kubernetes support for a snapshot and backup workflow, running your database service in a VM rather than a container, and exposing it it to your Kubernetes workload may be worth consideration.

Member

neolit123 Aug 2, 2018

it it to your has double it

Member

neolit123 Aug 2, 2018

worth consideration -> worth considering ?

content/en/blog/_posts/2018-08-03-make-kubernetes-production-grade-anywhere.md Outdated


		Buying a ticket on a commercial airline is convenient and safe. But when you travel to a remote location with a short runway, that commercial Airbus A320 flight isn’t an option. This doesn’t mean that air travel is off the table. It does mean that some compromises are necessary.

		The adage in aviation is that on a single engine aircraft, an engine failure means you crash. With twin engines, at the very least, you get more choices of where you crash. Kubernetes on a small number of hosts is sort of like this. And if your business case justifies it, you might scale up to a larger fleet of mixed large and small vehicles (e.g., FedEx, Amazon).

Member

neolit123 Aug 2, 2018

is sort of like this. And if -> is similar, and if

Member

neolit123 commented Aug 3, 2018 •

edited

Loading

FYI, i will submit a copy edit commit for this later today, as discussed with @kbarnard10.

Member

neolit123 commented Aug 3, 2018

@kbarnard10
Github tells me that i don't have push access to the kbarnard10:blog branch.

if you still want me to help with the edits, you need to grant me permission for the branch:
https://help.github.com/articles/enabling-branch-restrictions/

though, i need to leave in a couple of hours and i can do the rest on Sunday or Monday. 👍

Contributor

cantbewong commented Aug 3, 2018

@neolit123 thanks for the edits, all your changes LGTM


          Update 2018-08-03-make-kubernetes-production-grade-anywhere.md

fef53fc

Contributor Author

kbarnard10 commented Aug 3, 2018

@neolit123 I made your suggested edits for time's sake. But will add you to future blog posts.

natekartchner commented Aug 3, 2018

/lgtm
/approve

k8s-ci-robot added the lgtm label

Contributor

k8s-ci-robot commented Aug 3, 2018

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: natekartchner

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~content/en/blog/OWNERS~~ [natekartchner]
~~static/images/blog/OWNERS~~ [natekartchner]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

k8s-ci-robot added the approved label

k8s-ci-robot merged commit 4af1c1c into kubernetes:master

Member

neolit123 commented Aug 3, 2018

@kbarnard10 awesome, thanks.

embano1 mentioned this pull request

REQUEST: New membership for embano1 kubernetes/org#671

Closed

6 tasks

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

approved cncf-cla: yes lgtm size/L