Design load-balancing and high-availability of services #1788

NicolasT · 2019-09-26T00:20:08Z

Currently, services exposed using an Ingress object can be load-balanced and are highly available because the nginx-ingress nginx services are exposed on ports 80 and 443 of every node's workload-plane interface. As such, if a client is able to failover to other server(s) (for HA), and clients connect randomly or RR to nodes in the cluster (LB), we achieve the goals.

However, many clients keep insisting a single IP address they e.g. received from DNS must work (at least until some caches get invalidated), or are unable to balance access across multiple endpoints.

As such, we need to provide virtual IPs and failover for them, as well as (potentially) load-balancing.

#779 states MetalLB needs to be deployed. Indeed, this allows to create Services of type LoadBalancer, after which a VIP will be taken from a pool of available addresses, and this VIP will at all times be ARP broadcasted by a single node in the cluster (when using L2 mode, at least: BGP changes the picture). However, this implies this one node is a bottleneck for all traffic (even though, indeed, this traffic will be propagated to one of all backing service instances to achieve LB at the backend). There's metallb/metallb#439 (comment), but even when implemented, this doesn't seem to fit nicely in the Kubernetes Service model.

This ticket is meant to discuss alternative approaches and eventually come up with a design and implementation. Ideally this is not limited to Ingress-fronted services.

The text was updated successfully, but these errors were encountered:

jsecchiero · 2019-10-06T15:20:36Z

For L2 the soft anti-affinity would be ideal.
Also be able to select a favorite pool of workers to place IPs for dedicate them for high-traffic workloads would be great.

We no longer need this since we provide in-cluster HA for `kube-apiserver` access. If this is desired for out-of-cluster access, we can provide this using a `LoadBalancer` `Service` once we have the infrastructure to support this in place. This also removed the optional deployment of `keepalived`. See: #2103 See: #1788

NicolasT · 2020-03-06T14:42:34Z

While it's not exactly what I have in mind, https://github.com/redhat-cop/keepalived-operator goes into this direction to a certain extent. This could be a project we can contribute to to get done what we want to achieve.

NicolasT · 2020-03-07T13:34:29Z

I experimented a bit:

Set up Calico route reflectors on all infra nodes in my cluster
Peering between these infra nodes and an 'external' router (a Linux VM running Bird)
Enabling ClusterIP peering from Calico to this router, and filtering out all PodIPs.

Then, executing HTTP requests from either the router host, or another VM whose routing table pointed to the router for the ClusterIP network, caused these connections to be sent to multiple backend Pods as expected. Also, the traffic to LoadBalancer Services (at least, their ClusterIP) with externalTrafficPolicy set to Local would properly send traffic to hosts on which an Endpoint Pod was running, as such not creating any mesh effects.

So, in whatever design, we should consider the ability to BGP-peer with routers for L3 balancing. This could be done using Calico if we stick to ClusterIPs, or MetalLB in BGP mode (untested).

NicolasT added kind:enhancement New feature or request state:question Further information is requested kind:design Solution design choices topic:networking Networking-related issues moonshot labels Sep 26, 2019

NicolasT mentioned this issue Sep 26, 2019

Deploy MetalLB and nginx-ingress #1159

Closed

gdemonet added this to the MetalK8s 2.4.1 milestone Oct 31, 2019

NicolasT mentioned this issue Nov 13, 2019

Deploy MetalLB #779

Closed

gdemonet mentioned this issue Nov 19, 2019

Deploy dex in metalk8s from helm charts #2025

Merged

NicolasT mentioned this issue Dec 6, 2019

Control-plane without LB for API-server is a SPOF #2103

Closed

gdemonet removed the old:moonshot label Feb 4, 2020

NicolasT mentioned this issue Feb 11, 2020

Nodes don't route LoadBalancer traffic correctly #535

Open

gdemonet removed this from the MetalK8s 2.6.0 milestone Mar 9, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Design load-balancing and high-availability of services #1788

Design load-balancing and high-availability of services #1788

NicolasT commented Sep 26, 2019

jsecchiero commented Oct 6, 2019

NicolasT commented Mar 6, 2020

NicolasT commented Mar 7, 2020

Design load-balancing and high-availability of services #1788

Design load-balancing and high-availability of services #1788

Comments

NicolasT commented Sep 26, 2019

jsecchiero commented Oct 6, 2019

NicolasT commented Mar 6, 2020

NicolasT commented Mar 7, 2020