You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This is to propose the Kubernetes-node style health-check mechanisms for the MachinePools. It shall circumvent the issues related to degraded health of the previously Ready MachinePools.
The possible solution to such issues could involve:
Heartbeats from the MachinePoollet to APIServer/MachinePool.
Machinepool-controller, declaring MachinePool to be Unknown, NotReady based on pre-determined configurations, when beats are missing.
This is similar to how Kubelet updates the ReadyNode.Status.NodeCondtion[] regularly, missing of which leads Node-controller to declare Nodes to be Unknown/NodeReady.
The possible consumer for this could be the Scheduler, which can prevent further workload from being scheduled on the affected Machinepool, while also eviction-controllers being able to evict workloads if needed.
Considering this is a bigger epic, it's highly recommended to prepare an Enhancement proposal first.
This can also have possible touch-points with the Node-problem-detector like design with MachinePool, which is better discussed separately.
The text was updated successfully, but these errors were encountered:
Summary
This is to propose the Kubernetes-node style health-check mechanisms for the MachinePools. It shall circumvent the issues related to degraded health of the previously
Ready
MachinePools.The possible solution to such issues could involve:
Unknown
,NotReady
based on pre-determined configurations, when beats are missing.This is similar to how Kubelet updates the
Ready
Node.Status.NodeCondtion[]
regularly, missing of which leadsNode-controller
to declare Nodes to beUnknown
/NodeReady
.The possible consumer for this could be the Scheduler, which can prevent further workload from being scheduled on the affected Machinepool, while also eviction-controllers being able to evict workloads if needed.
Basic example
Motivation
To enhance the means the disaster recovery.
Note
Considering this is a bigger epic, it's highly recommended to prepare an Enhancement proposal first.
This can also have possible touch-points with the
Node-problem-detector
like design with MachinePool, which is better discussed separately.The text was updated successfully, but these errors were encountered: