The design doc of PodGroup Phase in Status. #533

k82cn · 2019-01-03T10:09:27Z

Signed-off-by: Da K. Ma [email protected]

Which issue(s) this PR fixes (optional, in fixes #<issue number>(, fixes #<issue_number>, ...) format, will close the issue(s) when PR gets merged):
Part of #521

Release note:

None

k8s-ci-robot · 2019-01-03T10:09:35Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: k82cn

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~OWNERS~~ [k82cn]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

k82cn · 2019-01-03T10:11:46Z

/cc @bsalamat @Zyqsempai @Jeffwan @MaciekPytel

k8s-ci-robot · 2019-01-03T10:11:48Z

@k82cn: GitHub didn't allow me to request PR reviews from the following users: bsalamat, Zyqsempai, JeffWan, MaciekPytel.

Note that only kubernetes-sigs members and repo collaborators can review this PR, and authors cannot review their own PRs.

In response to this:

/cc @bsalamat @Zyqsempai @Jeffwan @MaciekPytel

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

MaciekPytel · 2019-01-03T10:40:07Z

doc/design/podgroup-status.md

+* there are nodes in the cluster that have been underutilized for an extended period of time and their pods can be placed on other existing nodes.
+
+For first scenario, the Cluster-Autoscaler need to check PodGroup's `status.State` to know whether there're
+sufficient resources to run the whole group of pods.


This is not how CA works. It doesn't compute required resources at all.

Very roughly the way it works is:
0. (Import scheduler predicates from k/k repo)

If there are pending pods (pods wit PodScheduled condition set to false with reason given as unschedulable):
1.1 Create an in-memory node object that represents a new node that would result from scale-up.
1.2 Run imported scheduler predicates on pending pods to "schedule" them on in-memory node.
1.3 If some pods remain pending goto 1.1.

This way autoscaler automatically takes into account all scheduling constraints: resources, taints, affinity, storage, etc. Unfortunately it also means it's fundamentally incompatible with any form of custom scheduling. If you want CA to work with kube-batch you need to either encapsulate kube-batch logic in a form of predicates or else duplicate it in CA somehow. It certainly won't be a simple fix.
It would get even worse for things like gang scheduling. You can swap predicates for something different, but ultimately CA expects a black box that takes (pod, node) pair and tells whether the pod could be scheduled on a given node.

What will happen if new node can not be scheduled because of customized plugin? Will it continue to create new nodes?

I'm thinking whether CA can keep adding new nodes (with some rate & restriction) if PodGroup is pending.

CA will check what default scheduler would do with the pending pods. If CA thinks the pod can be scheduled on existing nodes it will not add new nodes.

That would be hard to work on plugins that can be compatible with CA in the future. In other thread, you mentioned make pod unschedulable is just tip of an iceber. Do you think we can extract some common criterions in CA for other custom scheduler to meet? For example, podCondition is just one of them.

en. I'm thinking to make CA support as out-of-scope for now; as CA need to work together with default scheduler. We can consider to enhance that in later release.

Signed-off-by: Da K. Ma <[email protected]>

jinzhejz · 2019-01-05T12:10:09Z

/lgtm

Added PodGroup Phase in Status.

k8s-ci-robot requested a review from animeshsingh January 3, 2019 10:09

k8s-ci-robot requested a review from suleisl2000 January 3, 2019 10:09

k8s-ci-robot added size/L Denotes a PR that changes 100-499 lines, ignoring generated files. approved Indicates a PR has been approved by an approver from all required OWNERS files. cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. labels Jan 3, 2019

k82cn changed the title ~~Added PodGroup Status Doc.~~ Added PodGroup Phase in Status. Jan 3, 2019

MaciekPytel reviewed Jan 3, 2019

View reviewed changes

This was referenced Jan 3, 2019

Coscheduilng. kubernetes/enhancements#639

Merged

Add Pod Condition and unblock cluster autoscaler #526

Closed

Added PodGroup Status Doc.

226bf22

Signed-off-by: Da K. Ma <[email protected]>

k8s-ci-robot assigned jinzhejz Jan 5, 2019

k8s-ci-robot added the lgtm Indicates that a PR is ready to be merged. label Jan 5, 2019

k8s-ci-robot merged commit 818f24b into kubernetes-retired:master Jan 5, 2019

k82cn deleted the podgroup_status_doc branch January 5, 2019 12:43

k82cn mentioned this pull request Jan 8, 2019

Add phase/conditions into PodGroup.Status #521

Closed

3 tasks

k82cn mentioned this pull request Jan 23, 2019

Use pod group instead of PDB for gang scheduling kubeflow/training-operator#916

Closed

k82cn added this to the v0.4 milestone Jan 26, 2019

k82cn changed the title ~~Added PodGroup Phase in Status.~~ The design doc of PodGroup Phase in Status. Jan 26, 2019

kevin-wangzefeng pushed a commit to kevin-wangzefeng/scheduler that referenced this pull request Jun 28, 2019

Merge pull request kubernetes-retired#533 from k82cn/podgroup_status_doc

67cdb66

Added PodGroup Phase in Status.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The design doc of PodGroup Phase in Status. #533

The design doc of PodGroup Phase in Status. #533

k82cn commented Jan 3, 2019

k8s-ci-robot commented Jan 3, 2019

k82cn commented Jan 3, 2019

k8s-ci-robot commented Jan 3, 2019

MaciekPytel Jan 3, 2019

k82cn Jan 3, 2019

k82cn Jan 3, 2019 •

edited

Loading

MaciekPytel Jan 3, 2019

Jeffwan Jan 4, 2019

k82cn Jan 4, 2019

jinzhejz commented Jan 5, 2019

The design doc of PodGroup Phase in Status. #533

The design doc of PodGroup Phase in Status. #533

Conversation

k82cn commented Jan 3, 2019

k8s-ci-robot commented Jan 3, 2019

k82cn commented Jan 3, 2019

k8s-ci-robot commented Jan 3, 2019

MaciekPytel Jan 3, 2019

Choose a reason for hiding this comment

k82cn Jan 3, 2019

Choose a reason for hiding this comment

k82cn Jan 3, 2019 • edited Loading

Choose a reason for hiding this comment

MaciekPytel Jan 3, 2019

Choose a reason for hiding this comment

Jeffwan Jan 4, 2019

Choose a reason for hiding this comment

k82cn Jan 4, 2019

Choose a reason for hiding this comment

jinzhejz commented Jan 5, 2019

k82cn Jan 3, 2019 •

edited

Loading