Improve unschedulable task warning messages by integrating with the autoscaler #18724

ericl · 2021-09-17T22:04:29Z

Why are these changes needed?

Today, we raise warnings when tasks / actors are not schedulable immediately. These warnings are confusing since they don't take into account future possible autoscaling, and hence can be false positives. False positives are bad since:

The user is often confused ("shouldn't my cluster autoscale?" "this warning doesn't make sense")
We can't raise exceptions since it could be a false positive ("user ignores warning and is confused when their app hangs")

Library users like Serve today disable these warnings by using placement groups, which is not ideal.

This PR eliminates these false positives via integration with the autoscaler. Instead of the raylet printing messages when resources are not schedulable, it defers to the autoscaler. The autoscaler can determine if a task will be infeasible even after autoscaling. This PR:

Makes the autoscaler always active (in "readonly" mode for laptop / manually setup clusters)
Defers responsibility for scheduler warning messages to autoscaler

In future PRs, we can close the loop by raising exceptions for "permanently infeasible" tasks. This would require the autoscaler to send statuses back to the scheduler about what task types are infeasible.

PRD doc: https://docs.google.com/document/d/1OT6m4xQDN8UtsBgnAMpX6nhXpNAfdeHJVve-iGhw1WI/edit#

Sample output on laptop (autoscaler output is unchanged):

======== Autoscaler status: 2021-09-23 17:45:56.525566 ========
Node status
---------------------------------------------------------------
Healthy:
 1 node_777cd260045578b90970679657460908a1ef8285ed248a093e79cc72
 1 node_2509c7c51cfb659b77700cb34c2035df2cf016a67a1868864da6d4b4
 1 node_50973ed2e3eb5a30f64a6e107ec76d9aec46b7cfdd502a66135aa2a4
 1 node_a7cf546146ef87737902b042ed0fb73619913d197617bca7899beb73
Pending:
 (no pending nodes)
Recent failures:
 (no failures)

Resources
---------------------------------------------------------------
Usage:
 60.0/64.0 CPU
 0.00/106.382 GiB memory
 0.00/0.586 GiB object_store_memory

Demands:
 {'CPU': 4.0}: 92+ pending tasks/actors
 {'CPU': 3.0}: 193+ pending tasks/actors
 {'CPU': 1.0, 'foo': 1.0}: 2+ pending tasks/actors
 {'CPU': 30.0}: 2+ pending tasks/actors

Related issue number

Closes #15933

TODO:

Add integration tests for emitted log messages
Add integration test for ray status with readonly provider
Update unit tests for resource demand scheduler
Add unit test for use of readonly node provider

sasha-s

Reviewed 17 of 17 files at r1, all commit messages.
Reviewable status: all files reviewed, 3 unresolved discussions (waiting on @AmeerHajAli, @DmitriGekhtman, @ericl, @ijrsvt, @pcmoritz, @raulchen, @robertnishihara, and @wuisawesome)

python/ray/worker.py, line 1091 at r1 (raw file):

                yield ("Tip: use `ray status` to view detailed "
                       "cluster status. To disable these "
                       "messages, set RAY_SCHEDULER_EVENTS=0.")

Right now AUTOSCALER_EVENTS are not documented.
Do we want to document RAY_SCHEDULER_EVENTS?

python/ray/autoscaler/_private/autoscaler.py, line 255 at r1 (raw file):

        def schedule_node_termination(node_id: NodeID,
                                      reason_opt: Optional[str]) -> None:
            if self.provider.is_readonly():

It is a bit confusing that we allow mutable operations for readonly providers, ignoring them.
Also, I think that we re-check is_readonly() below (which obsolete if we skip here).

python/ray/autoscaler/_private/monitor.py, line 237 at r1 (raw file):

        mirror_node_types = {}
        resource_deadlock = False

resource_deadlock sounds like a bug, maybe rename to something like
not_enought_resources?

ericl · 2021-09-23T23:48:23Z

Agree to support legacy logs for now, I'll add a feature flag.

ericl · 2021-09-24T00:46:59Z

Done with pass on comments. @AmeerHajAli I attached a "ray status" output in the PR description for readonly cluster status (autoscaler status is unchanged). You can also check out the asserts in test_cli.py and test_output.py

python/ray/autoscaler/_private/autoscaler.py

DmitriGekhtman

Looks great!
There are some tests to patch up.

ericl · 2021-09-24T19:20:23Z

Windows build seems to not trigger correctly, but yolo

ericl added 8 commits September 15, 2021 16:25

update

f8cd230

wip

9e72942

wip

b4c7cd6

wip

c88624c

wip

f565373

wip

dd2dc6b

wip

cb6a04f

add integration

be74542

ericl requested review from AmeerHajAli, pcmoritz, raulchen, robertnishihara and wuisawesome as code owners September 17, 2021 22:04

AmeerHajAli assigned sasha-s, ijrsvt and DmitriGekhtman Sep 17, 2021

AmeerHajAli requested review from DmitriGekhtman, ijrsvt and sasha-s September 17, 2021 22:07

ericl added 10 commits September 17, 2021 15:12

update

e4b1f25

wip

a9c8e52

update

b10a7b6

wip

99d1899

wip

d9bbe51

update

ae0f3c8

wip

06f8abd

update

461bd65

update

11e0fe7

update

fa684d1

sasha-s reviewed Sep 19, 2021

View reviewed changes

ericl added 3 commits September 23, 2021 16:45

wip

ee4b491

wip

987601b

wip

8dd1975

ericl added 2 commits September 23, 2021 16:52

wi

9d9a1b2

fix

15e4842

ericl removed the @author-action-required The PR author is responsible for the next step. Remove tag to send back to the reviewer. label Sep 24, 2021

fix

e0cbd68

DmitriGekhtman reviewed Sep 24, 2021

View reviewed changes

python/ray/autoscaler/_private/autoscaler.py Outdated Show resolved Hide resolved

ericl added 2 commits September 23, 2021 19:11

update

9d10efe

lint

dcbb7c3

rkooo567 mentioned this pull request Sep 24, 2021

[placement groups] Pending tasks not removed after job terminated #15778

Closed

DmitriGekhtman approved these changes Sep 24, 2021

View reviewed changes

DmitriGekhtman added the @author-action-required The PR author is responsible for the next step. Remove tag to send back to the reviewer. label Sep 24, 2021

ericl added 8 commits September 23, 2021 21:25

Merge remote-tracking branch 'upstream/master' into local-autoscaler

b4e5652

fixc

e965925

remove

4ea61ac

fix

bd20dc6

fix copy issue

aa935d7

update

c28e8ea

update

bf7d3f9

Merge remote-tracking branch 'upstream/master' into local-autoscaler

4c7e940

ericl merged commit 11a2dfc into ray-project:master Sep 24, 2021

AmeerHajAli mentioned this pull request Oct 7, 2021

[Bug] [Autoscaler] Autoscaler can start unnecessary nodes due to poor bin packing heuristic ordering #19124

Closed

2 tasks

rkooo567 mentioned this pull request Oct 26, 2021

Turn off legacy resource deadlock warning / infeasible tasks warning #19738

Closed

6 tasks

matthewdeng mentioned this pull request Oct 28, 2021

move jsonschema to core dependencies and update default AutoscalerPrometheusMetrics #19831

Merged

6 tasks

ckw017 mentioned this pull request Dec 6, 2021

[Bug] Autoscaller pollutes script/jupyter output #19636

Closed

2 tasks

ericl mentioned this pull request Jan 28, 2022

Flag off RAY_legacy_scheduler_warnings #21965

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve unschedulable task warning messages by integrating with the autoscaler #18724

Improve unschedulable task warning messages by integrating with the autoscaler #18724

ericl commented Sep 17, 2021 •

edited

Loading

sasha-s left a comment

ericl commented Sep 23, 2021

ericl commented Sep 24, 2021

DmitriGekhtman left a comment

ericl commented Sep 24, 2021

Improve unschedulable task warning messages by integrating with the autoscaler #18724

Improve unschedulable task warning messages by integrating with the autoscaler #18724

Conversation

ericl commented Sep 17, 2021 • edited Loading

Why are these changes needed?

Related issue number

sasha-s left a comment

Choose a reason for hiding this comment

ericl commented Sep 23, 2021

ericl commented Sep 24, 2021

DmitriGekhtman left a comment

Choose a reason for hiding this comment

ericl commented Sep 24, 2021

ericl commented Sep 17, 2021 •

edited

Loading