`transition_processing_memory` optimizations, etc. #4487

jakirkham · 2021-02-06T05:19:43Z

Closes #xxxx
Tests added / passed
Passes black distributed / flake8 distributed

Goes through transition_processing_memory and optimizes it more carefully. In particular simplifies the code around starts tops, durations, and occupancies. Also types variables to get more performant Cython code. Makes use of intermediate variables to group work and allow easier reuse in later expressions. Tries to simplify data generated and collected to cutdown on general overhead associated. Additionally really cuts down on the work needed to produce messages for clients and workers.

Also includes various small optimizations like using .get(...), typing variables, dropping unneeded intermediate variables, constructing values once, using typed attributes, etc.

jakirkham · 2021-02-08T17:49:43Z

Planning on merging tomorrow if no comments

distributed/scheduler.py

jakirkham · 2021-02-09T05:21:15Z

Seeing an unrelated test failure. Am able to reproduce without these changes. Filed as issue ( #4493 )

Edit: This should be resolved now that PR ( dask/dask#7194 ) is in

jakirkham · 2021-02-10T02:15:56Z

Looks like CI has a different issue unrelated to this PR. Fixing with PR ( #4499 )

Edit: Restarting CI now that that fix is in

Edit 2: Seems like GH Actions doesn't do a merge commit when testing. So was still seeing the old failure. Went ahead and rebased. Should clear out that issue

Avoids checking for the value and then retrieving it by simply trying to get the associated value. If it is not found, it is `None`, which is fast to check after and handle.

Allows Cython to optimize operations on this object.

This should allow faster access (particularly in Cython).

This is a holdover from before the `SchedulerState` refactor. Now this is properly typed and can be used efficiently in Cython anyways.

Avoids duplicate retrieval. Also handles the coercion to `double` once.

Avoid collecting an intermediate list of client keys and instead use them to produce messages directly.

This makes it easier to type this value and thus for Cython to optimize this value throughout.

Should knock out type checking when the function is called, which should simplify the work needed in later steps.

This allows us to only create the `set` `s` when we know we need it. Otherwise we just get the `set` previously contained in the `dict`.

Instead of creating an empty `set` when `s` is not defined, just check that `s` is non-trivial (either `None` or empty). This should be a bit faster and avoid the unnecessary creation step.

As `defaultdict`s are difficult for Cython to optimize and usually a `dict` will suffice, change `_unknown_duractions` to a `dict` to allow Cython to use Python C API specific to `dict`s.

This makes it easier to annotate these more accurately and benefit from Cython's optimizations around these `dict`-typed variables. Requires a very small amount of checking for keys and setting their values if not present. Though this remains efficient in both Python & Cython as well as compact.

These are annotated local variables, which are holdovers from before the `SchedulerState` refactor. We can now drop these and use `SchedulerState` to access these directly.

Cython will turn this into a very efficient `switch...case` statement. So this cuts down on the overhead of comparisons and checks (paying them once when computing the length). Then focuses on just checking this C typed integral value for which `case` to run.

jakirkham · 2021-02-10T16:28:00Z

Planning on merging tomorrow if no comments

jakirkham force-pushed the misc_opts branch 2 times, most recently from 3a7151d to 7df87de Compare February 6, 2021 05:52

jakirkham changed the title ~~[WIP] Miscellaneous optimizations~~ [WIP] transition_processing_memory optimizations Feb 6, 2021

jakirkham force-pushed the misc_opts branch 2 times, most recently from fd0991e to 087e0f0 Compare February 6, 2021 06:27

jakirkham changed the title ~~[WIP] transition_processing_memory optimizations~~ transition_processing_memory optimizations Feb 6, 2021

jakirkham marked this pull request as ready for review February 6, 2021 06:42

jakirkham force-pushed the misc_opts branch from 20ae6ab to f563e1e Compare February 7, 2021 00:37

jakirkham force-pushed the misc_opts branch from 5d6b834 to d79e0cd Compare February 9, 2021 00:17

jakirkham changed the title ~~transition_processing_memory optimizations~~ transition_processing_memory optimizations, etc. Feb 9, 2021

jakirkham force-pushed the misc_opts branch 2 times, most recently from bf67cd1 to 63d109e Compare February 9, 2021 03:22

quasiben reviewed Feb 9, 2021

View reviewed changes

distributed/scheduler.py Show resolved Hide resolved

quasiben reviewed Feb 9, 2021

View reviewed changes

distributed/scheduler.py Outdated Show resolved Hide resolved

jakirkham force-pushed the misc_opts branch 2 times, most recently from b7a2894 to bc3f76a Compare February 9, 2021 03:44

jakirkham force-pushed the misc_opts branch 3 times, most recently from c70475b to df6d8a4 Compare February 10, 2021 01:37

jakirkham added 8 commits February 9, 2021 20:02

Use .get to retrieve alias

3fd6d60

Avoids checking for the value and then retrieving it by simply trying to get the associated value. If it is not found, it is `None`, which is fast to check after and handle.

Assign WorkerState object

6b0aa24

Allows Cython to optimize operations on this object.

Use _address attribute in host

70cd1ff

This should allow faster access (particularly in Cython).

Drop bandwidth intermediate variable

5c1709f

This is a holdover from before the `SchedulerState` refactor. Now this is properly typed and can be used efficiently in Cython anyways.

Drop total_nthreads intermediate variable

37401ea

Drop total_occupancy intermediate variable

7e69bd7

Drop bandwidth intermediate variable

54ebab1

Annotate s as a set

d294aa2

jakirkham added 18 commits February 9, 2021 20:02

Assign double to variable

16e769c

Avoids duplicate retrieval. Also handles the coercion to `double` once.

Use parent to get stealing extension quickly

ebb93d3

Move ClientState annotation into else

d472fd1

Use dict comprehension in _task_to_client_msgs

3245147

Create client msgs from keys directly

1996117

Avoid collecting an intermediate list of client keys and instead use them to produce messages directly.

Annotate couple variables in attribute functions

7a0745b

Type duration as double w/-1 default

118b06b

This makes it easier to type this value and thus for Cython to optimize this value throughout.

Tidy up new_task

530ea35

Annotate arguments in new_task

ef94471

Should knock out type checking when the function is called, which should simplify the work needed in later steps.

Use None with pop to defer object creation

f2f35b3

This allows us to only create the `set` `s` when we know we need it. Otherwise we just get the `set` previously contained in the `dict`.

Just check s is non-trivial

2eda0b0

Instead of creating an empty `set` when `s` is not defined, just check that `s` is non-trivial (either `None` or empty). This should be a bit faster and avoid the unnecessary creation step.

Make _unknown_durations an ordinary dict

137f65c

As `defaultdict`s are difficult for Cython to optimize and usually a `dict` will suffice, change `_unknown_duractions` to a `dict` to allow Cython to use Python C API specific to `dict`s.

Assign messages to client_msgs for clarity

f84cd7a

Annotate for-loops in valid_workers for perf

c418993

Annotate "addresses" field as set

903bf0e

Assign to dw after sw is filled

6a19e55

Drop some unneeded intermediate variables

9ca3f60

These are annotated local variables, which are holdovers from before the `SchedulerState` refactor. We can now drop these and use `SchedulerState` to access these directly.

jakirkham force-pushed the misc_opts branch 3 times, most recently from 55e4b75 to eb3e6c3 Compare February 10, 2021 06:32

jakirkham added 5 commits February 9, 2021 22:44

Use wws for a looping variable

aff99cf

Assign ws None and return

c621451

Assign ws result and return

5dade08

Join ifs and cleanup spacing

0e1fe3d

jakirkham force-pushed the misc_opts branch from eb3e6c3 to 441b744 Compare February 10, 2021 06:49

jakirkham merged commit de7cf0a into dask:master Feb 11, 2021

jakirkham deleted the misc_opts branch February 11, 2021 16:57

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`transition_processing_memory` optimizations, etc. #4487

`transition_processing_memory` optimizations, etc. #4487

jakirkham commented Feb 6, 2021 •

edited

Loading

jakirkham commented Feb 8, 2021

jakirkham commented Feb 9, 2021 •

edited

Loading

jakirkham commented Feb 10, 2021 •

edited

Loading

jakirkham commented Feb 10, 2021

transition_processing_memory optimizations, etc. #4487

transition_processing_memory optimizations, etc. #4487

Conversation

jakirkham commented Feb 6, 2021 • edited Loading

jakirkham commented Feb 8, 2021

jakirkham commented Feb 9, 2021 • edited Loading

jakirkham commented Feb 10, 2021 • edited Loading

jakirkham commented Feb 10, 2021

`transition_processing_memory` optimizations, etc. #4487

`transition_processing_memory` optimizations, etc. #4487

jakirkham commented Feb 6, 2021 •

edited

Loading

jakirkham commented Feb 9, 2021 •

edited

Loading

jakirkham commented Feb 10, 2021 •

edited

Loading