Explicit-comms-shuffle: fine control of task scheduling #1025

madsbk · 2022-10-26T07:38:26Z

In shuffle, use Client.submit() to control where tasks are executed and release temporary dataframes ASAP.

Context

In the final step in explicit-comms shuffle, we call getitem() to extract the final dataframe partitions from the result of the local shuffle. In some cases, these getitem() tasks would not run on the worker that ran the local shuffle, which would result in extra communication and spilling.
We now use submit(..., worker=...) to make sure that the worker running the local shuffle also runs the getitem() task afterwards.

Is it possible to do this without the use of submit() to avoid the overhead of creating a Future for each dataframe partition?

dask_cuda/explicit_comms/dataframe/shuffle.py

pentschev

Only added a minor code duplication reduction, apart from that LGTM. Thanks @madsbk !

dask_cuda/explicit_comms/dataframe/shuffle.py

wence-

My only comment was similar to @pentschev's; don't know a better way to force location of the compute

dask_cuda/explicit_comms/dataframe/shuffle.py

Co-authored-by: Peter Andreas Entschev <[email protected]>

codecov-commenter · 2022-10-28T06:11:18Z

Codecov Report

Base: 0.00% // Head: 0.00% // No change to project coverage 👍

Coverage data is based on head (658c3da) compared to base (99894b1).
Patch coverage: 0.00% of modified lines in pull request are covered.

Additional details and impacted files

@@              Coverage Diff               @@
##           branch-22.12   #1025     +/-   ##
==============================================
  Coverage          0.00%   0.00%             
==============================================
  Files                25      17      -8     
  Lines              3315    2216   -1099     
==============================================
+ Misses             3315    2216   -1099

Impacted Files	Coverage Δ
dask_cuda/explicit_comms/dataframe/shuffle.py	`0.00% <0.00%> (ø)`
dask_cuda/benchmarks/local_cupy.py
dask_cuda/_version.py
dask_cuda/benchmarks/local_cudf_merge.py
dask_cuda/benchmarks/local_cudf_shuffle.py
dask_cuda/benchmarks/common.py
dask_cuda/benchmarks/local_cupy_map_overlap.py
dask_cuda/benchmarks/local_cudf_groupby.py
dask_cuda/benchmarks/utils.py

Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here.

☔ View full report at Codecov.
📢 Do you have feedback about the report comment? Let us know in this issue.

pentschev · 2022-10-28T11:19:56Z

rerun tests

pentschev · 2022-10-28T11:53:15Z

Thanks @madsbk !

pentschev · 2022-10-28T11:53:20Z

@gpucibot merge

madsbk added 4 commits October 26, 2022 09:04

Explicit release df_groups

8bfff64

style

4f49441

Explicit specify worker when submitting getitem tasks

9abc316

doc

6c6b916

github-actions bot added the python python code needed label Oct 26, 2022

madsbk added 2 - In Progress Currently a work in progress improvement Improvement / enhancement to an existing function non-breaking Non-breaking change and removed python python code needed labels Oct 26, 2022

wence- reviewed Oct 26, 2022

View reviewed changes

dask_cuda/explicit_comms/dataframe/shuffle.py Show resolved Hide resolved

madsbk marked this pull request as ready for review October 26, 2022 14:27

madsbk requested a review from a team as a code owner October 26, 2022 14:27

madsbk added 3 - Ready for Review Ready for review by team and removed 2 - In Progress Currently a work in progress labels Oct 26, 2022

pentschev approved these changes Oct 27, 2022

View reviewed changes

dask_cuda/explicit_comms/dataframe/shuffle.py Outdated Show resolved Hide resolved

wence- approved these changes Oct 27, 2022

View reviewed changes

dask_cuda/explicit_comms/dataframe/shuffle.py Outdated Show resolved Hide resolved

Apply suggestions from code review

658c3da

Co-authored-by: Peter Andreas Entschev <[email protected]>

github-actions bot added the python python code needed label Oct 28, 2022

pentschev mentioned this pull request Oct 28, 2022

Switch pre-import not found test to sync definition #1026

Merged

rapids-bot bot merged commit 40bbfed into rapidsai:branch-22.12 Oct 28, 2022

madsbk deleted the excomm_enforce_worker branch October 31, 2022 07:43

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Explicit-comms-shuffle: fine control of task scheduling #1025

Explicit-comms-shuffle: fine control of task scheduling #1025

madsbk commented Oct 26, 2022

pentschev left a comment

wence- left a comment

codecov-commenter commented Oct 28, 2022 •

edited

Loading

pentschev commented Oct 28, 2022

pentschev commented Oct 28, 2022

pentschev commented Oct 28, 2022

Explicit-comms-shuffle: fine control of task scheduling #1025

Explicit-comms-shuffle: fine control of task scheduling #1025

Conversation

madsbk commented Oct 26, 2022

Context

pentschev left a comment

Choose a reason for hiding this comment

wence- left a comment

Choose a reason for hiding this comment

codecov-commenter commented Oct 28, 2022 • edited Loading

Codecov Report

pentschev commented Oct 28, 2022

pentschev commented Oct 28, 2022

pentschev commented Oct 28, 2022

codecov-commenter commented Oct 28, 2022 •

edited

Loading