Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[REVIEW] Unpin dask & distributed for development #10623

Merged
merged 10 commits into from
Apr 11, 2022

Conversation

galipremsagar
Copy link
Contributor

This PR unpins dask & distributed for development.

@codecov
Copy link

codecov bot commented Apr 8, 2022

Codecov Report

Merging #10623 (0d716e1) into branch-22.06 (bf4ffc9) will increase coverage by 0.02%.
The diff coverage is 100.00%.

@@               Coverage Diff                @@
##           branch-22.06   #10623      +/-   ##
================================================
+ Coverage         86.33%   86.36%   +0.02%     
================================================
  Files               140      140              
  Lines             22289    22289              
================================================
+ Hits              19244    19250       +6     
+ Misses             3045     3039       -6     
Impacted Files Coverage Δ
python/dask_cudf/dask_cudf/backends.py 86.44% <100.00%> (ø)
python/dask_cudf/dask_cudf/sorting.py 93.03% <100.00%> (ø)
python/cudf/cudf/core/column/string.py 89.10% <0.00%> (+0.12%) ⬆️
python/cudf/cudf/core/groupby/groupby.py 91.72% <0.00%> (+0.22%) ⬆️
python/cudf/cudf/core/tools/datetimes.py 84.49% <0.00%> (+0.30%) ⬆️
python/cudf/cudf/core/column/lists.py 92.70% <0.00%> (+1.28%) ⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 28aa895...0d716e1. Read the comment docs.

Comment on lines +85 to +86
gpuci_logger "gpuci_mamba_retry install conda-forge::dask>=2022.03.0 conda-forge::distributed>=2022.03.0 conda-forge::dask-core>=2022.03.0 --force-reinstall"
gpuci_mamba_retry install conda-forge::dask>=2022.03.0 conda-forge::distributed>=2022.03.0 conda-forge::dask-core>=2022.03.0 --force-reinstall
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I've been wondering if a less cumbersome way of achieving this would be to manually remove dask/label/dev from our list of channels here? Maybe something like

Suggested change
gpuci_logger "gpuci_mamba_retry install conda-forge::dask>=2022.03.0 conda-forge::distributed>=2022.03.0 conda-forge::dask-core>=2022.03.0 --force-reinstall"
gpuci_mamba_retry install conda-forge::dask>=2022.03.0 conda-forge::distributed>=2022.03.0 conda-forge::dask-core>=2022.03.0 --force-reinstall
gpuci_logger "gpuci_mamba_retry install dask>=2022.03.0 --force-reinstall"
gpuc_conda_retry config --remove channels dask/label/dev
gpuci_mamba_retry install dask>=2022.03.0 --force-reinstall

Not sure if removing the channel would persist in other runs though, cc @ajschmidt8 @jakirkham if this solution makes sense

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I remember trying something similar previously but didn't seem to work, testing it now again in CI

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yeah agree. Recall Prem tried several things, but don't recall if this one (and if so whether we encountered any issues with that)

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

If this doesn't work, happy to merge in as is since the current setup is working

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Will this actually install Dask main or just the latest release?

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We use nightlies from the Dask channel in some cases. Depends if the channel dask/label/dev is used

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

And do nightlies really mean "once every night", or does it mean it's a new package after every merge?

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Every merge. They are equivalent to using main, but also include Dask's dependencies

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Sounds good, thanks for confirming.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@galipremsagar galipremsagar added the 5 - DO NOT MERGE Hold off on merging; see PR for details label Apr 8, 2022
rapids-bot bot pushed a commit to rapidsai/dask-cuda that referenced this pull request Apr 8, 2022
ci/gpu/build.sh Outdated Show resolved Hide resolved
ci/benchmark/build.sh Outdated Show resolved Hide resolved
ci/benchmark/build.sh Outdated Show resolved Hide resolved
ci/gpu/build.sh Outdated Show resolved Hide resolved
@@ -294,6 +294,6 @@ def sort_values(
df4 = df3.map_partitions(sort_function, **sort_kwargs)
if not isinstance(divisions, gd.DataFrame) and set_divisions:
# Can't have multi-column divisions elsewhere in dask (yet)
df4.divisions = methods.tolist(divisions)
df4.divisions = tuple(methods.tolist(divisions))
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This fix is required due to this upstream change: dask/dask#8806

@galipremsagar galipremsagar removed the 5 - DO NOT MERGE Hold off on merging; see PR for details label Apr 11, 2022
Copy link
Member

@charlesbluca charlesbluca left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM; one small question that can be addressed outside of this PR:

python/dask_cudf/dask_cudf/backends.py Show resolved Hide resolved
@galipremsagar galipremsagar added 5 - Ready to Merge Testing and reviews complete, ready to merge and removed 3 - Ready for Review Ready for review by team labels Apr 11, 2022
@galipremsagar
Copy link
Contributor Author

@gpucibot merge

@rapids-bot rapids-bot bot merged commit 97ac349 into rapidsai:branch-22.06 Apr 11, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
5 - Ready to Merge Testing and reviews complete, ready to merge improvement Improvement / enhancement to an existing function non-breaking Non-breaking change Python Affects Python cuDF API.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants