You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Perhaps related to #402 , we see slowdowns with groupby-aggregations when using UCX. This is particularly relevant when using split_out, which is often necessary for at-scale groupby-aggregations with many unique keys.
The text was updated successfully, but these errors were encountered:
beckernick
changed the title
Dask cuDF groupby-aggregations slower with UCX, particularly with split_out
[BUG] Dask cuDF groupby-aggregations slower with UCX, particularly with split_out
Jan 29, 2020
This indeed used to be an issue but I believe it was resolved by UCX upstream. New numbers with RAPIDS 0.20 and UCX 1.9:
TCP: 1.07s
UCX: 1.36s
TCP split_out=16: 1.97s
UCX split_out=16: 1.95s
It seems that for this particular workflow there isn't much gain with UCX, but the underlying issue seems to be resolved. Closing this for now, but please reopen if you encounter that again.
Perhaps related to #402 , we see slowdowns with groupby-aggregations when using UCX. This is particularly relevant when using
split_out
, which is often necessary for at-scale groupby-aggregations with many unique keys.I get these loose time results on a DGX-2:
TCP
UCX + NVLink
TCP
UCX + NVLink
Code to reproduce:
Relevant Conda Environment:
The text was updated successfully, but these errors were encountered: