You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The next milestone for UCXX is become the default UCX comms interface in Distributed. Currently there are two ways to use UCX:
protocol="ucx": Uses the legacy UCX-Py code, as has been the case for the past several years;
protocol="ucxx": Uses the UCXX library, which requires the distributed-ucxx package to be installed.
Since now the implementation of the comms protocol is part of the UCXX repo, it is also a standalone package and doesn't come anymore with the installation of the distributed packages via conda, PyPI or source, the user must install distributed-ucxx from the packaging repo or source.
Migration options:
Add a warning to UCX-Py telling users to migrate as it will be deprecated/removed soon and simply remove protocol="ucx" and force users to switch to protocol="ucxx" when the time comes; or
Introduce a proxy comms protocol="ucx" (replacing the current one which directly points to UCX-Py) which then chooses UCXX if distributed-ucxx is installed (as if protocol="ucxx" was specified), otherwise fallback to UCX-Py and warns the user to install distributed-ucxx. This is the most transparent way and allows us to control the time of final switch, plus RAPIDS already has distributed-ucxx installed so it should not be noticed by users, only those who pick a subset of packages will be affected and warned.
Once the migration is complete we will archive the UCX-Py repository and remove the UCX-Py implementation from distributed along with its tests, which should help alleviate overhead from maintainers in the Distributed repo.
The text was updated successfully, but these errors were encountered:
The next milestone for UCXX is become the default UCX comms interface in Distributed. Currently there are two ways to use UCX:
protocol="ucx"
: Uses the legacy UCX-Py code, as has been the case for the past several years;protocol="ucxx"
: Uses the UCXX library, which requires thedistributed-ucxx
package to be installed.Since now the implementation of the comms protocol is part of the UCXX repo, it is also a standalone package and doesn't come anymore with the installation of the
distributed
packages via conda, PyPI or source, the user must installdistributed-ucxx
from the packaging repo or source.Migration options:
protocol="ucx"
and force users to switch toprotocol="ucxx"
when the time comes; orprotocol="ucx"
(replacing the current one which directly points to UCX-Py) which then chooses UCXX ifdistributed-ucxx
is installed (as ifprotocol="ucxx"
was specified), otherwise fallback to UCX-Py and warns the user to installdistributed-ucxx
. This is the most transparent way and allows us to control the time of final switch, plus RAPIDS already hasdistributed-ucxx
installed so it should not be noticed by users, only those who pick a subset of packages will be affected and warned.From previous conversations, primarily with @charlesbluca and @rjzamora , option 2 is preferred with a PoC of the necessary changes in Distributed here.
Once the migration is complete we will archive the UCX-Py repository and remove the UCX-Py implementation from distributed along with its tests, which should help alleviate overhead from maintainers in the Distributed repo.
The text was updated successfully, but these errors were encountered: