Add dask serialization of CUDA objects #3482

jakirkham · 2020-02-15T01:06:10Z

This implements "dask" serialization for CUDA objects. All serializers work by first performing "cuda" serialization on the different objects and then performing a device-to-host transfer. The deserializers perform a host-to-device transfer before running the respective "cuda" deserializer for the object considered. Should provide an alternative way to do TCP transfers with CUDA objects that does not rely on pickling.

Note: Just like with UCX transfers, we preferentially add objects to the RMM pool if RMM is available. Otherwise we fallback to using Numba.

This should make room for Dask serializers to also be specified and added.

To make TCP a bit more performant with RMM, provide Dask serializers to allow going to and from host memory.

mrocklin · 2020-02-15T05:02:30Z

Thanks @jakirkham ! For testing, what do you think about calling serialize on a few objects, like a cupy array, and verifying that some of the frames are memoryview objects?

jakirkham · 2020-02-15T05:06:16Z

Yep, agree this needs some tests. Those sound like reasonable suggestions. Will work on those and update when they are up. Thanks Matt! 😄

To make sure that different CUDA objects can use different serialization protocols, test with each one individual and ensure it completes. In particular test both "cuda" and "dask". Where supported also test "pickle", but skip it when it is not (like with Numba).

To make sure Dask can handle transmission of the frames serialized, test they match the type expected by the protocol used. With "cuda" ensure we get something that supports `__cuda_array_interface__`. With "dask" make sure we get a `memoryview`.

jakirkham · 2020-02-18T04:06:19Z

Have parametrized tests around serializers so this tests cuda, dask, and pickle (where possible). Also have tested that frames are acceptable. Please let me know if there is anything else. 🙂

Planning on merging in 24hrs if no comments.

jakirkham added 6 commits February 14, 2020 16:58

Run isort on CUDA protocol imports

e06671f

Align CuPy serialize/deserialize function names

1fe5f2e

Prefix CUDA serializers with cuda_

d96dca1

This should make room for Dask serializers to also be specified and added.

Add Dask serializers for RMM DeviceBuffers

c5d4c52

To make TCP a bit more performant with RMM, provide Dask serializers to allow going to and from host memory.

Add Dask serializers for Numba DeviceNDArrays

081d640

Add Dask serializers for CuPy ndarrays

f990f93

jakirkham added 3 commits February 17, 2020 19:22

Merge dask/master into jakirkham/add_dask_cuda_serializers

aa57011

Check frames are the expected type

f73b614

To make sure Dask can handle transmission of the frames serialized, test they match the type expected by the protocol used. With "cuda" ensure we get something that supports `__cuda_array_interface__`. With "dask" make sure we get a `memoryview`.

jakirkham force-pushed the add_dask_cuda_serializers branch from e8ec4c6 to f73b614 Compare February 18, 2020 03:28

jakirkham merged commit b5e95ed into dask:master Feb 19, 2020

jakirkham deleted the add_dask_cuda_serializers branch February 19, 2020 03:56

jakirkham mentioned this pull request Feb 22, 2020

Using "dask" serialization protocol in spilling rapidsai/dask-cuda#242

Closed

quasiben mentioned this pull request May 26, 2020

[DOC] CUDA serialization #3820

Open

jakirkham mentioned this pull request Jul 1, 2020

Evaluate further serialization performance improvements rapidsai/dask-cuda#106

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add dask serialization of CUDA objects #3482

Add dask serialization of CUDA objects #3482

jakirkham commented Feb 15, 2020 •

edited

Loading

mrocklin commented Feb 15, 2020

jakirkham commented Feb 15, 2020

jakirkham commented Feb 18, 2020

Add dask serialization of CUDA objects #3482

Add dask serialization of CUDA objects #3482

Conversation

jakirkham commented Feb 15, 2020 • edited Loading

mrocklin commented Feb 15, 2020

jakirkham commented Feb 15, 2020

jakirkham commented Feb 18, 2020

jakirkham commented Feb 15, 2020 •

edited

Loading