Add `__cuda_array_interface__` to `Buffer` #4023

jakirkham · 2020-01-31T07:16:43Z

Adds the __cuda_array_interface__ to Buffer. This simplifies its coercion to Numba arrays, which is then leveraged to get away from some legacy code in RMM around array handling.

To make it easier to coerce `Buffer` to Numba or CuPy arrays as needed, add the `__cuda_array_interface__` to `Buffer`.

This winds up being quite a bit faster to build. As that is important to us for serialization purposes, switch Numba for CuPy in this case. Note that when it comes to copying the data back to host, other things start dominating the time spent.

python/cudf/cudf/core/buffer.py

python/cudf/cudf/core/column/column.py

python/cudf/cudf/core/buffer.py

To simplify generic handling of GPU array-like data, coerce it to CuPy first (known to be fast) and then coerce it to NumPy (same time as using Numba). This should keep things flexible when working with other GPU arrays while maintaining performance.

python/cudf/cudf/core/buffer.py

python/cudf/cudf/core/column/column.py

Seems that `gdf_dtype_from_dtype` isn't able to handle unsigned values at the moment. So switch back to signed 1-byte integers for now. Should clear up some failures.

Works around an issue where a CuPy scalar may be passed. Previously `__array__` would be called by NumPy, but CuPy will throw there. So this calls `int` on the object first. If it is a CuPy array, it will actually coerce itself to a Python `int`. Then we can coerce it to a NumPy `int32` value.

jakirkham · 2020-02-01T03:30:37Z

python/cudf/cudf/tests/test_binops.py

@@ -377,7 +377,7 @@ def test_reflected_ops_scalar(func, dtype, obj_class):
    ps_result = func(random_series)

    # verify
-    np.testing.assert_allclose(ps_result, gs_result)
+    utils.assert_eq(ps_result, gs_result)


We seem to be hitting some sort of edge case here. Tried switching to the utility function used elsewhere, but it seems to have issues comparing. Have a rough idea of the issue, but additional pointers would be welcome. 🙂

______________ test_reflected_ops_scalar[<lambda>-int160-Series] _______________ func = <function <lambda> at 0x7f531193f320>, dtype = <class 'numpy.int16'> obj_class = 'Series' @pytest.mark.parametrize("obj_class", ["Series", "Index"]) @pytest.mark.parametrize("func, dtype", list(product(_reflected_ops, _dtypes))) def test_reflected_ops_scalar(func, dtype, obj_class): # create random series np.random.seed(12) random_series = utils.gen_rand(dtype, 100, low=10) # gpu series gs = Series(random_series) # class typing if obj_class == "Index": gs = as_index(gs) gs_result = func(gs) # class typing if obj_class == "Index": gs = Series(gs) # pandas ps_result = func(random_series) # verify > utils.assert_eq(ps_result, gs_result) cudf/tests/test_binops.py:380: _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ self = 0 True 1 True 2 True 3 True 4 True ... 95 True 96 True 97 True 98 True 99 True Length: 100, dtype: bool def __nonzero__(self): raise ValueError( "The truth value of a {0} is ambiguous. " "Use a.empty, a.bool(), a.item(), a.any() or a.all().".format( > self.__class__.__name__ ) ) E ValueError: The truth value of a Series is ambiguous. Use a.empty, a.bool(), a.item(), a.any() or a.all(). /conda/envs/gdf/lib/python3.7/site-packages/pandas/core/generic.py:1555: ValueError

I'm guessing we're somehow getting down to this condition: https://github.com/rapidsai/cudf/blob/branch-0.13/python/cudf/cudf/tests/utils.py#L77

What are the types of ps_result and gs_result?

Sorry for the slow reply.

Actually it is getting stuck here. The comparison works, but the result is a Pandas Series, which can't be converted to a bool.

Backing up a bit, gs_result is a cuDF Series. However ps_result appears to be a NumPy array. Is the latter expected?

FWIW it appears this has always been the case. Not sure what caused us to now stumble into that.

Should add forcing ps_result to a Pandas Series fixes the issue.

Do you know what op this is that's causing the issue? It's a bit surprising that your PR changes are causing this to change behavior.

I tried locally with just lambda x: 1 + x, but would guess any of them have this issue.

Are we sure this is suppose to be gen_rand and not gen_rand_series though? ( #4091 )

🤷‍♂ not sure but if gen_rand_series fixes it that sounds good to me. This test is already pretty janky to test both indexes and series.

Yeah me neither honestly. So I could easily be wrong 🙂

Not that we should pick up more things here, but would it makes sense to break this test into two separate ones to handle Series and Index? If so, happy to raise an issue.

Possibly in the future, but the level of work for the reward is low right now. All of our index binaryops are implemented by casting to/from Series so once we fix that we can possibly address this.

python/cudf/cudf/core/column/column.py

If the underlying object contains a singleton value, treat it as if it is a 1-D array with 1 value. Should fix some issues where a 0-D array is passed.

stale

Use an RMM-backed CuPy array instead of `rmm.device_array`.

python/cudf/cudf/core/column/column.py

jakirkham · 2020-02-10T18:45:48Z

Given the __cuda_array_interface__ bits are in and this now is mostly a PR about using CuPy, have extracted those bits into a fresh draft PR ( #4110 ). We can then consider how we want to proceed with that on its own merits.

jakirkham added 2 commits January 30, 2020 23:08

Add __cuda_array_interface__ to Buffer

d502931

To make it easier to coerce `Buffer` to Numba or CuPy arrays as needed, add the `__cuda_array_interface__` to `Buffer`.

Use __cuda_array_interface__ to copy to host

67ebf11

jakirkham requested a review from a team as a code owner January 31, 2020 07:16

Coerce Buffers to Numba arrays directly

1fe1cde

jakirkham force-pushed the add_buf_cai branch 2 times, most recently from fe2f7a3 to e778560 Compare January 31, 2020 07:39

jakirkham added 2 commits January 30, 2020 23:40

View Buffer as CuPy array instead

5a38016

This winds up being quite a bit faster to build. As that is important to us for serialization purposes, switch Numba for CuPy in this case. Note that when it comes to copying the data back to host, other things start dominating the time spent.

Note addition in changelog

aeb3978

jakirkham force-pushed the add_buf_cai branch from e778560 to aeb3978 Compare January 31, 2020 07:40

jakirkham mentioned this pull request Jan 31, 2020

Dask-cudf multi partition merge slows down with ucx rapidsai/ucx-py#402

Closed

jakirkham commented Jan 31, 2020

View reviewed changes

python/cudf/cudf/core/buffer.py Outdated Show resolved Hide resolved

jakirkham commented Jan 31, 2020

View reviewed changes

python/cudf/cudf/core/column/column.py Outdated Show resolved Hide resolved

jakirkham commented Jan 31, 2020

View reviewed changes

python/cudf/cudf/core/buffer.py Outdated Show resolved Hide resolved

kkraus14 reviewed Jan 31, 2020

View reviewed changes

python/cudf/cudf/core/buffer.py Outdated Show resolved Hide resolved

kkraus14 reviewed Jan 31, 2020

View reviewed changes

python/cudf/cudf/core/column/column.py Outdated Show resolved Hide resolved

kkraus14 reviewed Jan 31, 2020

View reviewed changes

python/cudf/cudf/core/column/column.py Outdated Show resolved Hide resolved

kkraus14 reviewed Jan 31, 2020

View reviewed changes

python/cudf/cudf/core/column/column.py Outdated Show resolved Hide resolved

kkraus14 added 2 - In Progress Currently a work in progress Python Affects Python cuDF API. labels Jan 31, 2020

jakirkham added 5 commits January 31, 2020 12:20

Use CuPy in to_host_array for consistency

c00a313

Drop unneeded cp.asarray calls

c2b70e5

Update docstrings to mention CuPy ndarrays

04834c8

Switch back from |u1 to |i1

6026408

Seems that `gdf_dtype_from_dtype` isn't able to handle unsigned values at the moment. So switch back to signed 1-byte integers for now. Should clear up some failures.

Cast data array to the correct type

149e2b9

kkraus14 previously approved these changes Jan 31, 2020

View reviewed changes

kkraus14 added 5 - Ready to Merge Testing and reviews complete, ready to merge and removed 2 - In Progress Currently a work in progress labels Jan 31, 2020

jakirkham added 3 commits January 31, 2020 15:13

Import cupy as cp

66ed7f7

Coerce Column's data views to NumPy via CuPy

66986ea

jakirkham commented Feb 1, 2020

View reviewed changes

python/cudf/cudf/core/column/column.py Outdated Show resolved Hide resolved

jakirkham added 5 commits February 4, 2020 14:09

Handle __cuda_array_interface__ with singleton

9fa5bd9

If the underlying object contains a singleton value, treat it as if it is a 1-D array with 1 value. Should fix some issues where a 0-D array is passed.

Get pointer from CuPy array

9420d64

Add missing import cupy

185ce85

Coerce CuPy arrays to NumPy in tests

bb6faf5

Merge rapidsai/branch-0.13 into jakirkham/add_buf_cai

12d13be

kkraus14 self-requested a review February 5, 2020 06:14

kkraus14 added 2 - In Progress Currently a work in progress and removed 5 - Ready to Merge Testing and reviews complete, ready to merge labels Feb 5, 2020

jakirkham added 5 commits February 6, 2020 17:02

Allocate a CuPy array in Dataframe.as_gpu_matrix

b4968e9

Use an RMM-backed CuPy array instead of `rmm.device_array`.

Coerce CuPy arrays to NumPy arrays

7b4b70c

Handle CuPy and Numba arrays in copy_array

f8e3569

Coerce CuPy array to NumPy array

e058eb3

Merge rapidsai/branch-0.13 into jakirkham/add_buf_cai

15007e5

kkraus14 reviewed Feb 7, 2020

View reviewed changes

python/cudf/cudf/core/column/column.py Outdated Show resolved Hide resolved

jakirkham mentioned this pull request Feb 7, 2020

Redux serialize Buffer directly with __cuda_array_interface__ #4101

Merged

Merge rapidsai/branch-0.13 into jakirkham/add_buf_cai

4be73bc

jakirkham changed the title ~~Add __cuda_array_interface__ to Buffer~~ Use CuPy for array views Feb 10, 2020

jakirkham changed the title ~~Use CuPy for array views~~ Add __cuda_array_interface__ to Buffer Feb 10, 2020

jakirkham mentioned this pull request Feb 10, 2020

[WIP] Use CuPy array views [skip ci] #4110

Closed

jakirkham closed this Feb 10, 2020

jakirkham deleted the add_buf_cai branch February 10, 2020 18:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add `__cuda_array_interface__` to `Buffer` #4023

Add `__cuda_array_interface__` to `Buffer` #4023

jakirkham commented Jan 31, 2020

jakirkham Feb 1, 2020

kkraus14 Feb 3, 2020

jakirkham Feb 7, 2020

jakirkham Feb 7, 2020

kkraus14 Feb 7, 2020

jakirkham Feb 7, 2020

kkraus14 Feb 7, 2020

jakirkham Feb 7, 2020

kkraus14 Feb 7, 2020

jakirkham commented Feb 10, 2020

Add __cuda_array_interface__ to Buffer #4023

Add __cuda_array_interface__ to Buffer #4023

Conversation

jakirkham commented Jan 31, 2020

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jakirkham commented Feb 10, 2020

Add `__cuda_array_interface__` to `Buffer` #4023

Add `__cuda_array_interface__` to `Buffer` #4023