[FEA] Zero-copy nested types with other GPU libraries (like Awkward array) #14959

shwina · 2024-02-02T21:21:47Z

In a conversation with @martindurant and @jpivarski, it came up that there's no supported way to exchange data zero copy between cuDF and Awkward Array (which has GPU support).

The standard 0-copy mechanisms like dlpack and __cuda_array_interface__ don't support nested types like lists or structs. And our to/from_arrow() methods convert to and from host data so they're not useful when we want to 0-copy device data.

Option 1

We support a gpu=True (or similar) keyword argument in to_arrow() which would then return a PyArrow array backed by device data. Now, PyArrow does not seemingly support it, but it's possible to create a PyArrow array backed by device data:

In [5]: a = cp.asarray([1, 2, 3])

In [6]: buf = pa.foreign_buffer(a.data.ptr, a.nbytes, a)

In [7]: type(buf)
Out[7]: pyarrow.lib.Buffer

In [8]: print(buf)
<pyarrow.Buffer address=0x7f2f6fa00200 size=24 is_cpu=True is_mutable=False>

The problem (as can be seen above) is that PyArrow thinks this is a CPU-backed buffer. So attempting to do anything with it segfaults:

In [9]: arr = pa.Array.from_buffers(pa.int64(), len(a), buffers=[None, buf])

In [10]: print(arr)  # segfault

Option 2

We could expose new Series.to_buffers() and Series.from_buffers() functions that would produce and consume GPU buffers (along with a schema), presumably in the same order as arrow's from_buffers and buffers methods. We could use CuPy arrays to represent the buffers.

Curious what folks think? Interested also in @kkraus14's thoughts here if any.

The text was updated successfully, but these errors were encountered:

vyasr · 2024-02-02T21:36:01Z

I think #14926 is pretty relevant here.

kkraus14 · 2024-02-02T21:49:25Z

I agree that this is kinda the exact use case that #14926 is designed for. Along with something like a PyCapsule based protocol.

jpivarski · 2024-02-03T22:01:36Z

I should add here that, from the Awkward Array side, any format that preserves all of the information is equally good. If given CuPy arrays (option 2), we might internally convert them to a format that follows a pyarrow array's Buffers so that we can reuse code that makes the adjustments between Arrow and Awkward, but that's our business.

I suggested option 1, making a pyarrow array that would segfault if you touch it, because this works for us (we'll be careful to not dereference the GPU pointers) and if pyarrow ever does add the infrastructure to interpret it correctly, the same interface on cuDF will work for both Awkward and Arrow.

martindurant · 2024-02-09T16:00:00Z

Ping on this, @shwina ; I gather work is ongoing in the linked issue, but I would appreciate a brief summary here of status and what we can expect for awkward integration.

shwina · 2024-02-09T20:40:05Z

Thanks, @martindurant - I believe we should see a PR up for #14926 soon. At that point, we would be very grateful if you could provide feedback or perhaps do some early testing!

martindurant · 2024-02-09T20:42:08Z

Certainly, just let us know

kkraus14 · 2024-02-09T21:19:49Z

Just a note that #14926 will first yield the C++ level functions and C structs, and there would likely need to be a follow up in implementing the Python protocol around it. The issue tracking that work in Arrow is here: apache/arrow#38325

shwina · 2024-02-10T00:48:29Z

Wouldn't nanoarrow provide a way to access the DeviceArray from Cython?

shwina · 2024-02-16T10:57:15Z

Wouldn't nanoarrow provide a way to access the DeviceArray from Cython?

Would be very grateful if @paleolimbot could advise here!

paleolimbot · 2024-02-16T14:47:09Z

It's been touched on here, but I think the intention is ( apache/arrow#38325 ) to add a protocol __arrow_c_device_array__() to mirror how __arrow_c_array__() works but with explicit non-CPU support ( https://arrow.apache.org/docs/format/CDataInterface/PyCapsuleInterface.html ).

When nanoarrow for Python has matured a bit it might be able to help export (and test), but the Cython needed to make the required Capsule is pretty compact and any library doing exporting should probably just copy it (or translate it to pybind11 or nanobind): https://github.com/apache/arrow-nanoarrow/blob/main/python/src/nanoarrow/_lib.pyx#L112-L127 .

shwina · 2024-02-26T20:20:31Z

@martindurant just an update here that I'm waiting for #15047 to take some shape before I try and kick the tires with accessing from Python.

martindurant · 2024-02-26T20:25:04Z

@jpivarski , can you please link the experimental conversions code you wrote in awkward?

jpivarski · 2024-02-26T20:27:51Z

This is the script that I used to test conversion of CuDF's Arrow data into Awkward. (The other direction should be even easier.)

https://github.com/scikit-hep/awkward/blob/main/studies/cudf-to-awkward.py

vyasr · 2024-03-21T23:23:38Z

Just wanted to provide a quick status update here. I've put together a prototype of the device data capsule protocols in #15370. It's not usable yet for a number of reasons, largely boiling down to the need for a D2D copy at the moment (although that may still be enough of an improvement over the current D2H2D that our existing to/from_arrow methods do that you'd still find it useful for testing), but we should be able to make some progress on that soon. I've started the discussion on how best to proceed here.

shwina added feature request New feature or request Python Affects Python cuDF API. labels Feb 2, 2024

shwina self-assigned this Feb 2, 2024

GregoryKimball mentioned this issue Feb 9, 2024

[FEA] Produce and Consume ArrowDeviceArray struct from cudf::table / cudf::column #14926

Closed

vyasr added this to cuDF Python Nov 5, 2024

github-project-automation bot moved this to Todo in cuDF Python Nov 5, 2024

vyasr mentioned this issue Dec 5, 2024

[FEA] A Python wrapper for to_arrow_device. #17528

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FEA] Zero-copy nested types with other GPU libraries (like Awkward array) #14959

[FEA] Zero-copy nested types with other GPU libraries (like Awkward array) #14959

shwina commented Feb 2, 2024

vyasr commented Feb 2, 2024

kkraus14 commented Feb 2, 2024

jpivarski commented Feb 3, 2024

martindurant commented Feb 9, 2024

shwina commented Feb 9, 2024

martindurant commented Feb 9, 2024

kkraus14 commented Feb 9, 2024

shwina commented Feb 10, 2024

shwina commented Feb 16, 2024

paleolimbot commented Feb 16, 2024

shwina commented Feb 26, 2024

martindurant commented Feb 26, 2024

jpivarski commented Feb 26, 2024

vyasr commented Mar 21, 2024

[FEA] Zero-copy nested types with other GPU libraries (like Awkward array) #14959

[FEA] Zero-copy nested types with other GPU libraries (like Awkward array) #14959

Comments

shwina commented Feb 2, 2024

Option 1

Option 2

vyasr commented Feb 2, 2024

kkraus14 commented Feb 2, 2024

jpivarski commented Feb 3, 2024

martindurant commented Feb 9, 2024

shwina commented Feb 9, 2024

martindurant commented Feb 9, 2024

kkraus14 commented Feb 9, 2024

shwina commented Feb 10, 2024

shwina commented Feb 16, 2024

paleolimbot commented Feb 16, 2024

shwina commented Feb 26, 2024

martindurant commented Feb 26, 2024

jpivarski commented Feb 26, 2024

vyasr commented Mar 21, 2024