Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Python][Docs] Improve the Python user guide on the CUDA integration (pyarrow.cuda) #41666

Open
jorisvandenbossche opened this issue May 15, 2024 · 0 comments

Comments

@jorisvandenbossche
Copy link
Member

jorisvandenbossche commented May 15, 2024

We have a page in the user guide about the CUDA integration with the pyarrow.cuda module: https://arrow.apache.org/docs/15.0/python/integration/cuda.html. But this page is quite brief and outdated, even for the current state of the CUDA functionality (for example it only briefly shows buffers, but doesn't mention anything about having an Array or RecordBatch on the CUDA device, or copying reading/writing directly from/to IPC, etc).

A list of ideas:

  • Show reading/writing IPC
  • Show how to copy full Array or RecordBatch to/from host (depending on improvements in [Python] Add bindings for Device and MemoryManager classes and related methods #41126)
  • Expand section on interoperability with other tools (right now it only explains interop with numba, but could also add interop with eg pytorch or cupy, cudf, nanoarrow, etc)
  • Add guide on installing pyarrow with CUDA enabled (from binaries, not to have to build yourself, e.g. this is possible through conda-forge)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant