Remove cudf._lib.interop in favor of inlining pylibcudf #17555

mroeschke · 2024-12-09T22:20:35Z

Description

Contributes to #17317

Checklist

I am familiar with the Contributing Guidelines.
New or existing tests cover these changes.
The documentation is up to date with these changes.

…nterop

vyasr · 2024-12-12T23:53:58Z

python/cudf/cudf/core/column/column.py

+            result = cls.from_pylibcudf(plc.interop.from_arrow(array))
+            # TODO: cudf_dtype_from_pa_type may be less necessary for some types
+            return result._with_type_metadata(
+                cudf_dtype_from_pa_type(array.type)
+            )


This is the kind of thing I was thinking of in the other PR. Having a standardized entrypoint of some sort (maybe per-class?) into pylibcudf from cudf Python would help us collect common functionality like _with_type_metadata that we otherwise add piecemeal as we find bugs and incompatibilities with pandas.

vyasr · 2024-12-13T00:00:20Z

python/cudf/cudf/core/column/column.py

+        if isinstance(array, pa.ChunkedArray):
+            array = array.combine_chunks()


Do we always have to combine chunks? IIRC the existing implementation works without combining in most cases, and I don't think combining is free performance-wise so we should avoid it if we can. I could be wrong though, or misremembering an earlier state of the code.

Ah right. Yeah this will make a copy on the CPU side.

I see now in libcudf side we only support returning tables (and not columns) from an arrow stream. I was hoping to avoid the dance of putting the chunked array in a pyarrow table but I think the dance is worth avoiding a CPU copy

Thanks yeah I think this is the right call for now. We could generalize the libcudf APIs in the future if that helps.

…nterop

mroeschke · 2024-12-17T23:01:41Z

/merge

Remove cudf._lib.interop in favor of inlining pylibcudf

a5923d7

mroeschke added Python Affects Python cuDF API. improvement Improvement / enhancement to an existing function non-breaking Non-breaking change labels Dec 9, 2024

mroeschke self-assigned this Dec 9, 2024

mroeschke requested a review from a team as a code owner December 9, 2024 22:20

mroeschke requested review from wence- and vyasr December 9, 2024 22:20

github-actions bot added the CMake CMake build issue label Dec 9, 2024

mroeschke added 2 commits December 9, 2024 15:24

Merge remote-tracking branch 'upstream/branch-25.02' into cudf/_lib/i…

70379d2

…nterop

Merge remote-tracking branch 'upstream/branch-25.02' into cudf/_lib/i…

dada097

…nterop

vyasr reviewed Dec 13, 2024

View reviewed changes

mroeschke added 4 commits December 12, 2024 18:30

Merge remote-tracking branch 'upstream/branch-25.02' into cudf/_lib/i…

add48f0

…nterop

Go back to using pyarrow table

f706244

Merge remote-tracking branch 'upstream/branch-25.02' into cudf/_lib/i…

cbba8bd

…nterop

Merge remote-tracking branch 'upstream/branch-25.02' into cudf/_lib/i…

deed917

…nterop

vyasr approved these changes Dec 16, 2024

View reviewed changes

Merge remote-tracking branch 'upstream/branch-25.02' into cudf/_lib/i…

0852374

…nterop

rapids-bot bot merged commit b9760ac into rapidsai:branch-25.02 Dec 17, 2024
105 checks passed

mroeschke deleted the cudf/_lib/interop branch December 17, 2024 23:02

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove cudf._lib.interop in favor of inlining pylibcudf #17555

Remove cudf._lib.interop in favor of inlining pylibcudf #17555

mroeschke commented Dec 9, 2024

vyasr Dec 12, 2024

vyasr Dec 13, 2024

mroeschke Dec 13, 2024

vyasr Dec 16, 2024

mroeschke commented Dec 17, 2024

		if isinstance(array, pa.ChunkedArray):
		array = array.combine_chunks()

Remove cudf._lib.interop in favor of inlining pylibcudf #17555

Remove cudf._lib.interop in favor of inlining pylibcudf #17555

Conversation

mroeschke commented Dec 9, 2024

Description

Checklist

vyasr Dec 12, 2024

Choose a reason for hiding this comment

vyasr Dec 13, 2024

Choose a reason for hiding this comment

mroeschke Dec 13, 2024

Choose a reason for hiding this comment

vyasr Dec 16, 2024

Choose a reason for hiding this comment

mroeschke commented Dec 17, 2024