[FEA] Implement chunked_pack #13180

abellina · 2023-04-20T14:34:52Z

In spark-rapids, we use cudf::pack (in reality we cudf::contiguous_split with empty splits) to layout a cudf table with its various data, validity, offsets buffers and potential children, into a contiguous buffer described by metadata produced by cuDF. This has been a key operation for us to turn a table into something that we can trivially move to host or disk (spill). Because tables can be quite large, we have found that requiring them to be contiguous in memory, adds quite a bit of memory pressure and fragments the memory pool. Some tables can be GBs in size, so in order to satisfy the allocation needed for pack to return a contiguous buffer, we can actually introduce even more spill.

Lately we have been making more use of the spill framework as we are now able to retry some cuDF operations: NVIDIA/spark-rapids#7252. We would like to be able to turn cuDF tables into "spillable tables" without requiring a premeditated pack, instead performing the pack in chunks when it is actually needed. This means we want to call pack with a bounce buffer, ensuring that no large allocations happen during this process.

After the first invocation to pack, the bounce buffer contents are copied to host (where an allocation equal to the original contiguous size is waiting to be filled). We then keep calling pack iteratively, not unlike the other chunked interfaces in cuDF.

I've been working on this together with @nvdbaranec. I am going to post a series of PRs that get us there, but overall here's the plan:

Refactor pack/unpack/metadata/contiguous_split interfaces out of copying.hpp into a new header contiguous_split.hpp: Refactor contiguous_split API into contiguous_split.hpp #13186
Introduce a new metadata_builder that is used by the prior code, but allows us to create metadata without having to generate cudf::column to begin with: Add metadata_builder helper class #13232
PR chunked_pack: API Implement a chunked_pack API #13260
PR chunked_pack JNI changes: JNI api for cudf::chunked_pack #13278

The text was updated successfully, but these errors were encountered:

@davidwendt

This PR introduced some unused arguments that are causing compilation errors in nvcc 11.5 #13180. Taking care of that here. @davidwendt found these in his local nvcc 11.5 build Authors: - Alessandro Bellina (https://github.com/abellina) Approvers: - David Wendt (https://github.com/davidwendt) - Nghia Truong (https://github.com/ttnghia) - Bradley Dice (https://github.com/bdice) URL: #13387

abellina added feature request New feature or request Needs Triage Need team to review and classify labels Apr 20, 2023

abellina self-assigned this Apr 20, 2023

abellina mentioned this issue Apr 20, 2023

Make all buffers/columnar batches spillable by default NVIDIA/spark-rapids#7672

Closed

abellina added the Spark Functionality that helps Spark RAPIDS label Apr 20, 2023

abellina mentioned this issue May 9, 2023

Make tables spillable by default NVIDIA/spark-rapids#8264

Merged

abellina closed this as completed May 18, 2023

abellina mentioned this issue May 18, 2023

Fix unused argument errors in nvcc 11.5 #13387

Merged

3 tasks

bdice removed the Needs Triage Need team to review and classify label Mar 4, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FEA] Implement chunked_pack #13180

[FEA] Implement chunked_pack #13180

abellina commented Apr 20, 2023 •

edited

Loading

[FEA] Implement chunked_pack #13180

[FEA] Implement chunked_pack #13180

Comments

abellina commented Apr 20, 2023 • edited Loading

abellina commented Apr 20, 2023 •

edited

Loading