Fix chunked reads of Parquet delta encoded pages #14921

etseidl · 2024-01-29T18:16:41Z

Description

The chunked Parquet reader currently does not properly estimate the sizes of string pages that are delta encoded. This PR modifies gpuDecodeTotalPageStringSize() to take into account the new encodings.

Checklist

I am familiar with the Contributing Guidelines.
New or existing tests cover these changes.
The documentation is up to date with these changes.

copy-pr-bot · 2024-01-29T18:16:44Z

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

etseidl · 2024-01-29T18:17:46Z

@nvdbaranec I'd like your opinion on how to handle the sizes. Do you want me to decode the delta lengths to get exact sizes or do you think the estimates are good enough?

nvdbaranec · 2024-01-30T16:40:30Z

@nvdbaranec I'd like your opinion on how to handle the sizes. Do you want me to decode the delta lengths to get exact sizes or do you think the estimates are good enough?

For the output chunking, the granularity we care about right now is just at the page level. The individual row string sizes don't matter. Is it possible to know total page string size without the full decode? Also, how good would the estimate be?

nvdbaranec · 2024-01-30T16:48:51Z

cpp/src/io/parquet/decode_preprocess.cu

+    size_type str_bytes = 0;
+    if (s->page.encoding == Encoding::DELTA_BYTE_ARRAY) {
+      // this must be called by all threads
+      str_bytes = gpuDeltaPageStringSize(s, t);
+    } else if (t < warp_size) {
+      // single warp
+      str_bytes = gpuDecodeTotalPageStringSize(s, t);
+    }
+    if (t == 0) { s->page.str_bytes = str_bytes; }


Suggest refactoring gpuDecodeTotalPageStringSize as the single entry point and doing this check inside of it.

that was the other option :)

Another is to break out the delta_byte_array as it's own kernel and only call it if necessary (like we do with the decoding kernels). Then we don't get the shared mem hit if we don't have to.

etseidl · 2024-01-30T16:53:23Z

For the output chunking, the granularity we care about right now is just at the page level. The individual row string sizes don't matter. Is it possible to know total page string size without the full decode?

For DELTA_LENGTH_BYTE_ARRAY, it's much like plain encoding, so if we know the size of the encoded lengths, we can subtract that from the page size. Finding the end isn't terribly expensive, so it's probably worthwhile to do that correctly.

For DELTA_BYTE_ARRAY it's another story. To get the string size we need to decode the suffix and prefix lengths and sum them. Trying to estimate a size is pretty much impossible otherwise. A column with a lot of repetition in it might explode in size when decoded. I'd have to do some timings, but I don't know if decoding the lengths is ruinously expensive (maybe same OOM as the current time to traverse a plain encoded page), but the decoder adds 2k to the shared memory budget.

Edit:
Did a quick check of PLAIN vs DICT vs DELTA_LENGTH, and found that for a file with 76m rows of small strings, gpuComputePageSizes takes ~160ms for PLAIN, ~40ms for DICT, and about 5ms for DLBA, so no concern there. Now to wicker up a file with DELTA_BYTE_ARRAY...

Edit 2:
DELTA_BYTE_ARRAY is closer to DICT than PLAIN...~32ms

nvdbaranec · 2024-01-30T19:10:30Z

Sounds like doing it the accurate way is the way to go. We could live with approx if it was a conservative/overestimate approx, but if it can be too small we risk actually producing too much output (which is only really an issue in the case where output chunking is used to limit column lengths, but that's an important one).

hyperbolic2346

Nothing but nits and a couple of questions for clarification.

cpp/src/io/parquet/decode_preprocess.cu

hyperbolic2346 · 2024-01-31T03:29:04Z

cpp/src/io/parquet/decode_preprocess.cu

+  switch (s->page.encoding) {
+    case Encoding::PLAIN_DICTIONARY:
+    case Encoding::RLE_DICTIONARY:
+      // TODO: make this block-based instead of just 1 warp


Is this TODO for this PR or do we need a tracking issue for it?

It's an old TODO I moved from outside the function. @nvdbaranec do you think this TODO is still relevant?

I think it's fine to delete. At this point any optimization effort needs to be starting at the top and looking at the whole picture anyway.

hyperbolic2346 · 2024-01-31T03:30:49Z

cpp/src/io/parquet/decode_preprocess.cu

+      // TODO: since this is really just an estimate, we could just return
+      // s->dict_size (overestimate) or
+      // s->dict_size - sizeof(int) * s->page.num_input_values (underestimate)


Am I mistaken that this was discussed and decided to find the real size?

In my mind the discussion was more about the delta encodings...this is more a note that there could be a faster way to do this (esp since the plain string size calc is agonizingly slow). Unfortunately, at this point we don't really yet know how many values are present, so we can't get an exact number of string bytes. But if an overestimate is ok, we could just use dict_size and save a lot of time in this step. Not sure if that's something to address with this PR.

Edit: actually, for V2 headers, we do know how many values there are. I'm going to change this and get rid of the TODO.

Co-authored-by: Mike Wilson <[email protected]>

…unked

vuule · 2024-02-06T23:50:31Z

/ok to test

hyperbolic2346 · 2024-02-20T22:07:29Z

/ok to test

hyperbolic2346 · 2024-02-21T19:59:26Z

/ok to test

hyperbolic2346 · 2024-02-22T16:37:21Z

/ok to test

vuule · 2024-02-27T18:06:03Z

/ok to test

vuule · 2024-02-28T23:53:22Z

/ok to test

etseidl · 2024-02-28T23:55:31Z

Hope springs eternal 🥲

vuule · 2024-02-29T00:04:21Z

Hope springs eternal 🥲

A fix has been merged, so there iswas hope. I'm just not sure if it's too early.

vuule · 2024-02-29T20:02:43Z

/ok to test

vuule · 2024-03-01T02:02:48Z

/merge

fix chunked reads of delta encoded pages

915db23

github-actions bot added the libcudf Affects libcudf (C++/CUDA) code. label Jan 29, 2024

etseidl and others added 7 commits January 29, 2024 10:21

fix comment

1f9e7db

only assign str_bytes on thread 0

524149d

Merge branch 'branch-24.04' into delta_chunked

f32673f

Merge branch 'rapidsai:branch-24.04' into delta_chunked

91df98a

get accurate str_len for delta_length_byte_array

5be31b4

formatting

72be052

accurate size for delta_byte_array

ba597ac

etseidl marked this pull request as ready for review January 29, 2024 23:29

etseidl requested a review from a team as a code owner January 29, 2024 23:29

etseidl requested review from hyperbolic2346 and divyegala January 29, 2024 23:29

etseidl and others added 3 commits January 29, 2024 16:12

revert delta byte array changes...needs entire block, not a single warp

8c9432e

re-enable accurate delta_byte_array sizes

d5c96c0

Merge branch 'branch-24.04' into delta_chunked

86a1210

etseidl added 2 commits January 30, 2024 08:41

fix typo

cf6664b

add fixme

806d246

nvdbaranec reviewed Jan 30, 2024

View reviewed changes

refactor so gpuDecodeTotalPageStringSize is called by all threads

74d7e51

Merge branch 'branch-24.04' into delta_chunked

91e741d

hyperbolic2346 approved these changes Jan 31, 2024

View reviewed changes

etseidl and others added 3 commits January 30, 2024 19:49

suggestion from review

2cbc917

Co-authored-by: Mike Wilson <[email protected]>

use V2 page header info and remove TODO

31c598b

Merge branch 'delta_chunked' of github.com:etseidl/cudf into delta_ch…

7299aca

…unked

etseidl requested a review from nvdbaranec January 31, 2024 17:48

hyperbolic2346 approved these changes Feb 1, 2024

View reviewed changes

etseidl and others added 2 commits February 5, 2024 13:05

Merge branch 'rapidsai:branch-24.04' into delta_chunked

3f698c5

Merge remote-tracking branch 'origin/branch-24.04' into delta_chunked

cc16ca4

nvdbaranec approved these changes Feb 6, 2024

View reviewed changes

remove outdated TODO

18b99c6

vuule added the 5 - Ready to Merge Testing and reviews complete, ready to merge label Feb 6, 2024

etseidl and others added 4 commits February 14, 2024 13:19

Merge branch 'rapidsai:branch-24.04' into delta_chunked

fdd77c7

Merge branch 'branch-24.04' into delta_chunked

2ca52fe

Merge branch 'branch-24.04' into delta_chunked

53434e2

Merge branch 'branch-24.04' into delta_chunked

01da47e

Merge branch 'branch-24.04' into delta_chunked

9c21cb4

Merge branch 'branch-24.04' into delta_chunked

e0ce950

etseidl and others added 2 commits February 26, 2024 12:55

Merge branch 'rapidsai:branch-24.04' into delta_chunked

dcfa4ce

Merge branch 'branch-24.04' into delta_chunked

cf9619b

Merge branch 'branch-24.04' into delta_chunked

1bd0b42

etseidl and others added 2 commits February 29, 2024 08:15

Merge branch 'branch-24.04' into delta_chunked

34e3f45

Merge branch 'branch-24.04' into delta_chunked

09c26fe

rapids-bot bot merged commit 56a3b8f into rapidsai:branch-24.04 Mar 1, 2024
76 checks passed

etseidl deleted the delta_chunked branch March 1, 2024 02:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix chunked reads of Parquet delta encoded pages #14921

Fix chunked reads of Parquet delta encoded pages #14921

etseidl commented Jan 29, 2024

copy-pr-bot bot commented Jan 29, 2024

etseidl commented Jan 29, 2024

nvdbaranec commented Jan 30, 2024 •

edited

Loading

nvdbaranec Jan 30, 2024

etseidl Jan 30, 2024

etseidl commented Jan 30, 2024 •

edited

Loading

nvdbaranec commented Jan 30, 2024

hyperbolic2346 left a comment

hyperbolic2346 Jan 31, 2024

etseidl Jan 31, 2024

nvdbaranec Feb 6, 2024

hyperbolic2346 Jan 31, 2024

etseidl Jan 31, 2024 •

edited

Loading

vuule commented Feb 6, 2024

hyperbolic2346 commented Feb 20, 2024

hyperbolic2346 commented Feb 21, 2024

hyperbolic2346 commented Feb 22, 2024

vuule commented Feb 27, 2024

vuule commented Feb 28, 2024

etseidl commented Feb 28, 2024

vuule commented Feb 29, 2024

vuule commented Feb 29, 2024

vuule commented Mar 1, 2024

Fix chunked reads of Parquet delta encoded pages #14921

Fix chunked reads of Parquet delta encoded pages #14921

Conversation

etseidl commented Jan 29, 2024

Description

Checklist

copy-pr-bot bot commented Jan 29, 2024

etseidl commented Jan 29, 2024

nvdbaranec commented Jan 30, 2024 • edited Loading

nvdbaranec Jan 30, 2024

Choose a reason for hiding this comment

etseidl Jan 30, 2024

Choose a reason for hiding this comment

etseidl commented Jan 30, 2024 • edited Loading

nvdbaranec commented Jan 30, 2024

hyperbolic2346 left a comment

Choose a reason for hiding this comment

hyperbolic2346 Jan 31, 2024

Choose a reason for hiding this comment

etseidl Jan 31, 2024

Choose a reason for hiding this comment

nvdbaranec Feb 6, 2024

Choose a reason for hiding this comment

hyperbolic2346 Jan 31, 2024

Choose a reason for hiding this comment

etseidl Jan 31, 2024 • edited Loading

Choose a reason for hiding this comment

vuule commented Feb 6, 2024

hyperbolic2346 commented Feb 20, 2024

hyperbolic2346 commented Feb 21, 2024

hyperbolic2346 commented Feb 22, 2024

vuule commented Feb 27, 2024

vuule commented Feb 28, 2024

etseidl commented Feb 28, 2024

vuule commented Feb 29, 2024

vuule commented Feb 29, 2024

vuule commented Mar 1, 2024

nvdbaranec commented Jan 30, 2024 •

edited

Loading

etseidl commented Jan 30, 2024 •

edited

Loading

etseidl Jan 31, 2024 •

edited

Loading