-
Notifications
You must be signed in to change notification settings - Fork 93
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Read multiple row groups in Parquet files correctly (#3950)
* Read multiple row groups correctly Iterate through the column's row groups while maintaining a count of the total items read, and terminate the loop when the specified number of items have been read. Signed-off-by: John H. Hartman <[email protected]> * Skip values and count values read properly The variable skipIdx contains the number of values to be skipped in the column prior to reading values. Skipping is done one row group at a time, so this value must be updated as each row group is skipped. Also, readColumnDbFl and readColumnIrregularBitWidth now return the number of values read, so that ReadColumn increments the index into the output array properly. --------- Signed-off-by: John H. Hartman <[email protected]> Co-authored-by: John H. Hartman <[email protected]> Co-authored-by: ajpotts <[email protected]>
- Loading branch information
1 parent
e825f70
commit 091b8dd
Showing
1 changed file
with
25 additions
and
16 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters