[FEA] Add encoding_stats
footer metadata to the parquet writer
#15313
Labels
Milestone
encoding_stats
footer metadata to the parquet writer
#15313
Is your feature request related to a problem? Please describe.
The
parquet-cpp-arrow
writer includes ColumnChunkencoding_stats
after the ColumnChunk statistics in the Parquet file footer. The encoding stats are useful for providing a total page count, trackingRLE_DICTIONARY
fallback toPLAIN
encoding, and verifying optional V2 encodings such asDELTA_BYTE_ARRAY
andDELTA_LENGTH_BYTE_ARRAY
.Parquet-tools is a simple command line interface to learn more about a parquet file.
Here is an example of the
encoding_stats
data from the writerparquet-cpp-arrow version 14.0.2
parquet-tools inspect --detail cpp-arrow.pq
parquet-tools inspect --detail cudf.pq
The text was updated successfully, but these errors were encountered: