You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Is your feature request related to a problem? Please describe.
BYTE_STREAM_SPLIT encoding is the only Parquet encoding left that cuDF does not support. Previously limited to use with FLOAT and DOUBLE columns, there is a current proposal (apache/parquet-format#229, and jira) to extend this encoding to all fixed-width data types. When coupled with compression this encoding can provide significant space savings, and its implementation is less CPU intensive than DELTA_BINARY_PACKED encoding.
Describe the solution you'd like
Implement BYTE_STREAM_SPLIT encoding and decoding in cuDF.
Describe alternatives you've considered
Not supporting this encoding.
Additional context
This encoding is a pretty straightforward reimplementation of PLAIN encoding, but it may lead to memory issues due to the need to access input/output buffers with a large stride.
The text was updated successfully, but these errors were encountered:
Is your feature request related to a problem? Please describe.
BYTE_STREAM_SPLIT encoding is the only Parquet encoding left that cuDF does not support. Previously limited to use with FLOAT and DOUBLE columns, there is a current proposal (apache/parquet-format#229, and jira) to extend this encoding to all fixed-width data types. When coupled with compression this encoding can provide significant space savings, and its implementation is less CPU intensive than DELTA_BINARY_PACKED encoding.
Describe the solution you'd like
Implement BYTE_STREAM_SPLIT encoding and decoding in cuDF.
Describe alternatives you've considered
Not supporting this encoding.
Additional context
This encoding is a pretty straightforward reimplementation of PLAIN encoding, but it may lead to memory issues due to the need to access input/output buffers with a large stride.
The text was updated successfully, but these errors were encountered: