[FEA] Respect set_output_as_binary
in ORC writer
#14375
Labels
0 - Backlog
In queue waiting for assignment
cuIO
cuIO issue
feature request
New feature or request
libcudf
Affects libcudf (C++/CUDA) code.
Milestone
Is your feature request related to a problem? Please describe.
The cuDF ORC writer does not follow the
set_output_as_binary
option added for Parquet to write a string column with binary type.Describe the solution you'd like
To respect the
set_output_as_binary
option added in #6816 for the ORC writer as well.Describe alternatives you've considered
We could copy the table to host and use the Arrow ORC writer or similar instead, but this would mean more copying.
Additional context
Quick demo using libcudf 23.10: https://github.com/lidavidm/cudf-orc-binary-feature-request
The text was updated successfully, but these errors were encountered: