We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Describe the bug While reading ORC file with nested struct column, dtype is missing child column name. Thank you @isVoid for reporting the issue.
Steps/Code to reproduce bug
import pandas as pd, pyarrow as pa, cudf # Create a nested struct type column table = pa.Table.from_pandas(pd.DataFrame([{'col': {'a': {'b': 42}}}])) pa.orc.write_table(table, './test.orc') # Reads it df = cudf.read_orc('./test.orc') df['col']._column.dtype # StructDtype({'a': StructDtype({'0': dtype('int64')})})
Expected behavior Should have child column name as well.
Environment overview (please complete the following information)
The text was updated successfully, but these errors were encountered:
Fixes missing child column name in dtype while reading ORC file. (#9134)
e222584
Parent column was being processed first and then child column, so child column was missing the dtype information. It has been fixed. closes #9133 Authors: - Ram (Ramakrishna Prabhu) (https://github.com/rgsl888prabhu) Approvers: - Ashwin Srinath (https://github.com/shwina) URL: #9134
rgsl888prabhu
Successfully merging a pull request may close this issue.
Describe the bug
While reading ORC file with nested struct column, dtype is missing child column name.
Thank you @isVoid for reporting the issue.
Steps/Code to reproduce bug
Expected behavior
Should have child column name as well.
Environment overview (please complete the following information)
The text was updated successfully, but these errors were encountered: