Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BUG] While reading ORC file with nested struct column, dtype is missing child column name #9133

Closed
rgsl888prabhu opened this issue Aug 27, 2021 · 0 comments · Fixed by #9134
Assignees
Labels
bug Something isn't working

Comments

@rgsl888prabhu
Copy link
Contributor

Describe the bug
While reading ORC file with nested struct column, dtype is missing child column name.
Thank you @isVoid for reporting the issue.

Steps/Code to reproduce bug

import pandas as pd, pyarrow as pa, cudf

# Create a nested struct type column
table = pa.Table.from_pandas(pd.DataFrame([{'col': {'a': {'b': 42}}}]))
pa.orc.write_table(table, './test.orc')

# Reads it
df = cudf.read_orc('./test.orc')
df['col']._column.dtype # StructDtype({'a': StructDtype({'0': dtype('int64')})})

Expected behavior
Should have child column name as well.

Environment overview (please complete the following information)

  • Environment location: Bare-metal
  • Method of cuDF install: from source
@rgsl888prabhu rgsl888prabhu added bug Something isn't working Needs Triage Need team to review and classify labels Aug 27, 2021
@rgsl888prabhu rgsl888prabhu self-assigned this Aug 27, 2021
rapids-bot bot pushed a commit that referenced this issue Aug 27, 2021
Parent column was being processed first and then child column, so child column was missing the dtype information. It has been fixed.

closes #9133

Authors:
  - Ram (Ramakrishna Prabhu) (https://github.com/rgsl888prabhu)

Approvers:
  - Ashwin Srinath (https://github.com/shwina)

URL: #9134
@bdice bdice removed the Needs Triage Need team to review and classify label Mar 4, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants