Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[FEA] Support reading binary data types from Parquet as binary (not strings) #5416

Closed
tgravescs opened this issue May 3, 2022 · 4 comments · Fixed by #6161
Closed

[FEA] Support reading binary data types from Parquet as binary (not strings) #5416

tgravescs opened this issue May 3, 2022 · 4 comments · Fixed by #6161
Assignees
Labels
cudf_dependency An issue or PR with this label depends on a new feature in cudf feature request New feature or request

Comments

@tgravescs
Copy link
Collaborator

Is your feature request related to a problem? Please describe.
Currently we support reading binary as String from parquet if the schema is specifically specified and the parquet file schema is binary type. But CUDF doesn't support reading those binary columns as binary. Once CUDF supports it we should add support and tests on the plugin side.

@tgravescs tgravescs added feature request New feature or request ? - Needs Triage Need team to review and classify cudf_dependency An issue or PR with this label depends on a new feature in cudf labels May 3, 2022
@tgravescs
Copy link
Collaborator Author

cudf issue: rapidsai/cudf#10733

@sameerz sameerz removed the ? - Needs Triage Need team to review and classify label May 3, 2022
@viadea
Copy link
Collaborator

viadea commented Jun 3, 2022

The use case needs the Array[Binary] reading support as well. If I need to open a separate feature request , please feel free to let me know.

@tgravescs
Copy link
Collaborator Author

Note filed rapidsai/cudf#11044 to be able to read binary as binary and not strings

@tgravescs tgravescs changed the title [FEA] Support reading binary data types from Parquet [FEA] Support reading binary data types from Parquet as binary (not strings) Jul 19, 2022
@razajafri razajafri self-assigned this Jul 28, 2022
@razajafri
Copy link
Collaborator

Closed as part of #6292

There is a follow-on #6480

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
cudf_dependency An issue or PR with this label depends on a new feature in cudf feature request New feature or request
Projects
None yet
Development

Successfully merging a pull request may close this issue.

4 participants