You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Closes#8850
Adds ability to select specific children of a nested column. The python API mimics pyarrow and the format is
```python
cudf.read_parquet("test.parquet", columns=["struct1.child1.grandchild2", "struct1.child2"])
```
The C++ API takes each path as a vector
```c++
cudf::io::parquet_reader_options read_args =
cudf::io::parquet_reader_options::builder(cudf::io::source_info(filepath))
.columns({{"struct1", "child1", "grandchild2"},
{"struct1", "child2"}});
```
Authors:
- Devavret Makkar (https://github.com/devavret)
Approvers:
- GALI PREM SAGAR (https://github.com/galipremsagar)
- Vukasin Milovanovic (https://github.com/vuule)
- Christopher Harris (https://github.com/cwharris)
URL: #8933
Similar to #8848 , we should also allow nested column pruning in parquet reader. It has also been requested here: #7248 (comment)
The text was updated successfully, but these errors were encountered: