-
Notifications
You must be signed in to change notification settings - Fork 222
Reading parquet file with multiple row groups and nested nullable struct types panics #1249
Comments
From #1248:
|
A minimal reproducible example of arrow2/parquet2 writing corrupt files:
|
I believe that this is a bug in the writer. Specifically, we cannot make the call here: https://github.com/jorgecarleitao/arrow2/blob/main/src/io/parquet/write/primitive/nested.rs#L34 without identifying which elements represents null slots of a struct array. |
File: 6570499c-3be5-4de5-beb4-73b11c15ea39.parquet.zip
File was generated after making the following fix to translate a deeply nested avro file to a corresponding parquet file using parquet2/arrow2:
#1248
Note: This file works with the official
parquet-mr
Java reader, but also seems to break the pyarrow reader in addition to arrow2.error:
The text was updated successfully, but these errors were encountered: