You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
arrow-rs$ cargo test -p arrow-ipc --lib reader::tests::test_123
Finished test [unoptimized + debuginfo] target(s) in 0.05s
Running unittests src/lib.rs (target/debug/deps/arrow_ipc-b6339780ea47b538)
running 1 test
test reader::tests::test_123 ... FAILED
failures:
---- reader::tests::test_123 stdout ----
thread 'reader::tests::test_123' panicked at arrow-ipc/src/reader.rs:1862:14:
called `Result::unwrap()` on an `Err` value: ParseError("Unable to get root as footer: Utf8Error { error: Utf8Error { valid_up_to: 1, error_len: Some(1) }, range: 208..40208, error_trace: ErrorTrace([TableField { field_name: \"value\", position: 200 }, VectorElement { index: 0, position: 96 }, TableField { field_name: \"custom_metadata\", position: 88 }, TableField { field_name: \"schema\", position: 24 }]) }")
note: run with `RUST_BACKTRACE=1` environment variable to display a backtrace
failures:
reader::tests::test_123
test result: FAILED. 0 passed; 1 failed; 0 ignored; 0 measured; 36 filtered out; finished in 0.00s
error: test failed, to rerun pass `-p arrow-ipc --lib`
Expected behavior
Should be able to read file successfully.
Additional context
Though error likely lies upstream with flatbuffers, maybe there is a way we can allow the ipc reader to ignore invalid custom_metadata via user configuration?
The text was updated successfully, but these errors were encountered:
This sounds like the old issue of python extension types storing pickle data in a UTF-8 field without escaping it. This is an upstream bug IMO, but I would not be adverse to finding some way to skip such data
Describe the bug
Given a feather written file from PyArrow, when using the arrow-ipc reader to read this file, a flatbuffers ParseError is thrown due to invalid UTF8
To Reproduce
Given this ORC file:
https://github.com/apache/orc/blob/fa627ec6d7c72289c8a83632e6a43ae48603fc4b/examples/TestOrcFile.metaData.orc
When using PyArrow 15.0.0 to read and write it out to feather:
Then trying to read this file with arrow-ipc:
It throws error:
Expected behavior
Should be able to read file successfully.
Additional context
Though error likely lies upstream with flatbuffers, maybe there is a way we can allow the ipc reader to ignore invalid custom_metadata via user configuration?
The text was updated successfully, but these errors were encountered: