Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Parquet] Add benchmark and test for writing NaNs to Parquet #6955

Merged
merged 2 commits into from
Jan 8, 2025

Conversation

adamreeve
Copy link
Contributor

Which issue does this PR close?

This doesn't close an issue but adds a benchmark that should be improved when #6952 is fixed.

Rationale for this change

Splits out the benchmark from the fix for #6952 to make it easier to compare performance, as requested in #6953 (comment)

What changes are included in this PR?

  • Adds a unit test for round-tripping floating point NaNs via Parquet
  • Adds a benchmark for writing floating point values to Parquet where 50% of values are NaN

Are there any user-facing changes?

No

@github-actions github-actions bot added parquet Changes to the parquet crate arrow Changes to the arrow crate labels Jan 8, 2025
Copy link
Contributor

@alamb alamb left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks @adamreeve 🙏 -- looks good to me

@alamb alamb merged commit d0260fc into apache:main Jan 8, 2025
26 checks passed
@adamreeve adamreeve deleted the nan-dict-encode-benchmark branch January 8, 2025 22:24
CurtHagenlocher pushed a commit to CurtHagenlocher/arrow-rs that referenced this pull request Jan 13, 2025
…6955)

* Add test and benchmarks for writing floats with NaNs

* Remove extra benchmark with no NaNs
svencowart pushed a commit to elastiflow/arrow-rs that referenced this pull request Jan 14, 2025
…6955)

* Add test and benchmarks for writing floats with NaNs

* Remove extra benchmark with no NaNs
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
arrow Changes to the arrow crate parquet Changes to the parquet crate
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants