Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[SPARK-50492][SS] Fix java.util.NoSuchElementException when event time column is dropped after dropDuplicatesWithinWatermark #49065

Closed

Conversation

liviazhu-db
Copy link
Contributor

What changes were proposed in this pull request?

Update DeduplicateWithinWatermark references to include all attributes that could be the watermarking column.

Why are the changes needed?

Fix java.util.NoSuchElementException due to ColumnPruning.

Does this PR introduce any user-facing change?

No.

How was this patch tested?

Added unit test

Was this patch authored or co-authored using generative AI tooling?

No.

@HeartSaVioR
Copy link
Contributor

Could you please sync your fork with OSS and rebase your PR to be on top of recent master branch? It's on top of the commit which was 3 weeks ago. GA failure looks to be related to the out of sync.

Copy link
Contributor

@HeartSaVioR HeartSaVioR left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Only nits. Nice finding and thanks for the fix!

@liviazhu-db liviazhu-db changed the title [SPARK-50492] [SS] Fix java.util.NoSuchElementException when watermark column is dropped after dropDuplicatesWithinWatermark [SPARK-50492] [SS] Fix java.util.NoSuchElementException when event time column is dropped after dropDuplicatesWithinWatermark Dec 5, 2024
Copy link
Contributor

@HeartSaVioR HeartSaVioR left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

+1

@HeartSaVioR HeartSaVioR changed the title [SPARK-50492] [SS] Fix java.util.NoSuchElementException when event time column is dropped after dropDuplicatesWithinWatermark [SPARK-50492][SS] Fix java.util.NoSuchElementException when event time column is dropped after dropDuplicatesWithinWatermark Dec 6, 2024
@HeartSaVioR
Copy link
Contributor

Thanks! Merging to master/3.5.

HeartSaVioR pushed a commit that referenced this pull request Dec 6, 2024
…e column is dropped after dropDuplicatesWithinWatermark

### What changes were proposed in this pull request?

Update `DeduplicateWithinWatermark` references to include all attributes that could be the watermarking column.

### Why are the changes needed?

Fix `java.util.NoSuchElementException` due to ColumnPruning.

### Does this PR introduce _any_ user-facing change?

No.

### How was this patch tested?

Added unit test

### Was this patch authored or co-authored using generative AI tooling?

No.

Closes #49065 from liviazhu-db/liviazhu-db/dedup-watermark-fix.

Authored-by: Livia Zhu <[email protected]>
Signed-off-by: Jungtaek Lim <[email protected]>
(cherry picked from commit 851f5f2)
Signed-off-by: Jungtaek Lim <[email protected]>
@LuciferYang
Copy link
Contributor

@LuciferYang
Copy link
Contributor

LuciferYang commented Dec 6, 2024

Try fix: #49087

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants