Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

upgrade beam to 2.46 #227

Merged
merged 2 commits into from
Mar 24, 2023
Merged

upgrade beam to 2.46 #227

merged 2 commits into from
Mar 24, 2023

Conversation

ohnorobo
Copy link
Collaborator

@ohnorobo ohnorobo commented Mar 23, 2023

Our data dropping problems are potentially related to apache/beam#24535 and fixed by apache/beam#25101 on 2023-01-20

Fixed Python BigQuery Batch Load write may truncate valid data when deposition sets to WRITE_TRUNCATE and incoming data is large (Python) [#24623].

That fix was released in Release-2.45.0 on Feb 15th. This updates to the latest release.

The earliest report of the bug is 2022-09-20 here

Running a full backfill job using this version here which reported 17,294,627,206 rows at the bq write stage, and saw that the corresponding table had the same number of rows.

@ohnorobo ohnorobo requested a review from agiix March 24, 2023 09:27
@ohnorobo ohnorobo merged commit fc6c2a0 into master Mar 24, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants