-
Notifications
You must be signed in to change notification settings - Fork 4.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
🐛 Source S3: timestamp parquet data #6613
Conversation
Maksym Pavlenok seems not to be a GitHub user. You need a GitHub account to be able to sign the CLA. If you have already a GitHub account, please add the email address used for this commit to your account. You have signed the CLA already but the status is still pending? Let us recheck it. |
/test connector=connectors/source-s3
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good technically, but maybe it could be slightly better structured to be more readable? Feels like this is creating another location where parquet types are being defined on top of the PARQUET_TYPES map. Could this all be unified to be a bit more clear + some comments/docstrings on this logic?
...-integrations/connectors/source-s3/source_s3/source_files_abstract/formats/parquet_parser.py
Outdated
Show resolved
Hide resolved
...-integrations/connectors/source-s3/source_s3/source_files_abstract/formats/parquet_parser.py
Show resolved
Hide resolved
...-integrations/connectors/source-s3/source_s3/source_files_abstract/formats/parquet_parser.py
Outdated
Show resolved
Hide resolved
…es_abstract/formats/parquet_parser.py Co-authored-by: George Claireaux <[email protected]>
/test connector=connectors/source-s3
|
@Phlair , I've moved all type mappings (pyarrow, json) to a single place. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Like it, nice one!
* fix datetime parquet data * Update airbyte-integrations/connectors/source-s3/source_s3/source_files_abstract/formats/parquet_parser.py Co-authored-by: George Claireaux <[email protected]> * aggregate pyarrow types Co-authored-by: Maksym Pavlenok <[email protected]> Co-authored-by: George Claireaux <[email protected]>
What
Describe what the change is solving
It helps to add screenshots if it affects the frontend.
How
Describe the solution
Recommended reading order
x.java
y.python
Pre-merge Checklist
Expand the relevant checklist and delete the others.
Updating a connector
Community member or Airbyter
airbyte_secret
./gradlew :airbyte-integrations:connectors:<name>:integrationTest
.README.md
bootstrap.md
. See description and examplesdocs/integrations/<source or destination>/<name>.md
including changelog. See changelog exampleAirbyter
If this is a community PR, the Airbyte engineer reviewing this PR is responsible for the below items.
/test connector=connectors/<name>
command is passing./publish
command described here