Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

openlineage, gcs: add openlineage methods for GcsToGcsOperator #31350

Merged
merged 1 commit into from
Jul 27, 2023

Conversation

mobuchowski
Copy link
Contributor

@mobuchowski mobuchowski commented May 17, 2023

This PR adds OpenLineage support for GcsToGcsOperator.

Copy link
Collaborator

@sunank200 sunank200 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think we should add documentation around this along with the following changes

airflow/providers/google/cloud/transfers/gcs_to_gcs.py Outdated Show resolved Hide resolved
return None

return OperatorLineage(
inputs=[
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

is this not required @mobuchowski ?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I don't think it's required here

@mobuchowski
Copy link
Contributor Author

@sunank200

Add run facet which captures the error as per https://github.com/OpenLineage/OpenLineage/blob/main/spec/OpenLineage.md#run-facets
Add test case for error scenario where runfacet is populated

In general case it's hard to capture errors now, since on_task_instance_failed does not pass actual error message.
We could store it somewhere on task_instance maybe.

@mobuchowski mobuchowski force-pushed the openlineage-gcs-operator branch 2 times, most recently from 9af4382 to edf62f9 Compare May 25, 2023 12:24
@mobuchowski mobuchowski force-pushed the openlineage-gcs-operator branch 2 times, most recently from 65afad8 to cf86fb7 Compare June 15, 2023 15:02
@pankajkoti pankajkoti removed their request for review July 12, 2023 12:22
@mobuchowski mobuchowski force-pushed the openlineage-gcs-operator branch 4 times, most recently from 26d29bb to 0b83011 Compare July 24, 2023 11:52
return None

return OperatorLineage(
inputs=[
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

is this not required @mobuchowski ?

Copy link
Collaborator

@sunank200 sunank200 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Once test passes it should be good to be merged.

@eladkal eladkal merged commit b733667 into apache:main Jul 27, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
area:providers provider:google Google (including GCP) related issues provider:openlineage AIP-53
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants