You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Spark Structured Streaming runner supports Datasets that already have Schema information. This is used by Spark to optimize jobs (via Catalyst). This issue is to implement optimized translations of the transforms for the runner so we can benefit of the performance improvements internally done by Spark.
Notice that we also may need to map Beam's core internal representations like WindowedValue so we can have intermediary optimizations.
Imported from Jira BEAM-9451. Original Jira may contain additional context.
Reported by: iemejia.
The text was updated successfully, but these errors were encountered:
Spark Structured Streaming runner supports Datasets that already have Schema information. This is used by Spark to optimize jobs (via Catalyst). This issue is to implement optimized translations of the transforms for the runner so we can benefit of the performance improvements internally done by Spark.
Notice that we also may need to map Beam's core internal representations like WindowedValue so we can have intermediary optimizations.
Imported from Jira BEAM-9451. Original Jira may contain additional context.
Reported by: iemejia.
The text was updated successfully, but these errors were encountered: