-
Notifications
You must be signed in to change notification settings - Fork 14.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
SparkSubmit Connection Extras can be overridden #36151
SparkSubmit Connection Extras can be overridden #36151
Conversation
cc @eladkal |
8337473
to
8c8d7bf
Compare
@@ -204,8 +213,8 @@ def _resolve_connection(self) -> dict[str, Any]: | |||
|
|||
# Determine optional yarn queue from the extra field | |||
extra = conn.extra_dejson | |||
conn_data["queue"] = extra.get("queue") | |||
conn_data["deploy_mode"] = extra.get("deploy-mode") | |||
conn_data["queue"] = self._queue if self._queue else extra.get("queue") |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Override the values if passed otherwise fetch from Spark's Connection Extras
8c8d7bf
to
b65d985
Compare
ed36a39
to
681002a
Compare
4108233
to
38c45e6
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM. Might be worth a rebase to see if that failing integration is flaky, an actual issue, or resolved by another commit.
38c45e6
to
f7bf58b
Compare
377cf34
to
f7bf58b
Compare
This reverts commit 8c8d7bf27575df69b03747688b23a1a292206c5a.
f7bf58b
to
820b2fa
Compare
closes: #35911
Description
Currently there are some arguments which are being provided using Spark Connection,
but there is no way to override them in SparkSubmitOperator and SparkSubmitHook
eg.
--queue: option specifies the YARN queue to which the application should be submitted.
--deploy-mode: option specified deploy mode client/cluster
more - https://spark.apache.org/docs/3.2.0/running-on-yarn.html
Use case/motivation
These use-cases are particularly useful in a multi-tenant environment where different users or groups have allocated resources in specific YARN queues, or want to use different deploy mode in each spark submit job which might be different from option provided in Spark Connection Extras.