settings in ExecuteQueryParams is omitted by the Ballista's scheduler.execute_query(), cause wrong partition count #1848

mingmwang · 2022-02-17T10:22:56Z

Describe the bug

The issue is caused by the changes 1677
which always use the ExecutionContext from the SchedulerServer.

Before the change, run TPCH benchmark Q1 on Ballista:

[2022-02-16T08:47:59Z INFO ballista_scheduler] Adding stage 1 with 1 pending tasks
[2022-02-16T08:47:59Z INFO ballista_scheduler] Adding stage 2 with 2 pending tasks
[2022-02-16T08:47:59Z INFO ballista_scheduler] Adding stage 3 with 1 pending tasks

After the change:

[2022-02-16T08:44:57Z INFO ballista_scheduler] Adding stage 1 with 1 pending tasks
[2022-02-16T08:44:57Z INFO ballista_scheduler] Adding stage 2 with 8 pending tasks
[2022-02-16T08:44:57Z INFO ballista_scheduler] Adding stage 3 with 1 pending tasks.

A clear and concise description of what the bug is.

To Reproduce
Steps to reproduce the behavior:

Expected behavior

SchedulerServer should honor the configuration settings from the ExecuteQueryParams.

Additional context
Add any other context about the problem here.

mingmwang · 2022-02-17T10:29:11Z

@thinkharderdev Please take a look.

mingmwang · 2022-02-17T10:47:08Z

I think we need to introduce a session level state to hold any session specific configurations instead of global shared ExecutionContext/ExecutionContextState. We might have a shared Ballista Scheduler, different users might submit SQLs with different sql configurations or shuffle settings.

thinkharderdev · 2022-02-17T11:32:51Z

Will do. I think there are a couple of different ways we can approach this:

Have the client specify a namespace in the request and use a ExecutionContext-per-namespace on the scheduler. We could then dynamically create new contexts whenever a new namespace comes in.
Have the scheduler dynamically set target partitions based on executor statistics (e.g. number of available task slots). This would I think require a way to set the target partitions explicitly when creating a sql plan. So maybe add a new method to ExecutionContext like

pub async fn sql(&mut self, sql: &str, target_partitions: usize) -> Result<Arc<dyn DataFrame>>

Or both. 1 may be necessary anyway to support multi-tenancy but we may still, within a single namespace, want to allow specifying shuffle settings on a per-query basis.

thinkharderdev · 2022-02-17T11:33:31Z

Also, good catch! Apologies for overlooking this.

mingmwang · 2022-02-18T03:00:24Z

Will do. I think there are a couple of different ways we can approach this:

Have the client specify a namespace in the request and use a ExecutionContext-per-namespace on the scheduler. We could then dynamically create new contexts whenever a new namespace comes in.

Have the scheduler dynamically set target partitions based on executor statistics (e.g. number of available task slots). This would I think require a way to set the target partitions explicitly when creating a sql plan. So maybe add a new method to ExecutionContext like

pub async fn sql(&mut self, sql: &str, target_partitions: usize) -> Result<Arc<dyn DataFrame>>

Or both. 1 may be necessary anyway to support multi-tenancy but we may still, within a single namespace, want to allow specifying shuffle settings on a per-query basis.

I would prefer to let the users choose the target partition at the current phase. Target partition should not be changed too dynamically, otherwise the runtime distributed physical plan will not be stable and could introduce additional shuffle exchanges. In future we might add some kind of adaptive methods to adjust the target partition size based on input/output data volume.

mingmwang · 2022-02-18T03:08:48Z

Beside the target partition count, I think there are couple of other configuration options that could be specified by the users and can be changed dynamically, for example, batch_size, parquet_pruning, repartition_windows etc.

I searched the open issues and found there are couple of configuration related issues that are still open.

138
682

I think it is time to resolve those and come up with a more extensible configuration design.

mingmwang · 2022-04-01T03:34:43Z

The issue is fixed.

mingmwang added the bug Something isn't working label Feb 17, 2022

thinkharderdev mentioned this issue Feb 17, 2022

Abstract over logical and physical plan representations in Ballista #1677

Merged

mingmwang mentioned this issue Feb 18, 2022

Refactor ExecutionContext and related conf to support multi-tenancy configurations. #1862

Closed

mingmwang mentioned this issue Mar 9, 2022

Refactor ExecutionContext and related conf to support multi-tenancy configurations #1924

Closed

mingmwang closed this as completed Apr 1, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

settings in ExecuteQueryParams is omitted by the Ballista's scheduler.execute_query(), cause wrong partition count #1848

settings in ExecuteQueryParams is omitted by the Ballista's scheduler.execute_query(), cause wrong partition count #1848

mingmwang commented Feb 17, 2022

mingmwang commented Feb 17, 2022

mingmwang commented Feb 17, 2022

thinkharderdev commented Feb 17, 2022

thinkharderdev commented Feb 17, 2022

mingmwang commented Feb 18, 2022

mingmwang commented Feb 18, 2022

mingmwang commented Apr 1, 2022

settings in ExecuteQueryParams is omitted by the Ballista's scheduler.execute_query(), cause wrong partition count #1848

settings in ExecuteQueryParams is omitted by the Ballista's scheduler.execute_query(), cause wrong partition count #1848

Comments

mingmwang commented Feb 17, 2022

mingmwang commented Feb 17, 2022

mingmwang commented Feb 17, 2022

thinkharderdev commented Feb 17, 2022

thinkharderdev commented Feb 17, 2022

mingmwang commented Feb 18, 2022

mingmwang commented Feb 18, 2022

mingmwang commented Apr 1, 2022