Ballista: Implement scalable distributed joins #634

andygrove · 2021-06-27T15:43:19Z

Which issue does this PR close?

Closes #63.

This PR removes previous hacks around partitioning and now faithfully translates the DataFusion query plan, including RepartitionExec. I have tested with TPC-H query 12 and see consistent results between DataFusion and Ballista with the 100GB data set, where each table has 8 partitions. I have tested with multiple executors as well as single executors.

There is more work to do but I think this is at a good point to merge since it fixes some correctness issues.

Rationale for this change

Ballista cannot scale well without this because work is duplicated across all partitions to load the entire left side of the join into memory currently.

What changes are included in this PR?

Enables RepartitionExec in Ballista query plans and translate them to shuffles
Removes previous hacks intended to detect changes in partitioning

Are there any user-facing changes?

Query plans will change.

andygrove · 2021-07-03T13:49:22Z

@edrevo fyi

Dandandan · 2021-07-03T15:04:59Z

ballista/rust/core/src/utils.rs

-        .with_repartition_joins(false)
-        .with_repartition_aggregations(false)
-        .with_physical_optimizer_rules(rules);
+    let config = ExecutionConfig::new().with_concurrency(2); // TODO: this is hack to enable partitioned joins


What is the idea here for later? I guess the repartitioning needs to be applied with concurrency=1 too to avoid inefficient plans?

I filed https://github.com/apache/arrow-datafusion/issues/661 to discuss this

Dandandan

Amazing 😎😎😎

…ery plan output

jorgecarleitao

Ready to merge; very neat solution! 💯

github-actions bot added ballista datafusion Changes in the datafusion crate labels Jun 27, 2021

andygrove force-pushed the ballista-scalable-join branch from f597c0c to 8acdd12 Compare June 30, 2021 12:31

Refactor Ballista planner to support RepartitionExec

6f4cfd8

andygrove force-pushed the ballista-scalable-join branch from 8acdd12 to 6f4cfd8 Compare July 3, 2021 13:44

andygrove changed the title ~~Ballista: Implement scalable distributed joins [DRAFT]~~ Ballista: Implement scalable distributed joins Jul 3, 2021

andygrove marked this pull request as ready for review July 3, 2021 13:48

andygrove requested review from Dandandan, jorgecarleitao and alamb July 3, 2021 13:49

Dandandan reviewed Jul 3, 2021

View reviewed changes

Dandandan approved these changes Jul 3, 2021

View reviewed changes

Improve tests and replace MergeExec with CoalescePartitionsExec in qu…

b54a351

…ery plan output

jorgecarleitao approved these changes Jul 3, 2021

View reviewed changes

Dandandan merged commit 9314dbb into apache:master Jul 4, 2021

houqp added the enhancement New feature or request label Jul 29, 2021

andygrove deleted the ballista-scalable-join branch February 6, 2022 17:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ballista: Implement scalable distributed joins #634

Ballista: Implement scalable distributed joins #634

andygrove commented Jun 27, 2021 •

edited

Loading

andygrove commented Jul 3, 2021

Dandandan Jul 3, 2021

andygrove Jul 3, 2021

Dandandan left a comment

jorgecarleitao left a comment

Ballista: Implement scalable distributed joins #634

Ballista: Implement scalable distributed joins #634

Conversation

andygrove commented Jun 27, 2021 • edited Loading

Which issue does this PR close?

Rationale for this change

What changes are included in this PR?

Are there any user-facing changes?

andygrove commented Jul 3, 2021

Dandandan Jul 3, 2021

Choose a reason for hiding this comment

andygrove Jul 3, 2021

Choose a reason for hiding this comment

Dandandan left a comment

Choose a reason for hiding this comment

jorgecarleitao left a comment

Choose a reason for hiding this comment

andygrove commented Jun 27, 2021 •

edited

Loading