[BUG] Regression in NDSv2 of 4% because of spillable broadcast #6708

abellina · 2022-10-05T22:54:59Z

We found a regression of ~4% for our NDSv2 benchmark in our performance cluster after this change went in: #6604

The issue is we are acquiring the semaphore too early, before the stream side has materialized the first batch. So this means that if the first stream batch requires data from say a data source (like a parquet table), we'd hold the semaphore while we do all of the IO to materialize the stream side, this is not ideal. The issue is very similar to what was fixed here: #4539, and so the proposed fix is very similar, just for broadcasts.

abellina added bug Something isn't working ? - Needs Triage Need team to review and classify labels Oct 5, 2022

abellina added this to the Sep 26 - Oct 7 milestone Oct 5, 2022

abellina self-assigned this Oct 5, 2022

abellina added the P0 Must have for release label Oct 5, 2022

abellina mentioned this issue Oct 5, 2022

Take semaphore after first stream batch is materialized (broadcast) #6709

Merged

sameerz added performance A performance related task/issue and removed ? - Needs Triage Need team to review and classify bug Something isn't working labels Oct 5, 2022

tgravescs closed this as completed in #6709 Oct 6, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG] Regression in NDSv2 of 4% because of spillable broadcast #6708

[BUG] Regression in NDSv2 of 4% because of spillable broadcast #6708

abellina commented Oct 5, 2022

[BUG] Regression in NDSv2 of 4% because of spillable broadcast #6708

[BUG] Regression in NDSv2 of 4% because of spillable broadcast #6708

Comments

abellina commented Oct 5, 2022