Improve the performance of filter functions #16681

hackeryang · 2023-03-23T07:07:47Z

Description

In the original implementation of ArrayFilterFunction.filter*, an additional copy of the filtered block is adopted. The speed, especially for some nested column types such as RowType is not perfect.

So we improved the implementation to record the block index position, then directly call Block.copyPositions() after the filtering is completed, in order to reduce unnecessary block copies such as Type.appendTo() and BlockBuilder.build().

Additional context and related issues

Before the change

Benchmark                             (name)  Mode  Cnt   Score   Error  Units  
BenchmarkArrayFilter.benchmark        filter  avgt   20  22.543 ± 0.979  ns/op  
BenchmarkArrayFilter.benchmarkObject  filter  avgt   20  42.045 ± 2.088  ns/op

After the change

Benchmark                             (name)  Mode  Cnt   Score   Error  Units  
BenchmarkArrayFilter.benchmark        filter  avgt   20  13.576 ± 0.849  ns/op  
BenchmarkArrayFilter.benchmarkObject  filter  avgt   20  32.051 ± 0.268  ns/op

Release notes

( ) This is not user-visible or docs only and no release notes are required.
(x) Release notes are required, please propose a release note for me.
( ) Release notes are required, with the following suggested text:

# General
*  Improve performance of `filter` function on arrays. ({issue}`16681`)

hackeryang · 2023-03-23T09:53:17Z

Hello @findepi @ebyhr @sopel39 , can you please take a review when you have time, thank you~

core/trino-main/src/test/java/io/trino/operator/scalar/BenchmarkArrayFilterObject.java

core/trino-main/src/main/java/io/trino/operator/scalar/ArrayFilterFunction.java

core/trino-main/src/test/java/io/trino/operator/scalar/BenchmarkArrayFilterObject.java

raunaqmorarka · 2023-03-23T17:49:59Z

core/trino-main/src/main/java/io/trino/operator/scalar/ArrayFilterFunction.java

            if (TRUE.equals(keep)) {
-                elementType.appendTo(arrayBlock, position, resultBuilder);
+                positions[length++] = position;
            }


Could you check if writing this part as

positions[length] = position; length += TRUE.equals(keep) ? 1 : 0;

would give better results ?

I reverted to the if clause and benchmarked again, the result of if clause was below:

Benchmark (name) Mode Cnt Score Error Units BenchmarkArrayFilter.benchmark filter avgt 20 13.320 ± 0.610 ns/op BenchmarkArrayFilter.benchmarkObject filter avgt 20 32.699 ± 0.945 ns/op

The result of ? : clause was below:

Benchmark (name) Mode Cnt Score Error Units BenchmarkArrayFilter.benchmark filter avgt 20 13.576 ± 0.849 ns/op BenchmarkArrayFilter.benchmarkObject filter avgt 20 32.051 ± 0.268 ns/op

It seems that the performance didn't have too much improvement contrast to the if clause, but i still changed to this way because the code is cleaner~

I think whether you see improvement depends on whether the input data has a significant fraction of randomly generated nulls

I think whether you see improvement depends on whether the input data has a significant fraction of randomly generated nulls

@raunaqmorarka Good question, you reminded me~ So I temporarily changed the nullRate to 0.8F in BlockAssertions#createRandomBlockForType(the former null rate was 20%):

Then i benchmarked again, below was the result(i.e. with 80% null values):

Before our improvement

Benchmark (name) Mode Cnt Score Error Units BenchmarkArrayFilter.benchmarkObject filter avgt 20 7.694 ± 0.345 ns/op

After our improvement

Benchmark (name) Mode Cnt Score Error Units BenchmarkArrayFilter.benchmarkObject filter avgt 20 7.301 ± 2.077 ns/op

So our improvement will only have a little speed up, if there are most null values in the column.

But the less null values are, the faster our improvement will be.

core/trino-main/src/test/java/io/trino/operator/scalar/BenchmarkArrayFilterObject.java

Before the change: Benchmark (name) Mode Cnt Score Error Units BenchmarkArrayFilter.benchmark filter avgt 20 22.543 ± 0.979 ns/op BenchmarkArrayFilter.benchmarkObject filter avgt 20 42.045 ± 2.088 ns/op After the change: Benchmark (name) Mode Cnt Score Error Units BenchmarkArrayFilter.benchmark filter avgt 20 13.327 ± 0.359 ns/op BenchmarkArrayFilter.benchmarkObject filter avgt 20 34.443 ± 1.943 ns/op

hackeryang · 2023-03-28T04:09:13Z

@raunaqmorarka Thank you for your advice above very much, i have modified some implementations, please review again when you have time~

cla-bot bot added the cla-signed label Mar 23, 2023

hackeryang self-assigned this Mar 23, 2023

hackeryang added the performance label Mar 23, 2023

hackeryang requested review from findepi, sopel39 and ebyhr March 23, 2023 07:11

sopel39 requested a review from raunaqmorarka March 23, 2023 10:59

findepi requested review from lukasz-stec and dain and removed request for findepi March 23, 2023 13:41

raunaqmorarka reviewed Mar 23, 2023

View reviewed changes

hackeryang added 2 commits March 28, 2023 11:44

Add benchmark for array filter object

861e25a

hackeryang force-pushed the improve_filter_object_performance branch from 9cb9d4d to 4745664 Compare March 28, 2023 03:44

hackeryang changed the title ~~Improve the performance of filter function for object column types~~ Improve the performance of filter functions Mar 28, 2023

hackeryang requested a review from raunaqmorarka March 28, 2023 04:07

raunaqmorarka approved these changes Mar 29, 2023

View reviewed changes

raunaqmorarka merged commit 500a04a into trinodb:master Mar 29, 2023

raunaqmorarka mentioned this pull request Mar 29, 2023

Release notes for 411 #16453

Closed

hackeryang deleted the improve_filter_object_performance branch March 29, 2023 07:31

github-actions bot added this to the 411 milestone Mar 29, 2023

colebow mentioned this pull request Mar 29, 2023

Add Trino 411 release notes #16552

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve the performance of filter functions #16681

Improve the performance of filter functions #16681

hackeryang commented Mar 23, 2023 •

edited by raunaqmorarka

Loading

hackeryang commented Mar 23, 2023

raunaqmorarka Mar 23, 2023

hackeryang Mar 28, 2023 •

edited

Loading

raunaqmorarka Mar 29, 2023

hackeryang Mar 29, 2023 •

edited

Loading

hackeryang commented Mar 28, 2023

Improve the performance of filter functions #16681

Improve the performance of filter functions #16681

Conversation

hackeryang commented Mar 23, 2023 • edited by raunaqmorarka Loading

Description

Additional context and related issues

Before the change

After the change

Release notes

hackeryang commented Mar 23, 2023

raunaqmorarka Mar 23, 2023

Choose a reason for hiding this comment

hackeryang Mar 28, 2023 • edited Loading

Choose a reason for hiding this comment

raunaqmorarka Mar 29, 2023

Choose a reason for hiding this comment

hackeryang Mar 29, 2023 • edited Loading

Choose a reason for hiding this comment

Before our improvement

After our improvement

hackeryang commented Mar 28, 2023

hackeryang commented Mar 23, 2023 •

edited by raunaqmorarka

Loading

hackeryang Mar 28, 2023 •

edited

Loading

hackeryang Mar 29, 2023 •

edited

Loading