[Lens] improve percentile agg optimizations #145303

drewdaemon · 2022-11-15T22:14:02Z

Summary

Unfiltered single-percentile agg configs were already optimized.

As of this PR, filtered single-percentile aggs that use the same percentile (and other args) are now collapsed into one agg config.

Unfortunately, esaggs doesn't currently support combining filtered single-percentile aggs with different percentiles into a single agg config, so that special optimization is currently only applied on non-filtered single-percentile aggs.

Testing

Filtered percentiles with same value now deduplicated

Add a dimension with the following formula

percentile(bytes, percentile=2, kql='geo.dest : "AL" ') + 
percentile(bytes, percentile=2, kql='geo.dest : "AL" ') + 
percentile(bytes, percentile=2, kql='geo.dest : "BA" ') + 
percentile(bytes, percentile=2, kql='geo.dest : "BA" ') + 
percentile(bytes, percentile=2, kql='geo.dest : "BA" ')

Check the request. Should only have two aggs, one for each filter.

Unfiltered percentiles with different values still collapsed

Add a dimension with the following formula

percentile(bytes, percentile=2) + 
percentile(bytes, percentile=3) +
percentile(bytes, percentile=4) +
percentile(bytes, percentile=4) +
percentile(bytes, percentile=5)

Check the request. Agg should look like this

"0": {
      "percentiles": {
        "field": "bytes",
        "percents": [
          2,
          3,
          4,
          5,
        ],
        "keyed": false
      }
    }

…percentile-agg-deduplication

drewdaemon · 2022-11-18T14:47:05Z

@elasticmachine merge upstream

…percentile-agg-deduplication

…com:andrewctate/kibana into 135265/improve-percentile-agg-deduplication

elasticmachine · 2022-11-30T01:21:37Z

Pinging @elastic/kibana-visualizations @elastic/kibana-visualizations-external (Team:Visualizations)

drewdaemon · 2022-11-30T01:22:52Z

@elasticmachine merge upstream

drewdaemon · 2022-11-30T01:31:31Z

x-pack/plugins/lens/public/datasources/form_based/to_expression.ts

@@ -142,7 +142,7 @@ function getExpressionForLayer(
      if (def.input !== 'fullReference' && def.input !== 'managedReference') {
        const aggId = String(index);

-        const wrapInFilter = Boolean(def.filterable && col.filter);
+        const wrapInFilter = Boolean(def.filterable && col.filter?.query);


This change prevents us wrapping this agg in a filter when the query is just an empty string. No difference on the elasticsearch query, but this does make sure that if someone removes a kql filter from a dimension, that agg can be collapsed into other matching aggs instead of being passed over because it looks like it has a query.

kibana-ci · 2022-11-30T02:24:58Z

💚 Build Succeeded

Buildkite Build
Commit: 04407c4

Metrics [docs]

Async chunks

Total size of all lazy-loaded chunks that will be downloaded as the user navigates the app

id	before	after	diff
`lens`	1.3MB	1.3MB	-239.0B

Unknown metric groups

ESLint disabled in files

id	before	after	diff
`osquery`	1	2	+1

ESLint disabled line counts

id	before	after	diff
`enterpriseSearch`	19	21	+2
`fleet`	59	65	+6
`osquery`	109	115	+6
`securitySolution`	442	448	+6
total			+20

Total ESLint disabled count

id	before	after	diff
`enterpriseSearch`	20	22	+2
`fleet`	68	74	+6
`osquery`	110	117	+7
`securitySolution`	519	525	+6
total			+21

History

💚 Build #90155 succeeded 1a7742d
💚 Build #89256 succeeded b465b68

To update your PR or re-run it, just comment with:
@elasticmachine merge upstream

dej611

Tested locally on Safari 👍

I had a go also with a mixed formula like:

defaults(percentile(bytes, percentile=2, kql='extension.keyword : CSS'), 0) + 
defaults(percentile(bytes, percentile=2, kql='extension.keyword : CSS'), 0) +
percentile(bytes, percentile=4) +
percentile(bytes, percentile=4) +
percentile(bytes, percentile=5)

And both 4th and 2nd percentile were aggregated into each own bucket.
Adding an extra percentile(bytes, percentile=2) will add the 2nd percentile metric into the non-filtered bucket correctly and group the other similar ones into the filtered bucket.

drewdaemon added 6 commits October 29, 2022 15:07

use textual expressions in percentile test

1bd48c3

reuse duplicate finding logic

6370b7d

Merge branch 'main' of github.com:elastic/kibana into 135265/improve-…

66fa1b3

…percentile-agg-deduplication

improve types, skip filtered groups

0be46ec

rely on getGroupByKey for deduplication

c590e35

remove circular import

fd32f03

Merge branch 'main' into 135265/improve-percentile-agg-deduplication

b465b68

elastic deleted a comment from kibana-ci Nov 18, 2022

drewdaemon added 3 commits November 22, 2022 15:09

Merge branch 'main' of github.com:elastic/kibana into 135265/improve-…

8fe45f5

…percentile-agg-deduplication

Merge branch '135265/improve-percentile-agg-deduplication' of github.…

114a07b

…com:andrewctate/kibana into 135265/improve-percentile-agg-deduplication

don't wrap agg unless there's actually a query

1a7742d

drewdaemon changed the title ~~improve percentile agg optimizations~~ [Lens] improve percentile agg optimizations Nov 30, 2022

drewdaemon added Team:Visualizations Visualization editors, elastic-charts and infrastructure Feature:Lens release_note:skip Skip the PR/issue when compiling release notes labels Nov 30, 2022

drewdaemon marked this pull request as ready for review November 30, 2022 01:21

drewdaemon requested a review from a team as a code owner November 30, 2022 01:21

Merge branch 'main' into 135265/improve-percentile-agg-deduplication

04407c4

drewdaemon commented Nov 30, 2022

View reviewed changes

dej611 approved these changes Nov 30, 2022

View reviewed changes

drewdaemon merged commit 61a2df6 into elastic:main Nov 30, 2022

kibanamachine added v8.7.0 backport:skip This commit does not require backporting labels Nov 30, 2022

dej611 mentioned this pull request Mar 24, 2023

[Lens] [EsAggs][Meta] Optimize esaggs requests when possible #153629

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Lens] improve percentile agg optimizations #145303

[Lens] improve percentile agg optimizations #145303

drewdaemon commented Nov 15, 2022 •

edited

Loading

drewdaemon commented Nov 18, 2022

elasticmachine commented Nov 30, 2022

drewdaemon commented Nov 30, 2022

drewdaemon Nov 30, 2022

kibana-ci commented Nov 30, 2022

ESLint disabled in files

ESLint disabled line counts

Total ESLint disabled count

dej611 left a comment

[Lens] improve percentile agg optimizations #145303

[Lens] improve percentile agg optimizations #145303

Conversation

drewdaemon commented Nov 15, 2022 • edited Loading

Summary

Testing

Filtered percentiles with same value now deduplicated

Unfiltered percentiles with different values still collapsed

drewdaemon commented Nov 18, 2022

elasticmachine commented Nov 30, 2022

drewdaemon commented Nov 30, 2022

drewdaemon Nov 30, 2022

Choose a reason for hiding this comment

kibana-ci commented Nov 30, 2022

💚 Build Succeeded

Metrics [docs]

Async chunks

ESLint disabled in files

ESLint disabled line counts

Total ESLint disabled count

History

dej611 left a comment

Choose a reason for hiding this comment

drewdaemon commented Nov 15, 2022 •

edited

Loading