-
-
Notifications
You must be signed in to change notification settings - Fork 684
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
percentiles metric aggregation #1763
Comments
Algorithm Requirements
Elastic SearchElastic Search uses 2 algorithms: T-Digest and HDR Histogram Crates in Rust
|
Here are some insights from a comparison of different algorithms
Worse than just storing
If there are not many values, only keeping the array is preferable COUNT=[1_000], TDIGEST_BATCH=500, TDIGEST_MAX_SIZE=300, HDR_SIGFIG=3, DDSketch2Err=0.01
COUNT=[1_000_000], TDIGEST_BATCH=500, TDIGEST_MAX_SIZE=300, HDR_SIGFIG=3, DDSketch2Err=0.01
COUNT=[1_000_000], TDIGEST_BATCH=500, TDIGEST_MAX_SIZE=300, HDR_SIGFIG=3, DDSketch2Err=0.01
While HDRHistogram seems to be doing better, it has a severe limitation, it only operates on |
@PSeitz I naively assumed that tdigest would have a footprint close to TDIGEST_MAX_SIZE x SOMECONSTANTCLOSE_TO_8. |
@fulmicoton I use allocator hooks to track peak allocation PSeitz/stats_alloc@d925d3c. There's was a bugfix missing in the measurement. T-Digest handles updates in batches, the previous number of I update the tables above with the new measurements and added a serialized column (serialized with |
another challenger: https://arxiv.org/pdf/1908.10693.pdf ddsketch... |
I updated the table to include two ddsketch implementations Full results and benchmark source code is here |
percentiles aggregation returns the nth percentile for each interval (75th, 85th, 95th, and 99th percentile)
Percentiles show the point at which a certain percentage of observed values occur. For example, the 95th percentile is the value which is greater than 95% of the observed values.
https://www.elastic.co/guide/en/elasticsearch/reference/current/search-aggregations-metrics-percentile-aggregation.html
The text was updated successfully, but these errors were encountered: