[ML] Prevent duplicate notifications about the same anomaly result #91485

darnautov · 2021-02-16T13:54:47Z

Summary

Related issue #88940.

The alerting framework allows configuring when to get notified. By default, it's "Only on status change", which means that in the case of the ML Anomaly detection alert schedules Anomaly score matched the condition action on each execution multiple times in a row, the user will be notified only on the first status change, so receiving duplicates won't be an issue.
But setting it to "Every time alert is active" might result in multiple notifications for the same anomaly, depending on the check interval and the result bucket span. This PR adds a check to the ML alert executor for the existing alert instance with anomaly_score_match action group in .kibana-event-log-* index that helps to avoid it.

The alert instance key generated to check for duplicates is of the form

Buckets alerts: jobId_highestRecordTimestamp
Influencer alerts: jobId_highestRecordTimestamp_influencerFieldName_influencerFieldValue
Record alerts: jobId_highestRecordTimestamp_detectorIndex_function_entityFieldName_entityFieldValue
e.g. for a record alert instance: ecommerce_high_sum_total_sales_1613584800000_0_high_sum_customer_full_name.keyword_Rabbia Al Powell

Also, another scenario is possible when the check interval is significantly bigger than the result bucket span we look back during the alert condition execution. In that case, we risk missing the anomaly, hence using previousStartedAt time from the previous execution helps to detect the check gap and use this interval as a time range for querying anomalies.

How to test

Create the Anomaly detection alert and set Notify to Every time alert is active.
Set a frequent check interval so the alert executor check the same bucket multiple times

You should get notified based on the selected action only once for a particular anomaly result.

Checklist

Documentation was added for features that require explanation or tutorials
Unit or functional tests were updated or added to match the most common scenarios

elasticmachine · 2021-02-16T13:54:49Z

Pinging @elastic/ml-ui (:ml)

x-pack/plugins/ml/server/lib/alerts/alerting_service.ts

alvarezmelissa87 · 2021-02-16T22:24:33Z

Code LGTM aside from Pete's comment. Will test when PR is updated 👌

peteharverson · 2021-02-17T16:04:36Z

x-pack/plugins/ml/server/lib/alerts/alerting_service.ts

+    } else if (source.result_type === ANOMALY_RESULT_TYPE.RECORD) {
+      const fieldName = getEntityFieldName(source);
+      const fieldValue = getEntityFieldValue(source);
+      alertInstanceKey += `_${source.detector_index}_${source.function}_${fieldName}_${fieldValue}`;


source.detector_index is undefined in the key my test generated. Is this available in the source ?

forgot to include it into the source, fixed in 61764c5

peteharverson

Tested latest edits and LGTM

alvarezmelissa87

LGTM ⚡

kibanamachine · 2021-02-17T18:26:21Z

💚 Build Succeeded

continuous-integration/kibana-ci/pull-request
Commit: 61764c5

Metrics [docs]

Page load bundle

Size of the bundles that are downloaded on every page load. Target size is below 100kb

id	before	after	diff
`ml`	68.1KB	68.0KB	-32.0B

History

💔 Build #107297 failed 3e910c2
💔 Build #107288 failed a3d3d13
💚 Build #106815 succeeded f393a0f

To update your PR or re-run it, just comment with:
@elasticmachine merge upstream

…lastic#91485) * [ML] check kibana even logs for existing alert instance * [ML] create alert instance key, add check for alert id * [ML] use anomaly_utils, check interval gap * [ML] add detector index * [ML] fix unit test * [ML] include detector_index into source

kibanamachine · 2021-02-17T18:28:52Z

💚 Backport successful

✅ 7.x / #91720

Successful backport PRs will be merged automatically after passing CI.

…91485) (#91720) * [ML] check kibana even logs for existing alert instance * [ML] create alert instance key, add check for alert id * [ML] use anomaly_utils, check interval gap * [ML] add detector index * [ML] fix unit test * [ML] include detector_index into source Co-authored-by: Dima Arnautov <[email protected]>

darnautov added 2 commits February 16, 2021 12:11

[ML] check kibana even logs for existing alert instance

897415d

[ML] create alert instance key, add check for alert id

f393a0f

darnautov added release_note:enhancement :ml Feature:Anomaly Detection ML anomaly detection Feature:Alerting v8.0.0 v7.12.0 labels Feb 16, 2021

darnautov requested review from walterra, alvarezmelissa87 and peteharverson February 16, 2021 13:54

darnautov self-assigned this Feb 16, 2021

darnautov requested a review from a team as a code owner February 16, 2021 13:54

peteharverson reviewed Feb 16, 2021

View reviewed changes

x-pack/plugins/ml/server/lib/alerts/alerting_service.ts Outdated Show resolved Hide resolved

darnautov added 2 commits February 17, 2021 10:01

[ML] use anomaly_utils, check interval gap

a3d3d13

[ML] add detector index

3e910c2

darnautov requested a review from peteharverson February 17, 2021 15:14

darnautov added 2 commits February 17, 2021 16:52

[ML] fix unit test

224b3b3

[ML] include detector_index into source

61764c5

peteharverson reviewed Feb 17, 2021

View reviewed changes

peteharverson approved these changes Feb 17, 2021

View reviewed changes

darnautov enabled auto-merge (squash) February 17, 2021 17:17

darnautov mentioned this pull request Feb 17, 2021

[ML] Initial Alerting and Action integration #88940

Closed

10 tasks

darnautov added the auto-backport Deprecated - use backport:version if exact versions are needed label Feb 17, 2021

alvarezmelissa87 approved these changes Feb 17, 2021

View reviewed changes

darnautov merged commit c84047b into elastic:master Feb 17, 2021

kibanamachine mentioned this pull request Feb 17, 2021

[7.x] [ML] Prevent duplicate notifications about the same anomaly result (#91485) #91720

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ML] Prevent duplicate notifications about the same anomaly result #91485

[ML] Prevent duplicate notifications about the same anomaly result #91485

darnautov commented Feb 16, 2021 •

edited

Loading

elasticmachine commented Feb 16, 2021

alvarezmelissa87 commented Feb 16, 2021

peteharverson Feb 17, 2021

darnautov Feb 17, 2021

peteharverson left a comment

alvarezmelissa87 left a comment

kibanamachine commented Feb 17, 2021

kibanamachine commented Feb 17, 2021

[ML] Prevent duplicate notifications about the same anomaly result #91485

[ML] Prevent duplicate notifications about the same anomaly result #91485

Conversation

darnautov commented Feb 16, 2021 • edited Loading

Summary

How to test

Checklist

elasticmachine commented Feb 16, 2021

alvarezmelissa87 commented Feb 16, 2021

peteharverson Feb 17, 2021

Choose a reason for hiding this comment

darnautov Feb 17, 2021

Choose a reason for hiding this comment

peteharverson left a comment

Choose a reason for hiding this comment

alvarezmelissa87 left a comment

Choose a reason for hiding this comment

kibanamachine commented Feb 17, 2021

💚 Build Succeeded

Metrics [docs]

Page load bundle

History

kibanamachine commented Feb 17, 2021

💚 Backport successful

darnautov commented Feb 16, 2021 •

edited

Loading