[Alerting] Add more rule execution context #117504

dgieselaar · 2021-11-04T12:32:54Z

Closes #113506.

The following changes were made:

When rule execution starts, update the transaction name: Executing alerting rule
When rule execution starts, update the transaction labels with the rule id
When the rule is loaded, update the transaction name by appending the rule name
When the rule is loaded, update the transaction labels with the consumer, rule type id, rule name, rule params and rule tags
When the rule has executed, set the appropriate outcome
When the rule has executed, update the transaction labels with the number of active, recovered and new instances

Closes elastic#113506.

x-pack/plugins/alerting/server/task_runner/task_runner.ts

cyrille-leclerc · 2021-11-04T13:02:17Z

The added label and the new transaction name will be very helpful!

cyrille-leclerc · 2021-11-04T14:15:30Z

@dgieselaar Can you please remind us how we identify as spans the different rule actions: email, index, server-log, ServiceNow-itsm, webhook, pagerduty...

For example,

For remote systems, it will be helpful to track the destination/URL to get the span identified as a remote call (visualized on the service map, surface the destination as an uninstrumented backend...)
For the write operation in Elasticsearch, it would be interesting to have these actions visualized as generic Elasticsearch access spans
See example of span for CI steps in Jenkins: https://github.com/jenkinsci/opentelemetry-plugin/blob/4706034b0e8a2c7de88ecfe03326a6f2d9c6fef7/src/main/java/io/jenkins/plugins/opentelemetry/job/AbstractGitStepHandler.java#L60

…context

dgieselaar · 2021-11-05T13:59:15Z

@cyrille-leclerc unfortunately the trace waterfall is broken, @cauemarcondes is working on a fix, I'll update the PR with a screenshot for actions if that's fixed before this PR lands. I'm not sure if I get your second point though, can you elaborate?

…context

cyrille-leclerc · 2021-11-05T15:29:39Z

Thanks @dgieselaar

I'm not sure if I get your second point though, can you elaborate?

For the rule actions (email, index, server-log, servicenow-itsm, webhook, pagerduty...), it would be great to capture span labels that characterize the execution like the URL invoked, the authentication username ... in order to slice and dice traces in any dimension and also enable visualization of the destination on the service map and as an uninstrumented backend.

Here is an example that has a lot of commonalities with the labels we collect on CI/CD pipelines steps like the pipeline checkout step:

Labels
labels.ci_pipeline_run_user	SYSTEM
labels.git_branch	master
labels.git_clone_depth	0
labels.git_clone_shallow	FALSE
labels.git_repository	cyrille-leclerc/my-war
labels.git_username	cyrille-leclerc
labels.host_ip	192.168.1.46
labels.host_name	cyrillerclaptop.localdomain
labels.jenkins_computer_name	#controller#
labels.jenkins_pipeline_step_id	7
labels.jenkins_pipeline_step_name	Check out from version control
labels.jenkins_pipeline_step_plugin_name	workflow-scm-step
labels.jenkins_pipeline_step_plugin_version	2.13
labels.jenkins_pipeline_step_type	checkout
labels.jenkins_url	http://localhost:9600/
labels.service_namespace	jenkins
Trace
trace.id	dd7cc4c4220d72dc85f2a6de6b0c0d69
Span
span.id	a6c33c39e2e3141a
Service
service.name	jenkins

…context

dgieselaar · 2021-11-08T12:08:29Z

For the rule actions (email, index, server-log, servicenow-itsm, webhook, pagerduty...), it would be great to capture span labels that characterize the execution like the URL invoked, the authentication username ... in order to slice and dice traces in any dimension and also enable visualization of the destination on the service map and as an uninstrumented backend.

Hmm, I don't want to inadvertently leak sensitive data, I'm not sure how to prevent that if we for instance stringify params or config. Any thoughts here @elastic/kibana-alerting-services?

cyrille-leclerc · 2021-11-08T14:32:07Z

I don't want to inadvertently leak sensitive data,

Good catch @dgieselaar , we sanitized a bunch of attributes in the Jenkins Otel integration, typically parsing URLs and reconstructing them to ensure they don't leak credentials.

Here is an example:
https://github.com/jenkinsci/opentelemetry-plugin/blob/opentelemetry-0.21/src/main/java/io/jenkins/plugins/opentelemetry/job/AbstractGitStepHandler.java#L133-L153

dgieselaar · 2021-11-11T10:06:05Z

@elasticmachine merge upstream

xcrzx · 2021-11-15T14:32:03Z

@dgieselaar Is it possible to view latency distribution for all rules of a specific rule type with these changes? Let's say I'm investigating performance issues with siem.queryRule. I used to do that in the following way:

Select all transactions of the siem.queryRule rule type
On the latency distribution diagram, drag mouse to select slowest 10%
Examine their spans to identify performance bottlenecks

But with this PR, I cannot select all query rules in a single view, as transactions now split but rule name. So of I have 100+ activated rules, it becomes tedious to examine them one by one. Or I'm missing something?

dgieselaar · 2021-11-15T15:33:33Z

@dgieselaar Is it possible to view latency distribution for all rules of a specific rule type with these changes?

Not in the APM app. You can use e.g. Lens to gather that data, but it won't allow you to inspect a trace without manually copying trace ids.

Usually we separate transaction groups if they have different performance characteristics, which I would expect to be the case here from rule instance to rule instance.

dgieselaar · 2021-11-15T15:45:36Z

@elasticmachine merge upstream

cyrille-leclerc · 2021-11-15T23:06:15Z

Usually we separate transaction groups if they have different performance characteristics, which I would expect to be the case here from rule instance to rule instance.

From my observations, performance characteristics are homogeneous for Security rules of the same rule type. That's why I think grouping by rule type would be helpful.

Would it make sense to group rules by rule type by default as before and leave the ability to filter by rule name (it is already possible, labels.alerting_rule_name : "Rule Name") for those who need that level of granularity?

That's a very good point: for guided rules, the execution path and the performance characteristics are likely to be very homogeneous and thus it could make sense to have the same transaction group.
This would be different for generic rules that can have completely different performance characteristics and probably different execution path.
For this reason of homogeneity of "guided rules", it could make sense to use the rule type name as the name of the transaction. I'm not sure.

…context

dgieselaar · 2021-11-16T09:28:08Z

For this reason of homogeneity of "guided rules", it could make sense to use the rule type name as the name of the transaction. I'm not sure.

I'm not sure how to make that distinction (between guided and generic rules) from the alerting framework's perspective. It is something that the security rule types can set themselves in the rule executor. Maybe that's a good compromise?

cyrille-leclerc · 2021-11-16T14:08:36Z

I'm not sure how to make that distinction (between guided and generic rules) from the alerting framework's perspective. It is something that the security rule types can set themselves in the rule executor. Maybe that's a good compromise?

That could be an interesting starting point

xcrzx · 2021-11-16T16:37:38Z

I'm not sure how to make that distinction (between guided and generic rules) from the alerting framework's perspective. It is something that the security rule types can set themselves in the rule executor. Maybe that's a good compromise?

Yeah, it looks like we can use apm.setTransactionName to overwrite transaction names in Security to whatever value we want. Well, that sounds ok with me then.

dgieselaar · 2021-11-19T14:36:21Z

@elasticmachine merge upstream

dgieselaar · 2021-11-22T07:30:58Z

@elasticmachine merge upstream

ymao1

LGTM! Left some nits on naming but looks great otherwise. Thanks!

ymao1 · 2021-11-23T15:23:23Z

x-pack/plugins/actions/server/lib/action_executor.ts

@@ -105,7 +105,7 @@ export class ActionExecutor {
        name: `execute_action`,
        type: 'actions',
        labels: {
-          actionId,
+          actions_action_id: actionId,


Following our new terminology, I think this should be actions_connector_id and actions_connector_type_id

Thanks, updated the labels w/ new terminology!

ymao1 · 2021-11-23T15:24:21Z

x-pack/plugins/alerting/server/task_runner/task_runner.ts

@@ -855,6 +881,12 @@ function generateNewAndRecoveredInstanceEvents<
  const recoveredAlertInstanceIds = Object.keys(recoveredAlertInstances);
  const newIds = without(currentAlertInstanceIds, ...originalAlertInstanceIds);

+  if (apm.currentTransaction) {
+    apm.currentTransaction.addLabels({
+      alerting_new_instances: newIds.length,


Following our updated terminology, I believe this should be alerting_new_alerts, alerting_active_alerts, alerting_recovered_alerts

…context

addressed

kibana-ci · 2021-11-28T12:15:43Z

💚 Build Succeeded

Metrics [docs]

✅ unchanged

History

💚 Build #8258 succeeded cb46022
💚 Build #8003 succeeded d9d0700
💚 Build #6926 succeeded 04a0f6d
💚 Build #6716 succeeded 9c51b78
💚 Build #6099 succeeded 51901da
💚 Build #4956 succeeded b99d0c2

To update your PR or re-run it, just comment with:
@elasticmachine merge upstream

Co-authored-by: Kibana Machine <[email protected]>

kibanamachine · 2021-11-28T12:19:39Z

💚 Backport successful

Status	Branch	Result
✅	8.0
✅	7.16

The backport PRs will be merged automatically after passing CI.

Co-authored-by: Kibana Machine <[email protected]> Co-authored-by: Dario Gieselaar <[email protected]>

Co-authored-by: Kibana Machine <[email protected]>

[Alerting] Add more rule execution context

d300657

Closes elastic#113506.

dgieselaar added v8.0.0 release_note:skip Skip the PR/issue when compiling release notes auto-backport Deprecated - use backport:version if exact versions are needed v7.16.0 v8.1.0 labels Nov 4, 2021

dgieselaar requested a review from cyrille-leclerc November 4, 2021 12:32

cyrille-leclerc reviewed Nov 4, 2021

View reviewed changes

x-pack/plugins/alerting/server/task_runner/task_runner.ts Show resolved Hide resolved

Merge branch 'main' of github.com:elastic/kibana into rule-execution-…

8cfb1c8

…context

dgieselaar marked this pull request as ready for review November 5, 2021 14:02

dgieselaar requested a review from a team as a code owner November 5, 2021 14:02

dgieselaar added 2 commits November 5, 2021 15:48

Fix TM jest test

6ac6e4a

Merge branch 'main' of github.com:elastic/kibana into rule-execution-…

d82047a

…context

dgieselaar added 3 commits November 8, 2021 11:57

Merge branch 'main' of github.com:elastic/kibana into rule-execution-…

ed7a004

…context

Merge branch 'main' of github.com:elastic/kibana into rule-execution-…

ba9995e

…context

Consistent label formatting

b99d0c2

xcrzx mentioned this pull request Nov 10, 2021

[Security Solution] Instrument rule executors with Elastic APM #117672

Merged

1 task

Merge branch 'main' into rule-execution-context

51901da

matschaffer mentioned this pull request Nov 15, 2021

[APM] Add more info to the "Number of items in this trace exceed what is displayed" (xpack.apm.waterfall.exceedsMax) EuiCallOut #118282

Open

dgieselaar added 2 commits November 16, 2021 09:56

Merge branch 'main' of github.com:elastic/kibana into rule-execution-…

39d2a35

…context

Handle active/unknown state

04a0f6d

Merge branch 'main' into rule-execution-context

d9d0700

dgieselaar requested a review from pmuellr November 19, 2021 14:36

Merge branch 'main' into rule-execution-context

cb46022

ymao1 approved these changes Nov 23, 2021

View reviewed changes

dgieselaar added 2 commits November 28, 2021 11:56

Merge branch 'main' of github.com:elastic/kibana into rule-execution-…

8528192

…context

Review feedback

edf303d

dgieselaar added v7.16.1 and removed v7.16.0 labels Nov 28, 2021

dgieselaar enabled auto-merge (squash) November 28, 2021 11:00

dgieselaar merged commit a0650c7 into elastic:main Nov 28, 2021

kibanamachine mentioned this pull request Nov 28, 2021

[8.0] [Alerting] Add more rule execution context (#117504) #119792

Merged

kibanamachine added a commit to kibanamachine/kibana that referenced this pull request Nov 28, 2021

[Alerting] Add more rule execution context (elastic#117504)

f34d9ce

Co-authored-by: Kibana Machine <[email protected]>

kibanamachine added a commit to kibanamachine/kibana that referenced this pull request Nov 28, 2021

[Alerting] Add more rule execution context (elastic#117504)

ab79a9c

Co-authored-by: Kibana Machine <[email protected]>

kibanamachine mentioned this pull request Nov 28, 2021

[7.16] [Alerting] Add more rule execution context (#117504) #119793

Merged

kibanamachine added a commit that referenced this pull request Nov 28, 2021

[Alerting] Add more rule execution context (#117504) (#119792)

611b422

Co-authored-by: Kibana Machine <[email protected]> Co-authored-by: Dario Gieselaar <[email protected]>

kibanamachine added a commit that referenced this pull request Nov 28, 2021

[Alerting] Add more rule execution context (#117504) (#119793)

26e92de

Co-authored-by: Kibana Machine <[email protected]> Co-authored-by: Dario Gieselaar <[email protected]>

TinLe pushed a commit to TinLe/kibana that referenced this pull request Dec 22, 2021

[Alerting] Add more rule execution context (elastic#117504)

b9c2f9d

Co-authored-by: Kibana Machine <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Alerting] Add more rule execution context #117504

[Alerting] Add more rule execution context #117504

dgieselaar commented Nov 4, 2021 •

edited

Loading

cyrille-leclerc commented Nov 4, 2021

cyrille-leclerc commented Nov 4, 2021 •

edited

Loading

dgieselaar commented Nov 5, 2021

cyrille-leclerc commented Nov 5, 2021 •

edited

Loading

dgieselaar commented Nov 8, 2021

cyrille-leclerc commented Nov 8, 2021

dgieselaar commented Nov 11, 2021

xcrzx commented Nov 15, 2021

dgieselaar commented Nov 15, 2021

dgieselaar commented Nov 15, 2021

cyrille-leclerc commented Nov 15, 2021

dgieselaar commented Nov 16, 2021

cyrille-leclerc commented Nov 16, 2021

xcrzx commented Nov 16, 2021

dgieselaar commented Nov 19, 2021

dgieselaar commented Nov 22, 2021

ymao1 left a comment

ymao1 Nov 23, 2021

dgieselaar Nov 28, 2021

ymao1 Nov 23, 2021

kibana-ci commented Nov 28, 2021

kibanamachine commented Nov 28, 2021

[Alerting] Add more rule execution context #117504

[Alerting] Add more rule execution context #117504

Conversation

dgieselaar commented Nov 4, 2021 • edited Loading

cyrille-leclerc commented Nov 4, 2021

cyrille-leclerc commented Nov 4, 2021 • edited Loading

dgieselaar commented Nov 5, 2021

cyrille-leclerc commented Nov 5, 2021 • edited Loading

dgieselaar commented Nov 8, 2021

cyrille-leclerc commented Nov 8, 2021

dgieselaar commented Nov 11, 2021

xcrzx commented Nov 15, 2021

dgieselaar commented Nov 15, 2021

dgieselaar commented Nov 15, 2021

cyrille-leclerc commented Nov 15, 2021

dgieselaar commented Nov 16, 2021

cyrille-leclerc commented Nov 16, 2021

xcrzx commented Nov 16, 2021

dgieselaar commented Nov 19, 2021

dgieselaar commented Nov 22, 2021

ymao1 left a comment

Choose a reason for hiding this comment

ymao1 Nov 23, 2021

Choose a reason for hiding this comment

dgieselaar Nov 28, 2021

Choose a reason for hiding this comment

ymao1 Nov 23, 2021

Choose a reason for hiding this comment

kibana-ci commented Nov 28, 2021

💚 Build Succeeded

Metrics [docs]

History

kibanamachine commented Nov 28, 2021

💚 Backport successful

dgieselaar commented Nov 4, 2021 •

edited

Loading

cyrille-leclerc commented Nov 4, 2021 •

edited

Loading

cyrille-leclerc commented Nov 5, 2021 •

edited

Loading