[Investigation app] add entities route and investigation Contextual Insight #194432

dominiqueclarke · 2024-09-30T13:05:14Z

Summary

Adds a route that can be used to fetch entities related to an investigation.

The route fetches associated entities by service name, host name, or container id. It then identifies the associated indices and datastreams.

The discovered entities are passed to the contextual insight to inform the LLM.

This PR represents the first step in developing an AI-informed hypothesis at the beginning of the investigation. Over time, further insights will be provided to the LLM to deepen it's investigative analysis and propose a more helpful root cause hypothesis.

Testing

Create some APM data. I'm using the otel demo and triggering a failure via the flagd service. Since this is in flux, you can reach out to me about this workflow. However, you can also create APM data via synth-trace.
Create an custom threshold rule that you expect to trigger an alert. I created mine to using http.response.status_code: 500 / http.response.status_code : * and set a low threshold base on the amount of failures in my current test data. Be sure to also group the alert by service.name
Wait for the alert to fire, then visit the alert details page and start an investigation
notice the contextual insight. Expand it to see more information

elasticmachine · 2024-09-30T13:05:17Z

Pinging @elastic/obs-ux-management-team (Team:obs-ux-management)

obltmachine · 2024-09-30T13:05:27Z

🤖 GitHub comments

Expand to view the GitHub comments

Just comment with:

/oblt-deploy : Deploy a Kibana instance using the Observability test environments.
run docs-build : Re-trigger the docs validation. (use unformatted text in the comment!)

…fix'

cauemarcondes · 2024-09-30T13:29:19Z

...k/plugins/observability_solution/investigate_app/server/clients/create_entities_es_client.ts

+  ): Promise<{ responses: Array<InferSearchResponseOf<TDocument, TSearchRequest>> }>;
+}
+
+export function createEntitiesESClient({


Can't you use https://github.com/elastic/kibana/blob/main/x-pack/packages/observability/observability_utils/es/client/create_observability_es_client.ts#L32?

This is a client specifically for searching through entities indices, but I should be using the observability es client as a dependency. Will update when I can.

dominiqueclarke · 2024-09-30T13:38:03Z

...k/plugins/observability_solution/investigate_app/server/clients/create_entities_es_client.ts

+        .map((params) => {
+          const searchParams: [MsearchMultisearchHeader, MsearchMultisearchBody] = [
+            {
+              index: [SERVICE_ENTITIES_LATEST_ALIAS],


This is copypasta. I'd like to remove the reference to the service alias in particular.

…clarke/kibana into feature/investigation-entities

…/investigation-entities

kdelemme

I just did a quick first pass, will continue

kdelemme · 2024-10-02T16:58:00Z

x-pack/plugins/observability_solution/investigate_app/kibana.jsonc

@@ -28,7 +28,7 @@
      "kibanaReact",
      "kibanaUtils",
    ],
-    "optionalPlugins": [],


It's already in the requiredPlugins

kdelemme · 2024-10-02T17:06:14Z

...servability_solution/investigate_app/public/pages/details/contexts/investigation_context.tsx

+  const alertOriginInvestigation = alertOriginSchema.safeParse(investigation?.origin);
+  const alertId = alertOriginInvestigation.success ? alertOriginInvestigation.data.id : undefined;
+  const { data: alert } = useFetchAlert({ id: alertId });


🍰 nit: this logic is required every time we use useFetchAlert(), maybe we can refactor the hook to encapsulate this logic: the hook itself could use useInvestigation to retrieve the investigation, and we won't need to expose the originated alert in this context. I'm already worried about this context becoming bloated with too many things.

const { data: alert } = useFetchAlertOrigin()

x-pack/plugins/observability_solution/investigate_app/public/hooks/use_fetch_entities.ts

…ooks/use_fetch_entities.ts

kdelemme

Some questions and nits, but otherwise looks good to me.
I guess for testing this I need to setup a genAI connector, do you have a guide for this?

kdelemme · 2024-10-02T19:23:38Z

.../investigate_app/public/pages/details/components/investigation_items/investigation_items.tsx

+      {investigation?.id && (
+        <EuiFlexItem grow={false}>
+          <AssistantHypothesis investigationId={investigation.id} />
+        </EuiFlexItem>
+      )}


🍰 nit: use the context hook useInvestigation() directly from AssistantHypothesis:

Suggested change

{investigation?.id && (

<EuiFlexItem grow={false}>

<AssistantHypothesis investigationId={investigation.id} />

</EuiFlexItem>

)}

<EuiFlexItem grow={false}>

<AssistantHypothesis />

</EuiFlexItem>

I actually had this originally, but it made it so that the investigation was sometimes undefined, and I hated having to handle that all the time. Would you prefer that trade off?

kdelemme · 2024-10-02T19:25:44Z

...k/plugins/observability_solution/investigate_app/server/clients/create_entities_es_client.ts

+});
+export const SERVICE_ENTITIES_HISTORY_ALIAS = entitiesAliasPattern({
+  type: 'service',
+  dataset: ENTITY_HISTORY,


I thought EEM had removed the history?

They did. I'll remove this for now.

kdelemme · 2024-10-02T19:35:59Z

x-pack/plugins/observability_solution/investigate_app/server/services/get_entities.ts

+  hostName,
+  entitiesEsClient,
+}: {
+  context: InvestigateAppRequestHandlerContext;


If possible let's try to not leak route/request details into the services. Here we can replace the whole request handler context with the esClient, and do the wiring in the route handler.

kdelemme · 2024-10-02T19:42:14Z

x-pack/plugins/observability_solution/investigate_app/server/services/get_entities.ts

+  );
+}
+
+const getEntitySource = async ({ index }: { index: IndicesIndexState }) => {


Does it need to be async?

kdelemme · 2024-10-02T19:42:30Z

x-pack/plugins/observability_solution/investigate_app/server/services/get_entities.ts

+  return await Promise.all(
+    Object.values(indices).map(async (index) => {
+      return await getEntitySource({ index });
+    })
+  );


do we need the promise all and await here?

kdelemme · 2024-10-02T19:44:12Z

x-pack/plugins/observability_solution/investigate_app/server/services/get_entities.ts

+        const sourceIndex = entity?.sourceIndex;
+        if (!sourceIndex) return null;
+
+        const indices = await esClient.indices.get({ index: sourceIndex });


🍰 nit: might be probably too early to optimize, but this call is made in a double for-loop. Is there a way to call the esClient.indices.get for all sourceIndex at once?

…-fix'

jloleysens

Other than:

It's already in the requiredPlugins

kibana.jsonc lgtm

dominiqueclarke · 2024-10-03T13:18:39Z

Some questions and nits, but otherwise looks good to me. I guess for testing this I need to setup a genAI connector, do you have a guide for this?

The guide for setting up the connector can be found here https://github.com/elastic/kibana/blob/main/x-pack/plugins/observability_solution/observability_ai_assistant/README.md

You'll also need to start your knowledge base. The easiest way to do that is, after setting up your connector, open the Assistant flyout via the Assistant button on the top right and click the start knowledge base button.

…clarke/kibana into feature/investigation-entities

kibana-ci · 2024-10-04T15:25:40Z

💚 Build Succeeded

Buildkite Build
Commit: f95017d
Kibana Serverless Image: docker.elastic.co/kibana-ci/kibana-serverless:pr-194432-f95017d80e5d

Metrics [docs]

Module Count

Fewer modules leads to a faster build time

id	before	after	diff
`investigateApp`	567	572	+5

Public APIs missing comments

Total count of every public API that lacks a comment. Target amount is 0. Run node scripts/build_api_docs --plugin [yourplugin] --stats comments for more detailed information.

id	before	after	diff
`@kbn/investigation-shared`	73	81	+8

Async chunks

Total size of all lazy-loaded chunks that will be downloaded as the user navigates the app

id	before	after	diff
`investigateApp`	474.6KB	479.6KB	+5.0KB

Page load bundle

Size of the bundles that are downloaded on every page load. Target size is below 100kb

id	before	after	diff
`investigateApp`	6.5KB	6.4KB	-104.0B

Unknown metric groups

API count

id	before	after	diff
`@kbn/investigation-shared`	73	81	+8

History

💔 Build #239456 failed b9de0cadd52c177787c3a10cdcfe4fd0e0521fb8
💔 Build #239309 failed 4ab1a83
💔 Build #239306 failed 2a27f75
💔 Build #239215 failed d9d211b
💔 Build #239170 failed b70caeb

To update your PR or re-run it, just comment with:
@elasticmachine merge upstream

kibanamachine · 2024-10-04T17:58:47Z

Starting backport for target branches: 8.x

https://github.com/elastic/kibana/actions/runs/11184673144

kibanamachine · 2024-10-04T18:03:03Z

💔 All backports failed

Status	Branch	Result
❌	8.x	Backport failed because of merge conflicts You might need to backport the following PRs to 8.x: - feat(rca): add screen context into investigation details (#194753)

Manual backport

To create the backport manually run:

node scripts/backport --pr 194432

Questions ?

Please refer to the Backport tool documentation

dominiqueclarke · 2024-10-05T01:15:06Z

💚 All backports created successfully

Status	Branch	Result
✅	8.x

Note: Successful backport PRs will be merged automatically after passing CI.

Questions ?

Please refer to the Backport tool documentation

…nsight (elastic#194432) ## Summary Adds a route that can be used to fetch entities related to an investigation. The route fetches associated entities by service name, host name, or container id. It then identifies the associated indices and datastreams. The discovered entities are passed to the contextual insight to inform the LLM. ![image](https://github.com/user-attachments/assets/855a8d68-b039-4557-ba23-5661cd961021) This PR represents the first step in developing an AI-informed hypothesis at the beginning of the investigation. Over time, further insights will be provided to the LLM to deepen it's investigative analysis and propose a more helpful root cause hypothesis. ### Testing 1. Create some APM data. I'm using the otel demo and triggering a failure via the flagd service. Since this is in flux, you can reach out to me about this workflow. However, you can also create APM data via `synth-trace`. 2. Create an custom threshold rule that you expect to trigger an alert. I created mine to using `http.response.status_code: 500 / http.response.status_code : *` and set a low threshold base on the amount of failures in my current test data. Be sure to also group the alert by `service.name` 3. Wait for the alert to fire, then visit the alert details page and start an investigation 4. notice the contextual insight. Expand it to see more information --------- Co-authored-by: kibanamachine <[email protected]> (cherry picked from commit e4bb435)

…nsight (elastic#194432) ## Summary Adds a route that can be used to fetch entities related to an investigation. The route fetches associated entities by service name, host name, or container id. It then identifies the associated indices and datastreams. The discovered entities are passed to the contextual insight to inform the LLM. ![image](https://github.com/user-attachments/assets/855a8d68-b039-4557-ba23-5661cd961021) This PR represents the first step in developing an AI-informed hypothesis at the beginning of the investigation. Over time, further insights will be provided to the LLM to deepen it's investigative analysis and propose a more helpful root cause hypothesis. ### Testing 1. Create some APM data. I'm using the otel demo and triggering a failure via the flagd service. Since this is in flux, you can reach out to me about this workflow. However, you can also create APM data via `synth-trace`. 2. Create an custom threshold rule that you expect to trigger an alert. I created mine to using `http.response.status_code: 500 / http.response.status_code : *` and set a low threshold base on the amount of failures in my current test data. Be sure to also group the alert by `service.name` 3. Wait for the alert to fire, then visit the alert details page and start an investigation 4. notice the contextual insight. Expand it to see more information --------- Co-authored-by: kibanamachine <[email protected]>

…tual Insight (#194432) (#195158) # Backport This will backport the following commits from `main` to `8.x`: - [[Investigation app] add entities route and investigation Contextual Insight (#194432)](#194432)  ### Questions ? Please refer to the [Backport tool documentation](https://github.com/sqren/backport)  Co-authored-by: Rickyanto Ang <[email protected]>

add entities route

27c0182

dominiqueclarke added release_note:skip Skip the PR/issue when compiling release notes v9.0.0 Team:obs-ux-management Observability Management User Experience Team v8.16.0 labels Sep 30, 2024

dominiqueclarke requested a review from a team as a code owner September 30, 2024 13:05

botelastic bot added the ci:project-deploy-observability Create an Observability project label Sep 30, 2024

[CI] Auto-commit changed files from 'node scripts/lint_ts_projects --…

2d305e7

…fix'

cauemarcondes reviewed Sep 30, 2024

View reviewed changes

dominiqueclarke commented Sep 30, 2024

View reviewed changes

dominiqueclarke added the backport:prev-minor Backport to (8.x) the previous minor version (i.e. one version back from main) label Sep 30, 2024

Merge branch 'main' into feature/investigation-entities

84a3817

mgiota self-requested a review October 1, 2024 09:57

dominiqueclarke added 2 commits October 1, 2024 16:05

add initial contextual insight

dc8c47a

merge main

be4dfb7

dominiqueclarke requested a review from a team as a code owner October 1, 2024 20:14

dominiqueclarke and others added 4 commits October 1, 2024 16:21

adjust entities es client

16f926d

[CI] Auto-commit changed files from 'node scripts/yarn_deduplicate'

72914b4

add entity sources to the assistant prompt

a67953d

Merge branch 'feature/investigation-entities' of github.com:dominique…

f041ec1

…clarke/kibana into feature/investigation-entities

dominiqueclarke changed the title ~~[Investigation app] add entities route~~ [Investigation app] add entities route and investigation Contextual Insight Oct 2, 2024

dominiqueclarke added 2 commits October 2, 2024 11:43

adjust prompt

503b7f8

Merge branch 'main' of https://github.com/elastic/kibana into feature…

552d79c

…/investigation-entities

kdelemme reviewed Oct 2, 2024

View reviewed changes

dominiqueclarke commented Oct 2, 2024

View reviewed changes

x-pack/plugins/observability_solution/investigate_app/public/hooks/use_fetch_entities.ts Outdated Show resolved Hide resolved

Update x-pack/plugins/observability_solution/investigate_app/public/h…

1d65aec

…ooks/use_fetch_entities.ts

kdelemme approved these changes Oct 2, 2024

View reviewed changes

remove unnecessary async keyword

2907990

pass esClient to getEntities

53655e5

dominiqueclarke force-pushed the feature/investigation-entities branch from 34235a6 to 53655e5 Compare October 2, 2024 21:08

dominiqueclarke and others added 2 commits October 2, 2024 17:11

remove entity history references

3fb9019

[CI] Auto-commit changed files from 'node scripts/eslint --no-cache -…

862e98f

…-fix'

jloleysens approved these changes Oct 3, 2024

View reviewed changes

dominiqueclarke added 2 commits October 3, 2024 09:10

adjust alert hook

ed126e4

adjust plugin definition

2d119de

dominiqueclarke and others added 8 commits October 3, 2024 09:34

Merge branch 'feature/investigation-entities' of github.com:dominique…

56b2fb1

…clarke/kibana into feature/investigation-entities

remove imports from ai assistant

1932efe

[CI] Auto-commit changed files from 'node scripts/yarn_deduplicate'

b70caeb

adjust types

c24a620

Merge branch 'feature/investigation-entities' of github.com:dominique…

d9d211b

…clarke/kibana into feature/investigation-entities

merge main

2a27f75

account for missing sources

4ab1a83

adjust types

f95017d

dominiqueclarke force-pushed the feature/investigation-entities branch from b9de0ca to f95017d Compare October 4, 2024 14:27

dominiqueclarke merged commit e4bb435 into elastic:main Oct 4, 2024
23 checks passed

dominiqueclarke deleted the feature/investigation-entities branch October 4, 2024 17:58

kibanamachine mentioned this pull request Oct 4, 2024

[Cloud Security] Vulnerabilities Preview & Refactor CSP Plugin PHASE 2 #193638

Merged

dominiqueclarke mentioned this pull request Oct 5, 2024

[8.x] [Investigation app] add entities route and investigation Contextual Insight (#194432) #195158

Merged

kibanamachine mentioned this pull request Oct 7, 2024

[RCA] Events timeline !! #193265

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Investigation app] add entities route and investigation Contextual Insight #194432

[Investigation app] add entities route and investigation Contextual Insight #194432

dominiqueclarke commented Sep 30, 2024 •

edited by kibanamachine

Loading

elasticmachine commented Sep 30, 2024

obltmachine commented Sep 30, 2024

cauemarcondes Sep 30, 2024

dominiqueclarke Sep 30, 2024

dominiqueclarke Sep 30, 2024

kdelemme left a comment

kdelemme Oct 2, 2024

kdelemme Oct 2, 2024

kdelemme left a comment

kdelemme Oct 2, 2024

dominiqueclarke Oct 2, 2024

kdelemme Oct 2, 2024

dominiqueclarke Oct 2, 2024

kdelemme Oct 2, 2024

kdelemme Oct 2, 2024

kdelemme Oct 2, 2024

kdelemme Oct 2, 2024

jloleysens left a comment

dominiqueclarke commented Oct 3, 2024

kibana-ci commented Oct 4, 2024 •

edited

Loading

API count

kibanamachine commented Oct 4, 2024

kibanamachine commented Oct 4, 2024

dominiqueclarke commented Oct 5, 2024

[Investigation app] add entities route and investigation Contextual Insight #194432

[Investigation app] add entities route and investigation Contextual Insight #194432

Conversation

dominiqueclarke commented Sep 30, 2024 • edited by kibanamachine Loading

Summary

Testing

elasticmachine commented Sep 30, 2024

obltmachine commented Sep 30, 2024

🤖 GitHub comments

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

kdelemme left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

kdelemme left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jloleysens left a comment

Choose a reason for hiding this comment

dominiqueclarke commented Oct 3, 2024

kibana-ci commented Oct 4, 2024 • edited Loading

💚 Build Succeeded

Metrics [docs]

Module Count

Public APIs missing comments

Async chunks

Page load bundle

API count

History

kibanamachine commented Oct 4, 2024

kibanamachine commented Oct 4, 2024

💔 All backports failed

Manual backport

Questions ?

dominiqueclarke commented Oct 5, 2024

💚 All backports created successfully

Questions ?

dominiqueclarke commented Sep 30, 2024 •

edited by kibanamachine

Loading

kibana-ci commented Oct 4, 2024 •

edited

Loading