[inference] Add cancelation support for chatComplete and output #203108

pgayvallet · 2024-12-05T14:18:36Z

Summary

Fix #200757

Add cancelation support for chatComplete and output, based on an abort signal.

Examples

response mode

import { isInferenceRequestAbortedError } from '@kbn/inference-common';

try {
  const abortController = new AbortController();
  const chatResponse = await inferenceClient.chatComplete({
    connectorId: 'some-gen-ai-connector',
    abortSignal: abortController.signal,
    messages: [{ role: MessageRole.User, content: 'Do something' }],
  });
} catch(e) {
  if(isInferenceRequestAbortedError(e)) {
    // request was aborted, do something
  } else {
    // was another error, do something else
  }
}

// elsewhere
abortController.abort()

stream mode

import { isInferenceRequestAbortedError } from '@kbn/inference-common';

const abortController = new AbortController();
const events$ = inferenceClient.chatComplete({
  stream: true,
  connectorId: 'some-gen-ai-connector',
  abortSignal: abortController.signal,
  messages: [{ role: MessageRole.User, content: 'Do something' }],
});

events$.subscribe({
  next: (event) => {
    // do something
  },
  error: (err) => {
    if(isInferenceRequestAbortedError(e)) {
      // request was aborted, do something
    } else {
      // was another error, do something else
    }
  }
});

abortController.abort();

…est-cancelation

pgayvallet · 2024-12-05T14:18:44Z

/ci

…est-cancelation

pgayvallet · 2024-12-11T11:38:50Z

/ci

…est-cancelation

pgayvallet · 2024-12-11T12:31:41Z

/ci

pgayvallet · 2024-12-11T14:53:26Z

/ci

elasticmachine · 2024-12-11T14:57:16Z

Pinging @elastic/appex-ai-infra (Team:AI Infra)

legrego

LGTM!

legrego · 2024-12-16T15:58:38Z

x-pack/platform/plugins/shared/inference/server/chat_complete/utils/handle_cancellation.ts

+        },
+        complete: () => {
+          if (abortSignal.aborted) {
+            subscriber.error(createInferenceRequestAbortedError('Request was aborted'));


question What other messages do we envision for these aborted request errors? This is the only place it's called today, and we pass in a hard-coded message.

I am not asking you to change this, I'm just curious if you have thoughts about the future.

Yeah, that's a fair point. We probably don't need that message to be passed as a parameter to be honest. I was mostly following the pattern that was used for other inference error types. I will change that

legrego · 2024-12-16T16:01:41Z

x-pack/platform/plugins/shared/inference/server/chat_complete/utils/handle_cancellation.test.ts

+    source$.next(3);
+
+    expect(values).toEqual([1, 2]);
+    expect(thrownError).toBeDefined();


nit: should we assert the correct error instance is thrown?

Suggested change

expect(thrownError).toBeDefined();

expect(thrownError).toBeInstanceOf(InferenceTaskError);

expect(thrownError.code).toBe('requestAborted');

…est-cancelation

elasticmachine · 2024-12-17T13:45:36Z

💚 Build Succeeded

Buildkite Build
Commit: b4cbe24

Metrics [docs]

Public APIs missing comments

Total count of every public API that lacks a comment. Target amount is 0. Run node scripts/build_api_docs --plugin [yourplugin] --stats comments for more detailed information.

id	before	after	diff
`@kbn/inference-common`	43	40	-3

Page load bundle

Size of the bundles that are downloaded on every page load. Target size is below 100kb

id	before	after	diff
`inference`	7.4KB	7.5KB	+74.0B

Unknown metric groups

API count

id	before	after	diff
`@kbn/inference-common`	136	141	+5

History

💛 Build #259375 was flaky f40820a
💚 Build #259269 succeeded a8c2a44
💛 Build #257452 was flaky a108424

kibanamachine · 2024-12-17T15:13:36Z

Starting backport for target branches: 8.x

https://github.com/elastic/kibana/actions/runs/12375857143

…tic#203108) ## Summary Fix elastic#200757 Add cancelation support for `chatComplete` and `output`, based on an abort signal. ### Examples #### response mode ```ts import { isInferenceRequestAbortedError } from '@kbn/inference-common'; try { const abortController = new AbortController(); const chatResponse = await inferenceClient.chatComplete({ connectorId: 'some-gen-ai-connector', abortSignal: abortController.signal, messages: [{ role: MessageRole.User, content: 'Do something' }], }); } catch(e) { if(isInferenceRequestAbortedError(e)) { // request was aborted, do something } else { // was another error, do something else } } // elsewhere abortController.abort() ``` #### stream mode ```ts import { isInferenceRequestAbortedError } from '@kbn/inference-common'; const abortController = new AbortController(); const events$ = inferenceClient.chatComplete({ stream: true, connectorId: 'some-gen-ai-connector', abortSignal: abortController.signal, messages: [{ role: MessageRole.User, content: 'Do something' }], }); events$.subscribe({ next: (event) => { // do something }, error: (err) => { if(isInferenceRequestAbortedError(e)) { // request was aborted, do something } else { // was another error, do something else } } }); abortController.abort(); ``` (cherry picked from commit 0b74f62)

kibanamachine · 2024-12-17T15:18:45Z

💚 All backports created successfully

Status	Branch	Result
✅	8.x

Note: Successful backport PRs will be merged automatically after passing CI.

Questions ?

Please refer to the Backport tool documentation

…#203108) (#204588) # Backport This will backport the following commits from `main` to `8.x`: - [[inference] Add cancelation support for chatComplete and output (#203108)](#203108)  ### Questions ? Please refer to the [Backport tool documentation](https://github.com/sqren/backport)  Co-authored-by: Pierre Gayvallet <[email protected]>

…tic#203108) ## Summary Fix elastic#200757 Add cancelation support for `chatComplete` and `output`, based on an abort signal. ### Examples #### response mode ```ts import { isInferenceRequestAbortedError } from '@kbn/inference-common'; try { const abortController = new AbortController(); const chatResponse = await inferenceClient.chatComplete({ connectorId: 'some-gen-ai-connector', abortSignal: abortController.signal, messages: [{ role: MessageRole.User, content: 'Do something' }], }); } catch(e) { if(isInferenceRequestAbortedError(e)) { // request was aborted, do something } else { // was another error, do something else } } // elsewhere abortController.abort() ``` #### stream mode ```ts import { isInferenceRequestAbortedError } from '@kbn/inference-common'; const abortController = new AbortController(); const events$ = inferenceClient.chatComplete({ stream: true, connectorId: 'some-gen-ai-connector', abortSignal: abortController.signal, messages: [{ role: MessageRole.User, content: 'Do something' }], }); events$.subscribe({ next: (event) => { // do something }, error: (err) => { if(isInferenceRequestAbortedError(e)) { // request was aborted, do something } else { // was another error, do something else } } }); abortController.abort(); ```

pgayvallet added 2 commits December 5, 2024 15:05

WIP

eccd1d1

Merge remote-tracking branch 'upstream/main' into kbn-200757-add-requ…

a108424

…est-cancelation

pgayvallet added 6 commits December 10, 2024 08:18

Merge remote-tracking branch 'upstream/main' into kbn-200757-add-requ…

8fa35d0

…est-cancelation

WIP on request cancellation

766c644

Merge remote-tracking branch 'upstream/main' into kbn-200757-add-requ…

352877b

…est-cancelation

add unit tests

aab3de7

add some tests and tsdoc

b6bea6a

add README doc about cancellation

d2804dc

Merge remote-tracking branch 'upstream/main' into kbn-200757-add-requ…

a8c2a44

…est-cancelation

pgayvallet added release_note:skip Skip the PR/issue when compiling release notes backport:version Backport to applied version labels Team:AI Infra AppEx AI Infrastructure Team v8.18.0 labels Dec 11, 2024

self review

d511a59

pgayvallet marked this pull request as ready for review December 11, 2024 14:57

pgayvallet requested a review from a team as a code owner December 11, 2024 14:57

fixing CODEOWNER file

f40820a

legrego approved these changes Dec 16, 2024

View reviewed changes

pgayvallet added 2 commits December 17, 2024 12:53

Merge remote-tracking branch 'upstream/main' into kbn-200757-add-requ…

2bb360e

…est-cancelation

address review comments

b4cbe24

pgayvallet merged commit 0b74f62 into elastic:main Dec 17, 2024
8 checks passed

kibanamachine added the v9.0.0 label Dec 17, 2024

kibanamachine mentioned this pull request Dec 17, 2024

[8.x] [inference] Add cancelation support for chatComplete and output (#203108) #204588

Merged

kibanamachine mentioned this pull request Dec 17, 2024

[EDR Workflows] CrowdStrike RunScript: Log Actions and UI Output #204044

Merged

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[inference] Add cancelation support for chatComplete and output #203108

[inference] Add cancelation support for chatComplete and output #203108

pgayvallet commented Dec 5, 2024 •

edited by kibanamachine

Loading

pgayvallet commented Dec 5, 2024

pgayvallet commented Dec 11, 2024

pgayvallet commented Dec 11, 2024

pgayvallet commented Dec 11, 2024

elasticmachine commented Dec 11, 2024

legrego left a comment

legrego Dec 16, 2024

pgayvallet Dec 17, 2024

legrego Dec 16, 2024

elasticmachine commented Dec 17, 2024

API count

kibanamachine commented Dec 17, 2024

kibanamachine commented Dec 17, 2024

	expect(thrownError).toBeDefined();
	expect(thrownError).toBeInstanceOf(InferenceTaskError);
	expect(thrownError.code).toBe('requestAborted');

[inference] Add cancelation support for chatComplete and output #203108

[inference] Add cancelation support for chatComplete and output #203108

Conversation

pgayvallet commented Dec 5, 2024 • edited by kibanamachine Loading

Summary

Examples

response mode

stream mode

pgayvallet commented Dec 5, 2024

pgayvallet commented Dec 11, 2024

pgayvallet commented Dec 11, 2024

pgayvallet commented Dec 11, 2024

elasticmachine commented Dec 11, 2024

legrego left a comment

Choose a reason for hiding this comment

legrego Dec 16, 2024

Choose a reason for hiding this comment

pgayvallet Dec 17, 2024

Choose a reason for hiding this comment

legrego Dec 16, 2024

Choose a reason for hiding this comment

elasticmachine commented Dec 17, 2024

💚 Build Succeeded

Metrics [docs]

Public APIs missing comments

Page load bundle

API count

History

kibanamachine commented Dec 17, 2024

kibanamachine commented Dec 17, 2024

💚 All backports created successfully

Questions ?

pgayvallet commented Dec 5, 2024 •

edited by kibanamachine

Loading