[ML] Update vCPUs ranges for start model deployment #195617

darnautov · 2024-10-09T14:39:26Z

Summary

Different vCPUs ranges and enabling support for static allocations based on the serverless project type

Each serverless config yml, e.g. search.es.yml now contains parameters required for start model deployment:

xpack.ml.nlp:
  enabled: true
  modelDeployment:
    allowStaticAllocations: true
    vCPURange:
      low:
        min: 0
        max: 2
        static: 2
      medium:
        min: 1
        max: 32
        static: 32
      high:
        min: 1
        max: 512
        static: 512

Note: There will be no static allocations option for serverless O11y and serverless Security.

The minimum values of vCPUs

0 for the Low usage level on both serverless and ESS.
1 for the Medium and High usage levels on both serverless and ESS.

The default vCPUs usage levels

Low in serverless.
Medium in ESS and on-prem

Checklist

Unit or functional tests were updated or added to match the most common scenarios

…sed on nlp settings

elasticmachine · 2024-10-10T14:11:14Z

Pinging @elastic/ml-ui (:ml)

jloleysens

Did not test the serverless projects, updated YAML looks good to me.

x-pack/plugins/ml/public/application/contexts/ml/ml_server_info_context.tsx

peteharverson

LGTM. Tested locally - stateful and serverless search and o11y.

jgowdyelastic

LGTM

kc13greiner

Codeowners review - LGTM!

…values

elasticmachine · 2024-10-14T11:10:05Z

⏳ Build in-progress

Buildkite Build
Commit: 3389784
Cloud Deployment
Kibana Serverless Image: docker.elastic.co/kibana-ci/kibana-serverless:pr-195617-3389784a1bab
Elasticsearch Serverless Deployment

History

cc @darnautov

kibanamachine · 2024-10-14T14:38:43Z

Starting backport for target branches: 8.x

https://github.com/elastic/kibana/actions/runs/11329785623

## Summary #### Different vCPUs ranges and enabling support for static allocations based on the serverless project type - Each serverless config yml, e.g. [search.es.yml](https://github.com/darnautov/kibana/blob/84b3b79a1537fd98b18d1f137b16b532f3f1061f/config/serverless.es.yml#L61) now contains parameters required for start model deployment: ```yml xpack.ml.nlp: enabled: true modelDeployment: allowStaticAllocations: true vCPURange: low: min: 0 max: 2 static: 2 medium: min: 1 max: 32 static: 32 high: min: 1 max: 512 static: 512 ``` Note: _There will be no static allocations option for serverless O11y and serverless Security._ #### The minimum values of vCPUs - 0 for the Low usage level on both serverless and ESS. - 1 for the Medium and High usage levels on both serverless and ESS. #### The default vCPUs usage levels - Low in serverless. - Medium in ESS and on-prem ### Checklist - [x] [Unit or functional tests](https://www.elastic.co/guide/en/kibana/master/development-tests.html) were updated or added to match the most common scenarios (cherry picked from commit 1389708)

kibanamachine · 2024-10-14T14:43:14Z

💚 All backports created successfully

Status	Branch	Result
✅	8.x

Note: Successful backport PRs will be merged automatically after passing CI.

Questions ?

Please refer to the Backport tool documentation

…196156) # Backport This will backport the following commits from `main` to `8.x`: - [[ML] Update vCPUs ranges for start model deployment (#195617)](#195617)  ### Questions ? Please refer to the [Backport tool documentation](https://github.com/sqren/backport)  Co-authored-by: Dima Arnautov <[email protected]>

darnautov added 7 commits October 9, 2024 16:37

model deployment settings for serverless

72ee4a2

wip NLP settings

29ec5f7

MlServerInfoContext and nlpSettings

aee060d

wip serverless functional tests

ed5e49a

update deployment params mapper

b304b55

update deployment params mapper with enforcing adaptive allocation ba…

53ceacf

…sed on nlp settings

set default vCPU level based on env

84b3b79

darnautov marked this pull request as ready for review October 10, 2024 14:10

darnautov requested review from a team as code owners October 10, 2024 14:10

darnautov requested review from peteharverson and jgowdyelastic October 10, 2024 14:10

darnautov self-assigned this Oct 10, 2024

darnautov added release_note:enhancement :ml v9.0.0 Team:ML Team label for ML (also use :ml) v8.16.0 backport:version Backport to applied version labels labels Oct 10, 2024

darnautov added the Feature:3rd Party Models ML 3rd party models label Oct 10, 2024

darnautov added 2 commits October 10, 2024 18:02

update rendering test

04056e4

fix availability of the deploy button

d1a073f

darnautov added ci:cloud-deploy Create or update a Cloud deployment ci:project-deploy-elasticsearch Create an Elasticsearch Serverless project ci:project-deploy-observability Create an Observability project labels Oct 10, 2024

serverless test for security project

fe47394

peteharverson mentioned this pull request Oct 10, 2024

[ML] Increase Test Coverage 8.16.0 #188459

Closed

23 tasks

skip serverless functional tests

1da8f69

jloleysens approved these changes Oct 11, 2024

View reviewed changes

jgowdyelastic reviewed Oct 11, 2024

View reviewed changes

x-pack/plugins/ml/public/application/contexts/ml/ml_server_info_context.tsx Outdated Show resolved Hide resolved

peteharverson approved these changes Oct 11, 2024

View reviewed changes

darnautov added 3 commits October 11, 2024 15:52

update serverless tests with tinyElser

e038546

fix ts errors

90fdf82

rename context

31ac253

kc13greiner self-requested a review October 11, 2024 14:20

darnautov requested a review from jgowdyelastic October 11, 2024 14:20

jgowdyelastic approved these changes Oct 11, 2024

View reviewed changes

sphilipse approved these changes Oct 11, 2024

View reviewed changes

kc13greiner approved these changes Oct 11, 2024

View reviewed changes

darnautov added 2 commits October 14, 2024 11:04

Merge remote-tracking branch 'origin/main' into ml-update-allocation-…

3b939f8

…values

update serverless tests, fix helper text for static

3389784

darnautov removed the ci:cloud-deploy Create or update a Cloud deployment label Oct 14, 2024

darnautov merged commit 1389708 into elastic:main Oct 14, 2024
27 checks passed

darnautov deleted the ml-update-allocation-values branch October 14, 2024 14:38

kibanamachine mentioned this pull request Oct 14, 2024

[8.x] [ML] Update vCPUs ranges for start model deployment (#195617) #196156

Merged

kibanamachine mentioned this pull request Oct 14, 2024

[Cloud Security] Refactoring cloud-security-posture packages' folder structure #196008

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ML] Update vCPUs ranges for start model deployment #195617

[ML] Update vCPUs ranges for start model deployment #195617

darnautov commented Oct 9, 2024 •

edited by kibanamachine

Loading

elasticmachine commented Oct 10, 2024

jloleysens left a comment

peteharverson left a comment

jgowdyelastic left a comment

kc13greiner left a comment

elasticmachine commented Oct 14, 2024 •

edited

Loading

kibanamachine commented Oct 14, 2024

kibanamachine commented Oct 14, 2024

[ML] Update vCPUs ranges for start model deployment #195617

[ML] Update vCPUs ranges for start model deployment #195617

Conversation

darnautov commented Oct 9, 2024 • edited by kibanamachine Loading

Summary

Different vCPUs ranges and enabling support for static allocations based on the serverless project type

The minimum values of vCPUs

The default vCPUs usage levels

Checklist

elasticmachine commented Oct 10, 2024

jloleysens left a comment

Choose a reason for hiding this comment

peteharverson left a comment

Choose a reason for hiding this comment

jgowdyelastic left a comment

Choose a reason for hiding this comment

kc13greiner left a comment

Choose a reason for hiding this comment

elasticmachine commented Oct 14, 2024 • edited Loading

⏳ Build in-progress

History

kibanamachine commented Oct 14, 2024

kibanamachine commented Oct 14, 2024

💚 All backports created successfully

Questions ?

darnautov commented Oct 9, 2024 •

edited by kibanamachine

Loading

elasticmachine commented Oct 14, 2024 •

edited

Loading