-
Notifications
You must be signed in to change notification settings - Fork 8.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[ML] Update vCPUs ranges for start model deployment #195617
[ML] Update vCPUs ranges for start model deployment #195617
Conversation
…sed on nlp settings
Pinging @elastic/ml-ui (:ml) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Did not test the serverless projects, updated YAML looks good to me.
x-pack/plugins/ml/public/application/contexts/ml/ml_server_info_context.tsx
Outdated
Show resolved
Hide resolved
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM. Tested locally - stateful and serverless search and o11y.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Codeowners review - LGTM!
⏳ Build in-progress
History
cc @darnautov |
Starting backport for target branches: 8.x https://github.com/elastic/kibana/actions/runs/11329785623 |
## Summary #### Different vCPUs ranges and enabling support for static allocations based on the serverless project type - Each serverless config yml, e.g. [search.es.yml](https://github.com/darnautov/kibana/blob/84b3b79a1537fd98b18d1f137b16b532f3f1061f/config/serverless.es.yml#L61) now contains parameters required for start model deployment: ```yml xpack.ml.nlp: enabled: true modelDeployment: allowStaticAllocations: true vCPURange: low: min: 0 max: 2 static: 2 medium: min: 1 max: 32 static: 32 high: min: 1 max: 512 static: 512 ``` Note: _There will be no static allocations option for serverless O11y and serverless Security._ #### The minimum values of vCPUs - 0 for the Low usage level on both serverless and ESS. - 1 for the Medium and High usage levels on both serverless and ESS. #### The default vCPUs usage levels - Low in serverless. - Medium in ESS and on-prem ### Checklist - [x] [Unit or functional tests](https://www.elastic.co/guide/en/kibana/master/development-tests.html) were updated or added to match the most common scenarios (cherry picked from commit 1389708)
💚 All backports created successfully
Note: Successful backport PRs will be merged automatically after passing CI. Questions ?Please refer to the Backport tool documentation |
…196156) # Backport This will backport the following commits from `main` to `8.x`: - [[ML] Update vCPUs ranges for start model deployment (#195617)](#195617) <!--- Backport version: 9.4.3 --> ### Questions ? Please refer to the [Backport tool documentation](https://github.com/sqren/backport) <!--BACKPORT [{"author":{"name":"Dima Arnautov","email":"[email protected]"},"sourceCommit":{"committedDate":"2024-10-14T14:38:26Z","message":"[ML] Update vCPUs ranges for start model deployment (#195617)\n\n## Summary\r\n\r\n#### Different vCPUs ranges and enabling support for static allocations\r\nbased on the serverless project type\r\n\r\n- Each serverless config yml, e.g.\r\n[search.es.yml](https://github.com/darnautov/kibana/blob/84b3b79a1537fd98b18d1f137b16b532f3f1061f/config/serverless.es.yml#L61)\r\nnow contains parameters required for start model deployment:\r\n\r\n```yml\r\nxpack.ml.nlp:\r\n enabled: true\r\n modelDeployment:\r\n allowStaticAllocations: true\r\n vCPURange:\r\n low:\r\n min: 0\r\n max: 2\r\n static: 2\r\n medium:\r\n min: 1\r\n max: 32\r\n static: 32\r\n high:\r\n min: 1\r\n max: 512\r\n static: 512\r\n```\r\n\r\nNote: _There will be no static allocations option for serverless O11y\r\nand serverless Security._\r\n\r\n#### The minimum values of vCPUs\r\n\r\n- 0 for the Low usage level on both serverless and ESS.\r\n- 1 for the Medium and High usage levels on both serverless and ESS.\r\n \r\n#### The default vCPUs usage levels\r\n- Low in serverless.\r\n- Medium in ESS and on-prem\r\n\r\n### Checklist\r\n\r\n- [x] [Unit or functional\r\ntests](https://www.elastic.co/guide/en/kibana/master/development-tests.html)\r\nwere updated or added to match the most common scenarios","sha":"13897083dc8c465a16ddb3856d4e4904b9629610","branchLabelMapping":{"^v9.0.0$":"main","^v8.16.0$":"8.x","^v(\\d+).(\\d+).\\d+$":"$1.$2"}},"sourcePullRequest":{"labels":["release_note:enhancement",":ml","v9.0.0","Feature:3rd Party Models","Team:ML","ci:project-deploy-elasticsearch","ci:project-deploy-observability","v8.16.0","backport:version"],"title":"[ML] Update vCPUs ranges for start model deployment ","number":195617,"url":"https://github.com/elastic/kibana/pull/195617","mergeCommit":{"message":"[ML] Update vCPUs ranges for start model deployment (#195617)\n\n## Summary\r\n\r\n#### Different vCPUs ranges and enabling support for static allocations\r\nbased on the serverless project type\r\n\r\n- Each serverless config yml, e.g.\r\n[search.es.yml](https://github.com/darnautov/kibana/blob/84b3b79a1537fd98b18d1f137b16b532f3f1061f/config/serverless.es.yml#L61)\r\nnow contains parameters required for start model deployment:\r\n\r\n```yml\r\nxpack.ml.nlp:\r\n enabled: true\r\n modelDeployment:\r\n allowStaticAllocations: true\r\n vCPURange:\r\n low:\r\n min: 0\r\n max: 2\r\n static: 2\r\n medium:\r\n min: 1\r\n max: 32\r\n static: 32\r\n high:\r\n min: 1\r\n max: 512\r\n static: 512\r\n```\r\n\r\nNote: _There will be no static allocations option for serverless O11y\r\nand serverless Security._\r\n\r\n#### The minimum values of vCPUs\r\n\r\n- 0 for the Low usage level on both serverless and ESS.\r\n- 1 for the Medium and High usage levels on both serverless and ESS.\r\n \r\n#### The default vCPUs usage levels\r\n- Low in serverless.\r\n- Medium in ESS and on-prem\r\n\r\n### Checklist\r\n\r\n- [x] [Unit or functional\r\ntests](https://www.elastic.co/guide/en/kibana/master/development-tests.html)\r\nwere updated or added to match the most common scenarios","sha":"13897083dc8c465a16ddb3856d4e4904b9629610"}},"sourceBranch":"main","suggestedTargetBranches":["8.x"],"targetPullRequestStates":[{"branch":"main","label":"v9.0.0","branchLabelMappingKey":"^v9.0.0$","isSourceBranch":true,"state":"MERGED","url":"https://github.com/elastic/kibana/pull/195617","number":195617,"mergeCommit":{"message":"[ML] Update vCPUs ranges for start model deployment (#195617)\n\n## Summary\r\n\r\n#### Different vCPUs ranges and enabling support for static allocations\r\nbased on the serverless project type\r\n\r\n- Each serverless config yml, e.g.\r\n[search.es.yml](https://github.com/darnautov/kibana/blob/84b3b79a1537fd98b18d1f137b16b532f3f1061f/config/serverless.es.yml#L61)\r\nnow contains parameters required for start model deployment:\r\n\r\n```yml\r\nxpack.ml.nlp:\r\n enabled: true\r\n modelDeployment:\r\n allowStaticAllocations: true\r\n vCPURange:\r\n low:\r\n min: 0\r\n max: 2\r\n static: 2\r\n medium:\r\n min: 1\r\n max: 32\r\n static: 32\r\n high:\r\n min: 1\r\n max: 512\r\n static: 512\r\n```\r\n\r\nNote: _There will be no static allocations option for serverless O11y\r\nand serverless Security._\r\n\r\n#### The minimum values of vCPUs\r\n\r\n- 0 for the Low usage level on both serverless and ESS.\r\n- 1 for the Medium and High usage levels on both serverless and ESS.\r\n \r\n#### The default vCPUs usage levels\r\n- Low in serverless.\r\n- Medium in ESS and on-prem\r\n\r\n### Checklist\r\n\r\n- [x] [Unit or functional\r\ntests](https://www.elastic.co/guide/en/kibana/master/development-tests.html)\r\nwere updated or added to match the most common scenarios","sha":"13897083dc8c465a16ddb3856d4e4904b9629610"}},{"branch":"8.x","label":"v8.16.0","branchLabelMappingKey":"^v8.16.0$","isSourceBranch":false,"state":"NOT_CREATED"}]}] BACKPORT--> Co-authored-by: Dima Arnautov <[email protected]>
Summary
Different vCPUs ranges and enabling support for static allocations based on the serverless project type
Note: There will be no static allocations option for serverless O11y and serverless Security.
The minimum values of vCPUs
The default vCPUs usage levels
Checklist