[ML] Update the number of allocations per nlp process #86277

davidkyle · 2022-04-28T20:28:46Z

This is the Java side of elastic/ml-cpp#2258 which causes internal breakages while the PRs are out of sync due to naming changes.

Adds a method to DeploymentManager to update the number of allocations per process as implemented in elastic/ml-cpp#2258.

Also PyTorchResults now has an error type rather than the error being a special case of the inference result and reverts the test mutes in #86263

elasticmachine · 2022-04-28T20:28:50Z

Pinging @elastic/ml-core (Team:ML)

dimitris-athanasiou

Looks good. Just a minor suggestion for a potential method extraction.

dimitris-athanasiou · 2022-04-29T10:13:36Z

...ugin/ml/src/main/java/org/elasticsearch/xpack/ml/inference/deployment/DeploymentManager.java

+        TimeValue timeout,
+        ActionListener<ThreadSettings> listener
+    ) {
+        var processContext = getProcessContext(task, listener::onFailure);


There seems to be another method which we could extract here that is something like executePyTorchAction and takes in an AbstractPyTorchAction. Then we could reuse it when we fire either the inference action or the control message action and it does the getting of the process context and the try-catch of running the action.

I might be missing something that makes it impossible to do this. In any case, just a thought.

++ yeah there was an opportunity to refactor here

This reverts commit 74d86b5.

This reverts commit 00233be.

dimitris-athanasiou

LGTM

During elastic#86277 an error was introduced in parsing of pytorch thread settings. This commit fixes the issue.

During #86277 an error was introduced in parsing of pytorch thread settings. This commit fixes the issue.

During elastic#86277 an error was introduced in parsing of pytorch thread settings. This commit fixes the issue.

During #86277 an error was introduced in parsing of pytorch thread settings. This commit fixes the issue.

davidkyle added :ml Machine learning v8.3.0 labels Apr 28, 2022

elasticmachine added the Team:ML Meta label for the ML team label Apr 28, 2022

dimitris-athanasiou reviewed Apr 29, 2022

View reviewed changes

davidkyle added the >non-issue label May 3, 2022

davidkyle force-pushed the update-num-allocations branch from d439525 to dee9ed1 Compare May 5, 2022 07:22

davidkyle mentioned this pull request May 5, 2022

[ML] Mute ML model upgrade tests #86454

Closed

davidkyle added 5 commits May 5, 2022 10:16

update num allocations and renames

1c86ee2

Revert "[ML] Mute PyTorch tests (elastic#86263)"

112dc4c

This reverts commit 74d86b5.

precommit

a6bd827

Renames and test fixes

e457b06

Revert "[ML] Muting trained model upgrade tests (elastic#86453)"

f950f4c

This reverts commit 00233be.

davidkyle force-pushed the update-num-allocations branch from dee9ed1 to f950f4c Compare May 5, 2022 09:17

refactor

d60e762

dimitris-athanasiou approved these changes May 5, 2022

View reviewed changes

tests

ea21bf3

davidkyle merged commit 6318be5 into elastic:master May 5, 2022

davidkyle deleted the update-num-allocations branch May 5, 2022 11:03

davidkyle mentioned this pull request May 5, 2022

[CI] MLModelDeploymentFullClusterRestartIT testDeploymentSurvivesRestart failing #86461

Closed

dimitris-athanasiou added a commit to dimitris-athanasiou/elasticsearch that referenced this pull request Jun 8, 2022

[ML] Fix parsing of pytorch thread settings

1ebc914

During elastic#86277 an error was introduced in parsing of pytorch thread settings. This commit fixes the issue.

dimitris-athanasiou mentioned this pull request Jun 8, 2022

[ML] Fix parsing of pytorch thread settings #87525

Merged

dimitris-athanasiou added a commit that referenced this pull request Jun 9, 2022

[ML] Fix parsing of pytorch thread settings (#87525)

5673c6f

During #86277 an error was introduced in parsing of pytorch thread settings. This commit fixes the issue.

elasticsearchmachine pushed a commit that referenced this pull request Jun 9, 2022

[ML] Fix parsing of pytorch thread settings (#87525) (#87544)

77ac218

During #86277 an error was introduced in parsing of pytorch thread settings. This commit fixes the issue.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ML] Update the number of allocations per nlp process #86277

[ML] Update the number of allocations per nlp process #86277

davidkyle commented Apr 28, 2022

elasticmachine commented Apr 28, 2022

dimitris-athanasiou left a comment

dimitris-athanasiou Apr 29, 2022

davidkyle May 5, 2022

dimitris-athanasiou left a comment

[ML] Update the number of allocations per nlp process #86277

[ML] Update the number of allocations per nlp process #86277

Conversation

davidkyle commented Apr 28, 2022

elasticmachine commented Apr 28, 2022

dimitris-athanasiou left a comment

Choose a reason for hiding this comment

dimitris-athanasiou Apr 29, 2022

Choose a reason for hiding this comment

davidkyle May 5, 2022

Choose a reason for hiding this comment

dimitris-athanasiou left a comment

Choose a reason for hiding this comment