Npu allocator #437

saurabhkale17 · 2024-08-29T08:37:42Z

Description

draft PR for remote tensor feature implementation in OVEP

Crashing on tensor destruction. Might have UMD exceptions. Needs further debug. Unknown if values are correct.

This reverts commit d43219f.

fix: Fixed model_proto serialized dump in Debug

This reverts commit c1f3b3e.

Upgrade Openvino version to 2024.3.0

…' into npu_allocator

onnxruntime/core/providers/openvino/ov_allocator.cc

onnxruntime/core/providers/openvino/ov_allocator.h

Crashing on tensor destruction. Might have UMD exceptions. Needs further debug. Unknown if values are correct.

This reverts commit d43219f.

This reverts commit c1f3b3e.

preetha-intel · 2024-08-29T09:32:19Z

include/onnxruntime/core/framework/allocator.h

@@ -50,6 +50,15 @@ constexpr const char* HIP = "Hip";
 constexpr const char* HIP_PINNED = "HipPinned";
 constexpr const char* OpenVINO_CPU = "OpenVINO_CPU";
 constexpr const char* OpenVINO_GPU = "OpenVINO_GPU";
+constexpr const char* OpenVINO_NPU = "OpenVINO_RT_NPU";


OpenVINO_NPU is a redefinition of OpenVINO_RT_NPU.
Remove if its not referenced in the code.

not referenced so removed it in the new pr

preetha-intel · 2024-08-29T10:18:59Z

onnxruntime/core/providers/openvino/backends/basic_backend.cc

      size_t batch_size = 1;
      Ort::UnownedValue output_tensor =
          GetOutputTensor(context, batch_size, infer_request, std::move(output_name), subgraph_context_.output_names);
-      auto mem_info = output_tensor.GetTensorMemoryInfo();
-      if (mem_info.GetAllocatorName() == OpenVINO_GPU) {


Check if this has effect on OpenVINO_GPU IOBuffer

preetha-intel · 2024-08-29T10:20:20Z

onnxruntime/core/providers/openvino/backends/basic_backend.cc

-      if (mem_info.GetAllocatorName() == OpenVINO_GPU) {
-        return;
+      auto allocator_name = output_tensor.GetTensorMemoryInfo().GetAllocatorName();
+      ov_tensor_data_t ov_tensor_data;


check if the declaration in startasyncinference is redundant

we will require the ov_tensor_data for creating the input/output tensor before the inference in startasyncinference

preetha-intel · 2024-08-29T10:24:40Z

onnxruntime/test/perftest/ort_test_session.cc

@@ -854,6 +862,25 @@ select from 'TF8', 'TF16', 'UINT8', 'FLOAT', 'ITENSOR'. \n)");
    input_names_str_[i] = m.GetInputName(i);
    input_names_[i] = input_names_str_[i].c_str();
  }
+


Encapsulate the output tensor creation only if use_device_mem is set

made the relevant changes only if the use_device_mem is true in the new PR

javier-intel and others added 20 commits August 21, 2024 15:16

Prototype shared memory allocator on Windows using OV-EP

89094a8

Partially working allocator.

2e4b205

Crashing on tensor destruction. Might have UMD exceptions. Needs further debug. Unknown if values are correct.

Hard code onnx perf to use RT NPU allocator for inputs

63e8aee

Fix allocation lookups coming from different level zero contexts

cd88b0c

Page align OV allocation

89127f0

Allocate input as WC

d43219f

Only set tensors when they have changed.

274e6af

Revert "Allocate input as WC"

6feae84

This reverts commit d43219f.

Hard code onnx perf to use RT NPU for outputs

c1f3b3e

Merge branch 'microsoft:main' into ovep-release-lnl-1.2

fea4752

fix: Fixed model_proto serialized dump in Debug

e19f326

Merge pull request #428 from intel/ankit/debug_fixes_ovep_lnl_1.2

524d766

fix: Fixed model_proto serialized dump in Debug

Revert "Hard code onnx perf to use RT NPU for outputs"

1e3dadd

This reverts commit c1f3b3e.

Hard code onnx perf to use RT NPU for outputs fixed

61a2d4a

Fix onnx_perf_test app crash on tensor destroy

5800966

Upgrade Openvino version to 2024.3.0

075b14d

Merge pull request #433 from intel/jatin/upgarde_ov_to_2024_3

59ba9c7

Upgrade Openvino version to 2024.3.0

Merge remote-tracking branch 'ericcraw/ericcraw/ort_allocator_hacking…

5a3c793

…' into npu_allocator

refactor: remove redundant ort_shape_to_ovshape lambda function

a7f19aa

alocate buffer in NPU visible region from perf test application

20bca3b

saurabhkale17 requested review from sfatimar, ankitm3k and preetha-intel August 29, 2024 08:37

github-advanced-security bot found potential problems Aug 29, 2024

View reviewed changes

onnxruntime/core/providers/openvino/ov_allocator.cc Dismissed Show dismissed Hide dismissed

onnxruntime/core/providers/openvino/ov_allocator.h Dismissed Show dismissed Hide dismissed

saurabhkale17 requested a review from vthaniel August 29, 2024 08:50

saurabhkale17 and others added 5 commits August 29, 2024 02:51

remove redundant code

df617dd

fix: Fixed model_proto serialized dump in Debug

331679f

Prototype shared memory allocator on Windows using OV-EP

6ed4988

Partially working allocator.

92652ed

Crashing on tensor destruction. Might have UMD exceptions. Needs further debug. Unknown if values are correct.

Hard code onnx perf to use RT NPU allocator for inputs

2a06f44

ericcraw and others added 12 commits August 29, 2024 15:35

Fix allocation lookups coming from different level zero contexts

b83f8ac

Page align OV allocation

e812ca6

Allocate input as WC

077881a

Only set tensors when they have changed.

0060915

Revert "Allocate input as WC"

1468d38

This reverts commit d43219f.

Hard code onnx perf to use RT NPU for outputs

2334215

Revert "Hard code onnx perf to use RT NPU for outputs"

97f9e64

This reverts commit c1f3b3e.

Hard code onnx perf to use RT NPU for outputs fixed

6517a12

Fix onnx_perf_test app crash on tensor destroy

3c2a997

refactor: remove redundant ort_shape_to_ovshape lambda function

abe9f67

alocate buffer in NPU visible region from perf test application

94b55a7

remove redundant code

966c48a

saurabhkale17 force-pushed the npu_allocator branch from df617dd to 966c48a Compare August 29, 2024 10:08

saurabhkale17 added 2 commits August 29, 2024 03:36

add command line parameter in perf test for using remote tensors

a6004c5

add command line parameter in perf test for using remote tensors

ef44c87

preetha-intel reviewed Aug 30, 2024

View reviewed changes

saurabhkale17 closed this Sep 3, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Npu allocator #437

Npu allocator #437

saurabhkale17 commented Aug 29, 2024

preetha-intel Aug 29, 2024

saurabhkale17 Aug 30, 2024

preetha-intel Aug 29, 2024

preetha-intel Aug 29, 2024

saurabhkale17 Aug 30, 2024

preetha-intel Aug 29, 2024

saurabhkale17 Aug 30, 2024

Npu allocator #437

Npu allocator #437

Conversation

saurabhkale17 commented Aug 29, 2024

Description

preetha-intel Aug 29, 2024

Choose a reason for hiding this comment

saurabhkale17 Aug 30, 2024

Choose a reason for hiding this comment

preetha-intel Aug 29, 2024

Choose a reason for hiding this comment

preetha-intel Aug 29, 2024

Choose a reason for hiding this comment

saurabhkale17 Aug 30, 2024

Choose a reason for hiding this comment

preetha-intel Aug 29, 2024

Choose a reason for hiding this comment

saurabhkale17 Aug 30, 2024

Choose a reason for hiding this comment