Enable QNN HTP support for Node #20576

joncamp · 2024-05-06T15:50:57Z

Description

Add support for using Onnx Runtime with Node

Motivation and Context

Onnx Runtime supports the QNN HTP, but does not support it for Node.js. This adds baseline support for the Onnx Runtime to be used with Node.

Note it does not update the node packages that are distributed officially. This simply patches the onnxruntime.dll to allow 'qnn' to be used as an execution provider.

Testing was done using the existing onnxruntime-node package. The onnxruntime.dll and onnxruntime_binding.node were swapped into node_modules\onnxruntime-node\bin\napi-v3\win32\arm64 with the newly built version, then the various QNN dlls and .so files were placed next to the onnxruntime.dll. Testing was performed on a variety of models and applications, but the easiest test is to modify the node quickstart example.

…to enableQNN

joncamp · 2024-05-06T15:52:53Z

@microsoft-github-policy-service agree company="Cephable"

jywu-msft · 2024-05-06T17:31:50Z

/azp run Windows ARM64 QNN CI Pipeline,Windows x64 QNN CI Pipeline,Windows CPU CI Pipeline,Windows GPU CI Pipeline,Windows GPU TensorRT CI Pipeline,ONNX Runtime Web CI Pipeline,Linux CPU CI Pipeline,Linux CPU Minimal Build E2E CI Pipeline,Linux GPU CI Pipeline,Linux GPU TensorRT CI Pipeline

jywu-msft · 2024-05-06T17:32:02Z

/azp run Linux QNN CI Pipeline,MacOS CI Pipeline,orttraining-amd-gpu-ci-pipeline,orttraining-linux-ci-pipeline,orttraining-linux-gpu-ci-pipeline,orttraining-ortmodule-distributed,onnxruntime-binary-size-checks-ci-pipeline,Big Models,Android CI Pipeline

azure-pipelines · 2024-05-06T17:32:29Z

Azure Pipelines successfully started running 10 pipeline(s).

azure-pipelines · 2024-05-06T17:32:41Z

Azure Pipelines successfully started running 9 pipeline(s).

fs-eire · 2024-05-06T19:25:48Z

I have a few questions regarding this QNN HTP feature:

Is this feature only works ( or it's only planned to be supported ) on Windows/arm64 ? Will other OS/CPU arch also need this feature ?
Is QNN EP using static link (ie. included in onnxruntime.dll, like DML) or dynamic link (ie. included in onnxruntime_provider_xxx.dll, like CUDA) ?
Is it compatible with CPU/DML if a build enabled them all? Specifically:
- If it is running on a non-Qualcomm device (not sure if this assumption exists, please let me know if I made wrong assumption) or an environment where QNN is not supported, is it still be able to load and run other EP?
- Is the binary works with other non-CPU EP, like DML? (we know that CUDA has some issue that it does not work with DML in one build)
Do you want to include QNN HTP support in onnxruntime-node by default? If so-
- Do we already have a build pipeline for release artifacts in the "Zip-*" pipeline?

jywu-msft · 2024-05-06T19:38:27Z

I have a few questions regarding this QNN HTP feature:

Is this feature only works ( or it's only planned to be supported ) on Windows/arm64 ? Will other OS/CPU arch also need this feature ?

Is QNN EP using static link (ie. included in onnxruntime.dll, like DML) or dynamic link (ie. included in onnxruntime_provider_xxx.dll, like CUDA) ?

Is it compatible with CPU/DML if a build enabled them all? Specifically:

If it is running on a non-Qualcomm device (not sure if this assumption exists, please let me know if I made wrong assumption) or an environment where QNN is not supported, is it still be able to load and run other EP?

Is the binary works with other non-CPU EP, like DML? (we know that CUDA has some issue that it does not work with DML in one build)

Do you want to include QNN HTP support in onnxruntime-node by default? If so-

Do we already have a build pipeline for release artifacts in the "Zip-*" pipeline?

i can answer some of these questions.

win/arm64 for now. maybe other platforms/later need to see what those are. (QNN iteself runs on win/arm64, win/x64, linux/x64, android)
QNN EP is statically linked to onnxruntime.dll (that wouldn't change anytime soon)
QNN will not run on non-qualcomm HW
compatible with CPU. DML it should be. (but not sure how extensively it's been tested)
I think this PR just enables build from source, but it would be nice to eventually support this in a more official manner (pipelines, default options etc.)

hans00 · 2024-05-07T14:11:38Z

QNN will not run on non-qualcomm HW

QNN CPU might be ok
but as my test, performance will worse then XNNPACK + CPU EP

P.S. QNN CPU seems wrapper of XNNPACK

jywu-msft · 2024-05-07T16:11:52Z

/azp run Linux OpenVINO CI Pipeline

azure-pipelines · 2024-05-07T16:12:04Z

Azure Pipelines successfully started running 1 pipeline(s).

jywu-msft · 2024-05-07T17:06:17Z

@joncamp fyi, https://dev.azure.com/onnxruntime/onnxruntime/_build/results?buildId=1371836&view=logs&j=90af55dc-07cc-5abc-02f4-1cf38a060872&t=7ee52c27-1516-5118-a868-3d2d34beb196

lib/wasm/session-options.ts:121:31 - error TS2339: Property 'preferredLayout' does not exist on type 'QnnExecutionProviderOption'.

121 if (qnnOptions?.preferredLayout) {
~~~~~~~~~~~~~~~

Found 1 error in lib/wasm/session-options.ts:121

jywu-msft · 2024-05-07T20:05:43Z

/azp run Windows ARM64 QNN CI Pipeline,Windows x64 QNN CI Pipeline,Windows CPU CI Pipeline,Windows GPU CI Pipeline,Windows GPU TensorRT CI Pipeline,ONNX Runtime Web CI Pipeline,Linux CPU CI Pipeline,Linux CPU Minimal Build E2E CI Pipeline,Linux GPU CI Pipeline,Linux GPU TensorRT CI Pipeline

azure-pipelines · 2024-05-07T20:06:26Z

Azure Pipelines successfully started running 10 pipeline(s).

js/node/src/session_options_helper.cc

js/web/script/test-runner-cli.ts

jywu-msft · 2024-05-09T15:36:30Z

/azp run Windows ARM64 QNN CI Pipeline,Windows x64 QNN CI Pipeline,Windows CPU CI Pipeline,Windows GPU CI Pipeline,Windows GPU TensorRT CI Pipeline,ONNX Runtime Web CI Pipeline,Linux CPU CI Pipeline,Linux CPU Minimal Build E2E CI Pipeline,Linux GPU CI Pipeline,Linux GPU TensorRT CI Pipeline

azure-pipelines · 2024-05-09T15:37:09Z

Azure Pipelines successfully started running 10 pipeline(s).

jywu-msft · 2024-05-09T15:37:55Z

/azp run Linux QNN CI Pipeline,MacOS CI Pipeline,orttraining-amd-gpu-ci-pipeline,orttraining-linux-ci-pipeline,orttraining-linux-gpu-ci-pipeline,orttraining-ortmodule-distributed,onnxruntime-binary-size-checks-ci-pipeline,Big Models,Android CI Pipeline, Linux OpenVINO CI Pipeline

azure-pipelines · 2024-05-09T15:38:38Z

Azure Pipelines successfully started running 10 pipeline(s).

### Description Add support for using Onnx Runtime with Node ### Motivation and Context Onnx Runtime supports the QNN HTP, but does not support it for Node.js. This adds baseline support for the Onnx Runtime to be used with Node. Note it does not update the node packages that are distributed officially. This simply patches the onnxruntime.dll to allow 'qnn' to be used as an execution provider. Testing was done using the existing onnxruntime-node package. The `onnxruntime.dll` and `onnxruntime_binding.node` were swapped into `node_modules\onnxruntime-node\bin\napi-v3\win32\arm64` with the newly built version, then the various QNN dlls and .so files were placed next to the onnxruntime.dll. Testing was performed on a variety of models and applications, but the easiest test is to modify the [node quickstart example](https://github.com/microsoft/onnxruntime-inference-examples/tree/main/js/quick-start_onnxruntime-node).

joncamp and others added 16 commits April 12, 2024 14:54

Initial changes to enable QNN to run on Node

cae9c9a

Fix build errors

84dfd62

Merge branch 'microsoft:main' into enableQNN

48b9d2f

Merge branch 'microsoft:main' into enableQNN

02fb539

Revert minor merge issue

01a0f80

Update session_options_helper.cc to plumb backend

07d1fa9

Skip provider factory

20bcb23

Merge branch 'main' into enableQNN

babe5be

Merge branch 'microsoft:main' into enableQNN

eb60a1d

Merge branch 'microsoft:main' into enableQNN

1cb55a4

code cleanup

d55ab3a

Update QNN initialization for node to use bundled libs

7f04249

Merge branch 'microsoft:main' into enableQNN

a5d2024

Revert debugging changes

59a2e89

Merge branch 'enableQNN' of http://github.com/cephable/onnxruntime in…

1b2b46a

…to enableQNN

Minor update to reduce merge churn

1d2d189

jywu-msft requested a review from fs-eire May 6, 2024 16:35

mindest added the ep:QNN issues related to QNN exeution provider label May 7, 2024

fix compiler warning

d18890e

HectorSVC reviewed May 7, 2024

View reviewed changes

js/node/src/session_options_helper.cc Outdated Show resolved Hide resolved

HectorSVC reviewed May 7, 2024

View reviewed changes

js/node/src/session_options_helper.cc Show resolved Hide resolved

fs-eire reviewed May 8, 2024

View reviewed changes

js/web/script/test-runner-cli.ts Outdated Show resolved Hide resolved

joncamp added 2 commits May 8, 2024 16:07

Updates based on PR feedback

ef2dbb3

Remove changes to js/web per PR feedback

c43dcbc

jywu-msft added release:1.18.0 and removed release:1.18.0 labels May 9, 2024

jywu-msft approved these changes May 9, 2024

View reviewed changes

jywu-msft merged commit 768c793 into microsoft:main May 9, 2024
78 checks passed

jywu-msft added release:1.18.1 and removed release:1.18.1 labels May 29, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable QNN HTP support for Node #20576

Enable QNN HTP support for Node #20576

joncamp commented May 6, 2024

joncamp commented May 6, 2024 via email •

edited

Loading

jywu-msft commented May 6, 2024

jywu-msft commented May 6, 2024

azure-pipelines bot commented May 6, 2024

azure-pipelines bot commented May 6, 2024

fs-eire commented May 6, 2024

jywu-msft commented May 6, 2024

hans00 commented May 7, 2024

jywu-msft commented May 7, 2024

azure-pipelines bot commented May 7, 2024

jywu-msft commented May 7, 2024

jywu-msft commented May 7, 2024

azure-pipelines bot commented May 7, 2024

jywu-msft commented May 9, 2024

azure-pipelines bot commented May 9, 2024

jywu-msft commented May 9, 2024

azure-pipelines bot commented May 9, 2024

Enable QNN HTP support for Node #20576

Enable QNN HTP support for Node #20576

Conversation

joncamp commented May 6, 2024

Description

Motivation and Context

joncamp commented May 6, 2024 via email • edited Loading

jywu-msft commented May 6, 2024

jywu-msft commented May 6, 2024

azure-pipelines bot commented May 6, 2024

azure-pipelines bot commented May 6, 2024

fs-eire commented May 6, 2024

jywu-msft commented May 6, 2024

hans00 commented May 7, 2024

jywu-msft commented May 7, 2024

azure-pipelines bot commented May 7, 2024

jywu-msft commented May 7, 2024

jywu-msft commented May 7, 2024

azure-pipelines bot commented May 7, 2024

jywu-msft commented May 9, 2024

azure-pipelines bot commented May 9, 2024

jywu-msft commented May 9, 2024

azure-pipelines bot commented May 9, 2024

joncamp commented May 6, 2024 via email •

edited

Loading