[js/web] revise backend registration #18715

fs-eire · 2023-12-06T00:06:39Z

Description

This PR revises the backend registration.

The following describes the expected behavior after this change: (bolded are changed behavior)

(ort.min.js - built without webgpu support)
- loading: do not register 'webgpu' backend
- creating session without EP list: use default EP list ['webnn', 'cpu', 'wasm']
- creating session with ['webgpu'] as EP list: should fail with backend not available
(ort.webgpu.min.js - built with webgpu support)
- loading: always register 'webgpu' backend
  ( previous behavior: only register 'webgpu' backend when navigator.gpu is available)
- creating session without EP list: use default EP list ['webgpu', 'webnn', 'cpu', 'wasm']
  - when WebGPU is available (win): use WebGPU backend
  - when WebGPU is unavailable (android): should fail backend init, and try to use next backend in the list, 'webnn'
    (previous behavior: does not fail backend init, but fail in JSEP init, which was too late to switch to next backend)
- creating session with ['webgpu'] as EP list
  - when WebGPU is available (win): use WebGPU backend
  - when WebGPU is unavailable (android): **should fail backend init, and because no more EP listed, fail.

gyagp · 2023-12-06T06:13:42Z

As I commented in the PR, the description has problem with EP list like ['webnn', 'webgpu'].
So the only diff between ort.min.js and ort.webgpu.min.js is webgpu support, due to concern of package size. For any release, a list of EPs will be served (designated by default or provided by developers), and webgpu will only be initialized when it's chosen.
BTW, for the default EP list, do we have any spec to describe the sequence? What's difference between cpu EP and wasm EP? Why do we put cpu prior to wasm?

fs-eire · 2023-12-06T23:18:53Z

As I commented in the PR, the description has problem with EP list like ['webnn', 'webgpu']. So the only diff between ort.min.js and ort.webgpu.min.js is webgpu support, due to concern of package size. For any release, a list of EPs will be served (designated by default or provided by developers), and webgpu will only be initialized when it's chosen. BTW, for the default EP list, do we have any spec to describe the sequence? What's difference between cpu EP and wasm EP? Why do we put cpu prior to wasm?

If we create session with executionProviders, the names will be used as EP list.
If we create session without executionProviders in session options, the default EP list will be used.

The default EP list is registered from lib/index.ts, with the BUILD_DEFS set correspondingly.

for example:

    registerBackend('webgpu', wasmBackend, 5);

This registered object wasmBackend with backend name "webgpu" and priority 5. The lower number means higher priority.

Technially they are all the same except webgl. For the "backend" concept defined in onnxruntime-common, there are only 2 backends implemented in onnxruntime-web: the web assembly backend and the webgl backend. Now in this PR, backend name is added into the init() function to allow the web assembly can do things differently when being called for a specific backend name.

The probject history may explain why the backend registery/resolve concept is confusing: when we migrated onnx.js to onnxruntime-web, the new concept "execution provider" came and it is similar to but yet different from the old "backend" concept. Now, "backend" is an internal concept and "execution provider" is the public term and used in API. Also, since the new onnxruntime-web is based on web assembly, for long term, every backend will be wasm backend.

…-and-proxy

…xible-webgpu-backend-selection

…e-webgpu-backend-selection

fs-eire · 2023-12-12T02:10:17Z

This PR now includes all changes from #18756.

Initializations are now splitted into 3 steps for wasm. The last step is for initializations for EP speicific ( currently webgpu )

The wasm initialization steps are all combined and put into backend.init(), guarded as call_once.

The EP initialization step may be call multiple times, but for each EP name, it will also be called once.

qjia7

Nice work! I like this refactor.

js/web/lib/wasm/proxy-messages.ts

js/web/lib/wasm/proxy-wrapper.ts

…e-webgpu-backend-selection

gyagp

LGTM with 2 nits. Thanks for working on this!

js/web/lib/wasm/jsep/init.ts

…e-webgpu-backend-selection

guschmue

on top of this we need to also implement onnxruntime.get_available_providers()
because depending on the provider the app might want to load a different model, say for wasm quantized, for webgpu and webnn fp16 so it needs to know before it creates the session.

gyagp · 2023-12-20T03:11:13Z

on top of this we need to also implement onnxruntime.get_available_providers() because depending on the provider the app might want to load a different model, say for wasm quantized, for webgpu and webnn fp16 so it needs to know before it creates the session.

EP is available doesn't mean it can fully support a specific model. We may suppose developers would test their models with a specific version of onnxruntime-web, and set their EP list correctly.

guschmue · 2023-12-21T19:25:55Z

yes, and not all gpu's come equal even if the model works well on gpu in general.
Hard for developers to deal with this - longer term we should have some utility functions to help app developers with it.

### Description This PR revises the backend registration. The following describes the expected behavior after this change: (**bolded are changed behavior**) - (ort.min.js - built without webgpu support) - loading: do not register 'webgpu' backend - creating session without EP list: use default EP list ['webnn', 'cpu', 'wasm'] - creating session with ['webgpu'] as EP list: should fail with backend not available - (ort.webgpu.min.js - built with webgpu support) - loading: **always register 'webgpu' backend** ( previous behavior: only register 'webgpu' backend when `navigator.gpu` is available) - creating session without EP list: use default EP list ['webgpu', 'webnn', 'cpu', 'wasm'] - when WebGPU is available (win): use WebGPU backend - when WebGPU is unavailable (android): **should fail backend init,** and try to use next backend in the list, 'webnn' (previous behavior: does not fail backend init, but fail in JSEP init, which was too late to switch to next backend) - creating session with ['webgpu'] as EP list - when WebGPU is available (win): use WebGPU backend - when WebGPU is unavailable (android): **should fail backend init, and because no more EP listed, fail. related PRs: microsoft#18190 microsoft#18144

[js/web] revise backend registration

c80b242

fs-eire added 2 commits December 7, 2023 18:38

[js/web] refactor init and proxy in ort-web

8a07eb0

region close

8debdb4

fs-eire mentioned this pull request Dec 8, 2023

[js/web] revise init, wasm-core and proxy in ort-web #18756

Closed

fs-eire added 7 commits December 11, 2023 14:26

Merge remote-tracking branch 'origin/main' into fs-eire/refactor-init…

70ea0af

…-and-proxy

Merge branch 'fs-eire/refactor-init-and-proxy' into fs-eire/allow-fle…

b151058

…xible-webgpu-backend-selection

update type for proxy main

a3a2f52

update initialize ort ep

7794cc2

make sure no shared buffer used for test models

a9b4a28

enforce run once

4ca2fa1

Merge remote-tracking branch 'origin/main' into fs-eire/allow-flexibl…

1da3e2f

…e-webgpu-backend-selection

qjia7 approved these changes Dec 12, 2023

View reviewed changes

js/web/lib/wasm/proxy-messages.ts Show resolved Hide resolved

fs-eire commented Dec 14, 2023

View reviewed changes

js/web/lib/wasm/proxy-wrapper.ts Show resolved Hide resolved

fs-eire added 2 commits December 14, 2023 18:22

fix abort assignment

fbf41b1

Merge remote-tracking branch 'origin/main' into fs-eire/allow-flexibl…

3123fd5

…e-webgpu-backend-selection

gyagp approved these changes Dec 19, 2023

View reviewed changes

js/web/lib/wasm/jsep/init.ts Outdated Show resolved Hide resolved

js/web/lib/wasm/jsep/init.ts Outdated Show resolved Hide resolved

satyajandhyala previously approved these changes Dec 19, 2023

View reviewed changes

fs-eire added 2 commits December 19, 2023 16:17

resolve comments

a4d7d5e

Merge remote-tracking branch 'origin/main' into fs-eire/allow-flexibl…

ae8fb82

…e-webgpu-backend-selection

fs-eire dismissed satyajandhyala’s stale review via ae8fb82 December 20, 2023 00:18

satyajandhyala approved these changes Dec 20, 2023

View reviewed changes

guschmue approved these changes Dec 20, 2023

View reviewed changes

fs-eire merged commit 9a61388 into main Dec 20, 2023
92 of 100 checks passed

fs-eire deleted the fs-eire/allow-flexible-webgpu-backend-selection branch December 20, 2023 22:45

fs-eire mentioned this pull request Jan 17, 2024

Dynamically load ort-wasm*.js according to the EP name #19130

Closed

fs-eire mentioned this pull request Jan 23, 2024

[js/webgpu] Choose best wasm backend #18190

Closed

fs-eire mentioned this pull request Feb 2, 2024

Correct check for WebGPU support #18144

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[js/web] revise backend registration #18715

[js/web] revise backend registration #18715

fs-eire commented Dec 6, 2023 •

edited

Loading

gyagp commented Dec 6, 2023

fs-eire commented Dec 6, 2023

fs-eire commented Dec 12, 2023

qjia7 left a comment

gyagp left a comment

guschmue left a comment

gyagp commented Dec 20, 2023

guschmue commented Dec 21, 2023

[js/web] revise backend registration #18715

[js/web] revise backend registration #18715

Conversation

fs-eire commented Dec 6, 2023 • edited Loading

Description

gyagp commented Dec 6, 2023

fs-eire commented Dec 6, 2023

fs-eire commented Dec 12, 2023

qjia7 left a comment

Choose a reason for hiding this comment

gyagp left a comment

Choose a reason for hiding this comment

guschmue left a comment

Choose a reason for hiding this comment

gyagp commented Dec 20, 2023

guschmue commented Dec 21, 2023

fs-eire commented Dec 6, 2023 •

edited

Loading