[Web] WebGPU backend fails to load some model due to exception during initialization inside transpose optimizer #15869

gegogi · 2023-05-09T10:09:49Z

Describe the issue

I am trying to load a model on WebGPU backend env.
I could load the model downloaded from:
https://github.com/onnx/models/blob/main/vision/classification/mobilenet/model/mobilenetv2-12.onnx
But I couldn't load the following model:
https://huggingface.co/runwayml/stable-diffusion-v1-5/tree/onnx/vae_encoder
Both models can be loaded using Python onnxruntime.

To reproduce

Download the model from:
https://huggingface.co/runwayml/stable-diffusion-v1-5/tree/onnx/vae_encoder
and run the following code:

const ort = require('onnxruntime-web/webgpu');
async function main() {
        const modelPath = './models/sd15_vae_encoder_model.onnx';
        const session = await ort.InferenceSession.create(modelPath, {executionProviders: ['webgpu']});
}

Urgency

No response

ONNX Runtime Installation

Released Package

ONNX Runtime Version or Commit ID

[email protected]

Execution Provider

Other / Unknown

gegogi · 2023-05-09T11:28:42Z

FYI, loading still fails even after conversion to .ort format.

fs-eire · 2023-05-09T18:51:29Z

I will take a look

visheratin · 2023-05-13T02:10:04Z

The most likely reason is that the VAE encoder graph has operators that are not yet supported by the WebGPU execution provider, e.g., InstanceNormalization, Slice, Reshape.

fs-eire · 2023-05-16T01:21:54Z

The operator coverage is a problem, but that should not cause the model loading failure. After debugging the issue I found the problem is in the transpose optimizer.

 C:\a\_work\1\s\onnxruntime\core\optimizer\transpose_optimizer\optimizer_api_impl.cc:280 virtual std::vector<uint8_t> onnxruntime::ApiTensor::Data() const [ONNXRuntimeError] : 2 : INVALID_ARGUMENT : 
Yi @ ort.webgpu.min.js:6

Need dig deeper into the source code. I am debugging it.

### Description because of #15618 , the default allocator changed to device allocator, which will be GPU instead of CPU. in transpose optimizer we expect to read data from initializers so a CPU allocator is required here. this change fixes transpose optimizer on GPU EP Fixes the issue referred to in #15869, #15796

fs-eire · 2023-05-22T21:27:36Z

@gegogi This issue should have been fixed by the PR mentioned above. Please help to validate if it works. thanks

gegogi · 2023-05-24T01:44:36Z

Could you publish the latest nightly npm build? I tried to build onnxruntime myself but could't figure out compilation errors relating to protobuf version mismatch. It seems the project has the protobuf as a submodule but is trying to include headers from system directory which have different signatures.

fs-eire · 2023-05-25T19:17:43Z

Please try [email protected]

gabrielgrant · 2024-05-24T17:04:58Z

This appears to be fixed for me when running this example: https://gist.github.com/gabrielgrant/cb3e072dec5a416b4fc24f18ae902fb7

...but, despite using ort.webgpu.min.js and only having executionProviders: ['webgpu'] , it is still demanding that ort.env.wasm.wasmPaths be set, so it's not entirely clear to me that it's actually using the WebGPU backend instead of WASM? (is the WASM bundle just needed as fallback for kernels not yet implemented in WebGPU?)

@gegogi are you able to confirm this is fixed? (this should be in a release now)

@fs-eire:

Can you confirm the gist example I've put together should be testing the issue correctly?
are you confident enough that fix transpose optimizer on GPU EP #15988 fixes this to close this issue?

fs-eire · 2024-05-24T18:46:37Z

The ONNX Runtime Web depends on the C++ code for session, graph and model execution, which is compiled into WebAssembly. In short, ONNX Runtime Web always need to load WebAssembly, no matter you use webgpu or wasm(cpu) EP.

However, you don't have to always set ort.env.wasm.wasmPaths. If it is not set, it will try to load the .wasm files from the "current folder" (relative to the URL of the JavaScript file that is currently running). the flag just offers a way to customize the path.

The issue that related to "Transpose" is already fixed. So let me close the issue.

gegogi added the platform:web issues related to ONNX Runtime web; typically submitted using template label May 9, 2023

github-actions bot added the model:transformer issues related to a transformer model: BERT, GPT2, Hugging Face, Longformer, T5, etc. label May 9, 2023

fs-eire mentioned this issue May 9, 2023

[Web] WebGPU issues tracking #15796

Closed

fs-eire changed the title ~~[Web] WebGPU backend cannot load some models that Python runtime can load.~~ [Web] WebGPU backend fails to load some model due to exception during initialization inside transpose optimizer May 16, 2023

fs-eire mentioned this issue May 17, 2023

fix transpose optimizer on GPU EP #15988

Merged

fs-eire closed this as completed May 24, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Web] WebGPU backend fails to load some model due to exception during initialization inside transpose optimizer #15869

[Web] WebGPU backend fails to load some model due to exception during initialization inside transpose optimizer #15869

gegogi commented May 9, 2023

gegogi commented May 9, 2023

fs-eire commented May 9, 2023

visheratin commented May 13, 2023

fs-eire commented May 16, 2023

fs-eire commented May 22, 2023

gegogi commented May 24, 2023

fs-eire commented May 25, 2023

gabrielgrant commented May 24, 2024 •

edited

Loading

fs-eire commented May 24, 2024

[Web] WebGPU backend fails to load some model due to exception during initialization inside transpose optimizer #15869

[Web] WebGPU backend fails to load some model due to exception during initialization inside transpose optimizer #15869

Comments

gegogi commented May 9, 2023

Describe the issue

To reproduce

Urgency

ONNX Runtime Installation

ONNX Runtime Version or Commit ID

Execution Provider

gegogi commented May 9, 2023

fs-eire commented May 9, 2023

visheratin commented May 13, 2023

fs-eire commented May 16, 2023

fs-eire commented May 22, 2023

gegogi commented May 24, 2023

fs-eire commented May 25, 2023

gabrielgrant commented May 24, 2024 • edited Loading

fs-eire commented May 24, 2024

gabrielgrant commented May 24, 2024 •

edited

Loading