[WebNN EP] Decompose Concat with input number > 4 for CPU backend #18930

Honry · 2023-12-26T02:56:47Z

WebNN XNNPack backend only supports the concat with inputs number <= 4, decomposing the Concat with inputs number > 4 into multiple WebNN concat ops.

Honry · 2023-12-26T02:57:05Z

@fdwr, @guschmue, PTAL, thanks!

guschmue · 2023-12-28T19:18:34Z

/azp run ONNX Runtime Web CI Pipeline

azure-pipelines · 2023-12-28T19:18:45Z

Azure Pipelines successfully started running 1 pipeline(s).

guschmue · 2023-12-28T19:19:08Z

/azp run Linux CPU CI Pipeline,Linux CPU Minimal Build E2E CI Pipeline,Linux GPU CI Pipeline,Linux GPU TensorRT CI Pipeline,Linux OpenVINO CI Pipeline,Linux QNN CI Pipeline,MacOS CI Pipeline,Windows ARM64 QNN CI Pipeline,Windows CPU CI Pipeline

guschmue · 2023-12-28T19:19:18Z

/azp run Windows GPU CI Pipeline,Windows GPU TensorRT CI Pipeline,onnxruntime-binary-size-checks-ci-pipeline,orttraining-linux-ci-pipeline,orttraining-linux-gpu-ci-pipeline,orttraining-ortmodule-distributed,Windows x64 QNN CI Pipeline

azure-pipelines · 2023-12-28T19:19:42Z

Azure Pipelines successfully started running 9 pipeline(s).

azure-pipelines · 2023-12-28T19:19:43Z

Azure Pipelines successfully started running 7 pipeline(s).

guschmue · 2023-12-28T23:43:42Z

/azp run ONNX Runtime Web CI Pipeline

azure-pipelines · 2023-12-28T23:43:53Z

Azure Pipelines successfully started running 1 pipeline(s).

fdwr

This is temporary, right? I'm surprised that XNNPack doesn't have a higher limit, like 16/256/.../65536. This approach reminds me of growing std::vector with linear reallocation, and that because you're also copying all the existing elements each time, a linear push_back in a loop will actually result in a higher than linear time complexity (which is why most implementations have a 1.5x or 2x growth pattern to avoid this). So, those models that have 128 concatenated inputs will experience n^2 time o_o.

We definitely don't expect WebNN callers to duplicate this code when calling CPU, and so either XNNPack should handle > 4 inputs directly, or the Chromium WebNN interface should do it (because anything ORT layer can handle, surely the WebNN front-end can directly handle).

Honry · 2024-01-17T08:31:56Z

cc/ @huningxin, hope you could address @fdwr's comment.

huningxin · 2024-01-17T11:58:16Z

@Honry , feel free to open a Chromium issue for WebNN XNNPACK backend. We'll seek feedback from XNNPACK developers and Chromium developers to decide where to implement this feature. Thanks!

Honry · 2024-01-17T12:12:06Z

@Honry , feel free to open a Chromium issue for WebNN XNNPACK backend. We'll seek feedback from XNNPACK developers and Chromium developers to decide where to implement this feature. Thanks!

Sure. Will do that.

Honry · 2024-01-17T12:25:19Z

Issue created at https://bugs.chromium.org/p/chromium/issues/detail?id=1519119.

[WebNN EP] Decompose Concat with input number > 4 for CPU backend

6b55752

WebNN XNNPack backend only supports the concat with inputs number <= 4, decomposing the Concat with inputs number > 4 into multiple WebNN concat ops.

guschmue added the ep:WebNN WebNN execution provider label Dec 28, 2023

guschmue approved these changes Dec 28, 2023

View reviewed changes

guschmue merged commit 96d1f32 into microsoft:main Dec 29, 2023
62 of 70 checks passed

fdwr reviewed Jan 17, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WebNN EP] Decompose Concat with input number > 4 for CPU backend #18930

[WebNN EP] Decompose Concat with input number > 4 for CPU backend #18930

Honry commented Dec 26, 2023

Honry commented Dec 26, 2023

guschmue commented Dec 28, 2023

azure-pipelines bot commented Dec 28, 2023

guschmue commented Dec 28, 2023

guschmue commented Dec 28, 2023

azure-pipelines bot commented Dec 28, 2023

azure-pipelines bot commented Dec 28, 2023

guschmue commented Dec 28, 2023

azure-pipelines bot commented Dec 28, 2023

fdwr left a comment •

edited

Loading

Honry commented Jan 17, 2024

huningxin commented Jan 17, 2024

Honry commented Jan 17, 2024

Honry commented Jan 17, 2024

[WebNN EP] Decompose Concat with input number > 4 for CPU backend #18930

[WebNN EP] Decompose Concat with input number > 4 for CPU backend #18930

Conversation

Honry commented Dec 26, 2023

Honry commented Dec 26, 2023

guschmue commented Dec 28, 2023

azure-pipelines bot commented Dec 28, 2023

guschmue commented Dec 28, 2023

guschmue commented Dec 28, 2023

azure-pipelines bot commented Dec 28, 2023

azure-pipelines bot commented Dec 28, 2023

guschmue commented Dec 28, 2023

azure-pipelines bot commented Dec 28, 2023

fdwr left a comment • edited Loading

Choose a reason for hiding this comment

Honry commented Jan 17, 2024

huningxin commented Jan 17, 2024

Honry commented Jan 17, 2024

Honry commented Jan 17, 2024

fdwr left a comment •

edited

Loading