[js/webgpu] reuse buffer for GpuDataManager #16746

qjia7 · 2023-07-18T06:15:01Z

Description

Allocating new GPUBuffer in every session.run is not efficient. We should make it only happen in the first run. In the following runs, we should try to reuse those buffers.

Motivation and Context

This PR is for performance.
See mobilenetv2 becomes 9.58 ms from 12.9 ms.

This PR is for performance. See mobilenetv2 becomes 9.58 ms from 12.9 ms.

js/web/lib/wasm/jsep/webgpu/gpu-data-manager.ts

qjia7 · 2023-07-18T08:38:54Z

@fs-eire @guschmue Please take a look, thanks!

js/web/lib/wasm/jsep/webgpu/gpu-data-manager.ts

guschmue · 2023-07-18T16:19:15Z

/azp run ONNX Runtime Web CI Pipeline

azure-pipelines · 2023-07-18T16:19:25Z

Azure Pipelines successfully started running 1 pipeline(s).

js/web/lib/wasm/jsep/webgpu/gpu-data-manager.ts

guschmue · 2023-07-19T01:03:01Z

CI pipeline is nagging: run npm run format from js/ should make it happy.
https://dev.azure.com/onnxruntime/onnxruntime/_build/results?buildId=1067304&view=logs&jobId=96bb6f8b-177f-5c87-f240-68050a28686f&j=96bb6f8b-177f-5c87-f240-68050a28686f&t=473361f3-9d46-5fb7-3d33-5a64e2c16152

qjia7 · 2023-07-20T05:03:48Z

CI pipeline is nagging: run npm run format from js/ should make it happy. https://dev.azure.com/onnxruntime/onnxruntime/_build/results?buildId=1067304&view=logs&jobId=96bb6f8b-177f-5c87-f240-68050a28686f&j=96bb6f8b-177f-5c87-f240-68050a28686f&t=473361f3-9d46-5fb7-3d33-5a64e2c16152

Done. Please take another look. Thanks.

guschmue · 2023-07-20T15:04:56Z

/azp run ONNX Runtime Web CI Pipeline

azure-pipelines · 2023-07-20T15:05:06Z

No commit pushedDate could be found for PR 16746 in repo microsoft/onnxruntime

guschmue · 2023-07-20T16:07:34Z

/azp run ONNX Runtime Web CI Pipeline

azure-pipelines · 2023-07-20T16:07:39Z

No commit pushedDate could be found for PR 16746 in repo microsoft/onnxruntime

fs-eire · 2023-07-20T17:09:31Z

/azp run Windows CPU CI Pipeline, Windows GPU CI Pipeline, Windows GPU TensorRT CI Pipeline, Windows ARM64 QNN CI Pipeline, orttraining-linux-ci-pipeline, orttraining-linux-gpu-ci-pipeline, orttraining-ortmodule-distributed, ONNX Runtime React Native CI Pipeline

azure-pipelines · 2023-07-20T17:09:36Z

No commit pushedDate could be found for PR 16746 in repo microsoft/onnxruntime

fs-eire · 2023-07-20T17:09:40Z

/azp run Linux CPU CI Pipeline, Linux CPU Minimal Build E2E CI Pipeline, Linux GPU CI Pipeline, Linux GPU TensorRT CI Pipeline, Linux OpenVINO CI Pipeline, MacOS CI Pipeline, ONNX Runtime Web CI Pipeline, onnxruntime-binary-size-checks-ci-pipeline, Linux QNN CI Pipeline

azure-pipelines · 2023-07-20T17:09:45Z

No commit pushedDate could be found for PR 16746 in repo microsoft/onnxruntime

guschmue · 2023-07-20T19:39:15Z

/azp run ONNX Runtime Web CI Pipeline

azure-pipelines · 2023-07-20T19:39:20Z

No commit pushedDate could be found for PR 16746 in repo microsoft/onnxruntime

guschmue · 2023-07-21T15:11:38Z

/azp run Linux CPU CI Pipeline,Linux CPU Minimal Build E2E CI Pipeline,Linux GPU CI Pipeline,Linux GPU TensorRT CI Pipeline,Linux OpenVINO CI Pipeline,Linux QNN CI Pipeline,MacOS CI Pipeline,Windows ARM64 QNN CI Pipeline,Windows CPU CI Pipeline

azure-pipelines · 2023-07-21T15:12:15Z

Azure Pipelines successfully started running 9 pipeline(s).

guschmue · 2023-07-21T15:13:17Z

/azp run Windows GPU CI Pipeline,Windows GPU TensorRT CI Pipeline,onnxruntime-binary-size-checks-ci-pipeline,orttraining-linux-ci-pipeline,orttraining-linux-gpu-ci-pipeline,orttraining-ortmodule-distributed

azure-pipelines · 2023-07-21T15:13:22Z

No commit pushedDate could be found for PR 16746 in repo microsoft/onnxruntime

guschmue · 2023-07-21T17:14:50Z

/azp run Windows GPU CI Pipeline,Windows GPU TensorRT CI Pipeline,onnxruntime-binary-size-checks-ci-pipeline,orttraining-linux-ci-pipeline,orttraining-linux-gpu-ci-pipeline,orttraining-ortmodule-distributed

azure-pipelines · 2023-07-21T17:15:16Z

Azure Pipelines successfully started running 6 pipeline(s).

guschmue · 2023-07-21T17:15:52Z

/azp run ONNX Runtime Web CI Pipeline

azure-pipelines · 2023-07-21T17:16:02Z

Azure Pipelines successfully started running 1 pipeline(s).

### Description  Allocating new GPUBuffer in every session.run is not efficient. We should make it only happen in the first run. In the following runs, we should try to reuse those buffers. ### Motivation and Context  - This PR is for performance. See mobilenetv2 becomes 9.58 ms from 12.9 ms.

[js/webgpu] reuse buffer for GpuDataManager

8cdfb10

This PR is for performance. See mobilenetv2 becomes 9.58 ms from 12.9 ms.

gyagp reviewed Jul 18, 2023

View reviewed changes

js/web/lib/wasm/jsep/webgpu/gpu-data-manager.ts Outdated Show resolved Hide resolved

qjia7 marked this pull request as ready for review July 18, 2023 08:32

qjia7 commented Jul 18, 2023

View reviewed changes

js/web/lib/wasm/jsep/webgpu/gpu-data-manager.ts Show resolved Hide resolved

fs-eire reviewed Jul 18, 2023

View reviewed changes

qjia7 added 2 commits July 20, 2023 11:01

fix format issue

fd19d34

address comments

41ae616

qjia7 requested a review from fs-eire July 20, 2023 05:00

guschmue approved these changes Jul 20, 2023

View reviewed changes

fs-eire approved these changes Jul 20, 2023

View reviewed changes

guschmue merged commit 193415a into microsoft:main Jul 21, 2023

qjia7 deleted the reuse-buffer branch July 24, 2023 01:04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[js/webgpu] reuse buffer for GpuDataManager #16746

[js/webgpu] reuse buffer for GpuDataManager #16746

qjia7 commented Jul 18, 2023

qjia7 commented Jul 18, 2023

guschmue commented Jul 18, 2023

azure-pipelines bot commented Jul 18, 2023

guschmue commented Jul 19, 2023 •

edited

Loading

qjia7 commented Jul 20, 2023

guschmue commented Jul 20, 2023

azure-pipelines bot commented Jul 20, 2023

guschmue commented Jul 20, 2023

azure-pipelines bot commented Jul 20, 2023

fs-eire commented Jul 20, 2023

azure-pipelines bot commented Jul 20, 2023

fs-eire commented Jul 20, 2023

azure-pipelines bot commented Jul 20, 2023

guschmue commented Jul 20, 2023

azure-pipelines bot commented Jul 20, 2023

guschmue commented Jul 21, 2023

azure-pipelines bot commented Jul 21, 2023

guschmue commented Jul 21, 2023

azure-pipelines bot commented Jul 21, 2023

guschmue commented Jul 21, 2023

azure-pipelines bot commented Jul 21, 2023

guschmue commented Jul 21, 2023

azure-pipelines bot commented Jul 21, 2023

[js/webgpu] reuse buffer for GpuDataManager #16746

[js/webgpu] reuse buffer for GpuDataManager #16746

Conversation

qjia7 commented Jul 18, 2023

Description

Motivation and Context

qjia7 commented Jul 18, 2023

guschmue commented Jul 18, 2023

azure-pipelines bot commented Jul 18, 2023

guschmue commented Jul 19, 2023 • edited Loading

qjia7 commented Jul 20, 2023

guschmue commented Jul 20, 2023

azure-pipelines bot commented Jul 20, 2023

guschmue commented Jul 20, 2023

azure-pipelines bot commented Jul 20, 2023

fs-eire commented Jul 20, 2023

azure-pipelines bot commented Jul 20, 2023

fs-eire commented Jul 20, 2023

azure-pipelines bot commented Jul 20, 2023

guschmue commented Jul 20, 2023

azure-pipelines bot commented Jul 20, 2023

guschmue commented Jul 21, 2023

azure-pipelines bot commented Jul 21, 2023

guschmue commented Jul 21, 2023

azure-pipelines bot commented Jul 21, 2023

guschmue commented Jul 21, 2023

azure-pipelines bot commented Jul 21, 2023

guschmue commented Jul 21, 2023

azure-pipelines bot commented Jul 21, 2023

guschmue commented Jul 19, 2023 •

edited

Loading