Introduce reused_buffer_index_per_stream in allocation planner which will be reset after computing the reuse buffer for each stream #19515

jslhcl · 2024-02-14T01:11:00Z

Description

Introduce reused_buffer_index_per_stream in allocation planner which will be reset after computing the reuse buffer for each stream. So if a NodeArg is an input of several Ops across different streams and reuses other NodeArg, the reused NodeArg won't be involved when computing the second stream's reuse plan.

Motivation and Context

This is to fix #19480, which is a crash for the scenario mentioned above.

…ntime into leca/experiment

PatriceVignola · 2024-02-20T19:36:04Z

Thanks for this ! Do you know if it works when reverting this change? https://github.com/microsoft/onnxruntime/pull/19481/files

If it does, we don't need to disable the streams on the DML EP anymore and can revert it.

jslhcl · 2024-02-20T19:47:43Z

Thanks for this ! Do you know if it works when reverting this change? https://github.com/microsoft/onnxruntime/pull/19481/files

If it does, we don't need to disable the streams on the DML EP anymore and can revert it.

Yes it works. I've tested it in v1.17 without your PR. I will revert your PR once this PR is in

onnxruntime/core/framework/allocation_planner.cc

jslhcl · 2024-02-22T00:04:07Z

/azp run Python format

azure-pipelines · 2024-02-22T00:04:12Z

No pipelines are associated with this pull request.

…will be reset after computing the reuse buffer for each stream (#19515) ### Description  Introduce reused_buffer_index_per_stream in allocation planner which will be reset after computing the reuse buffer for each stream. So if a NodeArg is an input of several Ops across different streams and reuses other NodeArg, the reused NodeArg won't be involved when computing the second stream's reuse plan. ### Motivation and Context  This is to fix #19480, which is a crash for the scenario mentioned above. --------- Co-authored-by: Lei Cao <[email protected]>

jslhcl and others added 6 commits February 13, 2024 17:10

Compute reuse count only once and do not clear during every stream

0c9d031

DONOT decrease use count for the reused buffer if it is already 0

20734ba

reset reused_buffer_index every time computing next stream's reuse plan

52c28b8

introduce reused_buffer_index_per_stream

02b7043

Merge branch 'leca/experiment' of https://github.com/microsoft/onnxru…

18a1532

…ntime into leca/experiment

undo previous commit for the changes on reused_buffer_index

e99a8cd

jslhcl requested a review from souptc February 20, 2024 18:13

jslhcl changed the title ~~[DONT Review] Compute reuse count only once and do not clear during every stream~~ Introduce reused_buffer_index_per_stream in allocation planner which will be reset after computing the reuse buffer for each stream Feb 20, 2024

jslhcl requested a review from PatriceVignola February 20, 2024 18:22

add test case

8262d0b

jslhcl marked this pull request as ready for review February 20, 2024 18:39

fix comments

3e50a00

souptc previously approved these changes Feb 21, 2024

View reviewed changes

onnxruntime/core/framework/allocation_planner.cc Outdated Show resolved Hide resolved

onnxruntime/core/framework/allocation_planner.cc Show resolved Hide resolved

onnxruntime/core/framework/allocation_planner.cc Outdated Show resolved Hide resolved

fix lint

45747fd

jslhcl dismissed souptc’s stale review via 45747fd February 21, 2024 17:15

PatriceVignola previously approved these changes Feb 21, 2024

View reviewed changes

reset lint

16507c3

jslhcl dismissed PatriceVignola’s stale review via 16507c3 February 22, 2024 01:39

PatriceVignola approved these changes Feb 22, 2024

View reviewed changes

jslhcl merged commit 6b73ab3 into main Feb 22, 2024
94 checks passed

jslhcl deleted the leca/experiment branch February 22, 2024 18:19

diablodale mentioned this pull request Apr 3, 2024

multiple tests fail on Windows due to ORT_ENABLE_STREAM define logic error #20180

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Introduce reused_buffer_index_per_stream in allocation planner which will be reset after computing the reuse buffer for each stream #19515

Introduce reused_buffer_index_per_stream in allocation planner which will be reset after computing the reuse buffer for each stream #19515

jslhcl commented Feb 14, 2024 •

edited

Loading

PatriceVignola commented Feb 20, 2024

jslhcl commented Feb 20, 2024

jslhcl commented Feb 22, 2024

azure-pipelines bot commented Feb 22, 2024

Introduce reused_buffer_index_per_stream in allocation planner which will be reset after computing the reuse buffer for each stream #19515

Introduce reused_buffer_index_per_stream in allocation planner which will be reset after computing the reuse buffer for each stream #19515

Conversation

jslhcl commented Feb 14, 2024 • edited Loading

Description

Motivation and Context

PatriceVignola commented Feb 20, 2024

jslhcl commented Feb 20, 2024

jslhcl commented Feb 22, 2024

azure-pipelines bot commented Feb 22, 2024

jslhcl commented Feb 14, 2024 •

edited

Loading