DirectML Out of Memory Retry #707

NullSenseStudio · 2023-09-18T16:02:00Z

Allows DML execution in submodules to retry running when it runs out of memory. Sometimes this can allow generation to finish, but often it'll occur again on a later step and not recover from it.

Found and fixed some DML issues caused from merging the last branch.

It can run SDXL, but may require offloading depending on what GPU is used. Using half VAE tiling is also effective and maintains good decoded image accuracy.

NullSenseStudio · 2023-09-19T01:10:09Z

Found it helps quite a bit to clear the traceback before retrying. Now it'll more often recover and will usually finish 25 steps of SDXL without requiring offloading. The decoding stage still doesn't complete on my GPU without offloading or tiling though.

That should be it for this PR.

out of memory retry

ab4f295

NullSenseStudio requested a review from carson-katri September 18, 2023 16:04

clear traceback

37c7aa8

carson-katri approved these changes Sep 22, 2023

View reviewed changes

carson-katri merged commit fb1d1d0 into main Oct 9, 2023

carson-katri deleted the directml-oom branch October 9, 2023 21:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DirectML Out of Memory Retry #707

DirectML Out of Memory Retry #707

NullSenseStudio commented Sep 18, 2023

NullSenseStudio commented Sep 19, 2023

DirectML Out of Memory Retry #707

DirectML Out of Memory Retry #707

Conversation

NullSenseStudio commented Sep 18, 2023

NullSenseStudio commented Sep 19, 2023