Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

DirectML Out of Memory Retry #707

Merged
merged 2 commits into from
Oct 9, 2023
Merged

DirectML Out of Memory Retry #707

merged 2 commits into from
Oct 9, 2023

Conversation

NullSenseStudio
Copy link
Collaborator

Allows DML execution in submodules to retry running when it runs out of memory. Sometimes this can allow generation to finish, but often it'll occur again on a later step and not recover from it.

Found and fixed some DML issues caused from merging the last branch.

It can run SDXL, but may require offloading depending on what GPU is used. Using half VAE tiling is also effective and maintains good decoded image accuracy.

@NullSenseStudio
Copy link
Collaborator Author

Found it helps quite a bit to clear the traceback before retrying. Now it'll more often recover and will usually finish 25 steps of SDXL without requiring offloading. The decoding stage still doesn't complete on my GPU without offloading or tiling though.

That should be it for this PR.

@carson-katri carson-katri merged commit fb1d1d0 into main Oct 9, 2023
@carson-katri carson-katri deleted the directml-oom branch October 9, 2023 21:08
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants