Enabling gradient checkpointing in eval() mode #9878

MikeTkachuk · 2024-11-06T15:42:24Z

Removed unnecessary if self.training ... check when using gradient checkpointing
#9850

MikeTkachuk · 2024-11-06T15:46:31Z

Since all of the module implementations used the same if self.training and self.gradient_checkpointing: clause, I wonder if there is some reference template for def forward that others are using? Need to make sure future implementations are using the fixed clause

yiyixuxu · 2024-11-07T00:26:17Z

src/diffusers/models/transformers/cogvideox_transformer_3d.py

@@ -452,7 +452,7 @@ def forward(

        # 3. Transformer blocks
        for i, block in enumerate(self.transformer_blocks):
-            if self.training and self.gradient_checkpointing:
+            if self.gradient_checkpointing:


oh thanks! why do we also removed the torch.is_grad_enabled() check? gradient checkpointing isn't meaningful without gradient being computed, no?

added it back, thanks for pointing it out.
it does not break anything, but found that it throws an annoying warning when use_reentrant=True,

but found that it throws an annoying warning when use_reentrant=True,

what do you mean by that?

use_reentrant is an argument passed to torch.utils.checkpoint.checkpoint

if True one of the checks will print this to stderr
warnings.warn(
"None of the inputs have requires_grad=True. Gradients will be None"
)
but diffusers are using use_reentrant=False anyway

oh got thanks, so the warning is specific to when we use gradient checkpointing when gradient is not enabled

yiyixuxu · 2024-11-07T23:56:28Z

hi @MikeTkachuk unfortunately we have to update the branch and resolve conflicts now...
would you be able to do that? I'm cool with the change otherwise and can merge once it is synced with main

…le_grckpt_in_eval # Conflicts: # src/diffusers/models/controlnet_flux.py # src/diffusers/models/controlnet_sd3.py

MikeTkachuk · 2024-11-08T02:27:41Z

done, also fixed in a few other places I missed

HuggingFaceDocBuilderDev · 2024-11-08T07:10:44Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

docs/source/en/api/models/controlnet.md

yiyixuxu · 2024-11-08T07:18:28Z

can you rebase again?
I saw a bunch of changes that are not made in this PR and the commit history includes commits that's already been merged into main (from another PR)

MikeTkachuk · 2024-11-08T14:56:13Z

fixed

yiyixuxu

thanks!

* refactored

refactored

dea92b6

yiyixuxu reviewed Nov 7, 2024

View reviewed changes

add grad check

3beefa8

MikeTkachuk added 4 commits November 7, 2024 21:11

refactored

b0ebd05

add grad check

db85cf2

Merge remote-tracking branch 'origin/enable_grckpt_in_eval' into enab…

7049d07

…le_grckpt_in_eval # Conflicts: # src/diffusers/models/controlnet_flux.py # src/diffusers/models/controlnet_sd3.py

amend

238b475

yiyixuxu reviewed Nov 8, 2024

View reviewed changes

docs/source/en/api/models/controlnet.md Outdated Show resolved Hide resolved

Merge branch 'main' into enable_grckpt_in_eval

f40937f

yiyixuxu approved these changes Nov 8, 2024

View reviewed changes

yiyixuxu merged commit 5b972fb into huggingface:main Nov 8, 2024
15 checks passed

yiyixuxu mentioned this pull request Dec 4, 2024

gradient checkpointing runs during validations in the training examples #10107

Closed

sayakpaul pushed a commit that referenced this pull request Dec 23, 2024

Enabling gradient checkpointing in eval() mode (#9878)

a27125d

* refactored

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enabling gradient checkpointing in eval() mode #9878

Enabling gradient checkpointing in eval() mode #9878

MikeTkachuk commented Nov 6, 2024

MikeTkachuk commented Nov 6, 2024

yiyixuxu Nov 7, 2024

MikeTkachuk Nov 7, 2024

yiyixuxu Nov 7, 2024

MikeTkachuk Nov 7, 2024

yiyixuxu Nov 7, 2024

yiyixuxu commented Nov 7, 2024

MikeTkachuk commented Nov 8, 2024

HuggingFaceDocBuilderDev commented Nov 8, 2024

yiyixuxu commented Nov 8, 2024

MikeTkachuk commented Nov 8, 2024

yiyixuxu left a comment

Enabling gradient checkpointing in eval() mode #9878

Enabling gradient checkpointing in eval() mode #9878

Conversation

MikeTkachuk commented Nov 6, 2024

MikeTkachuk commented Nov 6, 2024

yiyixuxu Nov 7, 2024

Choose a reason for hiding this comment

MikeTkachuk Nov 7, 2024

Choose a reason for hiding this comment

yiyixuxu Nov 7, 2024

Choose a reason for hiding this comment

MikeTkachuk Nov 7, 2024

Choose a reason for hiding this comment

yiyixuxu Nov 7, 2024

Choose a reason for hiding this comment

yiyixuxu commented Nov 7, 2024

MikeTkachuk commented Nov 8, 2024

HuggingFaceDocBuilderDev commented Nov 8, 2024

yiyixuxu commented Nov 8, 2024

MikeTkachuk commented Nov 8, 2024

yiyixuxu left a comment

Choose a reason for hiding this comment