torch.compile with stable diffusion #7828

Madhumitha-MCW · 2024-04-30T17:15:39Z

Madhumitha-MCW
Apr 30, 2024

I would like to know why any inference example with stable diffusion has only unet part of the model passed into torch.compile(). https://pytorch.org/TensorRT/tutorials/_rendered_examples/dynamo/torch_compile_stable_diffusion.html

When using torch.compile with model.vae it relatively takes longer time than just compiling unet part. So, is it recommended to compile only unet or what is the exact reason?
And if I want to execute in graph mode, will all other uncompiled parts undergo eager execution if I compile only unet part?

tolgacangoz · 2024-04-30T17:29:32Z

tolgacangoz
Apr 30, 2024

Most of the time, torch.compile() is applied to UNet only because it is UNet that is responsible for most of the calculations. You can compile as many components as possible such as VAE, ControlNet, text encoder(s), and even optimizer, sampler function, torchvision.transforms.v2. But the last three may not be stable and proper yet. Are you saying that compiling both UNet and VAE is slower than compiling only UNet? Speeding-up thanks to compiling is observed in relatively new GPUs more.
I think so.

0 replies

Madhumitha-MCW · 2024-05-02T04:12:11Z

Madhumitha-MCW
May 2, 2024
Author

Yes, the time taken to compile vae is more than only compiling unet part. Thanks on your reply!

1 reply

tolgacangoz May 2, 2024

Are you talking about the first inference time? Don't count the first inference. Evaluate the next ones. The first inference (the compilation part) could take more time for more compilation tasks.

Madhumitha-MCW · 2024-05-02T06:22:02Z

Madhumitha-MCW
May 2, 2024
Author

Yes, I just was mentioning the first inference time(compilation part). The final inference is not slow.

1 reply

tolgacangoz May 2, 2024

My commonsense -> If someone compiles more parts then wouldn't it take more time to compile?

Madhumitha-MCW · 2024-05-02T06:58:02Z

Madhumitha-MCW
May 2, 2024
Author

Yes, it would. I first thought the entire inference time was longer, but when I verified it, it was just the initial part which took longer

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

torch.compile with stable diffusion #7828

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 4 comments 2 replies

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Select a reply

torch.compile with stable diffusion #7828

Madhumitha-MCW Apr 30, 2024

Replies: 4 comments · 2 replies

tolgacangoz Apr 30, 2024

Madhumitha-MCW May 2, 2024 Author

tolgacangoz May 2, 2024

Madhumitha-MCW May 2, 2024 Author

tolgacangoz May 2, 2024

Madhumitha-MCW May 2, 2024 Author

Madhumitha-MCW
Apr 30, 2024

Replies: 4 comments 2 replies

tolgacangoz
Apr 30, 2024

Madhumitha-MCW
May 2, 2024
Author

Madhumitha-MCW
May 2, 2024
Author

Madhumitha-MCW
May 2, 2024
Author