Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Question about Training Loss #9

Open
ddz16 opened this issue Sep 29, 2024 · 1 comment
Open

Question about Training Loss #9

ddz16 opened this issue Sep 29, 2024 · 1 comment

Comments

@ddz16
Copy link

ddz16 commented Sep 29, 2024

Great job! I’d like to ask a question about training loss. In the process of training diffusion model, there are two losses: $L_{LD}$ and $L_{AIR}$ (corresponding to Equation 8 and Equation 10 in the paper). $L_{LD}$ only has gradients for the U-Net of diffusion model, while $L_{AIR}$ has gradients for both U-Net and SCM. I would like to know whether, during the training process of diffusion, only $L_{AIR}$ is used for training, or if the weighted sum of both losses is used for training, or if there are other training methods being employed?

@ddz16
Copy link
Author

ddz16 commented Sep 29, 2024

Second, if the loss $L_{LD}$ is used, then what dose $z_t$ represent? Is it obtained by adding sampled noise to the VQVAE feature of the clean image $I_{gt}$?
image

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant