You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hello, I am interested in the code you posted, thank you for sharing. What puzzles me is that there is not much discussion of scale factor in the paper.
In SD Style, rho appears to be a learning rate associated with both "grad" and "classification guided effects",as shown below
However, in Face ID, rho is equal to at.sqrt(), as follows:
So, how exactly do we set up RHO, and is there some mathematical theory to support it? Thank you?
The text was updated successfully, but these errors were encountered:
@FromA2Z Thank you for your attention to our work! Regarding rho or the learning rate, it is an important parameter used to control the step size of guidance. Poorly chosen strategies for its setting can result in undesirable generation outcomes. The two strategies you've identified are two of the relatively better ones we've found:
Dynamically adjusting rho based on the length of guidance or score.
In experiments related to human faces, using at.sqrt() is a promising setting. However, it requires stopping the guidance after a certain number of steps, which is what the stop parameter accomplishes.
A detailed description of parameter settings is provided in the camera-ready version. We suggest you experiment with different parameter settings using the code we've provided. This should lead to numerous new discoveries. Thank you~
Hello, I am interested in the code you posted, thank you for sharing. What puzzles me is that there is not much discussion of scale factor in the paper.
![1692454027621](https://private-user-images.githubusercontent.com/108920982/261802475-3d6395eb-deb3-4f9c-b886-1e4cf8a69ade.jpg?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3Mzk0NzkyODEsIm5iZiI6MTczOTQ3ODk4MSwicGF0aCI6Ii8xMDg5MjA5ODIvMjYxODAyNDc1LTNkNjM5NWViLWRlYjMtNGY5Yy1iODg2LTFlNGNmOGE2OWFkZS5qcGc_WC1BbXotQWxnb3JpdGhtPUFXUzQtSE1BQy1TSEEyNTYmWC1BbXotQ3JlZGVudGlhbD1BS0lBVkNPRFlMU0E1M1BRSzRaQSUyRjIwMjUwMjEzJTJGdXMtZWFzdC0xJTJGczMlMkZhd3M0X3JlcXVlc3QmWC1BbXotRGF0ZT0yMDI1MDIxM1QyMDM2MjFaJlgtQW16LUV4cGlyZXM9MzAwJlgtQW16LVNpZ25hdHVyZT00ZWUxMTE0ZjE5OTFiYzcwMjJlZjExMWQxYjk2MThjMzA3N2EwNWE5MDI5OTAyYzRjMWYxN2IwZThjMjY5YjcyJlgtQW16LVNpZ25lZEhlYWRlcnM9aG9zdCJ9.aPsJlD216l7Ucr9kl1he6nczHfMyV1IbI6Kth_LWtZc)
![b130ea19a84525988e6f3db1801c4a5](https://private-user-images.githubusercontent.com/108920982/261802548-6cb05a6b-e268-46f2-9969-9e4daa52a896.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3Mzk0NzkyODEsIm5iZiI6MTczOTQ3ODk4MSwicGF0aCI6Ii8xMDg5MjA5ODIvMjYxODAyNTQ4LTZjYjA1YTZiLWUyNjgtNDZmMi05OTY5LTllNGRhYTUyYTg5Ni5wbmc_WC1BbXotQWxnb3JpdGhtPUFXUzQtSE1BQy1TSEEyNTYmWC1BbXotQ3JlZGVudGlhbD1BS0lBVkNPRFlMU0E1M1BRSzRaQSUyRjIwMjUwMjEzJTJGdXMtZWFzdC0xJTJGczMlMkZhd3M0X3JlcXVlc3QmWC1BbXotRGF0ZT0yMDI1MDIxM1QyMDM2MjFaJlgtQW16LUV4cGlyZXM9MzAwJlgtQW16LVNpZ25hdHVyZT0wMTlmYTAyNThmNjAwNTk2MGUxOWNkYWIzNWIxM2MyZDMzNTgzNGM3ZmM1ZjY1Zjk5ZjAzY2I1ZmI4ZTUyNTUwJlgtQW16LVNpZ25lZEhlYWRlcnM9aG9zdCJ9.WGRFM0oOJrZU10floyNANRPNimJDiWPDZmTTkAHQaPY)
In SD Style, rho appears to be a learning rate associated with both "grad" and "classification guided effects",as shown below
However, in Face ID, rho is equal to at.sqrt(), as follows:
So, how exactly do we set up RHO, and is there some mathematical theory to support it? Thank you?
The text was updated successfully, but these errors were encountered: