-
Notifications
You must be signed in to change notification settings - Fork 89
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Draft+ for SDXL [draft] #222
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks @rohitrango for the contribution!
I've left some comments. Let me know if there are any questions
nemo_aligner/models/mm/stable_diffusion/megatron_sdxl_draftp_model.py
Outdated
Show resolved
Hide resolved
nemo_aligner/models/mm/stable_diffusion/megatron_sdxl_draftp_model.py
Outdated
Show resolved
Hide resolved
nemo_aligner/models/mm/stable_diffusion/megatron_sdxl_draftp_model.py
Outdated
Show resolved
Hide resolved
nemo_aligner/models/mm/stable_diffusion/megatron_sdxl_draftp_model.py
Outdated
Show resolved
Hide resolved
Requires NVIDIA/NeMo#9543 edit: Also requires NVIDIA/NeMo#9654 to have feature in RC branch |
Signed-off-by: Rohit Jena <[email protected]>
Signed-off-by: Rohit Jena <[email protected]>
Signed-off-by: Rohit Jena <[email protected]>
Signed-off-by: Rohit Jena <[email protected]>
Signed-off-by: Rohit Jena <[email protected]>
Signed-off-by: Rohit Jena <[email protected]>
sampling instead Signed-off-by: Rohit Jena <[email protected]>
Signed-off-by: Rohit Jena <[email protected]>
Signed-off-by: Rohit Jena <[email protected]>
Signed-off-by: Rohit Jena <[email protected]>
Signed-off-by: Rohit Jena <[email protected]>
Signed-off-by: Rohit Jena <[email protected]>
Signed-off-by: Rohit Jena <[email protected]>
Signed-off-by: Rohit Jena <[email protected]>
Signed-off-by: Rohit Jena <[email protected]>
Signed-off-by: Rohit Jena <[email protected]>
Signed-off-by: Rohit Jena <[email protected]>
Signed-off-by: Rohit Jena <[email protected]>
Signed-off-by: Rohit Jena <[email protected]>
Signed-off-by: Rohit Jena <[email protected]>
Signed-off-by: Rohit Jena <[email protected]>
Signed-off-by: Rohit Jena <[email protected]>
Signed-off-by: Rohit Jena <[email protected]>
Signed-off-by: Rohit Jena <[email protected]>
Signed-off-by: Rohit Jena <[email protected]>
Signed-off-by: Rohit Jena <[email protected]>
Signed-off-by: Rohit Jena <[email protected]>
Signed-off-by: Rohit Jena <[email protected]>
Signed-off-by: Rohit Jena <[email protected]>
for more information, see https://pre-commit.ci Signed-off-by: Rohit Jena <[email protected]>
Signed-off-by: Rohit Jena <[email protected]>
Signed-off-by: Rohit Jena <[email protected]>
for more information, see https://pre-commit.ci Signed-off-by: Rohit Jena <[email protected]>
Signed-off-by: Rohit Jena <[email protected]>
for more information, see https://pre-commit.ci Signed-off-by: Rohit Jena <[email protected]>
Signed-off-by: Rohit Jena <[email protected]>
for more information, see https://pre-commit.ci Signed-off-by: Rohit Jena <[email protected]>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks @rohitrango for adding this new feature!
* leftover commit Signed-off-by: Rohit Jena <[email protected]> * leftover commit Signed-off-by: Rohit Jena <[email protected]> * commits Signed-off-by: Rohit Jena <[email protected]> * update gitignore Signed-off-by: Rohit Jena <[email protected]> * init model for SDXL Signed-off-by: Rohit Jena <[email protected]> * correct path in eos script Signed-off-by: Rohit Jena <[email protected]> * modified generate and log_generate scripts to use diffusion engine sampling instead Signed-off-by: Rohit Jena <[email protected]> * fixed most runtime bugs -- check for logical bugs Signed-off-by: Rohit Jena <[email protected]> * examining mapping between hf and nemo Signed-off-by: Rohit Jena <[email protected]> * writing converter script for unet Signed-off-by: Rohit Jena <[email protected]> * tmp commit (moving to eos) Signed-off-by: Rohit Jena <[email protected]> * more changes to draftp xl Signed-off-by: Rohit Jena <[email protected]> * changed batch scripts Signed-off-by: Rohit Jena <[email protected]> * changed launch scripts Signed-off-by: Rohit Jena <[email protected]> * check adapter control Signed-off-by: Rohit Jena <[email protected]> * clean up lora hotswap debugging Signed-off-by: Rohit Jena <[email protected]> * adding fsdp to draftp training Signed-off-by: Rohit Jena <[email protected]> * FSDP now works for SDXL?! Signed-off-by: Rohit Jena <[email protected]> * added custom rule to enable sharding of decoder Signed-off-by: Rohit Jena <[email protected]> * also shard the clip embeddings Signed-off-by: Rohit Jena <[email protected]> * multinode script created + testing 2048 config Signed-off-by: Rohit Jena <[email protected]> * add activation checkpointing Signed-off-by: Rohit Jena <[email protected]> * added activation checkpointing Signed-off-by: Rohit Jena <[email protected]> * added SFT and PEFT support with Draft+ Signed-off-by: Rohit Jena <[email protected]> * corrected init denoise bug Signed-off-by: Rohit Jena <[email protected]> * added multinode setup script - test it Signed-off-by: Rohit Jena <[email protected]> * enable sharding for sdlora Signed-off-by: Rohit Jena <[email protected]> * added multinode script for OCI Signed-off-by: Rohit Jena <[email protected]> * some more refactoring Signed-off-by: Rohit Jena <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Rohit Jena <[email protected]> * address Terry's comments Signed-off-by: Rohit Jena <[email protected]> * remove + 0 to clone Signed-off-by: Rohit Jena <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Rohit Jena <[email protected]> * add version guard Signed-off-by: Rohit Jena <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Rohit Jena <[email protected]> * added todo to merge these APIs later Signed-off-by: Rohit Jena <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Rohit Jena <[email protected]> --------- Signed-off-by: Rohit Jena <[email protected]> Co-authored-by: Rohit Jena <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: Alexander Bukharin <[email protected]>
* leftover commit Signed-off-by: Rohit Jena <[email protected]> * leftover commit Signed-off-by: Rohit Jena <[email protected]> * commits Signed-off-by: Rohit Jena <[email protected]> * update gitignore Signed-off-by: Rohit Jena <[email protected]> * init model for SDXL Signed-off-by: Rohit Jena <[email protected]> * correct path in eos script Signed-off-by: Rohit Jena <[email protected]> * modified generate and log_generate scripts to use diffusion engine sampling instead Signed-off-by: Rohit Jena <[email protected]> * fixed most runtime bugs -- check for logical bugs Signed-off-by: Rohit Jena <[email protected]> * examining mapping between hf and nemo Signed-off-by: Rohit Jena <[email protected]> * writing converter script for unet Signed-off-by: Rohit Jena <[email protected]> * tmp commit (moving to eos) Signed-off-by: Rohit Jena <[email protected]> * more changes to draftp xl Signed-off-by: Rohit Jena <[email protected]> * changed batch scripts Signed-off-by: Rohit Jena <[email protected]> * changed launch scripts Signed-off-by: Rohit Jena <[email protected]> * check adapter control Signed-off-by: Rohit Jena <[email protected]> * clean up lora hotswap debugging Signed-off-by: Rohit Jena <[email protected]> * adding fsdp to draftp training Signed-off-by: Rohit Jena <[email protected]> * FSDP now works for SDXL?! Signed-off-by: Rohit Jena <[email protected]> * added custom rule to enable sharding of decoder Signed-off-by: Rohit Jena <[email protected]> * also shard the clip embeddings Signed-off-by: Rohit Jena <[email protected]> * multinode script created + testing 2048 config Signed-off-by: Rohit Jena <[email protected]> * add activation checkpointing Signed-off-by: Rohit Jena <[email protected]> * added activation checkpointing Signed-off-by: Rohit Jena <[email protected]> * added SFT and PEFT support with Draft+ Signed-off-by: Rohit Jena <[email protected]> * corrected init denoise bug Signed-off-by: Rohit Jena <[email protected]> * added multinode setup script - test it Signed-off-by: Rohit Jena <[email protected]> * enable sharding for sdlora Signed-off-by: Rohit Jena <[email protected]> * added multinode script for OCI Signed-off-by: Rohit Jena <[email protected]> * some more refactoring Signed-off-by: Rohit Jena <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Rohit Jena <[email protected]> * address Terry's comments Signed-off-by: Rohit Jena <[email protected]> * remove + 0 to clone Signed-off-by: Rohit Jena <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Rohit Jena <[email protected]> * add version guard Signed-off-by: Rohit Jena <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Rohit Jena <[email protected]> * added todo to merge these APIs later Signed-off-by: Rohit Jena <[email protected]> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Signed-off-by: Rohit Jena <[email protected]> --------- Signed-off-by: Rohit Jena <[email protected]> Co-authored-by: Rohit Jena <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
What does this PR do ?
Add a one line overview of what this PR aims to accomplish.
Changelog
Usage
# Add a code snippet demonstrating how to use this
Before your PR is "Ready for review"
Pre checks:
Checklist when contributing a new algorithm
max_steps=-1
andvalidation
?Additional Information