-
Notifications
You must be signed in to change notification settings - Fork 2.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
enables default data step in megatron parallel to operate on a wider variety of tensors - second try #9671
Conversation
6a101eb
to
e7652cd
Compare
e7652cd
to
42c8139
Compare
…variety of tensors coming out of the dataloader Signed-off-by: Jonathan Mitchell <[email protected]>
Signed-off-by: Jonathan Mitchell <[email protected]>
Signed-off-by: jomitchellnv <[email protected]> Signed-off-by: Jonathan Mitchell <[email protected]>
than just dictionaries Signed-off-by: Jonathan Mitchell <[email protected]>
42c8139
to
25a7ccd
Compare
Signed-off-by: jomitchellnv <[email protected]>
@cuichenx , it sounded like @ashors1 wanted you to review and approve this PR? Let me know if someone else is better. While @marcromeyn is out, who should we have review and help us get things we need on the BioNeMo2 side into NeMo2 in general? |
I think it generally depends on the PR, but I can approve this one as well given that Marc approved the original attempt previously. We just need the CI to pass; then we should be able to merge |
…variety of tensors - second try (NVIDIA#9671) * enables default data step in megatron parallel to operate on a wider variety of tensors coming out of the dataloader Signed-off-by: Jonathan Mitchell <[email protected]> * handles the case where a batch is empty Signed-off-by: Jonathan Mitchell <[email protected]> * Apply isort and black reformatting Signed-off-by: jomitchellnv <[email protected]> Signed-off-by: Jonathan Mitchell <[email protected]> * Allows the default data step to operate on more types than just dictionaries Signed-off-by: Jonathan Mitchell <[email protected]> * Apply isort and black reformatting Signed-off-by: jomitchellnv <[email protected]> --------- Signed-off-by: Jonathan Mitchell <[email protected]> Signed-off-by: jomitchellnv <[email protected]> Co-authored-by: jomitchellnv <[email protected]> Co-authored-by: John St. John <[email protected]>
…variety of tensors - second try (#9671) * enables default data step in megatron parallel to operate on a wider variety of tensors coming out of the dataloader Signed-off-by: Jonathan Mitchell <[email protected]> * handles the case where a batch is empty Signed-off-by: Jonathan Mitchell <[email protected]> * Apply isort and black reformatting Signed-off-by: jomitchellnv <[email protected]> Signed-off-by: Jonathan Mitchell <[email protected]> * Allows the default data step to operate on more types than just dictionaries Signed-off-by: Jonathan Mitchell <[email protected]> * Apply isort and black reformatting Signed-off-by: jomitchellnv <[email protected]> --------- Signed-off-by: Jonathan Mitchell <[email protected]> Signed-off-by: jomitchellnv <[email protected]> Co-authored-by: jomitchellnv <[email protected]> Co-authored-by: John St. John <[email protected]> Signed-off-by: Tugrul Konuk <[email protected]>
…variety of tensors - second try (NVIDIA#9671) * enables default data step in megatron parallel to operate on a wider variety of tensors coming out of the dataloader Signed-off-by: Jonathan Mitchell <[email protected]> * handles the case where a batch is empty Signed-off-by: Jonathan Mitchell <[email protected]> * Apply isort and black reformatting Signed-off-by: jomitchellnv <[email protected]> Signed-off-by: Jonathan Mitchell <[email protected]> * Allows the default data step to operate on more types than just dictionaries Signed-off-by: Jonathan Mitchell <[email protected]> * Apply isort and black reformatting Signed-off-by: jomitchellnv <[email protected]> --------- Signed-off-by: Jonathan Mitchell <[email protected]> Signed-off-by: jomitchellnv <[email protected]> Co-authored-by: jomitchellnv <[email protected]> Co-authored-by: John St. John <[email protected]> Signed-off-by: Malay Nagda <[email protected]>
…variety of tensors - second try (NVIDIA#9671) * enables default data step in megatron parallel to operate on a wider variety of tensors coming out of the dataloader Signed-off-by: Jonathan Mitchell <[email protected]> * handles the case where a batch is empty Signed-off-by: Jonathan Mitchell <[email protected]> * Apply isort and black reformatting Signed-off-by: jomitchellnv <[email protected]> Signed-off-by: Jonathan Mitchell <[email protected]> * Allows the default data step to operate on more types than just dictionaries Signed-off-by: Jonathan Mitchell <[email protected]> * Apply isort and black reformatting Signed-off-by: jomitchellnv <[email protected]> --------- Signed-off-by: Jonathan Mitchell <[email protected]> Signed-off-by: jomitchellnv <[email protected]> Co-authored-by: jomitchellnv <[email protected]> Co-authored-by: John St. John <[email protected]> Signed-off-by: tonyjie <[email protected]>
…variety of tensors - second try (#9671) * enables default data step in megatron parallel to operate on a wider variety of tensors coming out of the dataloader Signed-off-by: Jonathan Mitchell <[email protected]> * handles the case where a batch is empty Signed-off-by: Jonathan Mitchell <[email protected]> * Apply isort and black reformatting Signed-off-by: jomitchellnv <[email protected]> Signed-off-by: Jonathan Mitchell <[email protected]> * Allows the default data step to operate on more types than just dictionaries Signed-off-by: Jonathan Mitchell <[email protected]> * Apply isort and black reformatting Signed-off-by: jomitchellnv <[email protected]> --------- Signed-off-by: Jonathan Mitchell <[email protected]> Signed-off-by: jomitchellnv <[email protected]> Co-authored-by: jomitchellnv <[email protected]> Co-authored-by: John St. John <[email protected]>
…variety of tensors - second try (NVIDIA#9671) * enables default data step in megatron parallel to operate on a wider variety of tensors coming out of the dataloader Signed-off-by: Jonathan Mitchell <[email protected]> * handles the case where a batch is empty Signed-off-by: Jonathan Mitchell <[email protected]> * Apply isort and black reformatting Signed-off-by: jomitchellnv <[email protected]> Signed-off-by: Jonathan Mitchell <[email protected]> * Allows the default data step to operate on more types than just dictionaries Signed-off-by: Jonathan Mitchell <[email protected]> * Apply isort and black reformatting Signed-off-by: jomitchellnv <[email protected]> --------- Signed-off-by: Jonathan Mitchell <[email protected]> Signed-off-by: jomitchellnv <[email protected]> Co-authored-by: jomitchellnv <[email protected]> Co-authored-by: John St. John <[email protected]> Signed-off-by: Hainan Xu <[email protected]>
…variety of tensors - second try (NVIDIA#9671) * enables default data step in megatron parallel to operate on a wider variety of tensors coming out of the dataloader Signed-off-by: Jonathan Mitchell <[email protected]> * handles the case where a batch is empty Signed-off-by: Jonathan Mitchell <[email protected]> * Apply isort and black reformatting Signed-off-by: jomitchellnv <[email protected]> Signed-off-by: Jonathan Mitchell <[email protected]> * Allows the default data step to operate on more types than just dictionaries Signed-off-by: Jonathan Mitchell <[email protected]> * Apply isort and black reformatting Signed-off-by: jomitchellnv <[email protected]> --------- Signed-off-by: Jonathan Mitchell <[email protected]> Signed-off-by: jomitchellnv <[email protected]> Co-authored-by: jomitchellnv <[email protected]> Co-authored-by: John St. John <[email protected]>
What does this PR do ?
Add more flexibility to the default data step inside megatron parallel.
Changelog
Usage
# Add a code snippet demonstrating how to use this
GitHub Actions CI
The Jenkins CI system has been replaced by GitHub Actions self-hosted runners.
The GitHub Actions CI will run automatically when the "Run CICD" label is added to the PR.
To re-run CI remove and add the label again.
To run CI on an untrusted fork, a NeMo user with write access must first click "Approve and run".
Before your PR is "Ready for review"
Pre checks:
PR Type:
If you haven't finished some of the above items you can still open "Draft" PR.
Who can review?
Anyone in the community is free to review the PR once the checks have passed.
Contributor guidelines contains specific people who can review PRs to various areas.
Additional Information