-
Notifications
You must be signed in to change notification settings - Fork 2.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Akoumparouli/mixtral fixes for r2.0.0rc1 #9911
Conversation
12ce03c
to
209a241
Compare
273f6b5
to
f4de842
Compare
…rect dtype Signed-off-by: Alexandros Koumparoulis <[email protected]>
… transforming ckpt Signed-off-by: Alexandros Koumparoulis <[email protected]>
Signed-off-by: Alexandros Koumparoulis <[email protected]>
Signed-off-by: akoumpa <[email protected]>
d221daa
to
48ccf0a
Compare
@@ -124,3 +124,6 @@ def _preprocess_and_split_data( | |||
shutil.rmtree(p) | |||
elif '.jsonl' not in str(p.name): | |||
p.unlink() | |||
|
|||
def reconfigure_limit_batches(self): |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Can we remove this now that a fix has been merged?
Besides the minor comment above, LGTM |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Approving since we agreed to address the comment in a subsequent PR
* nemo-ux-mixtral: use cpu init & skip init when importing; specify correct dtype Signed-off-by: Alexandros Koumparoulis <[email protected]> * nemo-ux-state: handle None in state_dict.keys; disable auto-grad when transforming ckpt Signed-off-by: Alexandros Koumparoulis <[email protected]> * add dummy SquadDataModule.reconfigure_limit_batches Signed-off-by: Alexandros Koumparoulis <[email protected]> * Apply isort and black reformatting Signed-off-by: akoumpa <[email protected]> --------- Signed-off-by: Alexandros Koumparoulis <[email protected]> Signed-off-by: akoumpa <[email protected]> Co-authored-by: akoumpa <[email protected]>
* nemo-ux-mixtral: use cpu init & skip init when importing; specify correct dtype Signed-off-by: Alexandros Koumparoulis <[email protected]> * nemo-ux-state: handle None in state_dict.keys; disable auto-grad when transforming ckpt Signed-off-by: Alexandros Koumparoulis <[email protected]> * add dummy SquadDataModule.reconfigure_limit_batches Signed-off-by: Alexandros Koumparoulis <[email protected]> * Apply isort and black reformatting Signed-off-by: akoumpa <[email protected]> --------- Signed-off-by: Alexandros Koumparoulis <[email protected]> Signed-off-by: akoumpa <[email protected]> Co-authored-by: akoumpa <[email protected]> Signed-off-by: Alexandros Koumparoulis <[email protected]>
* nemo-ux-mixtral: use cpu init & skip init when importing; specify correct dtype * nemo-ux-state: handle None in state_dict.keys; disable auto-grad when transforming ckpt * add dummy SquadDataModule.reconfigure_limit_batches * Apply isort and black reformatting --------- Signed-off-by: Alexandros Koumparoulis <[email protected]> Signed-off-by: akoumpa <[email protected]> Co-authored-by: Alexandros Koumparoulis <[email protected]> Co-authored-by: akoumpa <[email protected]>
* nemo-ux-mixtral: use cpu init & skip init when importing; specify correct dtype * nemo-ux-state: handle None in state_dict.keys; disable auto-grad when transforming ckpt * add dummy SquadDataModule.reconfigure_limit_batches * Apply isort and black reformatting --------- Signed-off-by: Alexandros Koumparoulis <[email protected]> Signed-off-by: akoumpa <[email protected]> Co-authored-by: Alexandros Koumparoulis <[email protected]> Co-authored-by: akoumpa <[email protected]> Signed-off-by: Boxiang Wang <[email protected]>
* nemo-ux-mixtral: use cpu init & skip init when importing; specify correct dtype * nemo-ux-state: handle None in state_dict.keys; disable auto-grad when transforming ckpt * add dummy SquadDataModule.reconfigure_limit_batches * Apply isort and black reformatting --------- Signed-off-by: Alexandros Koumparoulis <[email protected]> Signed-off-by: akoumpa <[email protected]> Co-authored-by: Alexandros Koumparoulis <[email protected]> Co-authored-by: akoumpa <[email protected]> Signed-off-by: Vivian Chen <[email protected]>
* nemo-ux-mixtral: use cpu init & skip init when importing; specify correct dtype * nemo-ux-state: handle None in state_dict.keys; disable auto-grad when transforming ckpt * add dummy SquadDataModule.reconfigure_limit_batches * Apply isort and black reformatting --------- Signed-off-by: Alexandros Koumparoulis <[email protected]> Signed-off-by: akoumpa <[email protected]> Co-authored-by: Alexandros Koumparoulis <[email protected]> Co-authored-by: akoumpa <[email protected]>
* nemo-ux-mixtral: use cpu init & skip init when importing; specify correct dtype * nemo-ux-state: handle None in state_dict.keys; disable auto-grad when transforming ckpt * add dummy SquadDataModule.reconfigure_limit_batches * Apply isort and black reformatting --------- Signed-off-by: Alexandros Koumparoulis <[email protected]> Signed-off-by: akoumpa <[email protected]> Co-authored-by: Alexandros Koumparoulis <[email protected]> Co-authored-by: akoumpa <[email protected]> Signed-off-by: Hainan Xu <[email protected]>
* nemo-ux-mixtral: use cpu init & skip init when importing; specify correct dtype * nemo-ux-state: handle None in state_dict.keys; disable auto-grad when transforming ckpt * add dummy SquadDataModule.reconfigure_limit_batches * Apply isort and black reformatting --------- Signed-off-by: Alexandros Koumparoulis <[email protected]> Signed-off-by: akoumpa <[email protected]> Co-authored-by: Alexandros Koumparoulis <[email protected]> Co-authored-by: akoumpa <[email protected]>
What does this PR do ?
Add a one line overview of what this PR aims to accomplish.
Collection: [Note which collection this PR will affect]
Changelog
Usage
# Add a code snippet demonstrating how to use this
GitHub Actions CI
The Jenkins CI system has been replaced by GitHub Actions self-hosted runners.
The GitHub Actions CI will run automatically when the "Run CICD" label is added to the PR.
To re-run CI remove and add the label again.
To run CI on an untrusted fork, a NeMo user with write access must first click "Approve and run".
Before your PR is "Ready for review"
Pre checks:
PR Type:
If you haven't finished some of the above items you can still open "Draft" PR.
Who can review?
Anyone in the community is free to review the PR once the checks have passed.
Contributor guidelines contains specific people who can review PRs to various areas.
Additional Information