-
Notifications
You must be signed in to change notification settings - Fork 2.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fixes bugs with loss averaging with validation_drop_last=False
for Megatron GPT
#5329
Conversation
for more information, see https://pre-commit.ci
This pull request introduces 1 alert when merging 08e67cf into b47a467 - view on LGTM.com new alerts:
|
This pull request introduces 1 alert when merging 103a3a3 into b47a467 - view on LGTM.com new alerts:
|
This pull request introduces 1 alert when merging 60a28c0 into 1826b73 - view on LGTM.com new alerts:
|
This pull request introduces 1 alert when merging 7a50d5e into d37db88 - view on LGTM.com new alerts:
|
This pull request introduces 1 alert when merging 15900cd into 83cb869 - view on LGTM.com new alerts:
|
This pull request introduces 1 alert when merging 27c4227 into 31e0440 - view on LGTM.com new alerts:
|
This pull request introduces 1 alert when merging 7a1b78b into 8df11ae - view on LGTM.com new alerts:
|
nemo/collections/nlp/data/language_modeling/megatron/megatron_batch_samplers.py
Outdated
Show resolved
Hide resolved
This pull request introduces 1 alert when merging 1910d93 into 373cbd6 - view on LGTM.com new alerts:
|
This pull request introduces 2 alerts when merging aafb93b into 373cbd6 - view on LGTM.com new alerts:
|
for more information, see https://pre-commit.ci
This pull request introduces 2 alerts when merging 66d4e35 into 265056e - view on LGTM.com new alerts:
|
This pull request introduces 2 alerts when merging 429ab5f into c29d779 - view on LGTM.com new alerts:
|
This pull request introduces 2 alerts when merging 4172a59 into c5c46ba - view on LGTM.com new alerts:
|
This pull request introduces 2 alerts when merging 9a8dd9f into f771f92 - view on LGTM.com new alerts:
|
validation_drop_last=False
for Megatron GPT
This pull request introduces 2 alerts when merging e85456e into 785057e - view on LGTM.com new alerts:
Heads-up: LGTM.com's PR analysis will be disabled on the 5th of December, and LGTM.com will be shut down ⏻ completely on the 16th of December 2022. Please enable GitHub code scanning, which uses the same CodeQL engine ⚙️ that powers LGTM.com. For more information, please check out our post on the GitHub blog. |
This pull request introduces 2 alerts when merging 62093ce into 3f2c1ae - view on LGTM.com new alerts:
Heads-up: LGTM.com's PR analysis will be disabled on the 5th of December, and LGTM.com will be shut down ⏻ completely on the 16th of December 2022. Please enable GitHub code scanning, which uses the same CodeQL engine ⚙️ that powers LGTM.com. For more information, please check out our post on the GitHub blog. |
…Megatron GPT (NVIDIA#5329) * fixing bugs * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * bug fix * bug fix * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Co-authored-by: Shanmugam Ramasamy <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Oleksii Kuchaiev <[email protected]> Signed-off-by: Hainan Xu <[email protected]>
…Megatron GPT (NVIDIA#5329) * fixing bugs * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * bug fix * bug fix * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Co-authored-by: Shanmugam Ramasamy <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Oleksii Kuchaiev <[email protected]> Signed-off-by: Hainan Xu <[email protected]>
…Megatron GPT (NVIDIA#5329) * fixing bugs * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * bug fix * bug fix * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Co-authored-by: Shanmugam Ramasamy <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Oleksii Kuchaiev <[email protected]>
…Megatron GPT (NVIDIA#5329) * fixing bugs * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * bug fix * bug fix * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci Co-authored-by: Shanmugam Ramasamy <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Oleksii Kuchaiev <[email protected]> Signed-off-by: andrusenkoau <[email protected]>
What does this PR do ?
Add a one line overview of what this PR aims to accomplish.
Collection: [Note which collection this PR will affect]
Changelog
Usage
# Add a code snippet demonstrating how to use this
Before your PR is "Ready for review"
Pre checks:
PR Type:
If you haven't finished some of the above items you can still open "Draft" PR.
Who can review?
Anyone in the community is free to review the PR once the checks have passed.
Contributor guidelines contains specific people who can review PRs to various areas.
Additional Information