Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix softmax #259

Closed
wants to merge 3 commits into from
Closed

Fix softmax #259

wants to merge 3 commits into from

Conversation

thomasw21
Copy link
Member

@thomasw21 thomasw21 commented Mar 1, 2022

@hyunwoongko has provided a fix for gpt-neox EleutherAI/gpt-neox#572. I think the same fix applies to us. I think this might have affect the throughput in some ways.

@hyunwoongko feel free to correct my if i'm wrong.

@stas00
Copy link
Contributor

stas00 commented Mar 1, 2022

Thank you for detecting that we missed that upstream fix, @thomasw21.

Since I originally applied the first batch of these fixes, I tried to see what else was missing and discovered a lot more of those and I'm trying a different approach here: #260

@thomasw21
Copy link
Member Author

thomasw21 commented Mar 1, 2022

Thanks @stas00 I think we can merge these fixes on your branch if your prefer or master. (I think they are relevant and not covered by your PR)

@stas00
Copy link
Contributor

stas00 commented Mar 2, 2022

Yes, and there are more fixes in this file besides what you added - I'm checking what else might need to go in there and updating my PR.

I'm going backwards - taking the Meg-LM version as master and checking if we made any changes to it and syncing those changes if need be.

@thomasw21
Copy link
Member Author

Closing in favor of #260

@thomasw21 thomasw21 closed this Mar 7, 2022
@thomasw21 thomasw21 deleted the thomas/fix_softmax branch July 4, 2022 08:50
adammoody pushed a commit to adammoody/Megatron-DeepSpeed that referenced this pull request Dec 18, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants