Fix printing momentum for non-deepspeed optimizer #464

Zixxy · 2020-10-07T19:14:56Z

Fix printing momentum for non-deepspeed optimizer

tjruwase · 2020-10-07T20:19:19Z

@Zixxy Thanks for creating this PR. Please consider the following comments:

Which optimizer (or use case) motivated this PR?
'betas' is not from DeepSpeed, but rather from the Adam-style optimizers of Torch: Adam, AdamW, Adamax, etc.
It would be great to support optimizers that name their momentum differently from 'betas', but the current fix will break optimizers that use 'betas' but are not DeepSpeed optimizers. How about changing get_mom() to take an optional string parameter representing the momentum name? def get_mom(self, name=None) or def get_mom(self, name='betas')

Will (3) work for your scenario?

fix momentum access for Adam

Zixxy · 2020-10-07T22:06:19Z

torch.optim.SGD / torch.optim.RMSProp
Ah right, fixed
It is fix for your _report_progress function: https://github.com/microsoft/DeepSpeed/blob/2efea6944616c7be9e35874adc37dbaf150ea05e/deepspeed/runtime/engine.py#L1007 , I would have to determine optimizer type inside of this function to call get_mom(name="betas"). Let me know if that is preferable anyway.

tjruwase · 2020-10-07T22:51:06Z

Oh I see, you are fixing the call within the engine code. For some reason, I was thinking of the client calling engine.get_mom(). In that case, your fix works just fine. Thanks so much.

It is exciting to see other optimizers, besides Adam & Lamb, are working seemingly out-of-the-box with DeepSpeed.

Fix printing momentum for non-deepspeed optimizer

4cff8c9

Fix printing momentum for non-deepspeed optimizer

Zixxy requested review from arashashari, awan-10, cli99, conglongli, eltonzheng, jeffra, minjiaz, niumanar, RezaYazdaniAminabadi, samyam, ShadenSmith and tjruwase as code owners October 7, 2020 19:14

fix momentum access for Adam

71c85a0

fix momentum access for Adam

tjruwase approved these changes Oct 7, 2020

View reviewed changes

tjruwase merged commit c39a76f into deepspeedai:master Oct 7, 2020

bobisapotato mentioned this pull request Jan 24, 2021

Another thing to merge. (MY EYES HURT) bobisai/DeepSpeed#1

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix printing momentum for non-deepspeed optimizer #464

Fix printing momentum for non-deepspeed optimizer #464

Zixxy commented Oct 7, 2020

tjruwase commented Oct 7, 2020 •

edited

Loading

Zixxy commented Oct 7, 2020

tjruwase commented Oct 7, 2020

Fix printing momentum for non-deepspeed optimizer #464

Fix printing momentum for non-deepspeed optimizer #464

Conversation

Zixxy commented Oct 7, 2020

tjruwase commented Oct 7, 2020 • edited Loading

Zixxy commented Oct 7, 2020

tjruwase commented Oct 7, 2020

tjruwase commented Oct 7, 2020 •

edited

Loading