Pass `lr_scheduler` to `Accelerator.prepare` #301

sgugger · 2022-03-30T20:11:50Z

This PR adds support for passing the learning rate scheduler to Accelerator.prepare. This only concerns learning rate schedulers that are called at each optimizer step, not schedulers that are called at the end of each epoch.

Why?

There are three reasons motivating this change:

users don't always remember they have to define the scheduler after they have prepared the training dataloader (since its size changes). This way is easier.
the scheduler step should be synchronized with the optimizer step, so when the step is skipped for the optimizer (in mixed precision), it should also be in the scheduler. With this PR, this is now the case.
this will unlock more cases where the optimizer step is skipped: having Accelerate automatically handle gradient accumulation

HuggingFaceDocBuilderDev · 2022-03-30T20:20:57Z

The documentation is not available anymore as the PR was closed or merged.

muellerzr

Great work on this! Left a few suggestions for documentation consistency and a copy/paste victim 😉

src/accelerate/accelerator.py

muellerzr · 2022-03-30T20:36:29Z

src/accelerate/accelerator.py

+        for opt in self._optimizers:
+            if getattr(scheduler, "optimizer", None) == opt.optimizer:
+                optimizer = opt
+                break


Clever! I had wondered how you'd do this 😄

src/accelerate/checkpointing.py

src/accelerate/scheduler.py

Co-authored-by: Zachary Mueller <[email protected]>

LysandreJik

As said offline, LGTM!

sgugger added 2 commits March 30, 2022 13:16

Work in progress

7f6d552

Pass scheduler to Accelerator.prapre

d7d3ef9

sgugger requested review from muellerzr and LysandreJik March 30, 2022 20:11

Fix tests

077265f

muellerzr approved these changes Mar 30, 2022

View reviewed changes

sgugger and others added 2 commits March 30, 2022 17:08

Apply suggestions from code review

7bb6f89

Co-authored-by: Zachary Mueller <[email protected]>

Style post comments

b8727b5

LysandreJik approved these changes Mar 31, 2022

View reviewed changes

Merge branch 'main' into lr_scheduler_prepare

729f3e9

sgugger merged commit 2c554b0 into main Mar 31, 2022

sgugger deleted the lr_scheduler_prepare branch March 31, 2022 13:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pass `lr_scheduler` to `Accelerator.prepare` #301

Pass `lr_scheduler` to `Accelerator.prepare` #301

sgugger commented Mar 30, 2022

HuggingFaceDocBuilderDev commented Mar 30, 2022 •

edited

Loading

muellerzr left a comment

muellerzr Mar 30, 2022

LysandreJik left a comment

Pass lr_scheduler to Accelerator.prepare #301

Pass lr_scheduler to Accelerator.prepare #301

Conversation

sgugger commented Mar 30, 2022

HuggingFaceDocBuilderDev commented Mar 30, 2022 • edited Loading

muellerzr left a comment

Choose a reason for hiding this comment

muellerzr Mar 30, 2022

Choose a reason for hiding this comment

LysandreJik left a comment

Choose a reason for hiding this comment

Pass `lr_scheduler` to `Accelerator.prepare` #301

Pass `lr_scheduler` to `Accelerator.prepare` #301

HuggingFaceDocBuilderDev commented Mar 30, 2022 •

edited

Loading