FEAT: Add LOMO optimizer #2695

younesbelkada · 2024-04-22T09:56:36Z

WIP - on par with huggingface/transformers#30178

HuggingFaceDocBuilderDev · 2024-04-22T10:01:09Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

muellerzr

This is a very good start, left some design comments that are more in-tune to what I'd expect for Accelerate :)

src/accelerate/accelerator.py

muellerzr · 2024-04-22T16:03:59Z

src/accelerate/accelerator.py

+        if is_lomo_available():
+            from lomo_optim import AdaLomo, Lomo


This should be done at the top of the file

Doing that leads to an error due to circular imports 😢 this is because lomo_optim imports stuff from transformers, that itself import stuff from accelerate. Added a comment there

muellerzr · 2024-04-22T16:04:47Z

src/accelerate/accelerator.py

+        elif learning_rate is not None and self._has_lomo_optimizer:
+            self._lomo_backward(loss, learning_rate)


Does this need to interact with self.scaler for AMP? That may be important!

hmm not sure here actually 🤯 I need to investigate a bit ..

muellerzr · 2024-04-22T16:07:25Z

src/accelerate/optimizer.py

+        if is_lomo_available():
+            from lomo_optim import AdaLomo, Lomo


Same thing as earlier, but let's move the detection for Lomo to happen in here rather than in the Accelerator. Then we can make an attribute self.is_lomo_optimizer

see: #2695 (comment)

muellerzr · 2024-04-22T17:55:12Z

src/accelerate/accelerator.py

@@ -2031,6 +2041,8 @@ def backward(self, loss, **kwargs):
            return
        elif self.scaler is not None:
            self.scaler.scale(loss).backward(**kwargs)
+        elif learning_rate is not None and self._has_lomo_optimizer:


Eventually, if we have more optimizers needing this, we may want to abstract this to a backward_func at some point

Yes makes sense!

muellerzr · 2024-04-22T17:55:51Z

src/accelerate/accelerator.py

@@ -2031,6 +2041,8 @@ def backward(self, loss, **kwargs):
            return
        elif self.scaler is not None:
            self.scaler.scale(loss).backward(**kwargs)
+        elif learning_rate is not None and self._has_lomo_optimizer:
+            self._lomo_backward(loss, learning_rate)


Let's make this lomo_backwards instead of hiding it.

Also, I'd rather see us raise an error if a learning rate isn't part of kwargs, and this is True, rather than silently failing.

Makes totally sense, refactored a bit things in 3a518dc - LMK what do you think !

muellerzr · 2024-04-22T17:56:16Z

src/accelerate/accelerator.py

+        if is_lomo_available():
+            from lomo_optim import AdaLomo, Lomo
+        else:
+            raise ValueError("`lomo_optim` package is needed to call backward on LOMO optimizers")


This should already be caught in the prepare_optimizer portion, so it should be fine.

Makes sense, removed it !

Co-authored-by: Zach Mueller <[email protected]>

muellerzr

Nice! This is looking fantastic! Last bit I’d like is a quick example in examples/ using LOMO, but for the sake of the release today we can get this in.

younesbelkada · 2024-05-03T08:53:00Z

Nice, thanks so much ! OK I will work on an example that use accelerate + LOMO and open a follow up PR !

BenjaminBossan · 2024-05-03T12:56:37Z

src/accelerate/accelerator.py

+            # transformers & accelerate
+            from lomo_optim import AdaLomo, Lomo
+
+            self.has_lomo_optimizer = isinstance(optimizer, (Lomo, AdaLomo))


For my understanding, we assume that the accelerator could have multiple self._optimizers, with some of them LOMO and others not. Would that not create the issue that self.has_lomo_optimizer takes the value based on whatever the last optimizer is? Would we not have to set: self.has_lomo_optimizer |= isinstance(optimizer, (Lomo, AdaLomo))

add v1 lomo

37e7809

younesbelkada added 2 commits April 22, 2024 12:29

final fixes

a36a8ff

fix

1ae5289

younesbelkada requested a review from muellerzr April 22, 2024 10:34

younesbelkada mentioned this pull request Apr 22, 2024

FEAT / Trainer: LOMO optimizer support huggingface/transformers#30178

Merged

muellerzr reviewed Apr 22, 2024

View reviewed changes

younesbelkada and others added 5 commits May 3, 2024 10:11

Merge remote-tracking branch 'origin/main' into add-lomo

9f7a749

Update src/accelerate/accelerator.py

6960c77

Co-authored-by: Zach Mueller <[email protected]>

add comment

334fbe3

more comments

3a518dc

fix

4a480ed

younesbelkada marked this pull request as ready for review May 3, 2024 08:27

younesbelkada requested a review from muellerzr May 3, 2024 08:28

muellerzr approved these changes May 3, 2024

View reviewed changes

younesbelkada merged commit 6ac27e2 into main May 3, 2024
25 checks passed

younesbelkada deleted the add-lomo branch May 3, 2024 08:55

BenjaminBossan reviewed May 3, 2024

View reviewed changes

younesbelkada mentioned this pull request May 6, 2024

LOMO / FIX: Support multiple optimizers #2745

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FEAT: Add LOMO optimizer #2695

FEAT: Add LOMO optimizer #2695

younesbelkada commented Apr 22, 2024

HuggingFaceDocBuilderDev commented Apr 22, 2024

muellerzr left a comment

muellerzr Apr 22, 2024

younesbelkada May 3, 2024

muellerzr Apr 22, 2024

younesbelkada May 3, 2024

muellerzr Apr 22, 2024

younesbelkada May 3, 2024

muellerzr Apr 22, 2024

younesbelkada May 3, 2024

muellerzr Apr 22, 2024

younesbelkada May 3, 2024

muellerzr Apr 22, 2024

younesbelkada May 3, 2024

muellerzr left a comment

younesbelkada commented May 3, 2024

BenjaminBossan May 3, 2024

		elif learning_rate is not None and self._has_lomo_optimizer:
		self._lomo_backward(loss, learning_rate)

FEAT: Add LOMO optimizer #2695

FEAT: Add LOMO optimizer #2695

Conversation

younesbelkada commented Apr 22, 2024

HuggingFaceDocBuilderDev commented Apr 22, 2024

muellerzr left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

muellerzr left a comment

Choose a reason for hiding this comment

younesbelkada commented May 3, 2024

Choose a reason for hiding this comment