Lightning Lite Examples #9987

awaelchli · 2021-10-18T10:37:08Z

What does this PR do?

This is the V1 for the new Lightning Lite package. It bundles all major changes together, but individual PRs will be done to get it merged (e.g., #10175, #10176)
Planned to be released as part of 1.5.

Demo

TODOs

Discussions

LightningLite constructor arguments: We are currently changing the Trainer constructor arguments to support a new pattern: Trainer(accelerator="cpu/tpu/gpu", strategy="ddp/deepspeed/...", devices=X) Should we start promoting this directly in the LightningLite API?
Deepspeed API for backward: The user can't call loss.backward(), it needs to be called on the model. Which API do we want to offer:
```
A) self.backward(loss, model) # model is optional for plugins other than deepspeed
B) model.backward(loss)
```
In both cases, user needs to change their code if they switch from one strategy to the next.
Deepspeed API for optimization step: Plain Deepspeed requires a call to model.step() as opposed to the usual optimizer.step(). Since we wrap the optimizers of the user anyway, we could still offer optimizer.step() and redirect to model.step(). It would mean less code changes for the user switching between plugins, but for deepspeed users it might be confusing!

Related work:

We are currently also redesigning the Precision/TrainingType plugin interactions in collaboration with FB.

Part of #1 (it's a lie, this is just here to avoid noisy GitHub bot)

pytorch_lightning/trainer/connectors/accelerator_connector.py

pytorch_lightning/lite/lite.py

pytorch_lightning/lite/wrappers.py

pytorch_lightning/plugins/precision/precision_plugin.py

pytorch_lightning/lite/lite.py

pl_examples/lite_examples/gan/run_examples.py

pl_examples/lite_examples/simple/mnist_example.py

ananthsub · 2021-10-18T19:55:09Z

The design document isn't visible to people outside of grid.aI so it's hard to know the context for this

…tghtning-lite

Co-authored-by: Kaushik B <[email protected]>

tchaton · 2021-10-19T11:06:11Z

The design document isn't visible to people outside of grid.aI so it's hard to know the context for this

Hey @ananthsub. Find the document here: https://docs.google.com/document/d/1b10LMNqnv1ellVTAEIlJFV5KvBuxIlFCTnNB3SYIFok/edit#heading=h.jl44rslqge7e.

Best,
T.C

Co-authored-by: Jirka Borovec <[email protected]>

c

…te-poc

Co-authored-by: Jirka Borovec <[email protected]>

awaelchli · 2021-11-02T08:54:32Z

pytorch_lightning/lite/wrappers.py

+    def __len__(self) -> Union[int, float]:
+        if isinstance(self._dataloader, Sized):
+            return len(self._dataloader)
+        return float("inf")


This does not belong in this PR. Why did we add this?
Needs to be addressed in #10297

awaelchli added feature Is an improvement or enhancement priority: 0 High priority task labels Oct 18, 2021

awaelchli added this to the v1.5 milestone Oct 18, 2021

awaelchli mentioned this pull request Oct 18, 2021

Introduce PrecisionPlugin.forward_context() #9988

Merged

11 tasks

awaelchli commented Oct 18, 2021

View reviewed changes

pytorch_lightning/trainer/connectors/accelerator_connector.py Outdated Show resolved Hide resolved

justusschock reviewed Oct 18, 2021

View reviewed changes

pytorch_lightning/lite/lite.py Outdated Show resolved Hide resolved

pytorch_lightning/lite/wrappers.py Outdated Show resolved Hide resolved

pytorch_lightning/plugins/precision/precision_plugin.py Outdated Show resolved Hide resolved

justusschock reviewed Oct 18, 2021

View reviewed changes

pytorch_lightning/lite/lite.py Outdated Show resolved Hide resolved

kaushikb11 reviewed Oct 18, 2021

View reviewed changes

pl_examples/lite_examples/gan/run_examples.py Outdated Show resolved Hide resolved

awaelchli mentioned this pull request Oct 18, 2021

Update setup logic in training type plugins [1 / n] #9994

Merged

11 tasks

kaushikb11 reviewed Oct 18, 2021

View reviewed changes

pl_examples/lite_examples/simple/mnist_example.py Outdated Show resolved Hide resolved

kaushikb11 reviewed Oct 18, 2021

View reviewed changes

pl_examples/lite_examples/simple/mnist_example.py Outdated Show resolved Hide resolved

awaelchli added 5 commits October 19, 2021 02:17

move scheduler

8ddb777

convert

c2b4b74

fix precision + dataloader wrapping for DP

0d430c5

Merge remote-tracking branch 'origin/lite-poc' into lightning-lite/li…

5d09298

…tghtning-lite

remove unused import

2a21ad9

awaelchli mentioned this pull request Oct 19, 2021

Update backward hook for PrecisionPlugin #10008

Merged

12 tasks

Merge branch 'master' into lightning-lite/litghtning-lite

23d6786

This was referenced Oct 19, 2021

Update setup logic in training type plugins (deepspeed) [2 / n] #10009

Merged

Update setup logic in training type plugins (data-parallel) [3 / n] #10010

Merged

kaushikb11 and others added 3 commits October 19, 2021 12:12

Update pl_examples/lite_examples/simple/mnist_example.py

eeff843

Update pl_examples/lite_examples/simple/mnist_example.py

3f7a2ce

Add LightningLite Example (#9991)

61c825c

Co-authored-by: Kaushik B <[email protected]>

awaelchli mentioned this pull request Oct 19, 2021

Add DDPSpawnPlugin.spawn() #10018

Merged

11 tasks

awaelchli added 2 commits October 19, 2021 12:58

call process_dataloader()

21ada36

Merge branch 'master' into lightning-lite/litghtning-lite

1bc966c

awaelchli added 2 commits October 19, 2021 14:27

refactor spawn

6dd770f

update new_process

724c2a9

awaelchli and others added 7 commits November 1, 2021 17:53

Update pl_examples/basic_examples/README.md

9fd0bba

Co-authored-by: Jirka Borovec <[email protected]>

update links, latest -> stable

60a65dc

switch order in run_examples.sh

f33911d

capitalization

6e76183

c

Merge remote-tracking branch 'origin/lite-poc' into lightning-lite/li…

d644ba6

…te-poc

Update pl_examples/basic_examples/README.md

6c7e630

Co-authored-by: Jirka Borovec <[email protected]>

Update pl_examples/basic_examples/README.md

be1d820

Co-authored-by: Jirka Borovec <[email protected]>

Borda approved these changes Nov 1, 2021

View reviewed changes

tchaton added 19 commits November 1, 2021 18:35

Merge branch 'master' into lite-poc

138f4f5

update on comments

09ccc0d

hotfix

a27d0e3

update

8a970ac

update

fd3d286

update

29bb0c9

remove test.py

3b9496b

update

8139093

update

af13b28

update

af1ad85

update

389e535

update

b4b63fb

update

3b82b57

update

7a88161

Merge branch 'master' into lite-poc

c05a102

update

506fc19

update

81ea111

update

5209601

update

33b8758

awaelchli merged commit 3cd65b5 into master Nov 2, 2021

awaelchli deleted the lite-poc branch November 2, 2021 08:04

awaelchli commented Nov 2, 2021

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Lightning Lite Examples #9987

Lightning Lite Examples #9987

awaelchli commented Oct 18, 2021 •

edited

Loading

ananthsub commented Oct 18, 2021

tchaton commented Oct 19, 2021

awaelchli Nov 2, 2021

Lightning Lite Examples #9987

Lightning Lite Examples #9987

Conversation

awaelchli commented Oct 18, 2021 • edited Loading

What does this PR do?

Demo

TODOs

Discussions

ananthsub commented Oct 18, 2021

tchaton commented Oct 19, 2021

awaelchli Nov 2, 2021

Choose a reason for hiding this comment

awaelchli commented Oct 18, 2021 •

edited

Loading