Generalize internal checks for precision plugin type, training type, accelerator type #10821

awaelchli · 2021-11-29T19:19:24Z

Proposed refactor

Internally, our checks against the type of Accelerator, Precision type, strategy is not robust towards custom instances passed in by the user.

Motivation

Internally, some operations in the optimization, logging, etc. need a different code path depending on 1) Accelerator type (cpu, gpu) or 2) Precision type (apex, native) or 3) strategy type (ddp, ddp-spawn, ...). Currently we have this pattern:

if trainer._device_type == DeviceType.CPU:
    # do something only for cpu


if trainer._amp_backend == AMPType.Apex:
    # do something differently for apex

Pitch

Change these to

if isinstance(trainer.accelerator, CPUAccelerator):
    # do something only for cpu


if isinstance(trainer.precision_plugin, ApexPrecisionPlugin):
    # do something differently for apex

This has the benefits:

User passes in custom plugins (subclasses of our plugins)
Encapsulation: Protected members _device_type, _strategy_type, won't be abusively accessed publicly anymore. They remain an implementation detail of AcceleratorConnector
Minimally simplifies AcceleratorConnector logic

Additional context

Discusson started in #10596

If you enjoy Lightning, check out our other projects! ⚡

Metrics: Machine learning metrics for distributed, scalable PyTorch applications.
Lite: enables pure PyTorch users to scale their existing code on any kind of device while retaining full control over their own loops and optimization logic.
Flash: The fastest way to get a Lightning baseline! A collection of tasks for fast prototyping, baselining, fine-tuning, and solving problems with deep learning.
Bolts: Pretrained SOTA Deep Learning models, callbacks, and more for research and production with PyTorch Lightning and PyTorch.
Lightning Transformers: Flexible interface for high-performance research using SOTA Transformers leveraging Pytorch Lightning, Transformers, and Hydra.

cc @Borda @justusschock @awaelchli @rohitgr7 @kaushikb11 @akihironitta @ananthsub

stale · 2022-01-03T20:28:45Z

This issue has been automatically marked as stale because it hasn't had any recent activity. This issue will be closed in 7 days if no further activity occurs. Thank you for your contributions, Pytorch Lightning Team!

tchaton · 2022-01-04T09:40:54Z

This seems cleaner and easier to debug. Let's do it.

carmocca · 2022-02-14T16:11:28Z

Since this is invisible to the user, this doesn't need to happen strictly before 1.6 so I'll move it out and we can update the milestone whenever this gets done.

carmocca · 2022-02-14T16:20:36Z

NVM my previous comment, @justusschock will take it

carmocca · 2022-08-05T13:16:40Z

@justusschock Do you think you could finish this for 1.8?

justusschock · 2022-08-05T14:19:09Z

Definitely!

awaelchli · 2023-01-21T16:08:56Z

You failed 😄

awaelchli · 2023-01-21T16:12:28Z

Sorry, couldn't resist with the stupid comment 😄

But in all seriousness, I think we completed this in the meantime already. It seems all enum types got removed. Or do you see anything left to do?

awaelchli added the refactor label Nov 29, 2021

awaelchli mentioned this issue Nov 29, 2021

2/n Move Precision Plugin into strategy - move optimizer related logics #10596

Merged

12 tasks

awaelchli added accelerator plugin labels Nov 29, 2021

four4fish mentioned this issue Dec 9, 2021

1/n Generalize internal checks for Accelerator in Trainer - remove trainer._device_type #11001

Closed

12 tasks

awaelchli mentioned this issue Dec 9, 2021

Remove trainer._device_type in favor of check Accelerator class #11002

Closed

four4fish mentioned this issue Dec 13, 2021

3/n Move accelerator into Strategy #11022

Merged

12 tasks

stale bot added the won't fix This will not be worked on label Jan 3, 2022

awaelchli added this to the 1.6 milestone Jan 4, 2022

stale bot removed the won't fix This will not be worked on label Jan 4, 2022

tchaton added the let's do it! approved to implement label Jan 4, 2022

This was referenced Feb 7, 2022

Rewrite accelerator_connector #11448

Merged

[Tracker] Remaining tasks for Strategy stable version #11812

Closed

carmocca added the good first issue Good for newcomers label Feb 14, 2022

carmocca modified the milestones: 1.6, future Feb 14, 2022

carmocca assigned carmocca and unassigned carmocca Feb 14, 2022

carmocca assigned justusschock Feb 14, 2022

carmocca modified the milestones: future, 1.6 Feb 14, 2022

carmocca added this to Frameworks Planning Feb 14, 2022

carmocca moved this to Todo in Frameworks Planning Feb 14, 2022

awaelchli mentioned this issue Feb 21, 2022

Remove Trainer._device_type #11992

Merged

11 tasks

justusschock mentioned this issue Feb 23, 2022

Unify checks #12069

Closed

12 tasks

justusschock moved this from Todo to In Review in Frameworks Planning Mar 1, 2022

carmocca modified the milestones: 1.6, 1.7 Mar 28, 2022

carmocca removed good first issue Good for newcomers let's do it! approved to implement labels Mar 28, 2022

carmocca modified the milestones: 1.6, 1.7 Mar 28, 2022

carmocca unassigned justusschock Jul 19, 2022

carmocca added the help wanted Open to be worked on label Jul 19, 2022

carmocca modified the milestones: pl:1.7, pl:future Jul 19, 2022

carmocca assigned justusschock Jul 19, 2022

carmocca removed the help wanted Open to be worked on label Jul 19, 2022

carmocca modified the milestones: pl:future, pl:1.8 Aug 5, 2022

Borda moved this from In Review to Blocked in Frameworks Planning Sep 1, 2022

carmocca modified the milestones: v1.8, future Oct 13, 2022

awaelchli closed this as completed Jan 21, 2023

github-project-automation bot moved this from Blocked to Done in Frameworks Planning Jan 21, 2023

carmocca modified the milestones: future, 2.0 Jan 25, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Generalize internal checks for precision plugin type, training type, accelerator type #10821

Generalize internal checks for precision plugin type, training type, accelerator type #10821

awaelchli commented Nov 29, 2021 •

edited by github-actions bot

Loading

stale bot commented Jan 3, 2022

tchaton commented Jan 4, 2022

carmocca commented Feb 14, 2022

carmocca commented Feb 14, 2022

carmocca commented Aug 5, 2022

justusschock commented Aug 5, 2022

awaelchli commented Jan 21, 2023 •

edited

Loading

awaelchli commented Jan 21, 2023 •

edited

Loading

Generalize internal checks for precision plugin type, training type, accelerator type #10821

Generalize internal checks for precision plugin type, training type, accelerator type #10821

Comments

awaelchli commented Nov 29, 2021 • edited by github-actions bot Loading

Proposed refactor

Motivation

Pitch

Additional context

If you enjoy Lightning, check out our other projects! ⚡

stale bot commented Jan 3, 2022

tchaton commented Jan 4, 2022

carmocca commented Feb 14, 2022

carmocca commented Feb 14, 2022

carmocca commented Aug 5, 2022

justusschock commented Aug 5, 2022

awaelchli commented Jan 21, 2023 • edited Loading

awaelchli commented Jan 21, 2023 • edited Loading

awaelchli commented Nov 29, 2021 •

edited by github-actions bot

Loading

awaelchli commented Jan 21, 2023 •

edited

Loading

awaelchli commented Jan 21, 2023 •

edited

Loading