Deprecate trainer.num_processe/trainer.num_gpus and remove incorrect tests #11624

four4fish · 2022-01-26T01:14:07Z

Proposed refactor

Trainer.num_processes() and trainer.num_gpus are not used in the code base and only exists in tests. Propose deprecate/remove these properties and remove incorrect/unnecessary tests

Motivation

Simplify code and reduce confusion. Strategy.num_processes != Trainer.num_processes, and Trainer.num_processes is only called in test. (confusion raised in Lazy initialize Strategy.parallel_devices #11572)
Simplify accelerator_connector rewrite Rewrite Accelerator_connector and follow up tasks #11449. The current accelerator_connector has a lot of logic related to num_processes which is not necessary and confusing, remove it first will simplify the refactor

Pitch

Steps:
1 Deprecate trainer.num_processes
https://github.com/PyTorchLightning/pytorch-lightning/blob/fe34bf2a653ebd50e6a3a00be829e3611f820c3c/pytorch_lightning/trainer/trainer.py#L1969-L1971

Remove trainer.num_processes tests
https://github.com/PyTorchLightning/pytorch-lightning/blob/fe34bf2a653ebd50e6a3a00be829e3611f820c3c/tests/trainer/test_trainer.py#L2093-L2111
Do not carry self.num_processes logic over to accelerator_connector rewrite.

Notes: this won't impact Strategy.num_processes as it's irrelevant. Strategy.num_processes calculated base on parallel_devices which is not equal or related to trainer.num_processes

same for trainer.num_gpus

Additional context

If you enjoy Lightning, check out our other projects! ⚡

Metrics: Machine learning metrics for distributed, scalable PyTorch applications.
Lite: enables pure PyTorch users to scale their existing code on any kind of device while retaining full control over their own loops and optimization logic.
Flash: The fastest way to get a Lightning baseline! A collection of tasks for fast prototyping, baselining, fine-tuning, and solving problems with deep learning.
Bolts: Pretrained SOTA Deep Learning models, callbacks, and more for research and production with PyTorch Lightning and PyTorch.
Lightning Transformers: Flexible interface for high-performance research using SOTA Transformers leveraging Pytorch Lightning, Transformers, and Hydra.

cc @justusschock @awaelchli @akihironitta @rohitgr7 @kaushikb11 @Borda @ananthsub @ninginthecloud @jjenniferdai

The text was updated successfully, but these errors were encountered:

awaelchli · 2022-01-29T12:21:43Z

This is reasonable imo considering the genericTrainer(devices=x) argument was recently introduced., num_processes does not follow that terminology anyway. When deprecating num_processes, num_gpus etc., we should introduce num_devices imo.
Other than that, I think we can go ahead with this change.

carmocca · 2022-04-12T11:55:36Z

Is this finished?

rohitgr7 · 2022-08-16T09:37:45Z

looks like the deprecation is done already. Closing this.

four4fish added refactor trainer labels Jan 26, 2022

four4fish self-assigned this Jan 26, 2022

four4fish changed the title ~~Deprecate trainer.num_processes and remove incorrect tests~~ Deprecate trainer.num_processe/trainer.num_gpus and remove incorrect tests Jan 26, 2022

awaelchli added this to the 1.6 milestone Jan 29, 2022

four4fish mentioned this issue Feb 2, 2022

Rewrite accelerator_connector #11448

Merged

12 tasks

carmocca added this to Frameworks Planning Feb 16, 2022

carmocca moved this to Todo in Frameworks Planning Feb 16, 2022

four4fish mentioned this issue Feb 26, 2022

Remove AcceleratorConnector.num_nodes #12082

Closed

This was referenced Mar 1, 2022

Remove AccleratorConnector.devices and Deprecate Trainer.devices in favor of Trainer.device_ids and Trainer.num_devices #12126

Closed

Refactor Trainer device properties that uses Trainer._accelerator_connector #12171

Closed

carmocca removed this from the 1.6 milestone Mar 21, 2022

rohitgr7 closed this as completed Aug 16, 2022

Repository owner moved this from Todo to Done in Frameworks Planning Aug 16, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Deprecate trainer.num_processe/trainer.num_gpus and remove incorrect tests #11624

Deprecate trainer.num_processe/trainer.num_gpus and remove incorrect tests #11624

four4fish commented Jan 26, 2022 •

edited

Loading

awaelchli commented Jan 29, 2022

carmocca commented Apr 12, 2022

rohitgr7 commented Aug 16, 2022

Deprecate trainer.num_processe/trainer.num_gpus and remove incorrect tests #11624

Deprecate trainer.num_processe/trainer.num_gpus and remove incorrect tests #11624

Comments

four4fish commented Jan 26, 2022 • edited Loading

Proposed refactor

Motivation

Pitch

Additional context

If you enjoy Lightning, check out our other projects! ⚡

awaelchli commented Jan 29, 2022

carmocca commented Apr 12, 2022

rohitgr7 commented Aug 16, 2022

four4fish commented Jan 26, 2022 •

edited

Loading