How to access `LightningDataModule` in `LightningModule` #10492

adamjstewart · 2021-11-11T21:36:39Z

adamjstewart
Nov 11, 2021

In TorchGeo, we use PyTorch Lightning to organize reproducible benchmarks for geospatial datasets. Currently, we have a set of LightningDataModules for each dataset and a much smaller number of LightningModules for each task (semantic segmentation, classification, regression, etc.). Each Dataset defines its own plot() method that describes how to plot images and masks.

During training/validation steps, we would like to plot a few examples to see how training is progressing. However, the LightningModule doesn't seem to know anything about the LightningDataModule/DataLoader/Dataset. Because of this, if we want to perform dataset-specific plotting during training or validation steps, we're forced to create a separate LightningModule for each dataset, increasing code duplication and defeating the whole purpose of PyTorch Lightning (example).

Is there an easy way for a LightningModule to tell which DataModule/DataLoader/Dataset is being used and call its dataset.plot() method?

@calebrob6 @isaaccorley

@tchaton this is slightly related to #10469 but different enough that I wanted to start a separate discussion about it.

Answered by akihironitta

Nov 11, 2021

@adamjstewart There is a reference to datamodule via trainer from LightningModule, but would that solve your issue?

self.trainer.datamodule

View full answer

akihironitta · 2021-11-11T23:35:20Z

akihironitta
Nov 11, 2021

@adamjstewart There is a reference to datamodule via trainer from LightningModule, but would that solve your issue?

self.trainer.datamodule

2 replies

adamjstewart Nov 12, 2021
Author

Ooh, that should work. Thanks!

akihironitta Nov 15, 2021

Hope it resolves your issue :]

For findability, let me change the title of this discussion from:

Dataset-specific plotting in a LightningModule

to

How to access LightningDataModule in LightningModule

ananthsub · 2021-11-15T05:59:02Z

ananthsub
Nov 15, 2021

This dependence sounds like the data isn't as separable from the model/loop.

Relying on self.trainer.datamodule is not foolproof.

Someone could use your lightning module but pass the data loaders directly to the trainer.fit function. In this case, there is no datamodule provided, and the module could fail unless it checks against this

1 reply

adamjstewart Dec 17, 2021
Author

I actually have a question related to this. During unit testing, we instantiate a LightningModule and LightningDataModule directly and try to test these components, but much of the pytorch-lightning library doesn't seem to work correctly. For instance, self.trainer is None. What's the correct way to handle this? Do we have to create a pl.Trainer every time? This process seems very time-intensive, making it unsuited for unit testing.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to access `LightningDataModule` in `LightningModule` #10492

{{title}}

Replies: 2 comments 3 replies

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

Select a reply

How to access LightningDataModule in LightningModule #10492

adamjstewart Nov 11, 2021

Replies: 2 comments · 3 replies

akihironitta Nov 11, 2021

adamjstewart Nov 12, 2021 Author

akihironitta Nov 15, 2021

ananthsub Nov 15, 2021

adamjstewart Dec 17, 2021 Author

How to access `LightningDataModule` in `LightningModule` #10492

adamjstewart
Nov 11, 2021

Replies: 2 comments 3 replies

akihironitta
Nov 11, 2021

adamjstewart Nov 12, 2021
Author

ananthsub
Nov 15, 2021

adamjstewart Dec 17, 2021
Author