[BUG] Loading losses with modules that have no parameters #1593

matteobettini · 2023-10-02T15:00:09Z

When loading a loss that has a neural network with no parameters, the reloading fails

  model = torch.nn.Tanh() # does not work
  # model = torch.nn.Linear(1, 1) works
  value = QValueActor(module=model, in_keys="obs", action_space="one_hot")
  loss = DQNLoss(value_network=model, action_space="one_hot")
  state = loss.state_dict()

  loss = DQNLoss(value_network=model, action_space="one_hot")
  loss.load_state_dict(state)

Traceback (most recent call last):
  File "/Users/matbet/PycharmProjects/rl/prova.py", line 16, in <module>
    loss.load_state_dict(state)
  File "/Users/matbet/miniconda3/envs/torchrl/lib/python3.9/site-packages/torch/nn/modules/module.py", line 2027, in load_state_dict
    load(self, state_dict)
  File "/Users/matbet/miniconda3/envs/torchrl/lib/python3.9/site-packages/torch/nn/modules/module.py", line 2015, in load
    load(child, child_state_dict, child_prefix)
  File "/Users/matbet/miniconda3/envs/torchrl/lib/python3.9/site-packages/torch/nn/modules/module.py", line 2009, in load
    module._load_from_state_dict(
  File "/Users/matbet/PycharmProjects/tensordict/tensordict/nn/params.py", line 792, in _load_from_state_dict
    self.data.load_state_dict(data)
  File "/Users/matbet/PycharmProjects/tensordict/tensordict/tensordict.py", line 834, in load_state_dict
    raise RuntimeError(
RuntimeError: Cannot load state-dict because the key sets don't match: got state_dict extra keys 
set()
 and tensordict extra keys
{'module'}

an example use case is the VDN module in MARL which is just a sum of the input and will cause this in the QMixerLoss

The text was updated successfully, but these errors were encountered:

vmoens · 2023-10-04T17:18:15Z

Thanks
I think moving to torch.func.functional_call will solve this issue. For this, pytorch/tensordict#526 needs to be mature

vmoens · 2024-02-01T16:31:58Z

I would suggest to use

import tensordict
sd = tensordict.TensorDict.from_module(loss)
sd.to_module(loss)

matteobettini · 2024-02-01T16:40:37Z

I see, thanks.

For BC-compatibility and interchangability with other components I still need to use the state_dict() interface though.

vmoens · 2024-02-01T16:45:35Z

yes i'm on it, but it's more efficient, faster and safer to serialize with tensordict

matteobettini added the bug Something isn't working label Oct 2, 2023

matteobettini assigned vmoens Oct 2, 2023

matteobettini mentioned this issue Nov 25, 2023

[Refactor] Refactor functional calls in losses #1707

Merged

vmoens mentioned this issue Feb 1, 2024

[BugFix] Loading phantom state-dicts pytorch/tensordict#650

Merged

vmoens closed this as completed in pytorch/tensordict#650 Feb 2, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG] Loading losses with modules that have no parameters #1593

[BUG] Loading losses with modules that have no parameters #1593

matteobettini commented Oct 2, 2023 •

edited

Loading

vmoens commented Oct 4, 2023

vmoens commented Feb 1, 2024

matteobettini commented Feb 1, 2024

vmoens commented Feb 1, 2024

[BUG] Loading losses with modules that have no parameters #1593

[BUG] Loading losses with modules that have no parameters #1593

Comments

matteobettini commented Oct 2, 2023 • edited Loading

vmoens commented Oct 4, 2023

vmoens commented Feb 1, 2024

matteobettini commented Feb 1, 2024

vmoens commented Feb 1, 2024

matteobettini commented Oct 2, 2023 •

edited

Loading