check shape #859

MordehayM · 2022-02-24T22:32:17Z

https://github.com/PyTorchLightning/metrics/blob/21ba6502418f537a7ca3618be0b19f617f83a062/torchmetrics/functional/audio/pit.py#L148
Hi,
I don't understand why the target shape and pred shape must be equal?
This question arises from the fact that the loss can be the categorical cross-entropy with multiple outputs (for each speaker, for instance) and then this constraint does not apply(categorical cross-entropy in Pytorch)

github-actions · 2022-02-24T22:32:58Z

Hi! thanks for your contribution!, great first issue!

SkafteNicki · 2022-02-25T12:38:39Z

cc: @quancs

quancs · 2022-02-25T13:01:29Z

Hi, you are right this constraint is not designed correctly for all possible use cases... Could you give some example input and output for your use case

quancs · 2022-02-27T16:18:42Z

I don't understand why the target shape and pred shape must be equal?

the batch dim of the pred and target should be the same by nature
the speaker dim should be the same required by PIT
For speech separation, the metrics like SDR, SI-SDR, PESQ require the pred and target to have the same shape at the time dim.

I guess it is (3) could not be applied to other audio sub-domains. Is that right? @MordehayM

MordehayM · 2022-02-27T16:28:49Z

Yes, exactly.
When the loss is categorical cross entropy, the target and pred do not have the same shape, the target's shape is [B, num_speaker, d1..dk] while the pred's shape is [B, num_speaker, C, d1..dk] where C is the number of categories.

quancs · 2022-02-27T17:07:29Z

Yes, exactly. When the loss is categorical cross entropy, the target and pred do not have the same shape, the target's shape is [B, num_speaker, d1..dk] while the pred's shape is [B, num_speaker, C, d1..dk] where C is the number of categories.

@MordehayM So, it works for your case if we just check the first two dimensions, batch and speaker?

MordehayM · 2022-02-27T17:11:12Z

Sorry, but I didn't check that. I chose another implementation of PIT.
In my opinion it should work.

quancs · 2022-02-27T17:18:37Z

In my opinion it should work.

Thanks for your opinions. I will fix this problem.

Sorry, but I didn't check that. I chose another implementation of PIT.

just a little bit curious, could you tell me which one? If we find it's faster, we could implement it in TorchMetrics ^^

MordehayM · 2022-02-27T17:27:24Z

This one:
https://github.com/asteroid-team/pytorch-pit/blob/master/torch_pit/pit_wrapper.py

SkafteNicki added the question Further information is requested label Feb 25, 2022

quancs mentioned this issue Feb 27, 2022

Improved shape checking of permutation_invariant_training #864

Merged

4 tasks

SkafteNicki closed this as completed in #864 Feb 28, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

check shape #859

check shape #859

MordehayM commented Feb 24, 2022

github-actions bot commented Feb 24, 2022

SkafteNicki commented Feb 25, 2022

quancs commented Feb 25, 2022

quancs commented Feb 27, 2022 •

edited

Loading

MordehayM commented Feb 27, 2022

quancs commented Feb 27, 2022

MordehayM commented Feb 27, 2022

quancs commented Feb 27, 2022 •

edited

Loading

MordehayM commented Feb 27, 2022

check shape #859

check shape #859

Comments

MordehayM commented Feb 24, 2022

github-actions bot commented Feb 24, 2022

SkafteNicki commented Feb 25, 2022

quancs commented Feb 25, 2022

quancs commented Feb 27, 2022 • edited Loading

MordehayM commented Feb 27, 2022

quancs commented Feb 27, 2022

MordehayM commented Feb 27, 2022

quancs commented Feb 27, 2022 • edited Loading

MordehayM commented Feb 27, 2022

quancs commented Feb 27, 2022 •

edited

Loading

quancs commented Feb 27, 2022 •

edited

Loading