Some metrics don't work on CPU using float16 #57

Llannelongue · 2020-12-31T11:47:11Z

🐛 Bug

It looks like some metrics such as Precision-Recall curve don't work on CPUs when using float16, perhaps due to a missing feature in pytorch?

Please reproduce using the BoringModel

https://colab.research.google.com/drive/1xDv043rRi5WBshP4m5aoxTt2ChlfxjIk?usp=sharing

Expected behavior

the metrics should work in half precision on CPUs as well.

Environment

CUDA:
- GPU:
  - Tesla T4
- available: True
- version: 10.1
Packages:
- numpy: 1.19.4
- pyTorch_debug: True
- pyTorch_version: 1.7.0+cu101
- pytorch-lightning: 1.1.2
- tqdm: 4.41.1
System:
- OS: Linux
- architecture:
  - 64bit
- processor: x86_64
- python: 3.6.9
- version: Proposal for help pytorch-lightning#1 SMP Thu Jul 23 08:00:38 PDT 2020

The text was updated successfully, but these errors were encountered:

Borda · 2020-12-31T12:03:44Z

@SkafteNicki mind have look? :]

SkafteNicki · 2020-12-31T14:34:14Z

Definitely an lack of support from pytorch side, but seems to be on track to be solved:
Issue: pytorch/pytorch#49889
PR with fix: pytorch/pytorch#49895
I can try to see if the code can be rewritten with operations that support float16
Probably should also add test for float16 in general for metrics.

SkafteNicki · 2021-01-01T11:31:07Z

Just a follow up: the following metrics does not work with float16 CPU because of lack of support from a list of pytorch operations (in parentheses)

SSIM (torch.arange)
PSNR (torch.pow)
MeanSquaredLogError (torch.logp1 & torch.pow)
MeanSquaredError (torch.pow)
ExplainedVariance (torch.pow)
PrecisionRecallCurve (torch.flip)
AveragePrecision (torch.flip)
@Borda how to move forward with this? I think some of these can be converted in operations that support float16 CPU (pow can be made to multiplication) but I am not sure about all of them

edenlightning · 2021-01-08T21:24:29Z

@Borda

github-actions · 2021-03-12T15:51:24Z

Hi! thanks for your contribution!, great first issue!

edenlightning · 2021-04-05T14:55:04Z

looks like this was fixed in pytorch/pytorch@5d93e2b

Borda assigned justusschock and SkafteNicki Jan 1, 2021

Borda transferred this issue from Lightning-AI/pytorch-lightning Mar 12, 2021

SkafteNicki mentioned this issue Mar 14, 2021

Add half precision testing [1/n] #77

Merged

4 tasks

Borda added bug / fix Something isn't working help wanted Extra attention is needed labels Mar 17, 2021

edenlightning unassigned justusschock Mar 22, 2021

Borda added this to the 0.3 milestone Mar 25, 2021

edenlightning closed this as completed Apr 5, 2021

MartaTintore mentioned this issue Aug 19, 2021

FID computation involves a large memory footprint #468

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Some metrics don't work on CPU using float16 #57

Some metrics don't work on CPU using float16 #57

Llannelongue commented Dec 31, 2020

Borda commented Dec 31, 2020

SkafteNicki commented Dec 31, 2020

SkafteNicki commented Jan 1, 2021

edenlightning commented Jan 8, 2021

github-actions bot commented Mar 12, 2021

edenlightning commented Apr 5, 2021

Some metrics don't work on CPU using float16 #57

Some metrics don't work on CPU using float16 #57

Comments

Llannelongue commented Dec 31, 2020

🐛 Bug

Please reproduce using the BoringModel

Expected behavior

Environment

Borda commented Dec 31, 2020

SkafteNicki commented Dec 31, 2020

SkafteNicki commented Jan 1, 2021

edenlightning commented Jan 8, 2021

github-actions bot commented Mar 12, 2021

edenlightning commented Apr 5, 2021