Computing metrics per-class for imbalanced data #204

AnselmC · 2021-04-27T11:05:04Z

Before submitting

✅ Was this discussed/approved via a Github issue? (no need for typos and docs improvements)
✅ Did you read the contributor guideline, Pull Request section?
✅ Did you make sure to update the docs?
✅ Did you write any new necessary tests?

What does this PR do?

Draft PR as discussed in #174.

PR review

Anyone in the community is free to review the PR once the tests have passed.
If we didn't discuss your PR in Github issues there's a high chance it will not be merged.

Did you have fun?

Make sure you had fun coding 🙃

…alanced_data

pep8speaks · 2021-04-27T11:05:07Z

Hello @AnselmC! Thanks for updating this PR.

There are currently no PEP 8 issues detected in this Pull Request. Cheers! 🍻

Comment last updated at 2021-06-14 12:17:10 UTC

Borda · 2021-04-28T19:21:53Z

@AnselmC how is it going here? ready for review?

for more information, see https://pre-commit.ci

AnselmC · 2021-04-29T09:55:18Z

@Borda Hi, so I believe it's at least ready for an initial discussion.
One point I was struggling with/unsure about is how to handle multi-dimensional multi-class data (and their different accumulations). As I understand (please correct me if I'm mistaken), this means that a sample may belong to multiple classes encoded by a multi-dimensional binary vector. Hence, when mdmc_average="global", these multi-dim vectors need to be treated as a label - this seems analogous to how I've dealt with integer labels.
In the samplewise case, however, each sample is treated as a separate "batch" and then the metrics are computed per "batch" and averaged - I'm not sure how to correctly address this but I'm inclined to keep functionality as is for this case.

Would appreciate some input on this or pointers to more documentation for these scenarios.

for more information, see https://pre-commit.ci

codecov · 2021-05-04T08:13:22Z

Codecov Report

Merging #204 (5d7a937) into master (ad5e360) will increase coverage by 0.03%.
The diff coverage is 100.00%.

@@            Coverage Diff             @@
##           master     #204      +/-   ##
==========================================
+ Coverage   96.78%   96.82%   +0.03%     
==========================================
  Files          94       94              
  Lines        3084     3115      +31     
==========================================
+ Hits         2985     3016      +31     
  Misses         99       99

Flag	Coverage Δ
Linux	`78.70% <70.17%> (-0.31%)`	⬇️
Windows	`78.70% <70.17%> (-0.31%)`	⬇️
cpu	`78.70% <70.17%> (-18.02%)`	⬇️
gpu	`96.78% <100.00%> (+0.03%)`	⬆️
macOS	`78.70% <70.17%> (-18.02%)`	⬇️
pytest	`96.82% <100.00%> (+0.03%)`	⬆️
python3.6	`?`
python3.8	`?`
python3.9	`?`
torch1.3.1	`?`
torch1.4.0	`?`
torch1.8.1	`?`

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
torchmetrics/classification/accuracy.py	`96.15% <ø> (ø)`
torchmetrics/classification/f_beta.py	`100.00% <100.00%> (ø)`
torchmetrics/classification/precision_recall.py	`100.00% <100.00%> (ø)`
torchmetrics/classification/specificity.py	`100.00% <100.00%> (ø)`
torchmetrics/classification/stat_scores.py	`100.00% <100.00%> (ø)`
torchmetrics/functional/classification/accuracy.py	`94.36% <100.00%> (+0.42%)`	⬆️
torchmetrics/functional/classification/f_beta.py	`100.00% <100.00%> (ø)`
...rics/functional/classification/precision_recall.py	`100.00% <100.00%> (ø)`
...chmetrics/functional/classification/specificity.py	`100.00% <100.00%> (ø)`

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update ad5e360...5d7a937. Read the comment docs.

SkafteNicki

overall lgtm

torchmetrics/functional/classification/f_beta.py

Co-authored-by: Nicki Skafte <[email protected]>

AnselmC · 2021-06-13T11:05:55Z

@Borda sorry, I was out on vacation for a week. I fixed the valid issues from deep source but not sure how to deal with torch.tensor is not callable (see here)).
Anything else you need from me on this atm?

Borda · 2021-06-13T11:21:19Z

@SkafteNicki @maximsch2 mind review?

SkafteNicki

Great job, LGTM!
Please add entry to changelog on which metrics are affected by this enchancement :]

AnselmC · 2021-06-14T11:47:45Z

@SkafteNicki done!

AnselmC added 3 commits April 22, 2021 18:18

initial draft

cf8c6fc

working for non-mdmc case

0fa84bf

Merge branch 'master' of github.com:PyTorchLightning/metrics into unb…

e74dcd7

…alanced_data

AnselmC requested review from ananyahjha93, Borda, justusschock, SkafteNicki and tchaton as code owners April 27, 2021 11:05

AnselmC marked this pull request as draft April 27, 2021 11:07

AnselmC and others added 2 commits April 27, 2021 13:09

pep8

92faa36

Merge branch 'master' into unbalanced_data

7f69a8b

[pre-commit.ci] auto fixes from pre-commit.com hooks

ed3bbe5

for more information, see https://pre-commit.ci

AnselmC and others added 4 commits April 29, 2021 11:55

Merge branch 'master' into unbalanced_data

2eb5755

Need torch import for docs

bb37da5

using numpy for logical operations as only available for torch>1.4

d2ab584

[pre-commit.ci] auto fixes from pre-commit.com hooks

85c2f54

for more information, see https://pre-commit.ci

Borda added enhancement New feature or request Important milestonish labels May 11, 2021

Borda added this to the v0.4 milestone May 11, 2021

Merge branch 'master' into unbalanced_data

5be853d

Borda marked this pull request as ready for review May 11, 2021 07:40

moving tensors to cpu

7b91368

SkafteNicki reviewed May 12, 2021

View reviewed changes

torchmetrics/functional/classification/f_beta.py Outdated Show resolved Hide resolved

torchmetrics/functional/classification/f_beta.py Outdated Show resolved Hide resolved

AnselmC and others added 3 commits May 12, 2021 11:32

Removed numpy use

6b6996d

Co-authored-by: Nicki Skafte <[email protected]>

Merge branch 'master' into unbalanced_data

b717e86

meaningless indices should be on cpu

e3470d4

Merge branch 'master' into unbalanced_data

1738ff3

mergify bot removed the has conflicts label Jun 8, 2021

mergify bot added 6 commits June 9, 2021 06:38

Merge branch 'master' into unbalanced_data

eef632e

Merge branch 'master' into unbalanced_data

94816d0

Merge branch 'master' into unbalanced_data

e5d4cf2

Merge branch 'master' into unbalanced_data

7151287

Merge branch 'master' into unbalanced_data

261dca9

Merge branch 'master' into unbalanced_data

3babb31

Borda added waiting on author and removed ready labels Jun 9, 2021

mergify bot and others added 6 commits June 9, 2021 11:03

Merge branch 'master' into unbalanced_data

cf82514

Merge branch 'master' into unbalanced_data

b89b883

Merge branch 'master' into unbalanced_data

d55744d

Merge branch 'master' into unbalanced_data

de2a90f

Merge branch 'master' into unbalanced_data

4c73196

fixed deep source issues

7937bc6

Borda added ready and removed waiting on author labels Jun 13, 2021

SkafteNicki approved these changes Jun 13, 2021

View reviewed changes

mergify bot and others added 5 commits June 13, 2021 21:31

Merge branch 'master' into unbalanced_data

d7d3afe

Merge branch 'master' into unbalanced_data

4aadc12

Merge branch 'master' into unbalanced_data

188f07c

Merge branch 'master' into unbalanced_data

fccf997

Update CHANGELOG.md

ec2e76f

Borda enabled auto-merge (squash) June 14, 2021 12:15

Update CHANGELOG.md

5d7a937

Borda merged commit 517a611 into Lightning-AI:master Jun 14, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Computing metrics per-class for imbalanced data #204

Computing metrics per-class for imbalanced data #204

AnselmC commented Apr 27, 2021 •

edited

Loading

pep8speaks commented Apr 27, 2021 •

edited

Loading

Borda commented Apr 28, 2021

AnselmC commented Apr 29, 2021

codecov bot commented May 4, 2021 •

edited

Loading

SkafteNicki left a comment

AnselmC commented Jun 13, 2021

Borda commented Jun 13, 2021

SkafteNicki left a comment

AnselmC commented Jun 14, 2021

Computing metrics per-class for imbalanced data #204

Computing metrics per-class for imbalanced data #204

Conversation

AnselmC commented Apr 27, 2021 • edited Loading

Before submitting

What does this PR do?

PR review

Did you have fun?

pep8speaks commented Apr 27, 2021 • edited Loading

Comment last updated at 2021-06-14 12:17:10 UTC

Borda commented Apr 28, 2021

AnselmC commented Apr 29, 2021

codecov bot commented May 4, 2021 • edited Loading

Codecov Report

SkafteNicki left a comment

Choose a reason for hiding this comment

AnselmC commented Jun 13, 2021

Borda commented Jun 13, 2021

SkafteNicki left a comment

Choose a reason for hiding this comment

AnselmC commented Jun 14, 2021

AnselmC commented Apr 27, 2021 •

edited

Loading

pep8speaks commented Apr 27, 2021 •

edited

Loading

codecov bot commented May 4, 2021 •

edited

Loading