Improve calibration error speed by replacing `for` loop #769

ramonemiliani93 · 2022-01-17T21:43:59Z

What does this PR do?

Improve calibration error speed by removing for loop and using bucketize + scatter_add.
Removes the for loop in the calibration error and uses bucketize with scatter add for improved speed (~10x).

Fixes #767

Before submitting

Was this discussed/approved via a Github issue? (no need for typos and docs improvements)
Did you read the contributor guideline, Pull Request section?
Did you make sure to update the docs?
Did you write any new necessary tests?

PR review

Anyone in the community is free to review the PR once the tests have passed.
If we didn't discuss your PR in Github issues there's a high chance it will not be merged.

Did you have fun?

Oh yes 😎

Make sure you had fun coding 🙃

Borda · 2022-01-17T22:29:37Z

Nice, have you measured the perfomance? 🐰

codecov · 2022-01-17T22:31:04Z

Codecov Report

Merging #769 (97be5af) into master (d6c423e) will decrease coverage by 24%.
The diff coverage is 17%.

@@           Coverage Diff           @@
##           master   #769     +/-   ##
=======================================
- Coverage      95%    71%    -24%     
=======================================
  Files         171    171             
  Lines        6908   6926     +18     
=======================================
- Hits         6546   4904   -1642     
- Misses        362   2022   +1660

ramonemiliani93 · 2022-01-18T02:15:47Z

@Borda Yes 👌 Only on CPU though, here's the script:

import timeit

import torch


def method_a(confidences, accuracies, bin_boundaries):
    def _method_a():
        conf_bin = torch.zeros_like(bin_boundaries)
        acc_bin = torch.zeros_like(bin_boundaries)
        prop_bin = torch.zeros_like(bin_boundaries)
        for i, (bin_lower, bin_upper) in enumerate(
            zip(bin_boundaries[:-1], bin_boundaries[1:])
        ):
            # Calculated confidence and accuracy in each bin
            in_bin = confidences.gt(bin_lower.item()) * confidences.le(bin_upper.item())
            prop_in_bin = in_bin.float().mean()
            if prop_in_bin.item() > 0:
                acc_bin[i] = accuracies[in_bin].float().mean()
                conf_bin[i] = confidences[in_bin].mean()
                prop_bin[i] = prop_in_bin

    return _method_a


def method_b(confidences, accuracies, bin_boundaries):
    def _method_b():
        acc_bin = torch.zeros(len(bin_boundaries) - 1)
        conf_bin = torch.zeros(len(bin_boundaries) - 1)
        count_bin = torch.zeros(len(bin_boundaries) - 1)

        indices = torch.bucketize(confidences, bin_boundaries) - 1

        count_bin.scatter_add_(dim=0, index=indices, src=torch.ones_like(confidences))

        conf_bin.scatter_add_(dim=0, index=indices, src=confidences)
        conf_bin = torch.nan_to_num(conf_bin / count_bin)

        acc_bin.scatter_add_(dim=0, index=indices, src=accuracies)
        acc_bin = torch.nan_to_num(acc_bin / count_bin)

        prop_bin = count_bin / count_bin.sum()

    return _method_b


n_bins = 20
size = (10000000,)
confidences = torch.rand(size)
accuracies = torch.randint(low=0, high=2, size=size).float()
bin_boundaries = torch.linspace(0, 1, steps=n_bins + 1)

t = timeit.Timer(method_a(confidences, accuracies, bin_boundaries))
print(t.timeit(100))

t = timeit.Timer(method_b(confidences, accuracies, bin_boundaries))
print(t.timeit(100))

The time depends on the size and n_bins, for the values I set the difference was from 60 s to 11.4 s on 100 runs which speeds it up ~6x.

torchmetrics/functional/classification/calibration_error.py

SkafteNicki · 2022-01-18T14:55:44Z

Can confirm a 30-50x on GPU with the proposed solution (after implementing the suggested changes) :)

Co-authored-by: Nicki Skafte Detlefsen <[email protected]>

* Remove deprecated functions, and warnings * Update links for docstring Co-authored-by: Daniel Stancl <[email protected]> Co-authored-by: Jirka Borovec <[email protected]>

Borda · 2022-01-18T22:09:50Z

seems that older PyTorch versions do not have bucketize it seems it was added in 1.8

AttributeError: module 'torch' has no attribute 'bucketize'

said so we can have a hard switch if needed - older versions will use loop, new version this fancy solutions :]

ramonemiliani93 · 2022-01-18T22:39:36Z

Should a try-except block do it?

    acc_bin = torch.zeros(len(bin_boundaries) - 1, device=confidences.device)
    conf_bin = torch.zeros(len(bin_boundaries) - 1, device=confidences.device)
    count_bin = torch.zeros(len(bin_boundaries) - 1, device=confidences.device)
    prop_bin = torch.zeros(len(bin_boundaries) - 1, device=confidences.device)
    
    try:
        indices = torch.bucketize(confidences, bin_boundaries) - 1

    except AttributeError:
        for i, (bin_lower, bin_upper) in enumerate(zip(bin_boundaries[:-1], bin_boundaries[1:])):
            # Calculated confidence and accuracy in each bin
            in_bin = confidences.gt(bin_lower.item()) * confidences.le(bin_upper.item())
            prop_in_bin = in_bin.float().mean()
            if prop_in_bin.item() > 0:
                acc_bin[i] = accuracies[in_bin].float().mean()
                conf_bin[i] = confidences[in_bin].mean()
                prop_bin[i] = prop_in_bin
    else:
        count_bin.scatter_add_(dim=0, index=indices, src=torch.ones_like(confidences))
        conf_bin.scatter_add_(dim=0, index=indices, src=confidences)
        conf_bin = torch.nan_to_num(conf_bin / count_bin)
        acc_bin.scatter_add_(dim=0, index=indices, src=accuracies)
        acc_bin = torch.nan_to_num(acc_bin / count_bin)
        prop_bin = count_bin / count_bin.sum()

SkafteNicki · 2022-01-19T11:17:07Z

seems that older PyTorch versions do not have bucketize it seems it was added in 1.8
AttributeError: module 'torch' has no attribute 'bucketize'
said so we can have a hard switch if needed - older versions will use loop, new version this fancy solutions :]

@Borda sure about this?
We have the following code elsewhere also relying on bucketize:
https://github.com/PyTorchLightning/metrics/blob/c519402693a6cb235367614f192d67332cbb4bc0/torchmetrics/functional/classification/auroc.py#L105-L109
From that it seems that 1.6 was the point it was introduced.

Borda · 2022-01-19T11:28:03Z

We have the following code elsewhere also relying on bucketize:

good point, I did not dive into details, just saw that all tests lower 1.8 was failing (could be also other reason lol)

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Jirka Borovec <[email protected]>

Borda · 2022-01-19T21:36:31Z

@ramonemiliani93 do you think we can finish it so we can include it in the next bug-fix release? 🐰

for more information, see https://pre-commit.ci

…into ce-speed

SkafteNicki · 2022-01-20T13:53:49Z

@Borda should be taken care of now

torchmetrics/functional/classification/calibration_error.py

ramonemiliani93 · 2022-01-20T14:07:54Z

@Borda Sorry for the late reply! I've had a lot of things on my side 😔 I just saw that @SkafteNicki already solved it. I hope to be able to contribute more next time!

Co-authored-by: Justus Schock <[email protected]>

SkafteNicki · 2022-01-20T15:15:34Z

It complains about torch.nan_to_num which was introduced in 1.8...
I increase the required version for the faster version :]

* Improve speed by removing for loop and using bucketize + scatter_add. * fast and slow binning * Apply suggestions from code review * cleaning & flake8 * increase to 1.8 Co-authored-by: Jirka Borovec <[email protected]> Co-authored-by: Nicki Skafte Detlefsen <[email protected]> Co-authored-by: mergify[bot] <37929162+mergify[bot]@users.noreply.github.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Jirka <[email protected]> (cherry picked from commit 51d952d)

Improve speed by removing for loop and using bucketize + scatter_add.

5523655

ramonemiliani93 requested review from ananyahjha93, Borda, ethanwharris, justusschock, SeanNaren, SkafteNicki and tchaton as code owners January 17, 2022 21:44

Borda added this to the v0.7 milestone Jan 17, 2022

Borda added the enhancement New feature or request label Jan 17, 2022

SkafteNicki reviewed Jan 18, 2022

View reviewed changes

torchmetrics/functional/classification/calibration_error.py Outdated Show resolved Hide resolved

Borda and others added 2 commits January 18, 2022 16:19

device

b3a9362

Co-authored-by: Nicki Skafte Detlefsen <[email protected]>

Merge branch 'master' into ce-speed

7fe4dcf

Borda requested a review from SkafteNicki January 18, 2022 15:20

Borda approved these changes Jan 18, 2022

View reviewed changes

Borda enabled auto-merge (squash) January 18, 2022 15:21

SkafteNicki approved these changes Jan 18, 2022

View reviewed changes

mergify bot and others added 3 commits January 18, 2022 16:20

Merge branch 'master' into ce-speed

a8cfeb6

Merge branch 'master' into ce-speed

b866c96

Merge branch 'master' into ce-speed

2b37933

Borda mentioned this pull request Jan 18, 2022

CI: dynamic testing #764

Merged

4 tasks

Borda and others added 2 commits January 18, 2022 19:42

Merge branch 'master' into ce-speed

cad1cb9

Remove deprecated functions, and warnings - Text (#773)

0848523

* Remove deprecated functions, and warnings * Update links for docstring Co-authored-by: Daniel Stancl <[email protected]> Co-authored-by: Jirka Borovec <[email protected]>

Merge branch 'master' into ce-speed

dd40465

Borda changed the title ~~Improve calibration error speed by removing for loop and using bucketize + scatter_add.~~ Improve calibration error speed by replacing for loop Jan 19, 2022

Fix Matthews correlation coefficient when the denominator is 0 (#781)

cfe5e87

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Jirka Borovec <[email protected]>

Merge branch 'master' into ce-speed

a3b4606

Borda force-pushed the master branch from cfe5e87 to cccb7a6 Compare January 19, 2022 22:07

mergify bot and others added 5 commits January 19, 2022 22:12

Merge branch 'master' into ce-speed

727c1fb

fast and slow binning

ec75b14

[pre-commit.ci] auto fixes from pre-commit.com hooks

102d772

for more information, see https://pre-commit.ci

changelog

0163b3a

Merge branch 'ce-speed' of https://github.com/ramonemiliani93/metrics …

b62ecd9

…into ce-speed

justusschock approved these changes Jan 20, 2022

View reviewed changes

torchmetrics/functional/classification/calibration_error.py Outdated Show resolved Hide resolved

mergify bot added the ready label Jan 20, 2022

mergify bot and others added 4 commits January 20, 2022 14:37

Merge branch 'master' into ce-speed

0d16d5f

Apply suggestions from code review

b77b751

Co-authored-by: Justus Schock <[email protected]>

cleaning

f6741ae

flake8

7ff78c4

SkafteNicki and others added 3 commits January 20, 2022 16:16

increase to 1.8

8f4ec57

Merge branch 'master' into ce-speed

e5efbb6

Merge branch 'master' into ce-speed

344e52a

Borda disabled auto-merge January 20, 2022 19:10

chlog

97be5af

Borda enabled auto-merge (squash) January 20, 2022 19:15

Borda disabled auto-merge January 20, 2022 19:16

Borda merged commit 51d952d into Lightning-AI:master Jan 20, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve calibration error speed by replacing `for` loop #769

Improve calibration error speed by replacing `for` loop #769

ramonemiliani93 commented Jan 17, 2022 •

edited by Borda

Loading

Borda commented Jan 17, 2022

codecov bot commented Jan 17, 2022 •

edited

Loading

ramonemiliani93 commented Jan 18, 2022

SkafteNicki commented Jan 18, 2022

Borda commented Jan 18, 2022 •

edited

Loading

ramonemiliani93 commented Jan 18, 2022

SkafteNicki commented Jan 19, 2022

Borda commented Jan 19, 2022

Borda commented Jan 19, 2022

SkafteNicki commented Jan 20, 2022

ramonemiliani93 commented Jan 20, 2022 •

edited

Loading

SkafteNicki commented Jan 20, 2022

Improve calibration error speed by replacing for loop #769

Improve calibration error speed by replacing for loop #769

Conversation

ramonemiliani93 commented Jan 17, 2022 • edited by Borda Loading

What does this PR do?

Before submitting

PR review

Did you have fun?

Borda commented Jan 17, 2022

codecov bot commented Jan 17, 2022 • edited Loading

Codecov Report

ramonemiliani93 commented Jan 18, 2022

SkafteNicki commented Jan 18, 2022

Borda commented Jan 18, 2022 • edited Loading

ramonemiliani93 commented Jan 18, 2022

SkafteNicki commented Jan 19, 2022

Borda commented Jan 19, 2022

Borda commented Jan 19, 2022

SkafteNicki commented Jan 20, 2022

ramonemiliani93 commented Jan 20, 2022 • edited Loading

SkafteNicki commented Jan 20, 2022

Improve calibration error speed by replacing `for` loop #769

Improve calibration error speed by replacing `for` loop #769

ramonemiliani93 commented Jan 17, 2022 •

edited by Borda

Loading

codecov bot commented Jan 17, 2022 •

edited

Loading

Borda commented Jan 18, 2022 •

edited

Loading

ramonemiliani93 commented Jan 20, 2022 •

edited

Loading