New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Add improved goodness of fit implementation #190

Merged

MMathisLab merged 14 commits into main from stes/better-goodness-of-fit

Feb 2, 2025

+272 −0

Member

stes commented Oct 27, 2024 •

edited

Loading

This adds a better goodness of fit measure. Instead of the old variant which simply matched the InfoNCE and depends on the batch size, the proposed measure

is at 0 for "chance level" (vs. log batch size)
does not need an adjustment for single session vs. multi-session solvers
increases as the model gets better, which might be more intuitive

The conversion is quite simply done via

GoF(model) = log (batch_size_per_session * num_sessions) - InfoNCE(model)

This measure is also used in DeWolf et al., 2024, Eq. (43)

Application example (GoF improves from 0 to a larger value during training):

OLD example, for comparison

Close https://github.com/AdaptiveMotorControlLab/CEBRA-dev/pull/669

cla-bot bot added the CLA signed label

stes self-assigned this

stes mentioned this pull request

Release 0.5.0rc1 #189

Merged

16 tasks

Member Author

stes commented Oct 27, 2024

TODO: Fix case where batch_size is None

stes force-pushed the stes/better-goodness-of-fit branch from 5e21cdc to c826b68 Compare

October 27, 2024 21:31

stes mentioned this pull request

Add goodness of fit metric #202

Closed

stes force-pushed the stes/better-goodness-of-fit branch from c826b68 to f43971f Compare

November 29, 2024 13:31

Member

CeliaBenquet commented Nov 29, 2024

@stes about what I implemented in #202 that I do see here.

I think it would be good to have a really basic function where you provide the loss and the batch size, so that it is easily usable in the pytorch implementation as well.

Also, it would be nice to test for the default CEBRA.batch_size = None, not sure it is handled here.

stes commented

View reviewed changes

cebra/integrations/sklearn/metrics.py Outdated Show resolved Hide resolved

stes commented

View reviewed changes

cebra/integrations/sklearn/metrics.py Outdated Show resolved Hide resolved

stes commented

View reviewed changes

cebra/integrations/sklearn/metrics.py Outdated Show resolved Hide resolved

Member Author

stes commented Dec 16, 2024

Unrelated build issue due to upstream change in sklearn (#204 ), attempted fix in #205

stes requested a review from CeliaBenquet

December 16, 2024 18:18

Member Author

stes commented Dec 16, 2024

The build issue is fixed, and once #205 is merged tests should pass here as well.

stes force-pushed the stes/better-goodness-of-fit branch from 1d55ead to ad8ae60 Compare

December 16, 2024 19:33

stes changed the title ~~[WIP] Add improved goodness of fit implementation~~ Add improved goodness of fit implementation

stes added the enhancement label

stes commented

View reviewed changes

cebra/integrations/sklearn/metrics.py Outdated Show resolved Hide resolved

stes commented

View reviewed changes

cebra/integrations/sklearn/metrics.py Outdated Show resolved Hide resolved

CeliaBenquet suggested changes

View reviewed changes

Member

CeliaBenquet left a comment

Thank you @stes! This looks nice!! Some minor suggestions on the docstrings and maybe add some tests for the different corner cases based on the arguments provided in infonce_to_goodness_of_fit.

cebra/integrations/sklearn/metrics.py Outdated Show resolved Hide resolved

cebra/integrations/sklearn/metrics.py Outdated Show resolved Hide resolved

cebra/integrations/sklearn/metrics.py Outdated Show resolved Hide resolved

cebra/integrations/sklearn/metrics.py Outdated Show resolved Hide resolved

tests/test_sklearn_metrics.py Show resolved Hide resolved

stes force-pushed the stes/better-goodness-of-fit branch from a1b4cf3 to f5ec82c Compare

January 21, 2025 23:13

stes commented

View reviewed changes

tests/test_sklearn_metrics.py Outdated Show resolved Hide resolved

Member Author

stes commented Jan 25, 2025

one review comment to resolve (= add more relevant tests for one batchsize None), then ready!

stes force-pushed the stes/better-goodness-of-fit branch from 7c85975 to c94f5ae Compare

January 25, 2025 16:27

stes requested a review from CeliaBenquet

January 25, 2025 16:27

stes and others added 5 commits

February 2, 2025 16:51


          Started implementing improved goodness of fit implementation

f93613f


          add tests and improve implementation

d871535


          Fix examples

17d31e5


          Fix docstring error

4f155d8


          Handle batch size = None for goodness of fit computation

afe25e6

stes and others added 9 commits

February 2, 2025 16:51


          adapt GoF implementation

caba8c5


          Fix docstring tests

3d05a18


          Update docstring for goodness_of_fit_score

e577b5a

Co-authored-by: Célia Benquet <[email protected]>


          add annotations to goodness_of_fit_history

cab2b2b

Co-authored-by: Célia Benquet <[email protected]>


          fix typo

1d42769

Co-authored-by: Célia Benquet <[email protected]>


          improve err message

d6f70e4

Co-authored-by: Célia Benquet <[email protected]>


          make numerical test less conversative

bf86944


          Add tests for exception handling

fd8e7cd


          fix tests

stes force-pushed the stes/better-goodness-of-fit branch from 0598346 to 3771990 Compare

February 2, 2025 16:18

stes requested review from MMathisLab and removed request for CeliaBenquet

February 2, 2025 16:54

MMathisLab approved these changes

View reviewed changes

Member

MMathisLab left a comment

lgtm! thanks

MMathisLab merged commit 4e32661 into main

14 checks passed

MMathisLab deleted the stes/better-goodness-of-fit branch

February 2, 2025 16:59

MMathisLab mentioned this pull request

How to test for: overfitting in CEBRA Supervised Embeddings #226

Closed

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

CLA signed enhancement