More generic API for Rainbow variants #324

karpov-sv · 2024-02-25T22:48:40Z

After discussion with @erusseil we decided to unify the zoo of Rainbow variants.
So the PR replaces them all with a single RainbowFit feature that, on initialization, accepts keywords bolometric and temperature that may be either strings (for built-in variants) or custom sub-classes of corresponding base classes for bolometric and temperature terms. Default values should match the behaviour of previous version of RainbowFit.

The built-in variants for bolometric term are:

bazin - symmetric form of Bazin function, normalized to unity to de-correlate the amplitude from timescales
sigmoid - rising sigmoid

Temperature options are:

sigmoid - falling or rising sigmoid, with reference time equal to the one of bolometric term
constant - constant temperature
delayed_sigmoid - again sigmoid, but with the delay between temperature and bolometric reference times as a free parameter

The PR also slightly improves initial guesses for Bazin parameters, and fixes minor bugs from previous code versions.
Also, it changes the normalization of the underlying Planck term to be more straightforward - it both de-clutters the code a bit (basically, by moving the normalization directy to the term from extra scaling/unscaling methods), and returns the amplitude parameter in (approximately) the same units as input data, which should be more understandable by the end user.

codecov · 2024-02-25T22:56:00Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 84.94%. Comparing base (84f276f) to head (267cf92).

❗ Current head 267cf92 differs from pull request most recent head ee90dfb. Consider uploading reports for the commit ee90dfb to get more accurate results

Additional details and impacted files

@@           Coverage Diff           @@
##           master     #324   +/-   ##
=======================================
  Coverage   84.94%   84.94%           
=======================================
  Files           9        9           
  Lines        2664     2664           
=======================================
  Hits         2263     2263           
  Misses        401      401

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

codspeed-hq · 2024-02-25T23:12:32Z

CodSpeed Performance Report

Merging #324 will not alter performance

_{Comparing karpov-sv:rainbow-generic (ee90dfb) with master (84f276f)}

Summary

✅ 97 untouched benchmarks

hombit · 2024-02-26T13:32:40Z

Thank you, @karpov-sv, it looks really great. I'll go through it a little later, sorry in advance for the delay

hombit

It is a huge and great contribution! Thank you so much for it.

Please find some discussion topics bellow.

Interface of `RainbowFit`

I believe that bolometric and temperature arguments of RainbowFit should be objects, not classes. I would remove @staticmethod from abstract methods and allow users to create parametric subclasses of BaseBolometricTerm and BaseTemperatureTerm. This interface would make string literals to correspond to the "default" objects of the built-in classes (see the discussion bellow).

`bolometric.py` and `temperature.py`

I'm not sure how idiomatic the inheritance approach used there: we go from variable length of parameters *params to fixed number of params. It should be fine because we are the only users of the child classes, and we hopefully always provide the parameters in a right order.
As I mentioned above, I would appreciate if we make child classes parametric. We can do it with @dataclass(frozen=True). Limits and initial guesses could easily be parameters, having the same default values as you currently use in the code. In this case we can use trivial constructors to create "default" objects for bolometric_terms and temperature_terms.

Scaling literals

I like the idea of specifying scaling rules in bolometric/temperature classes. I would ask to make the way we specify these rules more clear: add all variants to the base classes docs, verify them more strictly on the runtime.

light-curve/light_curve/light_curve_py/features/rainbow/_base.py

light-curve/light_curve/light_curve_py/features/rainbow/temperature.py

light-curve/light_curve/light_curve_py/features/rainbow/generic.py

hombit · 2024-02-27T14:41:59Z

@karpov-sv @erusseil sorry, I've forgotten to write about the amplitude. I see your point about amplitude normalization, but I don't like that the output value doesn't have a clear physical meaning. What about a compromise solution: could we output the final "amplitude" to be a peak flux value in the first passband? We still may have the same "amplitude" definition inside the code, but replace it with some meaningful value in the end.

karpov-sv · 2024-02-27T21:33:45Z

So I implemented most of your suggestions. For the rest I suggest to keep it as it is now.

Parametric subclasses would not work for the current approach - at least if you wish to pass initial guesses as the parameters to them. These classes operate on scaled variables, and have no way to access the original scalers - thus, their initial parameters are not related to anything the end user may provide, and cannot be converted. (It would also break the concept of "batch processing", no?..).
I believe the proper place to introduce initial-guesses-from-outside is a non-batch interface like fit_and_get_errors() where you do actually have both scalers and scaling rules for the parameters. This way it will also keep the feature pure.

As for the amplitude in the parameters - now it is physically in the same (roughly) scale as original data. It all goes down to the normalization of Planck function, which is still an open question. E.g. one idea is to actually normalize it to the flux at some reference wavelength - it would break the meaning of the first term as bolometric, but would allow better fitting supernovae. However, it cannot be just one more extra parameter for the fit - the fitter is way too unstable as it is now already...
Let's keep it as it is now, and discuss it next week in Clermont.

hombit · 2024-02-27T21:37:06Z

@karpov-sv sounds great! I agree about the parametric subclasses, but I still want us to find a better solution for the amplitude.

karpov-sv · 2024-02-29T01:21:30Z

It still tells me that changes are requested, but I see no more open questions here. What should I do?..

hombit · 2024-02-29T01:28:06Z

@karpov-sv would you like me to merge it now, so we can work on the next PR in Clermont?

karpov-sv · 2024-02-29T01:30:15Z

Yes, let's merge it as it is now, and discuss what to do next in Clermont.

hombit · 2024-02-29T01:43:49Z

@karpov-sv I'll do it! Sorry for the mis-communication

… generic

…-normalized Planck function

… it works meaningfully with temperature evolution

… to better estimate rise/fall times

for more information, see https://pre-commit.ci

…roperly defined

* Initial import from Etienne PR, with some adjustments to make it more generic * Make parameter and error unscaling more generic * Implement peak time computation * Use new generic implementation instead of old RainbowFit * (Slightly better) decorrelate temperature and amplitude by using peak-normalized Planck function * Generalize baseline handling w.r.t. bolometric term initial parameters * Cont. of previous commit * Do not crash in baseline fitting when some bands do not have data * Revert the normalization of Planck term back to 'bolometric', so that it works meaningfully with temperature evolution * Decorrelate Bazin amplitude from rise/fall times by normalizing it to 1 * Pass measurement errors to initial parameter estimators, and use them to better estimate rise/fall times * Add tests for generic RainbowFit implementation * Remove now obsolete versions of RainbowFit * Improve docstrings * Do not raise an exception if the fitting fails but `get_initial=True` * Delayed sigmoid temperature term added * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Fix the tests * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Do not use 3.9+ features * Improve docstrings and stricter check for parameter unscaling * Minor fixes requested in the review * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Additional fixes requested in the review * Small docstring for the sigmoid mentioning that it has no peak time properly defined * Fix linting errors --------- Co-authored-by: erusseil <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

hombit requested changes Feb 27, 2024

View reviewed changes

karpov-sv and others added 19 commits February 28, 2024 20:43

Initial import from Etienne PR, with some adjustments to make it more…

4274a8d

… generic

Make parameter and error unscaling more generic

84bb64c

Implement peak time computation

bf58584

Use new generic implementation instead of old RainbowFit

fb74c9a

(Slightly better) decorrelate temperature and amplitude by using peak…

18f4625

…-normalized Planck function

Generalize baseline handling w.r.t. bolometric term initial parameters

2eb3efa

Cont. of previous commit

3f56804

Do not crash in baseline fitting when some bands do not have data

a05bacd

Revert the normalization of Planck term back to 'bolometric', so that…

e6810f8

… it works meaningfully with temperature evolution

Decorrelate Bazin amplitude from rise/fall times by normalizing it to 1

8e49967

Pass measurement errors to initial parameter estimators, and use them…

21efbdf

… to better estimate rise/fall times

Add tests for generic RainbowFit implementation

e9c2f9c

Remove now obsolete versions of RainbowFit

4e9654a

Improve docstrings

8016b69

Do not raise an exception if the fitting fails but get_initial=True

8c7d700

Delayed sigmoid temperature term added

f638773

[pre-commit.ci] auto fixes from pre-commit.com hooks

6cdb871

for more information, see https://pre-commit.ci

Fix the tests

c628110

[pre-commit.ci] auto fixes from pre-commit.com hooks

b7afaff

for more information, see https://pre-commit.ci

karpov-sv and others added 7 commits February 28, 2024 20:43

Do not use 3.9+ features

c4879bd

Improve docstrings and stricter check for parameter unscaling

ea4e34c

Minor fixes requested in the review

fdf2341

[pre-commit.ci] auto fixes from pre-commit.com hooks

13fd9d6

for more information, see https://pre-commit.ci

Additional fixes requested in the review

2aee2da

Small docstring for the sigmoid mentioning that it has no peak time p…

552374d

…roperly defined

Fix linting errors

ee90dfb

hombit force-pushed the rainbow-generic branch from 267cf92 to ee90dfb Compare February 29, 2024 01:43

hombit self-requested a review February 29, 2024 01:43

hombit approved these changes Feb 29, 2024

View reviewed changes

hombit merged commit 4128c04 into light-curve:master Feb 29, 2024
39 checks passed

JulienPeloton mentioned this pull request Apr 3, 2024

lightcurve version 0.8.2 breaks the pipeline astrolabsoftware/fink-science#380

Closed

3 tasks

hombit mentioned this pull request Jul 2, 2024

Noticeable loss of performances for Rainbow feature extraction after 0.8.1 #392

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

More generic API for Rainbow variants #324

More generic API for Rainbow variants #324

karpov-sv commented Feb 25, 2024

codecov bot commented Feb 25, 2024 •

edited

Loading

codspeed-hq bot commented Feb 25, 2024 •

edited

Loading

hombit commented Feb 26, 2024

hombit left a comment

hombit commented Feb 27, 2024

karpov-sv commented Feb 27, 2024

hombit commented Feb 27, 2024

karpov-sv commented Feb 29, 2024

hombit commented Feb 29, 2024

karpov-sv commented Feb 29, 2024

hombit commented Feb 29, 2024

More generic API for Rainbow variants #324

More generic API for Rainbow variants #324

Conversation

karpov-sv commented Feb 25, 2024

codecov bot commented Feb 25, 2024 • edited Loading

Codecov Report

codspeed-hq bot commented Feb 25, 2024 • edited Loading

CodSpeed Performance Report

Merging #324 will not alter performance

Summary

hombit commented Feb 26, 2024

hombit left a comment

Choose a reason for hiding this comment

Interface of RainbowFit

bolometric.py and temperature.py

Scaling literals

hombit commented Feb 27, 2024

karpov-sv commented Feb 27, 2024

hombit commented Feb 27, 2024

karpov-sv commented Feb 29, 2024

hombit commented Feb 29, 2024

karpov-sv commented Feb 29, 2024

hombit commented Feb 29, 2024

codecov bot commented Feb 25, 2024 •

edited

Loading

codspeed-hq bot commented Feb 25, 2024 •

edited

Loading

Interface of `RainbowFit`

`bolometric.py` and `temperature.py`