Fix for logL_birth #324

williamjameshandley · 2023-08-01T20:25:25Z

Description

This PR fixes #310 by adding an explicit guard for logL_birth, dropping nans and infs before plotting

It's not the neastest solution, so any suggestions for improvements are welcome.

Checklist:

I have performed a self-review of my own code
My code is PEP8 compliant (flake8 anesthetic tests)
My code contains compliant docstrings (pydocstyle --convention=numpy anesthetic)
New and existing unit tests pass locally with my changes (python -m pytest)
I have added tests that prove my fix is effective or that my feature works
I have appropriately incremented the semantic version number in both README.rst and anesthetic/_version.py

codecov · 2023-08-01T20:33:28Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 100.00%. Comparing base (16083c4) to head (7eb1869).

Additional details and impacted files

@@            Coverage Diff            @@
##            master      #324   +/-   ##
=========================================
  Coverage   100.00%   100.00%           
=========================================
  Files           36        36           
  Lines         3041      3052   +11     
=========================================
+ Hits          3041      3052   +11

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

lukashergt

See comments inline.

lukashergt · 2023-08-01T23:55:06Z

anesthetic/samples.py

-                self[x].plot(ax=ax, xlabel=xlabel,
-                             *args, **kwargs)
+                if x == 'logL_birth':
+                    self[x].replace(-np.inf, np.nan
+                                    ).dropna().plot(ax=ax, xlabel=xlabel,
+                                                    *args, **kwargs)
+                else:
+                    self[x].plot(ax=ax, xlabel=xlabel, *args, **kwargs)


The .dropna() shouldn't be necessary.

Shall we separate inf replacement and plotting to reduce code duplication? We could have
if x == 'logL_birth': selfx = self[x].replace(...) else: selfx = self[x] selfx.plot(...)

I feel like changing infs to nans shouldn't happen silently. Shall we at least have a warning that we are setting -inf to nan for plotting purposes?

dropna is necessary for the 2D KDE, but not otherwiseo so I've removed those.
I've reduced code repetition as suggested.
With regard to infs to nans, I'm not that keen to have a warning for default behaviour (i.e. samples.plot_1d())

With regard to infs to nans, I'm not that keen to have a warning for default behaviour (i.e. samples.plot_1d())

Well, I'd argue here that samples.plot_1d() or samples.plot_2d() is not default behaviour for nested sampling data (nor for anesthetic dataframes in general, which typically come with very many columns), otherwise this would have come up earlier, too. By default the user should specify which columns are to be plotted.

So I think a warning message could help the new user (who is most likely to call the plotting functions without specifying columns) to learn faster how to use the plotting command. This is also how this PR came to be: new user simply trying samples.plot_1d() to see if it works...

I agree that too many warning messages are annoying and get in the way, but how often do we intentionally plot the posterior distribution of logL_birth? Also, for people that indeed want to intentionally plot logL_birth, they can easily avoid the warning by dropping infs first.

I am persuaded by this, so think we should have a warning message.

If we're going down a warning route, should we restrict this to logL_birth ? The code would be much neater if we just moved the relevant inf guards into kde_contour_plot_2d and kde_1d and kde_2d. This would then be more consistent with the other functions behaviour, e.g. samples.plot(), which just ignores nans and infs.

should we restrict this to logL_birth ?

We could do either logL_birth specifically, or anything with infs generally. Happy with both, but the warning should say something along the lines "there are infs in columns [logL_birth, ...]".

The infs will also cause problems for histograms, so I think catching them in plot_1d and plot_2d is probably the right place...?

My input from stumbling across this discussion via #332

Perhaps the default behavior is sensible (include nlive logl and loglbirth in the plot if nothing is specified), but it strikes me that the easiest thing to make the various kde errors less impactful is add a kind=scatter and make that the default for the specific case that the columns to plot are empty?

anesthetic/samples.py

tests/test_samples.py

…to logL_birth_inf

README.rst

williamjameshandley · 2023-08-20T22:29:02Z

Note that this has come up in the past in #96, which this PR obseletes, so that test is now removed.

lukashergt · 2023-08-22T06:45:30Z

The current question is whether we should change the default behaviour to ignoring the columns logL, logL_birth, and nlive by default?

I would vote against that default change. plot_1d and plot_2d are more general Samples methods which I don't think should drop any columns by default.

Those columns are more specific NestedSamples parameters. We could create NestedSamples.plot_1d and NestedSamples.plot_2d methods which would handle the treatment of those parameters (and then forward to Samples.plot_1d etc.), be it a warning or a change to default behaviour. I'd prefer a simple warning.

Perhaps the default behavior is sensible (include nlive logl and loglbirth in the plot if nothing is specified), but it strikes me that the easiest thing to make the various kde errors less impactful is add a kind=scatter and make that the default for the specific case that the columns to plot are empty?

I'd keep the kde default, but am open to be convinced otherwise.
Either way, it might be nice to have a simpler shortcut for scatter plots, so maybe we should create a kind='scatter' shortcut, which uses 'scatter_2d' in the lower triangle (and 'hist_1d' on the diagonal?)?

williamjameshandley · 2023-08-22T07:47:43Z

Either way, it might be nice to have a simpler shortcut for scatter plots, so maybe we should create a kind='scatter' shortcut, which uses 'scatter_2d' in the lower triangle (and 'hist_1d' on the diagonal?)?

Added scatter and scatter_2d to default kinds in b51b3ef

…to logL_birth_inf

williamjameshandley · 2023-08-22T08:38:49Z

I would also vote to keep things in Samples and avoid a NestedSamples override.

README.rst

anesthetic/samples.py

tests/test_samples.py

anesthetic/samples.py

…ns are actually ok here

…the newly added warnings and to keep our pytest output clean

lukashergt

@williamjameshandley, if you are happy with my tweaks, feel free to squash and merge.

* test that limits get accurately updated by successive plots with logscale axes, adjusting to new data limits, see issue #381 * fix typo from PR #324 * bump version to 2.8.10 * update logscale plot limits to datalimits at the end, making use of `ax.dataLim`

* test that limits get accurately updated by successive plots with logscale axes, adjusting to new data limits, see issue handley-lab#381 * fix typo from PR handley-lab#324 * bump version to 2.8.10 * update logscale plot limits to datalimits at the end, making use of `ax.dataLim`

* allow matplotlib 3.9 * bump version to 2.8.10 * Fix logscale limit updates (#383) * test that limits get accurately updated by successive plots with logscale axes, adjusting to new data limits, see issue #381 * fix typo from PR #324 * bump version to 2.8.10 * update logscale plot limits to datalimits at the end, making use of `ax.dataLim` * Fix macOS CI (#385) * attempt at fixing macOS CI by brew installing hdf5 * update from `miniconda@v2` to `miniconda@v3` * bump version to 2.8.11 * try newer `tables` version, which was previously restricted to 3.8.0 in #379 * Revert "attempt at fixing macOS CI by brew installing hdf5" This reverts commit 968bdb3. * Reapply "attempt at fixing macOS CI by brew installing hdf5" This reverts commit 204014a. Seems like this is needed after all, otherwise macOS is struggling to find a local HDF5. --------- Co-authored-by: Will Handley <[email protected]> * Fix to `color='C2'` plot_2d error post pandas 2 (#382) * Added failing test * bump version to 2.8.10 * Get color from self.color * Update README.rst * Update _version.py * Update README.rst * Update _version.py --------- Co-authored-by: Lukas Hergt <[email protected]> * bump version to 2.8.10 * bump version to 2.8.11 * bump version to 2.8.13 --------- Co-authored-by: Lukas Hergt <[email protected]> Co-authored-by: Will Handley <[email protected]>

Fix for logL_birth

f45e005

williamjameshandley requested a review from lukashergt August 1, 2023 20:25

bump version to 2.1.5

bdc97db

lukashergt reviewed Aug 1, 2023

View reviewed changes

williamjameshandley and others added 5 commits August 4, 2023 12:55

Merge branch 'master' into logL_birth_inf

f0020f8

bump version to 2.1.6

85618c3

now avoiding dropna where possible and reducing code repetition

343caff

Merge branch 'logL_birth_inf' of github.com:handley-lab/anesthetic in…

2a0d11d

…to logL_birth_inf

Merge branch 'master' into logL_birth_inf

c353f18

lukashergt reviewed Aug 15, 2023

View reviewed changes

README.rst Outdated Show resolved Hide resolved

lukashergt and others added 4 commits August 16, 2023 12:42

Merge branch 'master' into logL_birth_inf

a3c221b

version bump to 2.2.2

b185eb1

Merge branch 'master' into logL_birth_inf

aefbf14

Dropping all infs with warnings

318d1e0

williamjameshandley requested a review from lukashergt August 20, 2023 22:29

williamjameshandley mentioned this pull request Aug 21, 2023

Default plot_2d including run statistics #332

Open

williamjameshandley added 2 commits August 22, 2023 08:39

Corrected bump_version script

df622ef

Added a scatter kind

b51b3ef

Merge branch 'logL_birth_inf' of github.com:handley-lab/anesthetic in…

bdb7911

…to logL_birth_inf

lukashergt requested changes Aug 25, 2023

View reviewed changes

lukashergt added 4 commits September 29, 2023 17:17

Merge branch 'master' into logL_birth_inf

36f7ee4

fix mistakenly changed DOI in README

247c10c

check for inf directly instead of indirectly using isfinite, since na…

a498a0b

…ns are actually ok here

add pytest.warns to tests to explicitly check for the existance of …

d2915fb

…the newly added warnings and to keep our pytest output clean

lukashergt previously approved these changes Sep 30, 2023

View reviewed changes

williamjameshandley added 5 commits March 2, 2024 00:42

Merge branch 'master' into logL_birth_inf

00dcebc

bump version to 2.7.4

6dee428

Removed obselete test for #96

96a71bf

Merge branch 'master' into logL_birth_inf

4b9c66f

Merge branch 'master' into logL_birth_inf

233777b

lukashergt previously approved these changes Mar 22, 2024

View reviewed changes

lukashergt added 2 commits April 8, 2024 12:51

Merge branch 'master' into logL_birth_inf

6a05edf

Update README.rst version to 2.8.5

d3b2928

lukashergt dismissed their stale review via d3b2928 April 8, 2024 19:53

lukashergt and others added 5 commits April 8, 2024 12:54

Update _version.py to 2.8.5

e1526ea

Merge branch 'master' into logL_birth_inf

6f6de11

bump version to 2.8.6

b2136e3

Merge branch 'master' into logL_birth_inf

d5a2b87

bump version to 2.8.7

35b64b2

lukashergt previously approved these changes Apr 9, 2024

View reviewed changes

williamjameshandley added 2 commits April 9, 2024 17:25

Merge branch 'master' into logL_birth_inf

14f4709

bump version to 2.8.8

e40cbec

williamjameshandley dismissed lukashergt’s stale review via e40cbec April 9, 2024 16:25

lukashergt previously approved these changes Apr 9, 2024

View reviewed changes

bump version to 2.8.9

2242d50

lukashergt dismissed their stale review via 2242d50 April 9, 2024 18:12

Merge branch 'master' into logL_birth_inf

7eb1869

lukashergt approved these changes Apr 9, 2024

View reviewed changes

williamjameshandley merged commit 79c7bb6 into master Apr 9, 2024
22 checks passed

williamjameshandley deleted the logL_birth_inf branch April 9, 2024 18:43

lukashergt added a commit that referenced this pull request Apr 23, 2024

fix typo from PR #324

5585530

williamjameshandley mentioned this pull request Apr 24, 2024

Fix logscale limit updates #383

Merged

6 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix for logL_birth #324

Fix for logL_birth #324

williamjameshandley commented Aug 1, 2023

codecov bot commented Aug 1, 2023 •

edited

Loading

lukashergt left a comment

lukashergt Aug 1, 2023

williamjameshandley Aug 4, 2023

lukashergt Aug 4, 2023

williamjameshandley Aug 7, 2023

lukashergt Aug 7, 2023

yallup Aug 21, 2023

williamjameshandley commented Aug 20, 2023

lukashergt commented Aug 22, 2023

williamjameshandley commented Aug 22, 2023

williamjameshandley commented Aug 22, 2023

lukashergt left a comment

Fix for logL_birth #324

Fix for logL_birth #324

Conversation

williamjameshandley commented Aug 1, 2023

Description

Checklist:

codecov bot commented Aug 1, 2023 • edited Loading

Codecov Report

lukashergt left a comment

Choose a reason for hiding this comment

lukashergt Aug 1, 2023

Choose a reason for hiding this comment

williamjameshandley Aug 4, 2023

Choose a reason for hiding this comment

lukashergt Aug 4, 2023

Choose a reason for hiding this comment

williamjameshandley Aug 7, 2023

Choose a reason for hiding this comment

lukashergt Aug 7, 2023

Choose a reason for hiding this comment

yallup Aug 21, 2023

Choose a reason for hiding this comment

williamjameshandley commented Aug 20, 2023

lukashergt commented Aug 22, 2023

williamjameshandley commented Aug 22, 2023

williamjameshandley commented Aug 22, 2023

lukashergt left a comment

Choose a reason for hiding this comment

codecov bot commented Aug 1, 2023 •

edited

Loading