Boxcar extraction with non-finite pixels #245

hpparvi · 2025-01-29T12:22:07Z

This PR adds a new exclude mask treatment option to BoxcarExtract, allowing the extraction to exclude any non-finite pixels. With the exclude option, the extraction is now carried out as a weighted sum, calculated as the average of the finite in-window pixels multiplied by the number of in-window pixels. The behaviour stays identical to the previous with the other mask treatment options or if no non-finite values are present.

…eighted sum and exclude non-finite pixels.

…anges to BoxcarExtract handling.

codecov · 2025-01-29T12:25:18Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 85.46%. Comparing base (e0ac82a) to head (bf647cd).
Report is 5 commits behind head on main.

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #245      +/-   ##
==========================================
+ Coverage   85.38%   85.46%   +0.07%     
==========================================
  Files          13       13              
  Lines        1177     1183       +6     
==========================================
+ Hits         1005     1011       +6     
  Misses        172      172

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

…lly handled in extraction methods. Detailed the default behavior for the `mask_treatment` option in BoxcarExtract. Additionally, fixed some minor typos.

hpparvi · 2025-02-10T20:18:59Z

I improved the documentation slightly to describe how the non-finite values can be handled in boxcar extraction.

tepickering · 2025-02-13T00:48:54Z

specreduce/extract.py

-    mask_treatment: Literal['filter', 'omit', 'zero-fill'] = 'filter'
-    _valid_mask_treatment_methods = ('filter', 'omit', 'zero-fill')
+    # TODO: should the 'filter' option be changed to 'propagate'
+    mask_treatment: Literal['filter', 'omit', 'exclude'] = 'exclude'


this choice of words doesn't make it immediately clear what each option does. also, should we keep zero-fill as an option and implement it?

tepickering

i find that the names of the mask treatment options as implemented are not very clear and in fact somewhat misleading. it's worth having longer names that are more clearly descriptive of what they actually do. and if only two options are functionally supported, there should only be two options available. in my opinion, the options that make sense are:

ignore: don't apply mask or mask non-finite values; extract from image as-is.
apply: apply mask and mask non-finite values as you've implemented (should be default).
apply_mask_only: apply mask, but ignore non-finite values
mask_nan_only: ignore mask, but mask non-finite values
mask_extraction_bin: grow mask so that if any masked or non-finite values are in the extraction bin, the whole bin gets masked in the output.

apply_mask_only and mask_nan_only are kind of niche cases so maybe we don't worry about those. zero_fill is another option that might make sense in some contexts, but i think masking properly is better.

there's also the larger, more contentious issue that BoxcarExtract is a class that really should be a function. all of its functionality is contained within __call__() and making it a class adds a redundant layer of type declarations. it also requires users to either use two lines of code to perform an extraction or an unintuitive second pair of ()'s.

tepickering · 2025-02-13T00:49:27Z

specreduce/extract.py

+        width = width or self.width
+        disp_axis = disp_axis or self.disp_axis
+        cdisp_axis = crossdisp_axis or self.crossdisp_axis
+        mask_mapping = {'filter': 'filter', 'exclude': 'filter', 'omit': 'omit'}


and then the 3 unclear choices become actually 2...

hpparvi · 2025-02-13T16:17:24Z

Thanks for your comments @tepickering. I've aimed to retain the mask treatment option names chosen by @cshanahan1 in her mask treatment PR while making spectrum-extraction-specific addition of the exclude option and the removal of the zero-fill option. That said, I agree that we should have more discussion about these before we release v1.5.

As I understand it, the main goal of @cshanahan1's mask treatment work is to introduce a consistent set of mask and non-finite value treatment options that are easy to document, understand, and remember. Also, since not all options are relevant to every use case, individual classes and functions don't need to support all options but only those that make sense within their context.

The current options are filter, omit, and zero-fill, where filter ignores the mask, omit masks any columns with masked or non-finite pixels (that is, it treats masked values as NaNs and propagates them), and zero-fill is self-explanatory.

I like the apply and ignore options by @tepickering , and would keep zero-fill as it is. Personally, I find propagate to be a good alternative to omit, but I’m not sure if that's just my preference. mask_extraction_bin would also be clear, especially since this option is really only needed in the extraction classes and functions.

I can refactor the code once we're all happy with the option names, but I would do this in a separate PR. Before starting with this, I'd appreciate any input, so let me know if you have any opinions @cshanahan1, @tepickering, @camipacifici, @kecnry, @kbwestfall, and @eteq.

i find that the names of the mask treatment options as implemented are not very clear and in fact somewhat misleading. it's worth having longer names that are more clearly descriptive of what they actually do. and if only two options are functionally supported, there should only be two options available. in my opinion, the options that make sense are:
* `ignore`: don't apply mask or mask non-finite values; extract from image as-is.

* `apply`: apply mask and mask non-finite values as you've implemented (should be default).

* `apply_mask_only`: apply mask, but ignore non-finite values

* `mask_nan_only`: ignore mask, but mask non-finite values

* `mask_extraction_bin`: grow mask so that if any masked or non-finite values are in the extraction bin, the whole bin gets masked in the output.
apply_mask_only and mask_nan_only are kind of niche cases so maybe we don't worry about those. zero_fill is another option that might make sense in some contexts, but i think masking properly is better.

there's also the larger, more contentious issue that BoxcarExtract is a class that really should be a function. all of its functionality is contained within __call__() and making it a class adds a redundant layer of type declarations. it also requires users to either use two lines of code to perform an extraction or an unintuitive second pair of ()'s.

I agree with this. At the moment, boxcar and Horne extraction could be implemented as functions rather than classes. If there are good reasons to keep them as classes (anyone?), we could offer simple wrapper functions (e.g., boxcar_extract and horne_extract) for those who prefer not to use an object-oriented approach. Otherwise, we could convert them into functions, though I'd prefer not to do this before v2 to avoid breaking the API.

hpparvi added 2 commits January 29, 2025 11:58

Updated mask handling in BoxcarExtract to extract the spectrum as a w…

8d48457

…eighted sum and exclude non-finite pixels.

Updated the change log to reflect new mask_treatment options and ch…

9713654

…anges to BoxcarExtract handling.

hpparvi requested review from cshanahan1, kecnry and tepickering January 29, 2025 12:22

hpparvi added the enhancement label Jan 29, 2025

hpparvi self-assigned this Jan 29, 2025

hpparvi added this to the v1.5 milestone Jan 29, 2025

hpparvi changed the title ~~Weighted boxcar extraction~~ Boxcar extraction with non-finite pixels Jan 30, 2025

Updated documentation to explain how non-finite pixels are automatica…

bf647cd

…lly handled in extraction methods. Detailed the default behavior for the `mask_treatment` option in BoxcarExtract. Additionally, fixed some minor typos.

hpparvi requested a review from camipacifici February 10, 2025 20:18

tepickering reviewed Feb 13, 2025

View reviewed changes

tepickering requested changes Feb 13, 2025

View reviewed changes

kecnry mentioned this pull request Feb 13, 2025

Replace broken specviz2d data in test and notebook spacetelescope/jdaviz#3431

Open

9 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Boxcar extraction with non-finite pixels #245

Boxcar extraction with non-finite pixels #245

hpparvi commented Jan 29, 2025 •

edited

Loading

codecov bot commented Jan 29, 2025 •

edited

Loading

hpparvi commented Feb 10, 2025

tepickering Feb 13, 2025 •

edited

Loading

tepickering left a comment

tepickering Feb 13, 2025

hpparvi commented Feb 13, 2025

Boxcar extraction with non-finite pixels #245

Are you sure you want to change the base?

Boxcar extraction with non-finite pixels #245

Conversation

hpparvi commented Jan 29, 2025 • edited Loading

codecov bot commented Jan 29, 2025 • edited Loading

Codecov Report

hpparvi commented Feb 10, 2025

tepickering Feb 13, 2025 • edited Loading

Choose a reason for hiding this comment

tepickering left a comment

Choose a reason for hiding this comment

tepickering Feb 13, 2025

Choose a reason for hiding this comment

hpparvi commented Feb 13, 2025

hpparvi commented Jan 29, 2025 •

edited

Loading

codecov bot commented Jan 29, 2025 •

edited

Loading

tepickering Feb 13, 2025 •

edited

Loading