Allow SnowblindStep to run on _rate and _rateints #1

gbrammer · 2023-11-20T16:27:54Z

Add SnowblindStep handling for ImageModel and CubeModel inputs, e.g., rate and rateints products. The logic for deciding what to do is whether or not the datamodel has a groupdq or dq attribute.

The update also includes a new parameter

new_jump_flag = integer(default={JUMP_DET}) # DQ flag to set for dilated jumps

to allow the user to set a custom integer flag on the dilated jumps.

Basic tests also included, based on the original tests for the RampModel inputs.

jdavies-st · 2023-11-29T08:57:15Z

Thanks for the PR @gbrammer!

Is the idea here to run it after Detector1Pipeline? One major disadvantage of that is that it massively throws away data, as in virtually all cases for snowballs (except those that hit in the first group), a rate can be computed for the affected pixels if the snowball flagging is done within Detector1Pipeline.

Is running snowball flagging on the _rate file what is currently done in grizli?

jdavies-st · 2023-11-29T09:01:21Z

There are PEP8 style failures

https://github.com/mpi-astronomy/snowblind/pull/1/checks#step:5:18

gbrammer · 2023-11-29T09:24:41Z

Yes, grizli just flags the big snowball blobs in the Detector1Pipeline rate products. It doesn't make any effort to fix them, it just wants to make a conservative mask around them. I'll have a look at the pep8 problems.

gbrammer · 2023-11-29T09:33:09Z

OK, ci tests pass locally now.

gbrammer · 2023-11-29T10:07:18Z

The last fix was for the option to do a contraction with a negative scale_factor. My idea for this was for flagging saturated cores in rateints with two passes, the first with a positive scale_factor for integration i and the the second for something like scale_factor = -0.5 for flagging the cores in integrations i+1 : NINTS.

The main reason I'm working on this is that rerunning Detector1Pipeline on exposures with, like

  'nframes': 5,
  'ngroups': 20,
  'nints': 4,

takes forever and a huge amount of memory. Doing the flagging directly on the MAST rateints seems to be pretty comparable in terms of data quality and is much faster + cheaper.

gbrammer · 2023-12-06T21:20:52Z

Is there anything to consider before merging this into a new pip release? It's been working very well for some updates on grizli and it would be helpful to include a release version of snowblind as a requirement there.

jdavies-st · 2023-12-11T11:09:56Z

Hi @gbrammer, sorry for the radio silence. Euclid is keeping me pretty busy! Yeah, let's get this merged.

I think it's almost there. One question is why have a custom new_jump_flag parameter? Certainly the default for flagging jumps at the group level is to set JUMP_DET (GROUPDQ is 4), but this doesn't work if flagging the _rate file, as it needs both JUMP_DET and DO_NOT_USE set (so 5) so that later on drizzle knows not to use that. Is this the reasoning?

jdavies-st · 2023-12-11T11:35:50Z

Also, I'd like to try to convince you that you should be running SnowblindStep within Detector1PIpeline instead of afterwards. If you run it afterwards, you're actually throwing out lots of good data. For a typical deep field with 10 groups, 1 integration, and say 4 dithers per pointing, if you run snowblind on _rate file, for those pixels that were affected by a snowball, your drizzled stack will have 3/4 of the depth, since you masked all the pixels in a single dither _rate file.

But if you flag at the group level, you will only be flagging 1 or 2 groups out of 10 that are eventually used to compute the _rate. In the most typical case where NFRAMES>1, that's 2 groups flagged, so your final drizzled depth at those pixels is not 3/4 but 38/40 19/20 of the original depth. And for long exposures with lots of snowballs, this can have significant effect on final S/N for faint sources.

Of course at the center of the snowball which is saturated (and the ~2 pixel ring around it where you get charge migration) you will still lose all the signal in the ramp, but it's still likely that there are some good groups beforehand and those will be useful. But the number of saturated pixels relative to the size of the expanded snowball mask is tiny.

The animated GIF below is an example illustrating this with some NIRCam LONG data in one of the CEERS fields. 10 groups, 1 integration, 4 dithers. Final mosaic drizzled after SnowblindStep and 1/f noise mitigation done on both. The only difference is running snowblind on the _rate after Detector1Pipeline vs running it on the _ramp within Detector1Pipeline. You can see that for the expanded masks where snowballs hit, masking at the group level retains S/N.

gbrammer · 2023-12-11T13:04:00Z

Hi @gbrammer, sorry for the radio silence. Euclid is keeping me pretty busy! Yeah, let's get this merged.

I think it's almost there. One question is why have a custom new_jump_flag parameter? Certainly the default for flagging jumps at the group level is to set JUMP_DET (GROUPDQ is 4), but this doesn't work if flagging the _rate file, as it needs both JUMP_DET and DO_NOT_USE set (so 5) so that later on drizzle knows not to use that. Is this the reasoning?

I added this just to make the flagging more flexible. It defaults to the behavior you describe here, but when flagging on the rate products in grizli I want to set the snowball flags to something other than JUMP_DET because the bulk of the JUMP_DET pixels are generally fine to use when I pass the images to the low-level drizzle function.

I use the 1024 bit as my own DO_NOT_USE flag for optional additional bad pixel masks, along with 4096 pixels identified when I run groups of exposures through the legacy AstroDrizzle CR rejection steps. That is, I drizzle exposures into mosaics excluding DO_NOT_USE = (DQ | 1+1024+4096) > 0 pixels.

gbrammer · 2023-12-11T13:08:28Z

Also, I'd like to try to convince you that you should be running SnowblindStep within Detector1PIpeline instead of afterwards. If you run it afterwards, you're actually throwing out lots of good data. For a typical deep field with 10 groups, 1 integration, and say 4 dithers per pointing, if you run snowblind on _rate file, for those pixels that were affected by a snowball, your drizzled stack will have 3/4 of the depth, since you masked all the pixels in a single dither _rate file.

I certainly accept your point, but the CPU/memory resources required for Detector1Pipeline is still a bit too expensive/unpredictable for me to use for the general processing of tons of archival data.

jdavies-st

Thanks for this Gabe. Glad this expands the functionality. I note only the one problem below of SATURATED not being propagated to the _rate files properly, which might make it less effective recognizing snowballs.

jdavies-st · 2023-12-13T09:42:45Z

src/snowblind/snowblind.py

+            else:
+                self._has_groups = False
+                bool_jump = (jump.dq & JUMP_DET) == JUMP_DET
+                bool_sat = (jump.dq & SATURATED) == SATURATED


In the case of _rate files currently, only pixels that are saturated in all groups get the SATURATED flag. This is not good in my view, but what it means in practice is that a snowball saturates some pixels mid-exposure and the _rate file has a slope computed for these pixels due to the first part of the ramp, and the DQ only shows JUMP_DET in the area around the core. This may make it more difficult to detect snowballs.

Btw, there's an open issue to have this changed. See spacetelescope/jwst#8124

If the snowball lands in the 1st group, then the core may be saturated (it might not be if NFRAMES is large) for all groups, and so one would get NaNs in the core in the _rate file. Not sure of the best way to handle this.

gbrammer added 4 commits November 20, 2023 16:51

implement mode to run SnowblindStep on single images

5d84f64

enable for cubes and option to set the dilated flag value

6bf2f9f

add test for different flag value

d89022c

fix label bug

d9efdea

gbrammer added 2 commits November 29, 2023 10:27

option for eroding the mask for growth_factor < 0

99b2174

pep8 fixes

2dce8a2

jdavies-st changed the title ~~Implement logical paths for running the SnowblindStep on other file types~~ Allow SnowblindStep to run on _rate and _rateints Nov 29, 2023

fix logic for dilation

f2b92fa

jdavies-st approved these changes Dec 13, 2023

View reviewed changes

jdavies-st merged commit 4443b05 into mpi-astronomy:main Dec 13, 2023
3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow SnowblindStep to run on _rate and _rateints #1

Allow SnowblindStep to run on _rate and _rateints #1

gbrammer commented Nov 20, 2023

jdavies-st commented Nov 29, 2023

jdavies-st commented Nov 29, 2023

gbrammer commented Nov 29, 2023

gbrammer commented Nov 29, 2023

gbrammer commented Nov 29, 2023

gbrammer commented Dec 6, 2023

jdavies-st commented Dec 11, 2023

jdavies-st commented Dec 11, 2023 •

edited

Loading

gbrammer commented Dec 11, 2023

gbrammer commented Dec 11, 2023

jdavies-st left a comment

jdavies-st Dec 13, 2023

Allow SnowblindStep to run on _rate and _rateints #1

Allow SnowblindStep to run on _rate and _rateints #1

Conversation

gbrammer commented Nov 20, 2023

jdavies-st commented Nov 29, 2023

jdavies-st commented Nov 29, 2023

gbrammer commented Nov 29, 2023

gbrammer commented Nov 29, 2023

gbrammer commented Nov 29, 2023

gbrammer commented Dec 6, 2023

jdavies-st commented Dec 11, 2023

jdavies-st commented Dec 11, 2023 • edited Loading

gbrammer commented Dec 11, 2023

gbrammer commented Dec 11, 2023

jdavies-st left a comment

Choose a reason for hiding this comment

jdavies-st Dec 13, 2023

Choose a reason for hiding this comment

jdavies-st commented Dec 11, 2023 •

edited

Loading