Add option to accumulate observations #534

sbfnk · 2024-01-22T11:37:46Z

This PR adds an argument na to obs_opts with an "accumulate" option that adds any NA data points to the following data point in the inference. The idea is that this will allow fitting data at any time interval (e.g. weekly) including where spacing is irregular (e.g., monthly, or weekly where on some occasions data is reported a day later because of a holiday etc.).

Two points for potential discussion are:

When setting this to accumulate the first data point is ignored as we don't know when we should start accumulating for that one. This means that for a someone who would like that data point to be considered it would be advantageous to add a dummy data point to the beginning of the data set. We could add an option to do that, or document it, or leave it unmentioned, or find another solution.
Because we either skip or accumulate NAs we can't combine them, e.g. to have weekly data with missing values. Enabling this would, I think, require some sort of that marks dates explicitly as missing vs. NA.

~~I've updated estimate_secondary to still work with the examples/tests but it doesn't work with NA values yet. This is for another issue/PR.~~ It was necessary to update estimate_secondary in order to pass tests/checks. This now also works with NA values.

Closes #531

github-actions · 2024-01-22T13:47:30Z

This is how benchmark results would change (along with a 95% confidence interval in relative change) if 99c3559 is merged into main:

:ballot_box_with_check:default: 33.1s -> 32s [-21.89%, +15.59%]
:ballot_box_with_check:no_delays: 32.9s -> 35.1s [-10.48%, +24.14%]
:ballot_box_with_check:random_walk: 9.24s -> 14.5s [-76.54%, +191.13%]
:ballot_box_with_check:stationary: 20.1s -> 18s [-24.6%, +3.67%]
:ballot_box_with_check:uncertain: 51.2s -> 52s [-17.56%, +20.65%]
Further explanation regarding interpretation and methodology can be found in the documentation.

github-actions · 2024-01-22T15:55:43Z

This is how benchmark results would change (along with a 95% confidence interval in relative change) if 7ee4d94 is merged into main:

:ballot_box_with_check:default: 32.2s -> 33.1s [-14.22%, +20.2%]
:ballot_box_with_check:no_delays: 34.5s -> 35.6s [-4.78%, +10.62%]
:ballot_box_with_check:random_walk: 9.4s -> 9.55s [-10.87%, +13.95%]
:ballot_box_with_check:stationary: 18.9s -> 18.2s [-14.84%, +7.46%]
:ballot_box_with_check:uncertain: 49.4s -> 48s [-21.87%, +16.14%]
Further explanation regarding interpretation and methodology can be found in the documentation.

seabbs

Really nice.

Enabling this would, I think, require some sort of that marks dates explicitly as missing vs. NA.

I think this would be my preferred option as it would be more general but I also think it can be addressed in its own review as it would be a superset of this PR.

My thought on how that would work is to have a new variable (accumulate) that indicates which days should be summed.

R/opts.R

seabbs

This all looks good and is a nice feature to have. Some reservations about the precise implementation but as flagged can be addressed by follow up work.

This means that for a someone who would like that data point to be considered it would be advantageous to add a dummy data point to the beginning of the data set. We could add an option to do that, or document it, or leave it unmentioned, or find another solution.

I think my preferred option here would be to add a message when this method is used? (i.e dropping the first data point and use a dummy if you wish).

R/opts.R

inst/stan/functions/observation_model.stan

NEWS.md

seabbs

LGTM to me. I think the outstanding points are all for a new issues so if you agree we can resolve and merge?

R/opts.R

inst/stan/functions/observation_model.stan

sbfnk · 2024-02-14T14:49:30Z

LGTM to me. I think the outstanding points are all for a new issues so if you agree we can resolve and merge?

Yes sounds good to me.

sbfnk · 2024-02-14T14:51:12Z

I think my preferred option here would be to add a message when this method is used? (i.e dropping the first data point and use a dummy if you wish).

See

EpiNow2/R/opts.R

Line 484 in 528185b

if (na == "accumulate") {

Might be annoying but will remind us to implement a better solution if so.

seabbs · 2024-02-14T14:53:19Z

Might be annoying but will remind us to implement a better solution if so.

If things move over to {cli} this could be made a once per session thing to tone it down a bit.

github-actions · 2024-02-14T16:26:35Z

This is how benchmark results would change (along with a 95% confidence interval in relative change) if 2a32f95 is merged into main:

:ballot_box_with_check:default: 30.8s -> 32.1s [-12.54%, +20.89%]
:ballot_box_with_check:no_delays: 33.4s -> 39.7s [-23.84%, +61.15%]
:ballot_box_with_check:random_walk: 8.96s -> 10.6s [-14%, +50.41%]
:ballot_box_with_check:stationary: 17.5s -> 19.3s [-7.64%, +27.77%]
:ballot_box_with_check:uncertain: 51.7s -> 51.7s [-14.15%, +14.2%]
Further explanation regarding interpretation and methodology can be found in the documentation.

Co-authored-by: Sam Abbott <[email protected]>

github-actions · 2024-02-15T10:17:13Z

This is how benchmark results would change (along with a 95% confidence interval in relative change) if df1fdc8 is merged into main:

:ballot_box_with_check:default: 32.5s -> 47.8s [-52.33%, +147.07%]
:ballot_box_with_check:no_delays: 31.9s -> 38.9s [-26.42%, +70.59%]
:ballot_box_with_check:random_walk: 9.09s -> 9.24s [-4.27%, +7.66%]
:ballot_box_with_check:stationary: 18.7s -> 32.7s [-92.66%, +243.41%]
:ballot_box_with_check:uncertain: 51.4s -> 49.6s [-16.86%, +9.73%]
Further explanation regarding interpretation and methodology can be found in the documentation.

* add option to accumulate observations * accumulate in estimate_secondary model * add test for weekly accumulation * check there's data to fit initial growth model * ignore first observation when accumulating * document "na" argument * add news item * update obs_opts tests * make logical operator scalar * make NA option work with estimate_secondary * add tests * Apply suggestions from code review Co-authored-by: Sam Abbott <[email protected]> --------- Co-authored-by: Sam Abbott <[email protected]>

sbfnk requested a review from seabbs January 22, 2024 15:16

sbfnk mentioned this pull request Jan 23, 2024

Explore options for weekly data epiverse-trace/cfr#117

Closed

seabbs reviewed Feb 14, 2024

View reviewed changes

R/opts.R Show resolved Hide resolved

seabbs requested changes Feb 14, 2024

View reviewed changes

R/opts.R Outdated Show resolved Hide resolved

inst/stan/functions/observation_model.stan Show resolved Hide resolved

inst/stan/functions/observation_model.stan Show resolved Hide resolved

NEWS.md Outdated Show resolved Hide resolved

sbfnk force-pushed the accumulate-na branch from 85acf2b to dc28fc2 Compare February 14, 2024 13:55

seabbs enabled auto-merge (squash) February 14, 2024 13:56

sbfnk force-pushed the accumulate-na branch from 17a7c8f to 528185b Compare February 14, 2024 14:45

seabbs previously approved these changes Feb 14, 2024

View reviewed changes

R/opts.R Outdated Show resolved Hide resolved

inst/stan/functions/observation_model.stan Show resolved Hide resolved

inst/stan/functions/observation_model.stan Show resolved Hide resolved

sbfnk mentioned this pull request Feb 14, 2024

Move to {cli} for messaging #546

Closed

sbfnk dismissed seabbs’s stale review via 8f76a2f February 14, 2024 14:55

This was referenced Feb 14, 2024

Distinguish NA (missing) from NA (accumulated) #547

Closed

Efficiency of accumulating NAs #548

Open

sbfnk force-pushed the accumulate-na branch from 8f76a2f to 177b1e8 Compare February 14, 2024 17:22

sbfnk force-pushed the main branch from bd08c68 to 38fe801 Compare February 14, 2024 17:50

sbfnk added 9 commits February 15, 2024 08:47

add option to accumulate observations

59ad588

accumulate in estimate_secondary model

0eab176

add test for weekly accumulation

7490532

check there's data to fit initial growth model

87281b2

ignore first observation when accumulating

92faf5f

document "na" argument

ed9d636

add news item

464fa20

update obs_opts tests

34c38ba

make logical operator scalar

21d3f2d

sbfnk and others added 9 commits February 15, 2024 08:47

make NA option work with estimate_secondary

b091be9

add tests

2c64fb5

Apply suggestions from code review

3f3d819

Co-authored-by: Sam Abbott <[email protected]>

make documentation consistent

f67cfca

no need to re-specify values

d6531ef

re-render obs_opts documentation

16e1122

add message when accumulating (and update doc)

3f7fdbb

fix typo

62433ed

fix test issues after rebase

09e2c3c

sbfnk force-pushed the accumulate-na branch from 6af15a9 to 09e2c3c Compare February 15, 2024 08:48

seabbs approved these changes Feb 15, 2024

View reviewed changes

seabbs merged commit 50fc3cf into main Feb 15, 2024
14 checks passed

seabbs deleted the accumulate-na branch February 15, 2024 12:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add option to accumulate observations #534

Add option to accumulate observations #534

sbfnk commented Jan 22, 2024 •

edited

Loading

github-actions bot commented Jan 22, 2024

github-actions bot commented Jan 22, 2024

seabbs left a comment

seabbs left a comment

seabbs left a comment

sbfnk commented Feb 14, 2024

sbfnk commented Feb 14, 2024

seabbs commented Feb 14, 2024

github-actions bot commented Feb 14, 2024

github-actions bot commented Feb 15, 2024

Add option to accumulate observations #534

Add option to accumulate observations #534

Conversation

sbfnk commented Jan 22, 2024 • edited Loading

github-actions bot commented Jan 22, 2024

github-actions bot commented Jan 22, 2024

seabbs left a comment

Choose a reason for hiding this comment

seabbs left a comment

Choose a reason for hiding this comment

seabbs left a comment

Choose a reason for hiding this comment

sbfnk commented Feb 14, 2024

sbfnk commented Feb 14, 2024

seabbs commented Feb 14, 2024

github-actions bot commented Feb 14, 2024

github-actions bot commented Feb 15, 2024

sbfnk commented Jan 22, 2024 •

edited

Loading