Change which epi_archive operations have reference semantics #67

brookslogan · 2022-04-13T23:16:23Z

Two potential approaches:

change epi_archive from an R6 class into an S3-classed-list containing a lazy_dt (from dtplyr)
rename current epi_archive R6 class to EpiArchive, make epi_archive an S3 list wrapper on top of EpiArchive that somehow prevents reference semantics from messing with users

Main complication:

We may eventually want disk-backed and/or updating epi_archives for delphi-epidata, WayBack Machine, GitHub, and/or other types of data sources; if these are also considered epi_archives rather than their own separate class, we want to ensure that they will have a sensible and compatible non-reference-semantics or side-effect-based interface

Other work:

Update docs and vignettes talking about R6 and reference semantics.

brookslogan · 2022-04-13T23:46:36Z

The priority of this issue might be upgraded depending on feelings on #64.

brookslogan · 2022-04-14T20:24:13Z

We discussed an alternative approach to this:

archive$function is potentially effectful (and may often have invisible output)
archive %>% function_potentially_with_another_name is not effectful (often may just clone first) (and is does not often produce invisible output)
merge should not have side effects on the first argument as it does now

ryantibs · 2022-04-15T13:54:19Z

@lcbrooks Yes I'm currently in favor of the alternative approach. And I added something to #64 that is also consistent with this.

So I think it currently only remains to change merge() and epix_merge() to make them abide by the new philosophy. To be clear, here's what I'm thinking:

x$merge(y) should merge x and y, and overwrite x with the result.
epix_merge(x, y) should merge x and y, and return new epi_archive with the result, whose data table is a NEW object. That is, x and y are completely unchanged (as are their underlying data tables).

Thoughts?

brookslogan · 2022-08-01T19:01:26Z

I believe the "alternative approach" was completed for the currently available operations in #101. Any further discussion / revisiting is moved to #181.

brookslogan added the P2 low priority label Apr 13, 2022

dshemetov added P0 high priority and removed P2 low priority labels Apr 18, 2022

ryantibs mentioned this issue May 1, 2022

group_by() for epi_archive objects #64

Closed

brookslogan changed the title ~~Change epi_archive to not have reference semantics~~ Change what epi_archive operations have reference semantics May 9, 2022

brookslogan self-assigned this May 9, 2022

brookslogan changed the title ~~Change what epi_archive operations have reference semantics~~ Change which epi_archive operations have reference semantics May 10, 2022

brookslogan added the op-semantics Operational semantics; many potentially breaking changes here label May 31, 2022

ryantibs mentioned this issue Jul 18, 2022

Clarify naming and fix defaults: epix_slide is over versions #146

Closed

7 tasks

brookslogan mentioned this issue Aug 1, 2022

Revisit #67: are we happy with current reference semantics scheme #181

Closed

brookslogan closed this as completed Aug 1, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Change which epi_archive operations have reference semantics #67

Change which epi_archive operations have reference semantics #67

brookslogan commented Apr 13, 2022 •

edited

Loading

brookslogan commented Apr 13, 2022

brookslogan commented Apr 14, 2022

ryantibs commented Apr 15, 2022

brookslogan commented Aug 1, 2022 •

edited

Loading

Change which epi_archive operations have reference semantics #67

Change which epi_archive operations have reference semantics #67

Comments

brookslogan commented Apr 13, 2022 • edited Loading

brookslogan commented Apr 13, 2022

brookslogan commented Apr 14, 2022

ryantibs commented Apr 15, 2022

brookslogan commented Aug 1, 2022 • edited Loading

brookslogan commented Apr 13, 2022 •

edited

Loading

brookslogan commented Aug 1, 2022 •

edited

Loading