-
Notifications
You must be signed in to change notification settings - Fork 6
/
Copy pathREADME.Rmd
287 lines (217 loc) · 16.2 KB
/
README.Rmd
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
---
output:
md_document:
variant: markdown_github
fig_width: 5
pandoc_args: ["--smart"]
---
```{r, echo = FALSE}
knitr::opts_chunk$set(
collapse = TRUE,
comment = "#>",
fig.path = "tools/",
fig.height = 3,
fig.width = 8
)
```
# RmarineHeatWaves
[![CRAN_Status_Badge](http://www.r-pkg.org/badges/version/RmarineHeatWaves)](https://cran.r-project.org/package=RmarineHeatWaves)
[![Travis-CI Build Status](https://travis-ci.org/ajsmit/RmarineHeatWaves.svg?branch=master)](https://travis-ci.org/ajsmit/RmarineHeatWaves)
![](https://cranlogs.r-pkg.org/badges/grand-total/RmarineHeatWaves)
The **RmarineHeatWaves** package is a translation of the original Python code written
by Eric C. J. Oliver that can be found on [GitHub](https://github.com/ecjoliver/marineHeatWaves).
This package is actively maintained, but on 2018-04-24 it was forked to a new package called [**heatwaveR**](https://robwschlegel.github.io/heatwaveR/index.html), which will supercede and eventually replace **RmarineHeatWaves** in the near future. For continuity, we are ensuring that the functionality in **heatwaveR** will be exactly the same in **RmarineHeatWaves**, and for a while the two packages will be maintained in sync, at least as far as developments around the existing code-base are concerned. New functions will only be added to **heatwaveR**.
The **RmarineHeatWaves** R package contains a number of functions which calculate
and display marine heat waves according to the definition of Hobday et al. (2016).
The marine cold spell option was implemented in version 0.13 (21 Nov 2015) of the
Python module as a result of the preparation of Schlegel et al. (2017), wherein
the cold events are introduced and briefly discussed.
This package may be found on [CRAN](https://cran.r-project.org/package=RmarineHeatWaves).
Alternatively, you can install it from GitHub by issuing the following command:
`devtools::install_github("ajsmit/RmarineHeatWaves")`
## The functions
|Function |Description|
|---------------------|-----------|
|`detect()` |The main function which detects the events as per the definition of Hobday et al. (2016).|
|`make_whole()` |Constructs a continuous, uninterrupted time series of temperatures.|
|`block_average()` |Calculates annual means for event metrics.|
|`event_line()` |Creates a line plot of marine heat waves or cold spells.|
|`lolli_plot()` |Creates a timeline of selected event metrics.|
|`exceedance()` |A function similar to `detect()` but that detects consecutive days above/below a given threshold.|
|`geom_flame()` |Creates flame polygons of marine heat waves or cold spells.|
|`geom_lolli()` |Creates a lolliplot timeline of selected event metric.|
The package also provides data of observed SST records for three historical MHWs:
the 2011 Western Australia event, the 2012 Northwest Atlantic event and the
2003 Mediterranean event.
## The heat wave metrics
The function will return a list of two tibbles (see the 'tidyverse'), `clim` and `event`, which are the climatology and MHW (or MCS) events, respectively. The climatology contains the full time series of daily temperatures, as well as the the seasonal climatology, the threshold and various aspects of the events that were detected. The software was designed for detecting extreme thermal events, and the units specified below reflect that intended purpose. However, the various other kinds of extreme events may be detected according to the 'marine heat wave' specifications, and if that is the case, the appropriate units need to be determined by the user.
| Climatology metric | Description |
|--------------------|-------------|
|`doy` | Julian day (day-of-year). For non-leap years it runs 1...59 and 61...366, while leap years run 1...366. This column will be named differently if another name was specified to the `doy` argument. |
|`t` | The date of the temperature measurement. This column will be named differently if another name was specified to the `x` argument. |
|`temp` | If the software was used for the purpose for which it was designed, seawater temperature [deg. C] on the specified date will be returned. This column will of course be named differently if another kind of measurement was specified to the `y` argument. |
|`seas_clim_year` | Climatological seasonal cycle [deg. C]. |
|`thresh_clim_year` | Seasonally varying threshold (e.g., 90th percentile) [deg. C]. |
|`var_clim_year` | Seasonally varying variance (standard deviation) [deg. C]. |
|`thresh_criterion` | Boolean indicating if `temp` exceeds `thresh_clim_year`. |
|`duration_criterion` | Boolean indicating whether periods of consecutive `thresh_criterion` are >= `min_duration`. |
|`event` | Boolean indicating if all criteria that define a MHW or MCS are met. |
|`event_no` | A sequential number indicating the ID and order of occurence of the MHWs or MCSs. |
The events are summarised using a range of event metrics:
| Event metric | Description |
|--------------|-------------|
|`index_start` | Start index of event. |
|`index_stop` | Stop index of event. |
|`event_no` | A sequential number indicating the ID and order of the events. |
|`duration` | Duration of event [days]. |
|`date_start` | Start date of event [date]. |
|`date_stop` | Stop date of event [date]. |
|`date_peak` | Date of event peak [date]. |
|`int_mean` | Mean intensity [deg. C]. |
|`int_max` | Maximum (peak) intensity [deg. C]. |
|`int_var` | Intensity variability (standard deviation) [deg. C]. |
|`int_cum` | Cumulative intensity [deg. C x days]. |
|`rate_onset` | Onset rate of event [deg. C / day]. |
|`rate_decline` | Decline rate of event [deg. C / day]. |
`int_max_rel_thresh`, `int_mean_rel_thresh`, `int_var_rel_thresh`, and `int_cum_rel_thresh` are as above except relative to the threshold (e.g., 90th percentile) rather than the seasonal climatology.
`int_max_abs`, `int_mean_abs`, `int_var_abs`, and `int_cum_abs` are as above except as absolute magnitudes rather than relative to the seasonal climatology or threshold.
`int_max_norm` and `int_mean_norm` are as above except units are in multiples of threshold exceedances, i.e., a value of 1.5 indicates the event intensity (relative to the climatology) was 1.5 times the value of the threshold (relative to climatology, i.e., threshold - climatology.)
Note that `rate_onset` and `rate_decline` will return `NA` when the event begins/ends on the first/last day of the time series. This may be particularly evident when the function is applied to large gridded data sets. Although the other metrics do not contain any errors and provide sensible values, please take this into account in its interpretation.
## Examples of detection and graphing
The `detect()` function is the package's core function. Here is the `detect()` function applied to the Western Australian test data,
which are also discussed by Hobday et al. (2016):
```{r detect-example1, message = FALSE, warning = FALSE}
library(RmarineHeatWaves); library(plyr); library(dplyr); library(ggplot2)
ts <- make_whole(sst_WA)
mhw <- detect(ts, climatology_start = "1983-01-01", climatology_end = "2012-12-31")
mhw$event %>%
dplyr::ungroup() %>%
dplyr::select(event_no, duration, date_start, date_peak, int_mean, int_max, int_cum) %>%
dplyr::arrange(-int_cum) %>%
head(5)
```
The corresponding `event_line()` and `lolli_plot()`, which represent the massive Western Australian heatwave of 2011, look like this:
```{r fig-example1, fig.height = 3, fig.width = 8, eval = FALSE}
event_line(mhw, spread = 200, metric = "int_cum",
start_date = "2010-10-01", end_date = "2011-08-30")
```
![](tools/fig-example1-1.png)
```{r fig-example2, fig.height = 3, fig.width = 8, eval = FALSE}
lolli_plot(mhw)
```
![](tools/fig-example2-1.png)
The `event_line()` and `lolli_plot()` functions were designed to work directly on one of the list returned by `detect()`. If more control over the figures is required, it may be useful to create them in **ggplot2** by stacking 'geoms'. We specifically created two new **ggplot2** geoms to reproduce the functionality of `event_line()` and `lolli_plot()`. These functions are more general in their functionality and can be used outside of the **RmarineHeatWave** package too. To apply them to MHWs and MCSs, they require that we access the `clim` or `event` data frames within the list that is produced by `detect()`. Here is how:
```{r fig-example3, eval = FALSE}
mhw2 <- mhw$clim # find the climatology dataframe
mhw2 <- mhw2[10580:10690,] # identify the region of the time series of interest
ggplot(mhw2, aes(x = t, y = temp, y2 = thresh_clim_year)) +
geom_flame() +
geom_text(aes(x = as.Date("2011-02-01"), y = 28, label = "The MHW that launched\na thousand papers."))
ggplot(mhw$event, aes(x = date_start, y = int_max)) +
geom_lolli(colour = "salmon", colour.n = "red", n = 3) +
geom_text(aes(x = as.Date("2006-10-01"), y = 5,
label = "The distribution of events\nis skewed towards the\nend of the time series."),
colour = "black")
```
The default output of these function may not be to your liking. If so, not to worry. As **ggplot2** geoms, they are highly maleable. For example, if we were to choose to reproduce the format of the MHWs as seen in Hobday et al. (2016), the code would look something like this:
```{r fig-example4, warning = FALSE, eval = FALSE}
# It is necessary to give geom_flame() at least one row on either side of the event in order to calculate the polygon corners smoothly
mhw_top <- mhw2[49:110,]
ggplot(data = mhw2, aes(x = t)) +
geom_flame(aes(y = temp, y2 = thresh_clim_year, fill = "all"), show.legend = T) +
geom_flame(data = mhw_top, aes(y = temp, y2 = thresh_clim_year, fill = "top"), show.legend = T) +
geom_line(aes(y = temp, colour = "temp")) +
geom_line(aes(y = thresh_clim_year, colour = "thresh"), size = 1.0) +
geom_line(aes(y = seas_clim_year, colour = "seas"), size = 1.2) +
scale_colour_manual(name = "Line Colour",
values = c("temp" = "black", "thresh" = "forestgreen", "seas" = "grey80")) +
scale_fill_manual(name = "Event Colour", values = c("all" = "salmon", "top" = "red")) +
guides(colour = guide_legend(override.aes = list(fill = NA))) +
xlab("Date") +
ylab(expression(paste("Temperature [", degree, "C]")))
```
![](tools/fig-example4-1.png)
Conversely, should we not wish to highlight any events with `geom_lolli()`, it would look like this:
```{r fig-example5, fig.height = 3, fig.width = 8, warning = FALSE, eval = FALSE}
# Note that this is accomplished by setting 'colour.n = NA', not by setting 'n = 0'.
ggplot(mhw$event, aes(x = date_start, y = int_cum)) +
geom_lolli(colour = "salmon", n = 3, colour.n = NA)
```
![](tools/fig-example5-1.png)
The calculation and visualisation of marine cold spells is also accommodated within this package. Here is a cold spell detected in the OISST data for Western Australia:
```{r detect-example2, message = FALSE, warning = FALSE}
mcs <- detect(ts, climatology_start = "1983-01-01", climatology_end = "2012-12-31", cold_spells = TRUE)
mcs$event %>%
dplyr::ungroup() %>%
dplyr::select(event_no, duration, date_start, date_peak, int_mean, int_max, int_cum) %>%
dplyr::arrange(int_cum) %>%
head(5)
```
The plots showing the marine cold spells look like this:
```{r fig-example6, fig.height = 3, fig.width = 8, eval = FALSE}
# this function needs to be updated as it currently only works when 't' is the column name for date
event_line(mcs, spread = 200, metric = "int_cum",
start_date = "1990-01-01", end_date = "1990-08-30")
lolli_plot(mcs)
```
![](tools/fig-example6-1.png)
![](tools/fig-example6-2.png)
Cold spell figures may be created as geoms in **ggplot2**, too:
```{r fig-example7, eval = FALSE}
mcs2 <- mcs$clim
mcs2 <- mcs2[2990:3190,]
# # Note that the plot centres on the polygons, so it may be necessary to manually zoom out a bit
ggplot(data = mcs2, aes(x = t)) +
geom_flame(aes(y = thresh_clim_year, y2 = temp), fill = "steelblue3", show.legend = F) +
geom_line(aes(y = temp, colour = "temp")) +
geom_line(aes(y = thresh_clim_year, colour = "thresh"), size = 1.0) +
geom_line(aes(y = seas_clim_year, colour = "seas"), size = 1.2) +
scale_colour_manual(name = "Line Colour",
values = c("temp" = "black", "thresh" = "forestgreen", "seas" = "grey80")) +
scale_y_continuous(limits = c(18, 23.5)) +
xlab("Date") +
ylab(expression(paste("Temperature [", degree, "C]")))
ggplot(mcs$event, aes(x = date_start, y = int_cum)) +
geom_lolli(colour = "steelblue3", colour.n = "navy", n = 7) +
xlab("Date") +
ylab(expression(paste("Cumulative intensity [days x ", degree, "C]")))
```
![](tools/fig-example7-1.png)
![](tools/fig-example7-2.png)
## The exceedance function
In addition to the calculation of extreme events, consecutive days over a given static threshold may be calculated with the `exceedance()` function.
```{r exceedance-example1, message = FALSE, warning = FALSE}
exc <- exceedance(ts, threshold = 25)
exc$exceedance %>%
ungroup() %>%
select(exceedance_no, duration, date_start, date_peak, int_mean, int_cum) %>%
dplyr::arrange(-int_cum) %>%
head(5)
```
The same function may be used to calculate consecutive days below a threshold, too.
```{r exceedance-example2, message = FALSE, warning = FALSE}
exc <- exceedance(ts, threshold = 19, below = TRUE)
exc$exceedance %>%
dplyr::ungroup() %>%
dplyr::select(exceedance_no, duration, date_start, date_peak, int_mean, int_cum) %>%
dplyr::arrange(int_cum) %>%
head(5)
```
## Working with gridded SST data
We can also load the gridded 0.25 degree Reynolds [OISST data](https://www.ncei.noaa.gov/thredds/blended-global/oisst-catalog.html) and apply the function pixel by pixel over all of the days of data. The example data used here have 93 longitude steps, 43 latitude steps, and cover 12797 days (1981 to 2016). We apply the `detect()` function to these data, fit a generalised linear model (GLM), and then plot the trend per decade of the marine heatwave count. In other words, have marine heatwaves become more or less frequent in recent years? Under climate change we can expect that extreme events would tend to occur more frequently and be of greater intensity. Indeed, we can clearly see in the figure below of the result of the GLM, how the Agulhas Current has been experiencing marine heat waves more frequently in recent decades. But there are two smaller areas, one along the western side of the Cape Peninsula in the Benguela Upwelling system and another around the Eastern Cape Province near Algoa Bay, where the frequency of marine heat waves seems to have actually been decreasing -- although the *P*-value of the decreasing trend is > 0.05, and therefore not significant.
![Count-trend](vignettes/README-grid-example1.png)
![P-of-trend](vignettes/README-grid-example2.png)
Please read the package [vignette](https://github.com/ajsmit/RmarineHeatWaves/blob/master/vignettes/gridded-event-detection.Rmd) to see how to load a netCDF file with the OISST data, apply the RmarineHeatWaves function to the whole 3D array of data, and then fit the GLM and plot the data.
# References
Hobday, A.J. et al. (2016). A hierarchical approach to defining marine heatwaves, Progress in Oceanography, 141, pp. 227-238.
Schlegel, R. W., Oliver, E. C. J., Wernberg, T. W., Smit, A. J. (2017). Coastal and offshore co-occurrences of marine heatwaves and cold-spells. Progress in Oceanography, 151, pp. 189-205.
Schlegel, R. W., Oliver, E. C., Perkins-Kirkpatrick, S., Kruger, A., Smit, A. J. (2017). Predominant atmospheric and oceanic patterns during coastal marine heatwaves. Frontiers in Marine Science, 4, 323.
# Acknowledgements
The Python code was written by Eric C. J. Oliver.
Contributors to the Marine Heatwaves definition and its numerical implementation include Alistair J. Hobday, Lisa V. Alexander, Sarah E. Perkins, Dan A. Smale, Sandra C. Straub, Jessica Benthuysen, Michael T. Burrows, Markus G. Donat, Ming Feng, Neil J. Holbrook, Pippa J. Moore, Hillary A. Scannell, Alex Sen Gupta, and Thomas Wernberg.
The translation from Python to R was done by A. J. Smit and the graphing functions were contributed to by Robert. W. Schlegel.
# Contact
A. J. Smit
Department for Biodiversity & Conservation Biology, University of the Western Cape,
Private Bag X17, Bellville 7535, South Africa, E-mail: [email protected],
Work tel.: +27 (0)21 959 3783