/heatwaveR

This GitHub repo contains all of the code for the heatwaveR package.

Primary LanguageROtherNOASSERTION

heatwaveR

CRAN_Status_Badge R-CMD-check Codecov test coverage JOSS DOI Downloads

The heatwaveR package is a project-wide update to the RmarineHeatWaves package, which is itself a translation of the original Python code written by Eric C. J. Oliver. The heatwaveR package also uses the same naming conventions for objects, columns, and arguments as the Python code.

The heatwaveR R package contains the original functions from the RmarineHeatWaves package that calculate and display marine heatwaves (MHWs) according to the definition of Hobday et al. (2016) as well as calculating and visualising marine cold-spells (MCSs) as first introduced in Schlegel et al. (2017a). It also contains the functionality to calculate the categories of MHWs as outlined in Hobday et al. (2018).

This package does what RmarineHeatWaves does, but faster. The entire package has been deconstructed and modularised, and we are continuing to implement slow portions of the code in C++. This has alleviated the bottlenecks that slowed down the climatology creation portions of the code as well as generally creating an overall increase in the speed of the calculations. Currently the R code runs about as fast as the original python functions, at least in as far as applying it to single time series of temperatures.

Readers familiar with both languages will know about the ongoing debate around the relative speed of the two languages. In our experience, R can be as fast as python, provided that attention is paid to finding ways to reduce the computational inefficiencies that stem from i) the liberal use of complex and inefficient non-atomic data structures, such as data frames; ii) the reliance on non-vectorised calculations such as loops; and iii) lazy (but convenient) coding that comes from drawing too heavily on the tidyverse suite of packages. We will continue to ensure that heatwaveR becomes more-and-more efficient so that it can be applied to large gridded data products with ease. To that end, the extension package heatwave3 has been developed. This helps the user to apply the code from heatwaveR directly onto their NetCDF and other 3D gridded data files.

heatwaveR was also developed and released in order to better accommodate the inclusion of the definitions of atmospheric heatwaves in addition to MHWs. Additionally, heatwaveR also provides the first implementation of a definition for a ‘compound heatwave’. There are currently multiple different definitions for this type of event and each of which has arguments provided for it within the ts2clm() and detect_event() functions.

This package may be installed from CRAN by typing the following command into the console:

install.packages("heatwaveR")

Or the development version may be installed from GitHub with:

devtools::install_github("robwschlegel/heatwaveR")

The functions

Function Description
ts2clm() Constructs seasonal and threshold climatologies as per the definition of Hobday et al. (2016).
ts2clm3() As ts2clm() but built entirely on data.table for an up to 200% speed improvement
detect_event() The main function which detects the events as per the definition of Hobday et al. (2016).
detect_event3() As detect_event() but with data.table internals (not yet for category classification–to follow)
block_average() Calculates annual means for event metrics.
category() Applies event categories to the output of detect_event() based on Hobday et al. (2018).
exceedance() A function similar to detect_event() but that detects consecutive days above/below a given static threshold.
event_line() Creates a time series line graph of the heatwave or cold-spell results from detect_event().
lolli_plot() Creates a lolliplot time series of a selected event metric from the results generated by detect_event().
geom_flame() Creates flame polygons of heatwaves or cold-spells from a time series.
geom_lolli() Creates lolliplots from a time series of a selected event metric.

The package also provides data of observed SST records for three historical MHWs: the 2011 Western Australia event, the 2012 Northwest Atlantic event, and the 2003 Mediterranean event.

The heatwave metrics

The detect_event() function will return a list of two tibbles (see the tidyverse), climatology and event, which are the time series climatology and MHW (or MCS) events, respectively. The climatology contains the full time series of daily temperatures, as well as the the seasonal climatology, the threshold and various aspects of the events that were detected. The software was designed for detecting extreme thermal events, and the units specified below reflect that intended purpose. However, various other kinds of extreme events (e.g. rainfall) may be detected according to the ‘heatwave’ specifications, and if that is the case, the appropriate minDuration etc. and units of measurement need to be determined by the user.

Climatology metric Description
doy Julian day (day-of-year). For non-leap years it runs 1…59 and 61…366, while leap years run 1…366. This column will be named differently if another name was specified to the doy argument.
t The date of the temperature measurement. This column will be named differently if another name was specified to the x argument.
temp If the software was used for the purpose for which it was designed, seawater temperature (deg. C) on the specified date will be returned. This column will of course be named differently if another kind of measurement was specified to the y argument.
seas Climatological seasonal cycle (deg. C).
thresh Seasonally varying threshold (e.g., 90th percentile) (deg. C).
var Variance (standard deviation) per doy of temp (deg. C). (not returned by default as of v0.3.5)
threshCriterion Boolean indicating if temp exceeds thresh.
durationCriterion Boolean indicating whether periods of consecutive threshCriterion are >= minDuration.
event Boolean indicating if all criteria that define a MHW or MCS are met.
event_no A sequential number indicating the ID and order of occurrence of the MHWs or MCSs.

The events are summarised using a range of event metrics:

Event metric Description
event_no A sequential number indicating the ID and order of the events. This allows one to match/join results between the climatology and event outputs.
index_start Row number from the given time series where the event starts.
index_peak Row number from the given time series where the event peaks.
index_end Row number from the given time series where the event ends.
duration Duration of event (days).
date_start Start date of event (date).
date_peak Date of event peak (date).
date_end End date of event (date).
intensity_mean Mean intensity (deg. C).
intensity_max Maximum (peak) intensity (deg. C).
intensity_var Intensity variability (standard deviation) (deg. C).
intensity_cumulative Cumulative intensity (deg. C x days).
rate_onset Onset rate of event (deg. C / day).
rate_decline Decline rate of event (deg. C / day).

intensity_max_relThresh, intensity_mean_relThresh, intensity_var_relThresh, and intensity_cumulative_relThresh are as above except relative to the threshold (e.g., 90th percentile) rather than the seasonal climatology.

intensity_max_abs, intensity_mean_abs, intensity_var_abs, and intensity_cumulative_abs are as above except as absolute magnitudes rather than relative to the seasonal climatology or threshold.

Note that rate_onset and rate_decline will return NA when the event begins/ends on the first/last day of the time series. This may be particularly evident when the function is applied to large gridded data sets. Although the other metrics do not contain any errors and provide sensible values, please take this into account in the interpretation of the output. It must also be noted that events whose date_peak occur on the same day as the date_start or date_end of the event will return small negative values. This tends to only occur in areas with persistent ice cover. The authors are currently thinking about how best to handle this exception.

The Vignettes

For detailed explanations and walkthroughs on the use of the heatwaveR package please click on the Vignettes tab in the toolbar above, or follow the links below:

The Marine Heatwave Tracker

To see the heatwaveR package in action, check out the Marine Heatwave Tracker website. This is a daily updating global analysis of where in the world marine heatwaves are occurring. It has near real-time information as well as historic data going back to January 1st, 1982 and uses the Hobday et al. (2018) colour scheme to show how intense the MHWs are.

Contributing to heatwaveR

To contribute to the package please follow the guidelines here.

Please use this link to report any bugs found.

Citing heatwaveR

Because heatwaveR is and always will be free to use open source software, its citation in scientific literature and other sources is the primary metric through which the continued development of this package is motivated for. Therefore, if the heatwaveR package is used in any analyses please acknowledge this through the following citation:

Robert W. Schlegel and Albertus J. Smit (2018). heatwaveR: A central algorithm for the detection of heatwaves and cold-spells. Journal of Open Source Software, 3(27), 821, https://doi.org/10.21105/joss.00821

The BibTeX citation may be accessed in R with:

citation("heatwaveR")

For a list of sources that have cited heatwaveR see the Citations tab in the toolbar at the top of this page. If you do not see your publication in the list of citations and would like it added please contact the developer (see below).

References

Hobday, A.J. et al. (2016). A hierarchical approach to defining marine heatwaves. Progress in Oceanography, 141, pp. 227-238.

Schlegel, R. W., Oliver, E. C. J., Wernberg, T. W., Smit, A. J. (2017a). Nearshore and offshore co-occurrences of marine heatwaves and cold-spells. Progress in Oceanography, 151, pp. 189-205.

Schlegel, R. W., Oliver, E. C., Perkins-Kirkpatrick, S., Kruger, A., Smit, A. J. (2017b). Predominant atmospheric and oceanic patterns during coastal marine heatwaves. Frontiers in Marine Science, 4, 323.

Hobday, A. J., Oliver, E. C. J., Sen Gupta, A., Benthuysen, J. A., Burrows, M. T., Donat, M. G., Holbrook, N. J., Moore, P. J., Thomsen, M. S., Wernberg, T., Smale, D. A. (2018). Categorizing and naming marine heatwaves. Oceanography 31(2).

Acknowledgements

The Python code was written by Eric C. J. Oliver.

Contributors to the Marine Heatwaves definition and its numerical implementation include Alistair J. Hobday, Lisa V. Alexander, Sarah E. Perkins, Dan A. Smale, Sandra C. Straub, Jessica Benthuysen, Michael T. Burrows, Markus G. Donat, Ming Feng, Neil J. Holbrook, Pippa J. Moore, Hillary A. Scannell, Alex Sen Gupta, and Thomas Wernberg.

The translation from Python to R was done by A. J. Smit and the graphing functions were contributed by Robert. W. Schlegel.

Contact

Robert W. Schlegel

Data Scientist

Laboratoire d’Océanographie de Villefranche-sur-Mer, LOV

Institut de la Mer de Villefranche, IMEV

robert.schlegel@imev-mer.fr