Modeling of indicator responses to single pressures with GAMMs

model_gamm accounts for temporal autocorrelation (TAC) in the time series by fitting Generalized Additive Mixed Models (GAMMs) that include AR or ARMA correlation structures using the gamm function. The GAMMs are applied to all IND~pressure combinations provided as input or only those with significant TAC in the GAM residuals (using filter argument).

model_gamm(
  init_tbl,
  k = 5,
  family = stats::gaussian(),
  excl_outlier = NULL,
  filter = NULL,
  lme_control = NULL
)

Arguments

init_tbl	The output tibble of the `ind_init` function.
k	Choice of knots (for the smoothing function `s`); the default is 5.
family	A description of the error distribution and link to be used in the GAM. This needs to be defined as a family function (see also `family`). All standard family functions can be used as well some of the distribution families in the mgcv package (see `family.mgcv`; e.g.`negbin`). Note that `nb`, which estimates `theta` parameter, cannot be used for `gamm`.
excl_outlier	A list of values identified as outliers in specific IND~pressure GAMMs, which should be excluded in this modeling step (the output tibble of this function includes the variable `pres_outlier`, which is a column-list containing all indices of values with cook`s distance > 1 (see below). The function can be re-run again, then excluding all these outliers provided in `$pres_outlier` from the the first run (see example)).
filter	logical; a filter used to select specific rows in init_tbl (row gets selected if value TRUE). That could be the `tac` column in the `model_gam` output tibble which indicates whether the model residuals show TAC.
lme_control	A specification list of the control values for the lme fit. The default is NULL so the control settings in `gamm` are used (list(niterEM=0,optimMethod="L-BFGS-B",returnObject=TRUE)). To overwrite these values use best the `lmeControl` function (see example).

Value

Returns a model output tibble that contains for each filtered IND~pressure pair 6 rows with the individual GAMM outputs. The structure remains the same as in model_gam except for the explained deviance, which is not computed by the gamm function. The selection of the final correlation structure for each IND~pressure model can be done manually on this tibble or with an automatized routine using select_model.

Details

Modeling first-differenced indicator time series can be an alternative solution to avoid temporal dependence between observations. However, this approach does often not help reducing the significant auto-correlation while GAMMs do as found in Otto et al. (2018). Such an extension implies that the single elements of the response variable are not independent anymore and that the correlation between the residuals at time t1 and t2 only depends on their time difference t1 – t2 (Wood, 2006).

In model_gamm six GAMMs are computed for each filtered IND~pressure pair, i.e.

no correlation structure (for AIC comparison)
auto-regressive error structure of order p=1 (AR1)
auto-regressive error structure of order p=2 (AR2)
auto-regressive moving average of order p=1 and q=1 (ARMA11)
auto-regressive moving average of order p=1 and q=2 (ARMA12)
auto-regressive moving average of order p=2 and q=1 (ARMA21)

References

Otto, S.A., Kadin, M., Casini, M., Torres, M.A., Blenckner, T. (2018) A quantitative framework for selecting and validating food web indicators. Ecological Indicators, 84: 619-631, doi: https://doi.org/10.1016/j.ecolind.2017.05.045

Wood, S.N. (2006) Generalized Additive Models: An Introduction with R. Chapman and Hall/CRC Press

Examples

# Using the Baltic Sea demo data in this package
dat_init <- ind_init(
  ind_tbl = data.frame(Cod = ind_ex$Sprat),
  press_tbl = press_ex[, c("Fsprat", "Fher")],
  time = ind_ex[ ,1])
gam_tbl <- model_gam(dat_init)
# Any temporal autocorrelation
gam_tbl$tac
#> [1] TRUE TRUE
# Applying model_gamm function and passing the $tac variable as filter
gamm_tbl <- model_gamm(dat_init, filter = gam_tbl$tac)
# Use different control values for lme fit:
 gamm_tbl <- model_gamm(
 dat_init, filter = gam_tbl$tac,
 lme_control = nlme::lmeControl(niterEM = 100, msMaxIter = 200)
 )
# \donttest{
 # run analysis will full demo dataset
 gam_tbl <- model_gam(ind_init_ex)
 gamm_tbl <- model_gamm(ind_init_ex, filter = gam_tbl$tac)
 # now run the same analysis but exclude all outliers in the GAMMs
 gamm_tbl2 <- model_gamm(ind_init_ex, filter = gam_tbl$tac,
   excl_outlier = gamm_tbl$pres_outlier)
# }