Additional text and code is highlighted using boxes like this.

We now want to plot the predictions for the 8 models that we have so far (4 negative binomial, 4 Tweedie; 4 spatial only, 4 with environmental covariats; 4 with bivariate smooths, 4 with additive spatial effects). Duplicating the above code is a bit tiresome and can be prone to errors, so let's nerd-out pretty heavily for this solution and show how we can use the `ldply()` function from `plyr` to do the same task many times. ```{r predsp-allplot, fig.width=15, fig.height=7} # make a function that makes the predictions, adds them to a column named Nhat # and adds a column called "model" that stores the model name, then returns the # data.frame. make_pred_dat <- function(model_name, predgrid){ # we use get() here to grab the object with the name of its argument predgrid[["Nhat"]] <- predict(get(model_name), predgrid) predgrid[["model"]] <- model_name return(predgrid) } # load plyr and apply to a list of the names of the models, make_pred_dat returns # a data.frame (hence this is an "ld" function: list->data.frame) that it then binds # together library(plyr) big_predgrid <- ldply(list("dsm_nb_xy", "dsm_nb_x_y", "dsm_nb_xy_ms", "dsm_nb_x_y_ms", "dsm_tw_xy", "dsm_tw_x_y", "dsm_tw_xy_ms", "dsm_tw_x_y_ms"), make_pred_dat, predgrid=predgrid_plot) # make the plot, facetting using the model column p <- ggplot(big_predgrid) + geom_tile(aes(x=x, y=y, fill=Nhat, width=10*1000, height=10*1000)) + coord_equal() + facet_wrap(~model, nrow=2)+ labs(fill="Density")+ scale_fill_viridis() print(p) ``` Note here that the `_ms` models have the environmental covariates, the others are spatial-only. We can also use `plyr` to help calculate overall abundance... ```{r Nhat-calc} # ddply will apply summarize (which in turn sums the Nhat column) to the subsets of # the data defined by model (i.e. each model) Nhat_results <- ddply(big_predgrid, .(model), summarize, Nhat=sum(Nhat)) ``` We can use our friend `kable` to make a nice table of this information:

Fitting a quasi-Poisson model (and doing a quick bit of term selection): ```{r qp-xy} # load data load("df-models.RData") load("sperm-data.RData") obs <- obs[obs$distance <= df_hn$ddf$meta.data$width,] ``` ```{r} dsm_qp_xy_ms <- dsm(count~s(x,y, bs="ts") + s(Depth, bs="ts") + #s(DistToCAS, bs="ts") + # 3 s(SST, bs="ts", k=18), # + # 1 increase basis complexity #s(EKE, bs="ts") + # 2 #s(NPP, bs="ts"), # 4 df_hn, segs, obs, family=quasipoisson()) summary(dsm_qp_xy_ms) ``` Now predicting abundance: ```{r qp-pred} pp_qp <- predict(dsm_qp_xy_ms, predgrid) sum(pp_qp, na.rm=TRUE) ``` And we can make a map: ```{r predsp-qp} predgrid$Nhat_qp_xy <- pp_qp predgrid_plot <- predgrid[!is.na(predgrid$Depth),] # plot! p <- ggplot(predgrid_plot) + geom_tile(aes(x=x, y=y, fill=Nhat_qp_xy, width=10*1000, height=10*1000)) + coord_equal() + labs(fill="Density")+ scale_fill_viridis() print(p) ``` The plot and prediction of abundance are very different from what we've seen above. Note again the distributional issues highlighted in the Q-Q plot for this model: ```{r qp-qqplot} qq.gam(dsm_qp_xy_ms) ```

See above!