Model checking

class: title-slide, inverse, center, middle

# Lecture 4: Model checking
<div style="position: absolute; bottom: 15px; vertical-align: center; left: 10px">
<img src="images/02-foundation-vertical-white.png" height="200">
</div>

---

class: inverse, middle, center

## *"perhaps the most important part of applied statistical modelling"*

### Simon Wood

---
# Model checking

- As with detection functions, checking is important
- Checking *doesn't* mean your model is **right**
- Want to know the model conforms to assumptions
- What assumptions should we check?

---
class: inverse, middle, center
# Convergence

---
# Convergence

- Fitting the GAM involves an optimization
- By default this is REstricted Maximum Likelihood (REML) score
- Sometimes this can go wrong
- R will warn you!

---
# A model that converges

```r
gam.check(dsm_tw_xy_depth)
```

```
## 
## Method: REML   Optimizer: outer newton
## full convergence after 7 iterations.
## Gradient range [-3.456333e-05,1.051004e-05]
## (score 374.7249 & scale 4.172176).
## Hessian positive definite, eigenvalue range [1.179219,301.267].
## Model rank =  39 / 39 
## 
## Basis dimension (k) checking results. Low p-value (k-index<1) may
## indicate that k is too low, especially if edf is close to k'.
## 
##             k'   edf k-index p-value    
## s(x,y)   29.00 11.11    0.65  <2e-16 ***
## s(Depth)  9.00  3.84    0.81    0.37    
## ---
## Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
```

---
# A bad model

```
Error in while (mean(ldxx/(ldxx + ldss)) > 0.4) { :
  missing value where TRUE/FALSE needed
In addition: Warning message:
In sqrt(w) : NaNs produced
Error in while (mean(ldxx/(ldxx + ldss)) > 0.4) { :
  missing value where TRUE/FALSE needed
```

This is **rare**

---
class: inverse, middle, center
# The Folk Theorem of Statistical Computing

## *"most statistical computational problems are due not to the algorithm being used but rather the model itself"*

### Andrew Gelman

---
# Folk Theorem anecdata

- Often if there are fitting problems, you're asking too much from your data
- Model is too complicated
- Too little data
- Try something simpler, see what happens

---
class: inverse, middle, center
# Basis size

---
# Basis size (k)

- Set `k` per term
- e.g. `s(x, k=10)` or `s(x, y, k=100)`
- Penalty removes "extra" wigglyness
  - *up to a point!*
- (But computation is slower with bigger `k`)

---
# Checking basis size

```r
gam.check(dsm_x_tw)
```

```
## 
## Method: REML   Optimizer: outer newton
## full convergence after 7 iterations.
## Gradient range [-3.196351e-06,4.485625e-07]
## (score 409.936 & scale 6.041307).
## Hessian positive definite, eigenvalue range [0.7645492,302.127].
## Model rank =  10 / 10 
## 
## Basis dimension (k) checking results. Low p-value (k-index<1) may
## indicate that k is too low, especially if edf is close to k'.
## 
##        k'  edf k-index p-value
## s(x) 9.00 4.96    0.76    0.38
```

---
# Increasing basis size

```r
dsm_x_tw_k <- dsm(count~s(x, k=20), ddf.obj=df,
                  segment.data=segs, observation.data=obs,
                  family=tw())
gam.check(dsm_x_tw_k)
```

```
## 
## Method: REML   Optimizer: outer newton
## full convergence after 7 iterations.
## Gradient range [-2.30124e-08,3.930703e-09]
## (score 409.9245 & scale 6.033913).
## Hessian positive definite, eigenvalue range [0.7678456,302.0336].
## Model rank =  20 / 20 
## 
## Basis dimension (k) checking results. Low p-value (k-index<1) may
## indicate that k is too low, especially if edf is close to k'.
## 
##         k'   edf k-index p-value
## s(x) 19.00  5.25    0.76    0.35
```

---
# Sometimes basis size isn't the issue...

- Generally, double `k` and see what happens
- Didn't increase the EDF much here
- Other things can cause low "`p-value`" and "`k-index`"
- Increasing `k` can cause problems

---
# k is a maximum

- Don't worry about things being too wiggly
- `k` gives the maximum complexity
- Penalty deals with the rest

![](dsm4-model_checking_files/figure-html/plotk-1.png)

---
class: inverse, middle, center
# Residuals

---
# What are residuals?

- Generally residuals = observed value - fitted value
- BUT hard to see patterns in these "raw" residuals
- Need to standardise `$\Rightarrow$` **deviance residuals**
- Expect these residuals `$\sim N(0,1)$`

---
# Why are residuals important?

- Structure in the residuals means your model didn't capture something
- Maybe a missing covariate
- Model doesn't describe the data well

---
# Residuals vs. covariates

![](dsm4-model_checking_files/figure-html/covar-resids-1.png)

---
# Residuals vs. covariates (boxplots)

![](dsm4-model_checking_files/figure-html/covar-resids-boxplot-1.png)

---
# Fitting to residuals

- Refit our model but with the residuals as response
- Response is normal (for deviance residuals)
- What pattern is left in the residuals?

---
# Example

- Example model with `NPP` and `Depth`

```r
# get data
refit_dat <- dsm_depth_npp$data
# make residuals column
refit_dat$resid <- residuals(dsm_depth_npp)
# fit a model (same model)
resid_fit <- gam(resid~s(Depth, bs="ts", k=20) +
                        s(NPP, bs="ts", k=20),
                 family=gaussian(), data=refit_dat, method="REML")
```

---
# `summary(resid_fit)`

```
## 
## Family: gaussian 
## Link function: identity 
## 
## Formula:
## resid ~ s(Depth, bs = "ts", k = 20) + s(NPP, bs = "ts", k = 20)
## 
## Parametric coefficients:
##             Estimate Std. Error t value Pr(>|t|)    
## (Intercept) -0.49454    0.03274   -15.1   <2e-16 ***
## ---
## Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
## 
## Approximate significance of smooth terms:
##              edf Ref.df     F p-value    
## s(Depth) 2.56621     19 1.230 4.9e-06 ***
## s(NPP)   0.03322     19 0.002   0.316    
## ---
## Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
## 
## R-sq.(adj) =  0.0241   Deviance explained = 2.67%
## -REML =   1362  Scale est. = 1.0174    n = 949
```

---
# What's going on there?

- Something unexplained going on?
- Maybe `Depth` + `NPP` is not enough?
  - Add other smooths (`s(x, y)`? )
- Increase `k`?

---
class: inverse, middle, center
# Other residual checking

---
# `gam.check`

![](dsm4-model_checking_files/figure-html/gamcheck-1.png)

---
# Shortcomings

- `gam.check` can be helpful
- "Resids vs. linear pred" is victim of artifacts
- Need an alternative
- "Randomised quanitle residuals"
  - `rqgam.check`
  - Exactly normal residuals

---
# Randomised quantile residuals

![](dsm4-model_checking_files/figure-html/rqgamcheck-1.png)

---
# Example of "bad" plots

![Bad residual check plot from Wood 2006](images/badgam.png)

---
# Example of "bad" plots

![Bad residual check plot from Wood 2006](images/badgam-annotate.png)

---
# Residual checks

- Looking for patterns (not artifacts)
- This can be tricky
- Need to use a mixture of techniques
- Cycle through checks, make changes recheck

---
class: inverse, middle, center
# Observed vs. expected

---
# Response vs. fitted values

- `gam.check` "response vs. fitted values"
- BUT smooths are "wrong" everywhere in particular

![](dsm4-model_checking_files/figure-html/gamcheck-response-fitted-1.png)

---
# Summarize over covariate chunks

- On average the smooth is right
- Check aggregations of count
- Here detection function has Beaufort as factor

```r
obs_exp(dsm_bad, "Beaufort_f")
```

```
##             [0,1]    (1,2]    (2,3]    (3,4]    (4,5]
## Observed  1.00000 95.45000 103.5500 34.70000 4.000000
## Expected 20.28781 54.57573 136.3581 53.98742 5.949304
```

```r
obs_exp(dsm_good, "Beaufort_f")
```

```
##           [0,1]    (1,2]    (2,3]    (3,4]    (4,5]
## Observed 1.0000 95.45000 103.5500 34.70000 4.000000
## Expected 6.8887 45.18587 118.5747 53.81458 4.909644
```

---
# Observed vs. expected for environmental covariates

- Just need to specify the cutpoints

```r
obs_exp(dsm_bad, "Depth", c(0, 1000, 2000, 3000, 4000, 6000))
```

```
##          (0,1e+03] (1e+03,2e+03] (2e+03,3e+03] (3e+03,4e+03] (4e+03,6e+03]
## Observed   4.00000      52.53333     139.16667      35.00000       8.00000
## Expected  85.65231      37.98341      63.40892      53.78726      30.32642
```

```r
obs_exp(dsm_good, "Depth", c(0, 1000, 2000, 3000, 4000, 6000))
```

```
##          (0,1e+03] (1e+03,2e+03] (2e+03,3e+03] (3e+03,4e+03] (4e+03,6e+03]
## Observed  4.000000      52.53333      139.1667      35.00000      8.000000
## Expected  5.308628      48.14915      128.7962      38.76013      8.359456
```

---
# Summary

- Convergence
  - Rarely an issue
- Basis size
  - `k` is a maximum
  - Double and see what happens
- Residuals
  - Deviance and randomised quantile
  - check for artifacts
- Observed vs. expected
  - Compare aggregate information