Practical advice Real survey data is messy Distance sampling in the - - PowerPoint PPT Presentation

practical advice real survey data is messy distance
SMART_READER_LITE
LIVE PREVIEW

Practical advice Real survey data is messy Distance sampling in the - - PowerPoint PPT Presentation

Practical advice Real survey data is messy Distance sampling in the Real World We've talked a lot about models We've also talked about assumptions Our example is relatively well-behaved What can we do about all the nasty real world stuff?


slide-1
SLIDE 1

Practical advice

slide-2
SLIDE 2

Real survey data is messy

slide-3
SLIDE 3

Distance sampling in the Real World

We've talked a lot about models We've also talked about assumptions Our example is relatively well-behaved What can we do about all the nasty real world stuff?

slide-4
SLIDE 4

Some days...

slide-5
SLIDE 5

Aims

Here we want to cover common questions Not definitive answers Some guidance on where to look for answers

slide-6
SLIDE 6

What should my sample size be?

slide-7
SLIDE 7

What do we mean by "sample size"?

Number of animal (groups) recorded detection function Number of segments spatial model Number of segments with observations spatial model

slide-8
SLIDE 8

Re-frame

slide-9
SLIDE 9

How would we know when we have enough samples?

We don't Heavily context-dependent Go back to assumptions

slide-10
SLIDE 10

"How many data?"

slide-11
SLIDE 11

Pilot studies and "you get what you pay for"

Designing surveys is hard Designing surveys is essential Better to fail one season than fail for 5, 10 years Get information early, get it cheap Inform design from a pilot study

slide-12
SLIDE 12

Avoiding rules of thumb

Think about assumptions Detection function Spatial model Think about design Spatial coverage Covariate coverage

slide-13
SLIDE 13

Spatial coverage (IWC POWER)

slide-14
SLIDE 14

Covariate coverage

slide-15
SLIDE 15

Sometimes things are complicated

Weather has a big effect on detectability Need to record during survey Disambiguate between distribution/detectability Potential confounding can be BAD

slide-16
SLIDE 16

Visibility during POWER 2014

Thanks to Hiroto Murase and co. for this data!

slide-17
SLIDE 17

Covariates can make a big difference!

slide-18
SLIDE 18

Disappointment

Sometimes you don't have enough data Or, enough coverage Or, the right covariates

Sometimes, you can't build a spatial model

slide-19
SLIDE 19

@kitabet

slide-20
SLIDE 20

"Which of options X, Y, Z is correct?"

slide-21
SLIDE 21

Alternatives problem

When faced with options, try them. Where does the sensitivity lie? What's really going on? What is your objective?

slide-22
SLIDE 22

"How big should our segments be?"

slide-23
SLIDE 23

Segment size

If you think it's an issue test it Resolution of covariates also important Maybe species-/domain-dependent? (Solutions on the horizon to avoid this)

slide-24
SLIDE 24

"Is our model right?"

slide-25
SLIDE 25

Model validation

Some variety of cross-validation Temporal replication Leave out 1 year, fit to others, predict, assess Spatial “pseudo-jackknife” Leave out every segment, refit, … (Maybe leave out 2, 3 etc…)

nth

slide-26
SLIDE 26

Modelling philosophy

slide-27
SLIDE 27

Which covariates should we include?

Dynamic vs static variables Spatial terms? Habitat models?

slide-28
SLIDE 28

Getting help

slide-29
SLIDE 29

Resources

Bibliography has pointers to these topics Distance sampling Google Group Friendly, helpful, low traffic see distancesampling.org/distancelist.html

slide-30
SLIDE 30

Advanced topics

slide-31
SLIDE 31

This is a whirlwind tour...

slide-32
SLIDE 32

...and some of this is experimental

slide-33
SLIDE 33

Smoother zoo

slide-34
SLIDE 34

Cyclic smooths

What if things “wrap around”? (Time, angles, …) Match value and derivative Use bs="cc" See ?smooth.construct.cs.smooth.spec

slide-35
SLIDE 35

Smoothing in complex regions

Edges are important Whales don't live on land Bad things happen when we don't account for this Include boundary info in smoother ?soap

slide-36
SLIDE 36

Multivariate smooths

Thin plate splines are isotropic 1 unit in any direction is equal Fine for space, not for other things

slide-37
SLIDE 37

Tensor products

As many covariates as you like! (But takes time) te() or ti() (instead of s())

(x,z) = (x) (z) sx,z ∑k1 ∑k2 βksx sz

slide-38
SLIDE 38

Black bears like to sunbathe

slide-39
SLIDE 39

Random effects

normal random effects exploits equivalence of random effects and splines ? gam.vcomp useful when you just have a “few” random effects ?random.effects

slide-40
SLIDE 40

Making things faster

slide-41
SLIDE 41

Parallel processing

Some models are very big/slow Run on multiple cores Use engine="bam"! Some constraints in what you can do Wood, Goude and Shaw (2015)

slide-42
SLIDE 42

Summary

Lots of complicated problems Lots of potential solutions (see also “other approaches” mini-lecture) Need to get simple things right first Trade assumptions for data