Nick Isaac Arco van Strien, Tom August, Gary Powney & David Roy - - PowerPoint PPT Presentation

nick isaac
SMART_READER_LITE
LIVE PREVIEW

Nick Isaac Arco van Strien, Tom August, Gary Powney & David Roy - - PowerPoint PPT Presentation

What can be done to remove biases in volunteer-gathered biological records? Nick Isaac Arco van Strien, Tom August, Gary Powney & David Roy Talk Outline The Problem 120 Solutions? 100 Testing the Solutions Number of sites 80


slide-1
SLIDE 1

What can be done to remove biases in volunteer-gathered biological records?

Nick Isaac

Arco van Strien, Tom August, Gary Powney & David Roy

slide-2
SLIDE 2

Talk Outline

  • The Problem
  • Solutions?
  • Testing the Solutions
  • The Way Ahead
  • Tools
  • Applications

20 40 60 80 100 120 1970 1980 1990 2000 2010

Number of sites

Year

slide-3
SLIDE 3

Problem: ad hoc recording is biased

slide-4
SLIDE 4

Problem: ad hoc recording is biased

  • in time
  • in space
  • detectability
  • effort per visit

1 10 100 1000 10000 100000 1000000

1970 1980 1990 2000 2010

Number of records

Butterflies Bryophyte Orthoptera Myriapod Isopods Coleoptera Moths Bees Wasps

Effort Number of Species

slide-5
SLIDE 5

Most lists are incomplete

For most groups, ~50% of ‘lists’ are single species For many groups, the prevalence of short lists varies systematically over time

0% 10% 20% 30% 40% 50% 60% 70% 80% 90% 100% Proportion of all visits > 3 species 3 species 2 species Single species

slide-6
SLIDE 6

Solutions?

  • Aggregation
  • Data Selection methods
  • Correction for sampling effort
  • Modelling the data collection process
slide-7
SLIDE 7

Aggregation into Atlas periods

slide-8
SLIDE 8

Selection methods

  • Remove the bias, leave the signal
  • The ‘well-sampled set’
  • Threshold number of species
  • Threshold number of years
  • Untested assumption
  • Loss of power?

Well-sampled sites for Dragonflies

slide-9
SLIDE 9

Correction for sampling effort

  • by time period
  • Telfer’s Change index
  • per year
  • Ball’s ‘Reporting Rate’ method
  • per visit
  • Szabo’s ‘List Length’ method
  • in space (per grid cell or neighbourhood)
  • Hill’s ‘Frescalo’ method
slide-10
SLIDE 10

Correction: Hill’s Frescalo method

Hill, MO (2011). Local frequency as a key to interpreting species occurrence data when recording effort is not known. Methods in Ecology and Evolution, 3(1), 195–205.

Red = under-recorded White = well-recorded

Frescalo estimates the recording intensity of each grid cell

slide-11
SLIDE 11

Hill’s Frescalo method

Frescalo estimates which species ‘should’ be in each grid cell if well-sampled Trends can be modelled as changes in ‘relative recording rate’

slide-12
SLIDE 12

Occupancy: modelling data collection

Extant Extinct Occupancy (unobserved)

Separation of “state” and “data generation” processes into separate submodels permits (annual) estimation of

  • ccupancy and detection

Observations Data generation process Year 1 Year 2 Year 4 Year 3 Year 5

slide-13
SLIDE 13

Testing the solutions by simulation

  • Generate records resembling NBN-type datasets
  • 1000 sites, 25 species, 10 years
  • Realistic scenarios of recorder behaviour
  • Parameterized from UK and Dutch datasets
  • Formally compare methods for estimating trends
  • Type I error rate when no trend exists
  • Power to detect genuine trend
slide-14
SLIDE 14

Simulation results: Type I error rates

Isaac et al (in review) Methods in Ecology & Evolution

slide-15
SLIDE 15

Simulation results

  • Simple ‘correction’ models fail easily
  • Frescalo performs well but subjective to apply
  • Selection methods are robust but less powerful
  • Occupancy most promising overall
  • Least often wrong
  • Most powerful overall
  • … but a problem with spatially-biased sampling

Isaac et al (in review) Methods in Ecology & Evolution

slide-16
SLIDE 16

The Way Ahead

  • Occupancy + site-selection criterion?
  • Pdetect ≈ ฀List Length, Julian Date,

Previously Recorded, …….}

  • If we knew more about the bias, we could

model it

  • A little bit of meta data would go a long way
  • Visit-based records are crucial
slide-17
SLIDE 17

Tools

  • An easy way to record
  • Great potential for

harvesting meta-data

slide-18
SLIDE 18

https://github.com/BiologicalRecordsCentre http://bit.ly/18wTrrK

slide-19
SLIDE 19

Applications

  • Identifying drivers of change in native ladybirds
  • Overview of trends in UK biodiversity
  • Developing a biodiversity indicator
slide-20
SLIDE 20

Identifying drivers of change

Declines in native ladybirds are attributable to the arrival of the invasive Harlequin ladybird Similar patterns across 8 native species in both GB & Belgium

Roy et al (2012) Diversity & Distributions, 18: 717–725

Mike Majerus davidkennardphotography.com

slide-21
SLIDE 21

Trends in British Biodiversity 1990-2000

  • Good news: Median change +2.4%
  • Bad news: >1000 species would qualify as VU or worse
slide-22
SLIDE 22

The Priority Species Indicator

Source: Biodiversity in Your Pocket 2013

20 40 60 80 100 120 1970 1975 1980 1985 1990 1995 2000 2005 2010 Index (1970 = 100)

95% Confidence interval max

95% Confidence interval min United Kingdom 10 20 30 40 50 60 70 80 90 100 Long term Percentage of species Decline Increase

20 40 60 80 100 120 1970 1975 1980 1985 1990 1995 2000 2005 2010 Index (1970 = 100)

95% Confidence interval max

95% Confidence interval min United Kingdom 10 20 30 40 50 60 70 80 90 100 Long term Short term Percentage of species Decline Increase

slide-23
SLIDE 23

Conclusions

We shouldn’t remove the bias but model it Occupancy models are especially promising A little bit of meta-data would go a long way = a vast untapped resource

slide-24
SLIDE 24
slide-25
SLIDE 25

Simulated patterns of recording

  • 1. Even recording: random sampling
  • 2. Doubling intensity: number of visits doubles
  • 3. Doubling with biased sampling wrt focal sites
  • 4. Incomplete recording (growth in short lists)
  • 5. Detection increasing: focal species becomes more

detectable

  • 6. Non-focal declines
slide-26
SLIDE 26

Robust to:

  • Changes in effort over time
  • Change in spatial pattern of recording
  • Changes in community composition
  • Temporally & spatially precise
  • Can easily add covariates

The ‘well-sampled sites’ model

Assumptions/Caveats:

  • Groups are recorded collectively, as an assemblage
  • Effort per visit has not changed over time
  • Detectability per visit is constant over time
  • Well-sampled sites are representative

Well-sampled sites for Dragonflies