Disease Prevention vs Data Privacy: using landcover maps to inform - - PowerPoint PPT Presentation

disease prevention vs data privacy using landcover maps
SMART_READER_LITE
LIVE PREVIEW

Disease Prevention vs Data Privacy: using landcover maps to inform - - PowerPoint PPT Presentation

Disease Prevention vs Data Privacy: using landcover maps to inform spatial epidemic models Mike Tildesley & Sadie Ryan Centre for Complexity Science University of Warwick/ SUNY ESF Foot-and-Mouth Disease Foot-and-Mouth Disease - How


slide-1
SLIDE 1

Disease Prevention vs Data Privacy: using landcover maps to inform spatial epidemic models Mike Tildesley & Sadie Ryan

Centre for Complexity Science University of Warwick/ SUNY ESF

slide-2
SLIDE 2

Foot-and-Mouth Disease

slide-3
SLIDE 3

Foot-and-Mouth Disease - How to deal with diminishing data.

For the UK we have demographic information

  • n:
  • the location and size of all livestock farms
  • the movement of all animals

For the UK we have epidemic information

  • n:
  • the location of all infected & culled farms
  • epidemiological tracing of possible infection

routes

slide-4
SLIDE 4

Ring vaccination –

  • ptimal ring size to

control outbreak?

20km 15km 10km 5km

Cumbria – lots of cattle, high vaccination radius. West Lothian – moderate vaccination. The South – no vaccination.

slide-5
SLIDE 5

How to deal with diminishing data.

For USA we have limited demographic information:

  • the number of farms in each county
  • the number of animals in each county
  • the movement of some animals (very few!)

For USA there is no epidemic information.

slide-6
SLIDE 6

US Data

Region United States Alabama Alaska Arizona Arkansas California Colorado INVENTORY (2002) Cattle and calves farms 1018359 27094 113 2838 29925 17379 13311 number 95497994 1437795 12609 841277 1842273 5234177 2656351 Farms by inventory: 1 t

  • 9

farms 211030 5031 43 915 4278 5348 3252 number 1104588 28642 172 4100 24631 25038 15353 10 t

  • 19

farms 189193 6244 15 455 5845 3014 1926 number 2607827 87623 217 5905 81634 40285 26249 20 t

  • 49

farms 271202 8541 23 535 9702 3209 2997 number 8480012 265152 748 16920 304473 96961 94112 50 t

  • 99

farms 153618 4036 10 278 5412 1426 1741 number 10648669 274879 703 18556 370887 97596 121413 100 t

  • 199

farms 103513 2040 12 253 3023 1092 1479 number 14078261 270680 1679 34796 400159 149773 203188 200 t

  • 499

farms 62774 915 5 190 1367 1215 1120 number 18563597 262901 1177 57884 389844 386061 338206 500 or more farms 27029 287 5 212 298 2075 796 number 40015040 247918 7913 703116 270645 4438463 1857830

Number of farms and cattle in Alabama Number of farms in Colorado with between 10 and 19 cattle

Data

No XY locations or animals numbers at the individual farm level are given.

slide-7
SLIDE 7

Can we use geographic data to infer farm locations? Would epidemic models utilising such data be useful as predictive tools?

slide-8
SLIDE 8

THE MODEL

Model developed during 2001 to investigate optimal control strategies for FMD in the UK.

Infected j

Probability of infection for every susceptible farm for each day of the epidemic is calculated using the formula:

Prob = 1 - exp - [ScNc,i

pc+ SsNs,i ps]

[TcNc,j

qc + TsNs,j qs]K(dij)

Σ [ ]

i

slide-9
SLIDE 9

Prob = 1 - exp - [ScNc,i

pc+ SsNs,i ps]

[TcNc,j

qc + TsNs,j qs]K(dij)

Σ

Infected j

[ ]

i

Sc

  • susceptibility of cows.

N c,i

  • number of cows on farm i.

Tc - transmissibility of cows (or how much a cow “transmits” infection). K(dij) - the distance kernel. A parameter which weights the probability of infection based on the distance between farm i and farm j.

Transmission Risk Distance from source

Kernel Tc, Ts and Sc (and associated powers) are estimated using the 2001 UK epidemic data. Wales Mixture of long and short range transmission

slide-10
SLIDE 10

In many countries, precise locations of farms are not available. It may be possible to capture farm demography using other data in the public domain. We use land cover in the UK to determine whether we can accurately predict farm locations.

slide-11
SLIDE 11

Farm Locations

We use Land Cover Map 2000 to obtain surrogate farm locations for UK livestock farms (to compare with the 2000 UK Agricultural Census). Land Cover Map 2000 defines land use in one of 10 aggregate classes: 1-2. Woodland.

  • 3. Arable and Horticulture.

4-5. Grassland.

  • 6. Mountain/Heath/Bog.
  • 7. Urban.

8-10. Water/Coastal. Our aim is to determine land cover classes that will potentially correlate with livestock farm locations. YES! SHEEP FARMS? Land use data is available in parcels

  • f 25 square metres.
slide-12
SLIDE 12
  • 15. Neutral Grass
  • 16. Setaside Grass
  • 17. Bracken
  • 18. Calcareous Grass
  • 19. Acid grassland
  • 20. Fen, Marsh, Swamp
  • 5. Semi-Natural Grass

Farm Locations

This gives us two sets of surrogate farm databases: LC1 – farms are only located within land cover classes 4 and 5. Aggregate Class Sub class

  • 4. Improved Grassland.
  • 14. Improved Grassland

LC2 – farms are only located within land cover classes 4, 5 and 6.

slide-13
SLIDE 13

Farm Locations

This gives us two sets of surrogate farm databases: LC1 – farms are only located within land cover classes 4 and 5.

  • 4. Improved Grassland.
  • 5. Semi-Natural Grass

Aggregate Class Sub class

  • 14. Improved Grassland
  • 15. Neutral Grass

LC2 – farms are only located within land cover classes 4, 5 and 6.

  • 6. Mountain/Heath/Bog.
  • 8. Bog
  • 9. Dense heath
  • 10. Open heath
  • 11. Mountain
slide-14
SLIDE 14

Farm Locations

This gives us four sets of surrogate farm databases: LC1 – farms are only located within land cover classes 4 and 5.

  • 4. Improved Grassland.
  • 5. Semi-Natural Grass

Aggregate Class Sub class

  • 14. Improved Grassland
  • 15. Neutral Grass

LC2 – farms are only located within land cover classes 4, 5 and 6.

  • 6. Mountain/Heath/Bog.
  • 8. Bog
  • 9. Dense heath
  • 10. Open heath
  • 11. Mountain

LC3 – farms are only located within sub classes 14 and 15. LC4 – farms are only located within sub classes 9, 10, 11, 14 and 15.

slide-15
SLIDE 15

Disease Simulations

We generate surrogate farm databases assuming the same number of farms in a county as in the recorded data. Farms are only located in accepted land cover types. We generate surrogate data for three counties in the UK. Cumbria Clwyd Devon 100 surrogate farm databases are generated for each land cover type (LC1-LC4) and epidemics are simulated. Epidemic sizes and optimal control policies are determined and compared with that predicted by the “truth data”.

slide-16
SLIDE 16

Model Results

Mean Epidemic Size (95% CIs) Cumbria Devon Clwyd

slide-17
SLIDE 17

Model Results

Mean Epidemic Size (95% CIs) Cumbria Devon Clwyd True Data

2320 (100-2908) 480 (15-1891) 700 (12-1181)

slide-18
SLIDE 18

Model Results

Mean Epidemic Size (95% CIs) Cumbria Devon Clwyd True Data

2320 (100-2908) 480 (15-1891) 700 (12-1181)

Random

196 (11-841) 168 (15-649) 63 (10-209)

slide-19
SLIDE 19

Model Results

Mean Epidemic Size (95% CIs) Cumbria Devon Clwyd True Data

2320 (100-2908) 480 (15-1891) 700 (12-1181)

Random

196 (11-841) 168 (15-649) 63 (10-209)

LC1

608 (15-1544) 566 (19-1889) 106 (11-383)

slide-20
SLIDE 20

Model Results

Mean Epidemic Size (95% CIs) Cumbria Devon Clwyd True Data

2320 (100-2908) 480 (15-1891) 700 (12-1181)

Random

196 (11-841) 168 (15-649) 63 (10-209)

LC1

608 (15-1544) 566 (19-1889) 106 (11-383)

LC2

481 (16-1489) 479 (17-1584) 90 (10-323)

slide-21
SLIDE 21

Model Results

Mean Epidemic Size (95% CIs) Cumbria Devon Clwyd True Data

2320 (100-2908) 480 (15-1891) 700 (12-1181)

Random

196 (11-841) 168 (15-649) 63 (10-209)

LC1

608 (15-1544) 566 (19-1889) 106 (11-383)

LC2

481 (16-1489) 479 (17-1584) 90 (10-323)

LC3

1880 (17-2908) 509 (18-1882) 481 (13-1164)

slide-22
SLIDE 22

Model Results

Mean Epidemic Size (95% CIs) Cumbria Devon Clwyd True Data

2320 (100-2908) 480 (15-1891) 700 (12-1181)

Random

196 (11-841) 168 (15-649) 63 (10-209)

LC1

608 (15-1544) 566 (19-1889) 106 (11-383)

LC2

481 (16-1489) 479 (17-1584) 90 (10-323)

LC3

1880 (17-2908) 509 (18-1882) 481 (13-1164)

LC4

1563 (15-2669) 419 (17-1565) 246 (11-799)

slide-23
SLIDE 23

Optimal Control – Ring Culling

Optimal Ring Cull Radius (km) [ΔEmax] Cumbria Devon Clwyd

ΔEmax is the maximum increase in epidemic size if we applied the optimal ring cull as predicted by the surrogate database.

slide-24
SLIDE 24

Optimal Control – Ring Culling

Optimal Ring Cull Radius (km) [ΔEmax] Cumbria Devon Clwyd True Data

3.6 2.8 3.6

ΔEmax is the maximum increase in epidemic size if we applied the optimal ring cull as predicted by the surrogate database.

slide-25
SLIDE 25

Optimal Control – Ring Culling

Optimal Ring Cull Radius (km) [ΔEmax] Cumbria Devon Clwyd True Data

3.6 2.8 3.6

Random

1.6 [1358] 1.2 [595] 0.8 [291]

ΔEmax is the maximum increase in epidemic size if we applied the optimal ring cull as predicted by the surrogate database.

slide-26
SLIDE 26

Optimal Control – Ring Culling

Optimal Ring Cull Radius (km) [ΔEmax] Cumbria Devon Clwyd True Data

3.6 2.8 3.6

Random

1.6 [1358] 1.2 [595] 0.8 [291]

LC1

2.8 [199] 2.0 [181] 1.3 [232]

ΔEmax is the maximum increase in epidemic size if we applied the optimal ring cull as predicted by the surrogate database.

slide-27
SLIDE 27

Optimal Control – Ring Culling

Optimal Ring Cull Radius (km) [ΔEmax] Cumbria Devon Clwyd True Data

3.6 2.8 3.6

Random

1.6 [1358] 1.2 [595] 0.8 [291]

LC1

2.8 [199] 2.0 [181] 1.3 [232]

LC2

2.8 [199] 1.8 [227] 1.4 [221]

ΔEmax is the maximum increase in epidemic size if we applied the optimal ring cull as predicted by the surrogate database.

slide-28
SLIDE 28

Optimal Control – Ring Culling

Optimal Ring Cull Radius (km) [ΔEmax] Cumbria Devon Clwyd True Data

3.6 2.8 3.6

Random

1.6 [1358] 1.2 [595] 0.8 [291]

LC1

2.8 [199] 2.0 [181] 1.3 [232]

LC2

2.8 [199] 1.8 [227] 1.4 [221]

LC3

3.5 [17] 2.8 [0] 3.5 [12]

ΔEmax is the maximum increase in epidemic size if we applied the optimal ring cull as predicted by the surrogate database.

slide-29
SLIDE 29

Optimal Control – Ring Culling

Optimal Ring Cull Radius (km) [ΔEmax] Cumbria Devon Clwyd True Data

3.6 2.8 3.6

Random

1.6 [1358] 1.2 [595] 0.8 [291]

LC1

2.8 [199] 2.0 [181] 1.3 [232]

LC2

2.8 [199] 1.8 [227] 1.4 [221]

LC3

3.5 [17] 2.8 [0] 3.5 [12]

LC4

3.5 [17] 2.9 [6] 3.1 [69]

ΔEmax is the maximum increase in epidemic size if we applied the optimal ring cull as predicted by the surrogate database.

slide-30
SLIDE 30

Land Cover

LC1 – Aggregate Classes 4 (Improved Grassland) and 5 (Semi- Natural Grass). LC2 – Aggregate Classes 4, 5 and 6 (Mountain/Heath/Bog). LC3 – Subclasses 14 (Improved Grassland) and 15 (Neutral Grass). LC4 – Subclasses 14, 15, 9 (Dense Heath), 10 (Open Heath) and 11 (Mountain). Aggregate class data are not sufficiently resolved to predict epidemic sizes and optimal control policies.

slide-31
SLIDE 31

Farm Locations

Truth Data Random LC3

slide-32
SLIDE 32

Farm Locations

So what are we capturing in the sub-class data that we are missing in less well-resolved data sets? Lake District

  • 14. Improved Grassland
  • 15. Neutral grass
slide-33
SLIDE 33
slide-34
SLIDE 34

Devon

True Data Random LC4 Dartmoor National Park

slide-35
SLIDE 35

Farm Distribution in Aggregate Classes

AC Aberdeen Clwyd Cumbria Devon All 1 0.01 0.02 0.02 0.04 0.02 2 0.10 0.05 0.03 0.01 0.05 3 0.42 0.02 0.04 0.21 0.20 4 0.15 0.41 0.48 0.60 0.41 5 0.04 0.38 0.33 0.07 0.18 6 0.25 0.06 0.06 0.03 0.11 7 0.02 0.04 0.01 0.03 0.02 8 0.00 0.00 0.01 0.00 0.00 9 0.00 0.01 0.01 0.00 0.01 10 0.01 0.00 0.00 0.01 0.00

  • 3. Arable and Horticulture.
slide-36
SLIDE 36

Highly resolved land use data could potentially act as a proxy for true farm locations. In some regions land cover parcels may be misleading when trying to correlate with farm locations.

slide-37
SLIDE 37

Acknowledgements

DHS NIMBIOS University of Warwick Matt Keeling Marleen Werkman Colorado State/USDA Colleen Webb Michael Buhnerkempe Ryan Miller Linkøpings University Uno Wennergren Tom Lindström SUNY ESF Sadie Ryan