BaySTDetect: Detecting unusual temporal patterns in small area - PowerPoint PPT Presentation

BaySTDetect: Detecting unusual temporal patterns in small area disease rates using Bayesian posterior model probabilities Guangquan Li, Sylvia Richardson, Lea Fortunato, Isma¨ ıl Ahmed, Anna Hansell, Mireille Toledano and Nicky Best Department of Epidemiology and Biostatistics Imperial College London ISBA2012 Kyoto, June 29, 2012 1 / 25

Outline Motivation BaySTDetect: Bayesian model choice for detecting unusual temporal patterns in small area data Simulation study Application1: Policy assessment Application2: Data mining cancer incidence Conclusions 2 / 25

Motivation ◮ For many areas of application such as small area estimates of income, unemployment, crime rates and rates of chronic diseases, there is typically a general time trend that affects most areas similarly. ◮ However, abrupt changes may occur in a particular area due to, for example, ◮ emergence of localized risk factor(s); ◮ local policy implementation (e.g., health awareness or screening campaigns); ◮ changes to health care provision or social structure of the local population; ◮ local variations in diagnostic or coding practice; ◮ · · · ◮ Detection of areas with unusual temporal patterns is therefore important as a screening tool for further investigation. 4 / 25

Motivation: Two applications 1. COPD: Policy assessment 2. TCR: Retrospective surveillance on cancer incidence 5 / 25

Motivation: Two applications 1. COPD: Policy assessment ◮ Industrial Injuries Disablement Benefit was made available for miners developing COPD from 1992 onwards in the UK. ◮ There was a debate on whether this policy may have differentially increased the likelihood of a COPD diagnosis in mining areas As miners with other respiratory problems with similar symptoms (e.g., asthma) could potentially have benefited from this scheme. 2. TCR: Retrospective surveillance on cancer incidence 5 / 25

Motivation: Two applications 1. COPD: Policy assessment ◮ Industrial Injuries Disablement Benefit was made available for miners developing COPD from 1992 onwards in the UK. ◮ There was a debate on whether this policy may have differentially increased the likelihood of a COPD diagnosis in mining areas As miners with other respiratory problems with similar symptoms (e.g., asthma) could potentially have benefited from this scheme. 2. TCR: Retrospective surveillance on cancer incidence ◮ to highlight areas with a potential need for further investigation and/or intervention 5 / 25

Problems in small area detection 1. Sparse data (small number of cases) ◮ BaySTDetect employs the Bayesian multilevel modelling framework to allow appropriate information borrowing. 6 / 25

Problems in small area detection 1. Sparse data (small number of cases) ◮ BaySTDetect employs the Bayesian multilevel modelling framework to allow appropriate information borrowing. 2. Multiple comparisons are made ◮ A Bayesian procedure is used in BaySTDetect to derive decision rules which enable the control of the false discovery rate (FDR). 6 / 25

BaySTDetect: Model specification Data level y it ∼ Poisson ( µ it · E it ) Modelling underlying risks Model ¡1 : ¡Time ¡ Common ¡ Common ¡ trend ¡pa-ern ¡is ¡the ¡ )me ¡trend ¡ spa)al ¡pa6ern ¡ same ¡for ¡all ¡areas ¡ ¡ log( μ it ) ¡ Model ¡2 : ¡Time ¡ trends ¡are ¡es0mated ¡ Area-‑specific ¡)me ¡trends ¡ independently ¡for ¡ each ¡area ¡ 8 / 25

BaySTDetect: Model specification Data level y it ∼ Poisson ( µ it · E it ) Modelling underlying risks Model ¡1 : ¡Time ¡ Common ¡ Common ¡ trend ¡pa-ern ¡is ¡the ¡ )me ¡trend ¡ spa)al ¡pa6ern ¡ same ¡for ¡all ¡areas ¡ ¡ log( μ it ) ¡ Model ¡2 : ¡Time ¡ trends ¡are ¡es0mated ¡ Area-‑specific ¡)me ¡trends ¡ independently ¡for ¡ each ¡area ¡ Selection A model indicator z i indicates for each area whether Model 1 ( z i = 1) or Model 2 ( z i = 0) is supported by the data. µ it = z i · µ ( M 1) + (1 − z i ) · µ ( M 2) it it 8 / 25

BaySTDetect: Model specification y it ∼ Poisson( E it · µ it ) � α 0 + η i + γ t Model 1 for all i , t log ( µ it ) = u i + ξ it Model 2 for all i , t . 9 / 25

BaySTDetect: Model specification y it ∼ Poisson( E it · µ it ) � α 0 + η i + γ t Model 1 for all i , t log ( µ it ) = u i + ξ it Model 2 for all i , t . Model 1 ∼ spatial BYM model Common spatial pattern η i random walk [RW( σ 2 ∼ γ )] Common temporal pattern γ t 9 / 25

BaySTDetect: Model specification y it ∼ Poisson( E it · µ it ) � α 0 + η i + γ t Model 1 for all i , t log ( µ it ) = u i + ξ it Model 2 for all i , t . Model 1 ∼ spatial BYM model Common spatial pattern η i random walk [RW( σ 2 ∼ γ )] Common temporal pattern γ t Model 2 u i ∼ N (0 , 1000) random walk [RW( σ 2 ∼ ξ, i )] Area-specific temporal pattern ξ i , t 9 / 25

BaySTDetect: Model specification y it ∼ Poisson( E it · µ it ) � α 0 + η i + γ t Model 1 for all i , t log ( µ it ) = u i + ξ it Model 2 for all i , t . Model 1 ∼ spatial BYM model Common spatial pattern η i random walk [RW( σ 2 ∼ γ )] Common temporal pattern γ t Model 2 u i ∼ N (0 , 1000) random walk [RW( σ 2 ∼ ξ, i )] Area-specific temporal pattern ξ i , t Selection z i ∼ Bern (0 . 95) 9 / 25

A detection rule based on FDR ◮ Define f i = P ( z i = 1 | data) which is the probability that area i belongs to the common trend model (Model 1) ◮ A small f i suggests that area i is unlikely to follow the common trend. 10 / 25

A detection rule based on FDR ◮ Define f i = P ( z i = 1 | data) which is the probability that area i belongs to the common trend model (Model 1) ◮ A small f i suggests that area i is unlikely to follow the common trend. ◮ We need to set a suitable cut-off value, C , such that areas with f i < C are declared to be unusual. 10 / 25

A detection rule based on FDR ◮ Define f i = P ( z i = 1 | data) which is the probability that area i belongs to the common trend model (Model 1) ◮ A small f i suggests that area i is unlikely to follow the common trend. ◮ We need to set a suitable cut-off value, C , such that areas with f i < C are declared to be unusual. ◮ Put another way, if we declare area i to be unusual, then f i can be thought of as the probability of false detection for that area. ◮ We chose C in such a way that we ensure that the average probability of false detection (i.e. the average value of f i ) amongst areas declared to be unusual is less than some pre-set level α . 10 / 25

A detection rule based on FDR ◮ Define f i = P ( z i = 1 | data) which is the probability that area i belongs to the common trend model (Model 1) ◮ A small f i suggests that area i is unlikely to follow the common trend. ◮ We need to set a suitable cut-off value, C , such that areas with f i < C are declared to be unusual. ◮ Put another way, if we declare area i to be unusual, then f i can be thought of as the probability of false detection for that area. ◮ We chose C in such a way that we ensure that the average probability of false detection (i.e. the average value of f i ) amongst areas declared to be unusual is less than some pre-set level α . ◮ This procedure ensures that, on average, the number of false positives is no more than ( k × α ), where k is the number of declared unusual areas. 10 / 25

Simulation: Setup ◮ Simulated data were based on the observed COPD mortality data (see Li et al. 2012). ◮ Three departure patterns were considered. ◮ When simulating the data, either the original set of expected counts from the COPD data or a reduced set (multiplying the original by 1/5) were used. ◮ 15 areas (approx. 4%) were chosen to have the unusual trend patterns. ◮ areas were chosen to cover a wide range expected count values and overall spatial risks. ◮ Results were compared to those from the popular SaTScan space-time scan statistic. 12 / 25

Simulation: Unusual patterns Figure: Illustration of the three departure patterns (red) with the common trend (black). Pattern 1 Pattern 2 Pattern 3 Two departure magnitudes, q =1.5 and 2, were considered. 13 / 25

Simulation: Sensitivity Figure: Sensitivity of detecting the 15 truly unusual areas 14 / 25

COPD application: Detected areas (FDR=0.05) 16 / 25

BaySTDetect: Detecting unusual temporal patterns in small area - PowerPoint PPT Presentation

BaySTDetect: Detecting unusual temporal patterns in small area disease rates using Bayesian posterior model probabilities Guangquan Li, Sylvia Richardson, Lea Fortunato, Isma l Ahmed, Anna Hansell, Mireille Toledano and Nicky Best

Detecting Spammers and Content Detecting Spammers and Content Detecting Spammers and Content

12/6/2013 Detecting Fakes Image Forensics: Detecting Forged Photos 1.Detecting photorealistic

Factory Patterns: Factory Method and Abstract Factory Design Patterns In Java Bob Tarr

Spatio-Temporal Statistics with R Chapter Two: Exploring Spatio-Temporal Data Spatio-Temporal

Principles and Patterns 26 February, 2020 Recap Principles Patterns Inheritance Anti-patterns

Temporal, Spatial, and Spatio-temporal Granularities Gabriele Pozzani Department of Computer

Temporal Code Temporal Code Temporal Code (Acoustic Front-end) Human Recognition Machine

Temporal Privacy in Wireless Sensor Networks Temporal Privacy in Wireless Sensor Networks

Temporal Planning Planning with Temporal and Concurrent Actions 1 Literature Malik

Temporal Distortion Temporal Distortion Perspective) Perspective) t t Blue view Blue view y

Detecting Events and Patterns in the Social Web with Statistical Learning Vasileios Lampos

Detecting Events and Patterns in the Social Web with Statistical Learning Vasileios Lampos

NetFlow Analysis: Detecting covert channels on the network Detecting malicious traffic by using

Introduction Detecting Errors in Effects of Annotation Errors Detecting Errors in Corpus

Design Patterns Applications Programming What is design patterns? The design patterns are

Design Patterns 1 What are Design Patterns? Design patterns describe common (and successful)

POLAR GP Welcome March 2017 3/30/2017 Insert Footer Here 1 Explanatory Note The following

Risk Assessment and Genomics Risk Assessment and Genomics Science and Policy: EPAs

ICBO 2010 Patricia S. Lemer, M. Ed., NCC devdelay@mindspring.com Statistics Is There an Autism

Neam County County overview Location An old region of the Moldavia province, with settlements

Explo loration a n and nd M Mini ning ng o of Web R Repositories WSDM DM 2 2014 T

roup Focus on Health ingleton hire ealthy nvironment A community-based group looking to

Engaging in Qualitative Research Methods: Opportunities for Prevention and Health Promotion

Balancing Wellness and Academic Success in College Settings Anisha Patel December 4, 2019 The

Sambuz

Useful Links

Newsletter

Mail Us