what I am after Statistical modelling and analysis do not from - PDF document

Why graphical models in R? ‘what I am after’ • Statistical modelling and analysis do not from gR2002 respect boundaries of model classes • Software should encourage and support good practice - and graphical models are good practice! • Data analysis - model-based • R for ‘reference implementation’ of new Peter Green, methodology University of Bristol, UK • Open software Contingency Markov Spatial Questions tables chains statistics • Scope? – Digram, MIM, CoCo, TETRAD, Hugin, Genetics BUGS? – Determined by classes of model, or Regression Graphical models classes of algorithm? • Market? AI – Statistics researcher, statistics MSc, arbitrary Excel user? • Delivery? Statistical Sufficiency Covariance – R package(s), with C code? physics selection Bayesian Hierarchical models Contents • Hierarchical models • Variable-length parameters • Models with undirected edges • Hidden Markov models • Inference on structure • Discrete graphical models/PES • Grappa properly integrating out all sources of variation

Repeated measures on children's Repeated measures on children's weights weights, continued ( α i β , ) • Children i=1,2,…,k have their weights • Suppose that vary across the i measured on n i occasions, t ij ,j=1,2,…n i population according to obtaining weights y ij . 2 2 α ~ N ( µ , σ ) β ~ N ( µ , σ ) i α α i β β • Suppose that, for each child, we have a linear growth equation, with • A Bayesian completes the model by independent normal errors 2 2 2 ( µ , σ , µ , σ , σ ) specifying priors on α α β β + 2 y ~ N ( α β t , σ ) ij i i ij Measurement error Graph for children’s weights Explanatory variables µ σ µ σ α α β β X subject to error - we σ { ij t } α β { } { } only observe i i U on most cases { y } ij Contents Mixture modelling k DAG for a • Hierarchical models mixture model • Variable-length parameters w • Models with undirected edges k θ ∑ θ y ~ w f ( y | ) • Hidden Markov models j j j = 1 • Inference on structure • Discrete graphical models/PES • Grappa y

Measurement error using mixture Mixture modelling model for population k DAG for a length= k mixture model w k θ ∑ θ y ~ w f ( y | ) z j j = j 1 = θ ( y | z j ) ~ f ( y | ) y j = ) = p ( z j w j value set ={1,2,…, k } Modelling with undirected Contents graphs • Hierarchical models Directed acyclic graphs are a natural representation of the way we usually • Variable-length parameters specify a statistical model - directionally: • Models with undirected edges • Hidden Markov models • disease → symptom • Inference on structure • past → future • Discrete graphical models/PES • parameters → data ….. • Grappa However, sometimes (e.g. spatial models) there is no natural direction Scottish lip cancer data Scottish lip cancer data (2) The data include The rates of lip cancer in 56 counties in • the observed and expected cases (expected Scotland have been analysed by numbers based on the population and its age Clayton and Kaldor (1987) and Breslow and sex distribution in the county), and Clayton (1993) • a covariate measuring the percentage of the population engaged in agriculture, fishing, or forestry, and (the analysis here is based on the example in the • the "position'' of each county expressed as WinBugs manual) a list of adjacent counties.

Scottish lip cancer data (3) Model for lip cancer data (1) Graph regression coefficient County Obs Exp x SMR Adjacent cases cases (% in counties covariate agric.) random spatial effects 1 9 1.4 16 652.2 5,9,11,19 2 39 8.7 16 450.3 7,10 expected counts ... ... ... ... ... ... 56 0 1.8 10 0.0 18,24,30,33,45,55 observed counts Model for lip cancer data Bugs code for lip cancer data (2) Distributions µ O ~ Poisson ( ) • Data: model i i { • Link function: b[1:regions] ~ car.normal(adj[], weights[], num[], tau) µ = + α + α + b.mean <- mean(b[]) log log E x / 10 b i i 0 1 i i for (i in 1 : regions) { O[i] ~ dpois(mu[i]) • Random spatial effects: log(mu[i]) <- log(E[i]) + alpha0 + alpha1 * x[i] / 10 + b[i] ∑ τ ∝ τ n / 2 − τ − 2 SMRhat[i] <- 100 * mu[i] / E[i] p ( b ,..., b | ) exp( ( b b ) / 4 ) 1 n i j } Note: declarative, i ~ j alpha1 ~ dnorm(0.0, 1.0E-5) rather than alpha0 ~ dflat() • Priors: tau ~ dgamma(r, d) procedural α 0 α τ Γ , ~ Uniform ~ ( r , d ) sigma <- 1 / sqrt(tau) language 1 } Bugs code for lip cancer data Bugs code for lip cancer data model model { { b[1:regions] ~ car.normal(adj[], weights[], num[], tau) b[1:regions] ~ car.normal(adj[], weights[], num[], tau) b.mean <- mean(b[]) b.mean <- mean(b[]) µ O ~ Poisson ( ) for (i in 1 : regions) { for (i in 1 : regions) { µ = + α + α + log log E x / 10 b i i O[i] ~ dpois(mu[i]) O[i] ~ dpois(mu[i]) i i 0 1 i i log(mu[i]) <- log(E[i]) + alpha0 + alpha1 * x[i] / 10 + b[i] log(mu[i]) <- log(E[i]) + alpha0 + alpha1 * x[i] / 10 + b[i] SMRhat[i] <- 100 * mu[i] / E[i] SMRhat[i] <- 100 * mu[i] / E[i] } } alpha1 ~ dnorm(0.0, 1.0E-5) alpha1 ~ dnorm(0.0, 1.0E-5) alpha0 ~ dflat() alpha0 ~ dflat() tau ~ dgamma(r, d) tau ~ dgamma(r, d) sigma <- 1 / sqrt(tau) sigma <- 1 / sqrt(tau) } }

Bugs code for lip cancer data Bugs code for lip cancer data ∑ p ( b ,..., b | τ ) ∝ τ n / 2 exp( − τ ( b − b ) 2 / 4 ) model model 1 n i j { i ~ j { b[1:regions] ~ car.normal(adj[], weights[], num[], tau) b[1:regions] ~ car.normal(adj[], weights[], num[], tau) b.mean <- mean(b[]) b.mean <- mean(b[]) for (i in 1 : regions) { for (i in 1 : regions) { O[i] ~ dpois(mu[i]) O[i] ~ dpois(mu[i]) log(mu[i]) <- log(E[i]) + alpha0 + alpha1 * x[i] / 10 + b[i] log(mu[i]) <- log(E[i]) + alpha0 + alpha1 * x[i] / 10 + b[i] SMRhat[i] <- 100 * mu[i] / E[i] SMRhat[i] <- 100 * mu[i] / E[i] } } alpha1 ~ dnorm(0.0, 1.0E-5) alpha1 ~ dnorm(0.0, 1.0E-5) alpha0 ~ dflat() alpha0 ~ dflat() τ Γ ~ ( r , d ) tau ~ dgamma(r, d) tau ~ dgamma(r, d) sigma <- 1 / sqrt(tau) sigma <- 1 / sqrt(tau) } } WinBugs for lip cancer data WinBugs for lip cancer data alpha1 sample: 7000 Dynamic traces for some parameters: 4.0 Posterior densities 3.0 alpha1 for some parameters: 2.0 0.75 0.5 1.0 0.25 0.0 0.0 -0.25 -0.5 0.0 0.5 1.0 16600 16650 16700 16750 16800 16850 16900 16950 iteration mu[1] sample: 7000 tau sample: 7000 tau mu[1] 0.3 0.8 6.0 15.0 0.6 4.0 10.0 0.2 0.4 5.0 2.0 0.1 0.2 0.0 0.0 0.0 16600 16650 16700 16750 16800 16850 16900 16950 0.0 16600 16650 16700 16750 16800 16850 16900 16950 iteration iteration 0.0 5.0 10.0 15.0 0.0 2.0 4.0 Contents Hidden Markov models e.g. Hidden Markov chain • Hierarchical models (DLM, state space model) • Variable-length parameters • Models with undirected edges z 0 z 1 z 2 z 3 z 4 hidden • Hidden Markov models • Inference on structure • Discrete graphical models/PES y 1 y 2 y 3 y 4 observed • Grappa

what I am after Statistical modelling and analysis do not from - PDF document

Why graphical models in R? what I am after Statistical modelling and analysis do not from gR2002 respect boundaries of model classes Software should encourage and support good practice - and graphical models are good practice!

GLO Science Professional Before & After Images Before GLO After GLO Before GLO After GLO

Floor Cleaning By Vacuum After vacuum Cleaning After vacuum Cleaning After vacuum Cleaning

Watup, could you pick up this thing for me NEARBY? HOME NEARBY After 20 metres After 10 metres

MALE INFERTILITY CASE-I Before Treatment: After Treatment: After Treatment: CASE 2 BEFORE

Worcestershire Voices Worcestershire Voices Civil Society after Civil Society after

Looked After Children GP PLT training sessions July 2020 Kent and Medway CCG Looked After

Orange County Citizens Com m ission for Children ( CCC) After School/ Sum m er Program s

VIP turbo / After burner Developing on Afterburner VIP turbo and After burner HIB HIB Modem

Rebuilding after Hurricane Katrina: Rebuilding after Hurricane Katrina: Neighborhood Planning

The Ramifications After Article 50: of After Article 50: The Ramifications of Brexit Contents /

MONTANA NO KID HUNGRY BREAKFAST AFTER THE BELL 1 in 5 Breakfast after the Bell On average,

Trade Show Logistics Before After Before After Topic Overview Budgeting Show Venue

Acute and late toxicity after Acute and late toxicity after fractionated total body fractionated

CDE After School Division Stakeholder Survey and Interview Results Stakeholder Survey and

Enhance Recover After Surgery and Colorectal Bundles Nichole Schmidt, MSN, RN, CNOR, CNL

A private, after-hours program at What is Dusk at the Park? A private, after-hours program

Welcome to the 2015 Cyber Risk Insights Conference! @Advisen #CyberRisk Welcoming Remarks Bill

Secure Genome me Analysis The privacy workshop is jointly

Cybersecurity for Future Presidents Lecture 8: How can individuals be associated with actions in

Nanopore Sequencing Technology and Tools for Genome Assembly: Computational Analysis of the

Public Genomics Enabling Responsible Permissions You are free to : Copy, share, adapt, or

Graduate education activities to foster awareness of social implications of genetics John

The Math & Science Exploratory School Arin M. Rus usch, , Principa incipal The Basics

Temet Nosce: Know Thy Endpoint Through and Through Thomas V. Fischer I am Threat

what I am after Statistical modelling and analysis do not from - PDF document

Why graphical models in R? what I am after Statistical modelling and analysis do not from gR2002 respect boundaries of model classes Software should encourage and support good practice - and graphical models are good practice!

GLO Science Professional Before &amp; After Images Before GLO After GLO Before GLO After GLO

Floor Cleaning By Vacuum After vacuum Cleaning After vacuum Cleaning After vacuum Cleaning

Watup, could you pick up this thing for me NEARBY? HOME NEARBY After 20 metres After 10 metres

MALE INFERTILITY CASE-I Before Treatment: After Treatment: After Treatment: CASE 2 BEFORE

Worcestershire Voices Worcestershire Voices Civil Society after Civil Society after

Looked After Children GP PLT training sessions July 2020 Kent and Medway CCG Looked After

Orange County Citizens Com m ission for Children ( CCC) After School/ Sum m er Program s

VIP turbo / After burner Developing on Afterburner VIP turbo and After burner HIB HIB Modem

Rebuilding after Hurricane Katrina: Rebuilding after Hurricane Katrina: Neighborhood Planning

The Ramifications After Article 50: of After Article 50: The Ramifications of Brexit Contents /

MONTANA NO KID HUNGRY BREAKFAST AFTER THE BELL 1 in 5 Breakfast after the Bell On average,

Trade Show Logistics Before After Before After Topic Overview Budgeting Show Venue

Acute and late toxicity after Acute and late toxicity after fractionated total body fractionated

CDE After School Division Stakeholder Survey and Interview Results Stakeholder Survey and

Enhance Recover After Surgery and Colorectal Bundles Nichole Schmidt, MSN, RN, CNOR, CNL

A private, after-hours program at What is Dusk at the Park? A private, after-hours program

Welcome to the 2015 Cyber Risk Insights Conference! @Advisen #CyberRisk Welcoming Remarks Bill

Secure Genome me Analysis The privacy workshop is jointly

Cybersecurity for Future Presidents Lecture 8: How can individuals be associated with actions in

Nanopore Sequencing Technology and Tools for Genome Assembly: Computational Analysis of the

Public Genomics Enabling Responsible Permissions You are free to : Copy, share, adapt, or

Graduate education activities to foster awareness of social implications of genetics John

The Math &amp; Science Exploratory School Arin M. Rus usch, , Principa incipal The Basics

Temet Nosce: Know Thy Endpoint Through and Through Thomas V. Fischer I am Threat

GLO Science Professional Before & After Images Before GLO After GLO Before GLO After GLO

The Math & Science Exploratory School Arin M. Rus usch, , Principa incipal The Basics