I know what you ate last summer! with some uncertainty, of course - PowerPoint PPT Presentation

I know what you ate last summer! • … with some uncertainty, of course … … or not. • Outline:  Practical use of Bayesian statistics for simple problems. Example.  Bayes for evidence synthesis.  Bayes for source attribution.  Bayes for acute food consumption risk and prediction.

Bayes for risk assessment in food safety • Food safety depends on lots of things from farm-to-fork Farm Processing Restaurants Consumer Retail • Not enough to ’ know what you typically eat ’ , but also: • how much/often you eat, • how you made/kept it, • where did you buy it, • and how it was produced ! • Some data from these steps. • Bayesian methods exploited to quantify probabilities.

• More Bayesian Food Safety Risk Assessment applications 3 2.12.2013. Bio-Bayes, jukka.ranta@evira.fi

• Biofilm production by 10 strains of Practical usefull llness: S.Enteritidis on cutting boards doin ing sim imple statistics Material None Weak Moderate Strong Often small sample analyses are done using various statistical tests. Wood 4 5 1 0 Worries: Plastic 6 4 0 0  What test to use? Glass 9 1 0 0  Is the sample size large enough?  • What are we really asking? Is the number of blocks/groups large enough? • Which material is safest?  Interpretation of results: reject H 0 or not, • How does it translate to a statistical question? and what to say then?  Multiple testing problems … • Q1: do the materials differ?  Testing just because of the habit? • Q2: which material has the highest P(None)? [Foodborne pathogens and disease 15, (2), 2018. 81-85.]

Bayesian formula lation Density Density 0.0 2.0 0.0 2.0 of the proble lem 0.0 0.5 1.0 0.0 0.5 1.0 Model: Multinomial probabilities P(None) on Wood P(None) on Plastic p 1 ,p 2 ,p 3 ,p 4 = Density 0.0 4.0 P(None), P(Weak), P(Moderate), P(Strong) Compute P( p 1 ,p 2 ,p 3 ,p 4 | data) for each 0.0 0.5 1.0 1.5 material P(None) on Glass  P( p1 highes Typical prior: Dirichlet (1/4,…,1/4)   Posterior: Dirichlet(x 1 +1/4,…,x 4 +1/4) 0.0 0.5 1.0  All conclusions produced from this!  For example:  P( P(None) is highest on Glass ) 0 1 2 3 4 Material

Take home recipe: simple-to-run code for OpenBUGS/WinBUGS model{ for(i in 1:materials){ pnone[i] <- p[i,1] p[i,1:k] ~ ddirch(a[i,1:k]) for(j in 1:k){a[i,j] <- x[i,j]+1/k} } largest.value <- ranked(pnone[],materials) for(i in 1:materials){ which[i] <- equals(pnone[i],largest.value)*i } pnonelargest <- sum(which[]) } # data: list(materials=3,k=4, x=structure(.Data=c( Simple Bayesian models for simple problems can also be useful, 4,5,1,0, and not too hard to implement. 6,4,0,0, 9,1,0,0),.Dim=c(3,4)))

Simple evidence synthesis: N( m , s 2 ) m , s 2 N( m , s 2 ) DATA 1 Reported log-concentrations: data often modeled with parametric distributions, DATA 1: measurements e.g. normal. DATA1: this goes in easily!

Simple evidence synthesis: N( m , s 2 ) m , s 2 N( m , s 2 ) Some data could be reported only as averages DATA 1 Include also DATA2 DATA 1: measurements N( m , s 2 /10) DATA 2 DATA 2: averages of 10 measurements

Simple evidence synthesis: N( m , s 2 ) Or reported differences N( 0 ,2 s 2 ) m , s 2 N( m , s 2 ) DATA3 goes in too! DATA 3 DATA 1 DATA 3: differences of two measurements DATA 1: measurements N( m , s 2 /10) DATA 2 DATA 2: averages of 10 measurements

Simple evidence synthesis: N( m , s 2 ) N( 0 ,2 s 2 ) m , s 2 N( m , s 2 ) DATA 3 F (c, m , s 2 ) DATA 1 DATA 3: differences of two measurements DATA 1: measurements N( m , s 2 /10) Reported DATA 4 DATA 2 values below c DATA4 DATA 4: censored measurements DATA 2: averages of 10 measurements

If there is a model, there’s a way Maximum likelihood estimation Bayesian inference • Construct full likelihood of all • Construct full likelihood of all datasets. datasets. • Maximise to get ML-estimates • Define prior distributions. • Higher dimensions can become • Simulate the posterior difficult. distribution using MCMC (BUGS,JAGS,STAN,own sampler). • Multiple maxima? • Aiming to get the uncertainty • Aiming to get the single distribution of all parameters. estimate .

Is there Campylobacter in the broiler you get? • Your broilers are ’ sampled ’ from production batches. • There is variability between batches and within batches.  consumers ’ risk

Do we have enough evidence for an estimate? • There were two (Swedish) data sets: • A: representing only one broiler from each batch, 10 slaughterhouses, 705 batches, sampled in a representative way. Result: positive/negative, & concentration if positive.  88 pos, 617 neg, hence 88 conc. values. • B: representing the mean and SD of log-concentrations, from 5 to 25 positive broilers per batch, from 20 positive batches, and the # posit/negat broilers in each batch.

Complementing evidence from both • A: information about mean and total variance of concentrations in positive broilers, but nothing about within-batch prevalence*, or variance components. (*) if we assume within-batch prevalence 100%, can estimate batch prevalence. • B: information on within-batch parameters for positive batches, but nothing on overall batch prevalence. • Make a synthesis of A & B with a Bayesian model.

Just like in the example before: models connected with common parameters m s b m j ’’ N j ’’ q N’ m j' a p j’’ x j ’’ J’ s w y , SD( ) y y 1 j '   ' ' ' ' j j N j’’ /batch + data 1/batch data

Posterior distributions for the two variance components

Estimation from a synthesis is interesting , but there’s more than that … • A M icrobiological C riterion ( MC ) can be placed for the acceptance of a batch. • This would be based on sampling results, batch by batch. • When bad batches are rejected, consumers ’ risk is reduced. • But producers ’ costs are increased if too many batches are rejected! consumers ’ risk VS producers ’ risk

What does the outcome from such test sample represent? - Additional evidence. • Can use Bayesian model to revise the estimates for PREDICTED ACCEPTED batches. • This determines the new consumer risk, under such criterion. • Can also calculate the probability of rejection for batches  predicted percentage of lost batches. • A criterion could be : ”n/c/m” = ”at most c samples out of n are allowed to have concentration >m”. • HOW TO CHOOSE n/c/m ? • Uncertainty analysis involves 2D Monte Carlo (MC within MCMC).

Finding an optimal criterion, accounting for uncertainties. RR = risk ratio = risk when MC is met / risk if no MC was applied. P(MC not met) = percentage of rejected batches.

Classification problems : ’ source attribution ’ 5 5 B 15 A 10 20 20 ? 75 ? 50 15 5 C 15 15 ? 10 D ? 70 30 30 13.10.2016 23

• Bacteria types sampled from a few broad food categories, denoted as ’the sources ’. 10 5 5 5 15 15 15 20 15 20 10 70 30 30 75 50 • E.g. broilers (samples from meat and/or animals), • Likewise turkey , cattle, pigs, etc. • Possibly also other exposures: swimming waters, environment ,… • Bacteria types from human isolates taken as a mixture sample of sources . • Problem: assuming human isolates (somehow) originated from those sources, • classify each isolate into sources. • estimate what fraction of cases are generally from which source (mixture proportions).

Proportion (q 1 ) of types 1,…,J in source 1 Number of types 1,...,J among human cases. q 11 , … , q 1J Y 1 , … , Y J p 1 X 11 , … , X 1J Number of types 1,...,J in sample. p 1 q 1 + p 2 q 2 Proportion (q 2 ) of types 1,...,J in source 2 p 2 q 21 , … , q 2J X 21 , … , X 2J Number of types 1,...,J in sample.

Bayesian classification methods • Naive Bayes classifier with sources i = 1,…,I, and types j=1,…,J • P(source i | type j) = P(type j | source i) P(source i) / const • P(source i) = 1/I, prior probability , i=1,…,I sources. • P(type j| source i) = Multinomial( q i *,1) with estimated type frequencies q i * directly from data: q ij * = x ij / n i or smoothed: (x ij +1/J)/ (n i +1). • If P(source i) = p i with prior P(p i ), we obtain posterior distribution: P(I 1 ,…,I N ,p 1 ,…,p I | x,y ) for the population fractions p (mixture proportions), and source labels I n for each human case, based on source samples x and human samples y.

Bayesian classification methods • Posterior predictive classifier • For a single new isolate in a source i , predictive probability: P(type j |x i ) = P ij   a   a ( ) ( 1 )  ij ij from the integral (predictiv e distributi on) : P    a  a ij ( 1 ) ( ) ij ij  a a Multin( ,..., , 1 ) Dir( ,..., | ,..., ) q q q q Dq 1 1 1 i iJ i iJ i iJ • a j are parameters of the posterior distribution of the type frequencies q in that source. • These predictive probabilities can be used to evaluate P(source i | type j, x 1, …, x I ) = P(type j | x i ) P(source i) / const.

I know what you ate last summer! with some uncertainty, of course - PowerPoint PPT Presentation

I know what you ate last summer! with some uncertainty, of course or not. Outline: Practical use of Bayesian statistics for simple problems. Example. Bayes for evidence synthesis. Bayes

What You Dont Know What You Dont Know What You Dont Know What You Dont Know That

(11-14) How much do you know about the internet? Make sure you stay SAFE AND SECURE ONLINE YOU

Grammars & Parsing Lecture 12 CS 2112 Fall 2018 Motivation The cat ate the rat. Not

SDHSA S TADIUM I MPROVEMENTS P ROJECT P HASE 1 B S TART D ATE : 3/17/14 E ST . C OMPLETION D

www.evalu-ate.org www.evalu-ate.org 1 Strategies for Effective Evaluation Reporting ATE PI

OCMS F IELD , P ATH OF T RAVEL , A RT , C-S MART L AB U PGRADES S TART D ATE 5/16/14 C

Know how. Know now. Know how. Know now. Please Thank our sponsor! The Nebraska Soybean Board

WELCOME! You need to know what you know, and know what you dont know. Then work on your areas

We Know It ! We Know It ! WeKnowIt WeKnowIt Emerging, Collective Intelligence for personal,

I know who you clicked last summer Svenja Schr oder December 30, 2007 Svenja Schr oder I

SUMMER BRAIN GAIN: REIMAGINING SUMMER LEARNING What is the problem? Why Summer Matters There is

The Art The Art when you don't know! Define what you want when you do know! of of Know

Str ate gic L ow De cline , L ight Oil Acquisition Cr e ate s Sustainable Yie ld +

Clim ate change and UK Clim ate change and UK airport expansion airport expansion Cait Hew itt

Mor e than wor ds? Mapping the Cor por ate Soc ial Re sponsibility (CSR) - Str ate gy

Summer Salary MARCH 20, 2019 Todays Agenda What is Summer Salary? Key Considerations

HL7 middleware framework for laboratory notification for notifiable diseases Presented by Donald

Modelling and monitoring of epidemic phenomena using the surveillance package in R Sebastian Meyer

PulseNet/OutbreakNet Regional Meeting High-Burden State Perspective Hope Dishman, MPH and Stacy

Microbiology and Food Safety B. A. Reiling Mmm, Is this food really safe for my family? As a

BIOSENSING FOR HEALTH APPLICATIONS Department of Informatics Intelligent Robotics WS 2015/16 Hwei

Re-Imagining Cancer Care: Leading with Capability, Comfort, and Calm Our Time Together

Cancer in the Workplace Larita Howard, PHR Manager, Human Resources Arizona Division Laurie

http://www.masoniccanceralliance.org/cancer-survivorship-echo-series.html K. Allen Greiner ,MD

I know what you ate last summer! with some uncertainty, of course - PowerPoint PPT Presentation

I know what you ate last summer! with some uncertainty, of course or not. Outline: Practical use of Bayesian statistics for simple problems. Example. Bayes for evidence synthesis. Bayes

What You Dont Know What You Dont Know What You Dont Know What You Dont Know That

(11-14) How much do you know about the internet? Make sure you stay SAFE AND SECURE ONLINE YOU

Grammars &amp; Parsing Lecture 12 CS 2112 Fall 2018 Motivation The cat ate the rat. Not

SDHSA S TADIUM I MPROVEMENTS P ROJECT P HASE 1 B S TART D ATE : 3/17/14 E ST . C OMPLETION D

www.evalu-ate.org www.evalu-ate.org 1 Strategies for Effective Evaluation Reporting ATE PI

OCMS F IELD , P ATH OF T RAVEL , A RT , C-S MART L AB U PGRADES S TART D ATE 5/16/14 C

Know how. Know now. Know how. Know now. Please Thank our sponsor! The Nebraska Soybean Board

WELCOME! You need to know what you know, and know what you dont know. Then work on your areas

We Know It ! We Know It ! WeKnowIt WeKnowIt Emerging, Collective Intelligence for personal,

I know who you clicked last summer Svenja Schr oder December 30, 2007 Svenja Schr oder I

SUMMER BRAIN GAIN: REIMAGINING SUMMER LEARNING What is the problem? Why Summer Matters There is

The Art The Art when you don't know! Define what you want when you do know! of of Know

Str ate gic L ow De cline , L ight Oil Acquisition Cr e ate s Sustainable Yie ld +

Clim ate change and UK Clim ate change and UK airport expansion airport expansion Cait Hew itt

Mor e than wor ds? Mapping the Cor por ate Soc ial Re sponsibility (CSR) - Str ate gy

Summer Salary MARCH 20, 2019 Todays Agenda What is Summer Salary? Key Considerations

HL7 middleware framework for laboratory notification for notifiable diseases Presented by Donald

Modelling and monitoring of epidemic phenomena using the surveillance package in R Sebastian Meyer

PulseNet/OutbreakNet Regional Meeting High-Burden State Perspective Hope Dishman, MPH and Stacy

Microbiology and Food Safety B. A. Reiling Mmm, Is this food really safe for my family? As a

BIOSENSING FOR HEALTH APPLICATIONS Department of Informatics Intelligent Robotics WS 2015/16 Hwei

Re-Imagining Cancer Care: Leading with Capability, Comfort, and Calm Our Time Together

Cancer in the Workplace Larita Howard, PHR Manager, Human Resources Arizona Division Laurie

http://www.masoniccanceralliance.org/cancer-survivorship-echo-series.html K. Allen Greiner ,MD

Grammars & Parsing Lecture 12 CS 2112 Fall 2018 Motivation The cat ate the rat. Not