Bayesian Updating: Continuous Priors 18.05 Spring 2014 Jeremy Orloff - PowerPoint PPT Presentation

Bayesian Updating: Continuous Priors 18.05 Spring 2014 Jeremy Orloff and Jonathan Bloom

Beta distribution Beta ( a , b ) has density ( a + b − 1)! θ a − 1 (1 − θ ) b − 1 f ( θ ) = ( a − 1)!( b − 1)! http://ocw.mit.edu/ans7870/18/18.05/s14/applets/beta-jmo.html Observation: The coefficient is a normalizing factor, so if we have a pdf f ( θ ) = c θ a − 1 (1 − θ ) b − 1 then θ ∼ beta( a , b ) and ( a + b − 1)! c = ( a − 1)!( b − 1)! May 30, 2014 2 / 18

Board question preamble: beta priors Suppose you have a coin with unknown probability of heads θ . You don’t know that it’s fair, but your prior belief is that it’s probably not too unfair. You capture this intuition in with a beta(5,5) prior on θ . Beta(5,5) for θ 2.0 1.0 0.0 0.0 0.2 0.4 0.6 0.8 1.0 In order to sharpen this distribution you take data and update the prior. Question on next slide. May 30, 2014 3 / 18

Board question: beta priors ( a + b − 1)! θ a − 1 (1 − θ ) b − 1 Beta ( a , b ): f ( θ ) = ( a − 1)!( b − 1)! Coin has prior f ( θ ) ∼ beta(5 , 5) 1. Suppose you flip 10 times and get 6 heads. Find the posterior distribution on θ . Identify the type of the posterior distribution. 2. Suppose you recorded the order of the flips and got H H H T T H H H T T. Find the posterior based on this data. 3. Using your answer to (2) give an integral for the posterior predictive probability of heads on the next toss. 4. Use what you know about pdf’s to evaluate the integral without computing it directly May 30, 2014 4 / 18

Predictive probabilities Continuous hypotheses θ , discrete data x 1 , x 2 , . . . (Assume trials are independent.) Prior predictive probability � p ( x 1 ) = p ( x 1 | θ ) f ( θ ) d θ Posterior predictive probability � p ( x 2 | x 1 ) = p ( x 2 | θ ) f ( θ | x 1 ) d θ Analogous to discrete hypotheses: H 1 , H 2 , . . . . 1 1 n n p ( x 1 ) = p ( x 1 | H i ) P ( H i ) p ( x 2 | x 1 ) = p ( x 2 | H i ) p ( H i | x 1 ) . i =1 i =1 May 30, 2014 5 / 18

Concept Question Suppose your prior f ( θ ) in the bent coin example is Beta(6 , 8). You flip the coin 7 times, getting 2 heads and 5 tails. What is the posterior pdf f ( θ | x )? 1. Beta(2,5) 2. Beta(3,6) 3. Beta(6,8) 4. Beta(8,13) May 30, 2014 6 / 18

Continuous priors, continuous data Bayesian update tables with and without infinitesimals unnormalized hypoth. prior likeli. posterior posterior f ( θ | x ) = f ( x | θ ) f ( θ ) θ f ( θ ) f ( x | θ ) f ( x | θ ) f ( θ ) f ( x ) total 1 f ( x ) 1 unnormalized hypoth. prior likeli. posterior posterior f ( θ | x ) d θ = f ( x | θ ) f ( θ ) d θ dx θ ± d θ f ( θ ) d θ f ( x | θ ) dx f ( x | θ ) f ( θ ) d θ dx 2 f ( x ) dx total 1 f ( x ) dx 1 � � f ( x ) = f ( x | θ ) f ( θ ) d θ May 30, 2014 7 / 18

Normal prior, normal data N( µ, σ 2 ) has density 1 − ( y − µ ) 2 / 2 σ 2 √ f ( y ) = e . σ 2 π Observation: The coefficient is a normalizing factor, so if we have a pdf − ( y − µ ) 2 / 2 σ 2 f ( y ) = c e then y ∼ N( µ, σ 2 ) and 1 c = √ σ 2 π May 30, 2014 8 / 18

Board question: normal prior, normal data 1 − ( y − µ ) 2 / 2 σ 2 N( µ, σ 2 ) has pdf: f ( y ) = √ e . σ 2 π Suppose our data follows a N( θ, 4) distribution with unknown mean θ and variance 4. That is f ( x | θ ) = pdf of N( θ, 4) Suppose our prior on θ is N(3 , 1). Suppose we obtain data x 1 = 5. 1. Use the data to find the posterior pdf for θ . Write out your tables clearly. Use (and understand) infinitesimals. May 30, 2014 9 / 18

Solution graphs prior = blue; posterior = purple; data = red Data: x 1 = 5 Prior: µ prior = 3; σ prior = 1 Posterior is normal µ posterior = 3 . 4; σ posterior = 0 . 894 May 30, 2014 10 / 18

Board question: Romeo and Juliet Romeo is always late. How late follows a uniform distribution uniform(0 , θ ) with unknown parameter θ in hours. Juliet knows that θ ≤ 1 hour and she assumes a flat prior for θ on [0 , 1]. On their first date Romeo is 15 minutes late. (a) find and graph the prior and posterior pdf’s for θ (b) find and graph the prior predictive and posterior predictive pdf’s of how late Romeo will be on the second data (if he gets one!). May 30, 2014 11 / 18

Solution continued Prior and posterior pdf’s for θ . May 30, 2014 12 / 18

Solution continued Prior (red) and posterior (blue) predictive pdf’s for x 2 May 30, 2014 13 / 18

From discrete to continuous Bayesian updating Bent coin with unknown probability of heads θ . Data x 1 : heads on one toss. Start with a flat prior and update: unnormalized hyp. prior likelihood posterior posterior θ d θ θ θ d θ 2 θ d θ J 1 Total 1 0 θ d θ = 1 / 2 1 Posterior pdf: f ( θ | x 1 ) = 2 θ . May 30, 2014 14 / 18

Approximate continuous by discrete approximate the continuous range of hypotheses by a finite number of hypotheses. create the discrete updating table for the finite number of hypotheses. consider how the table changes as the number of hypotheses goes to infinity. May 30, 2014 15 / 18

Chop [0 , 1] into 4 intervals hypothesis prior un. posterior posterior likelihood θ = 1 / 8 1/4 1 / 8 (1 / 4) × (1 / 8) 1 / 16 θ = 3 / 8 1/4 3 / 8 (1 / 4) × (3 / 8) 3 / 16 θ = 5 / 8 1/4 5 / 8 (1 / 4) × (5 / 8) 5 / 16 θ = 7 / 8 1/4 7 / 8 (1 / 4) × (7 / 8) 7 / 16 n 1 – θ i ∆ θ Total 1 1 i =1 May 30, 2014 16 / 18

Chop [0 , 1] into 12 intervals hypothesis prior un. posterior posterior likelihood θ = 1 / 24 1/12 1 / 24 (1 / 12) × (1 / 24) 1 / 144 θ = 3 / 24 1/12 3 / 24 (1 / 12) × (3 / 24) 3 / 144 θ = 5 / 24 1/12 5 / 24 (1 / 12) × (5 / 24) 5 / 144 θ = 7 / 24 1/12 7 / 24 (1 / 12) × (7 / 24) 7 / 144 θ = 9 / 24 1/12 9 / 24 (1 / 12) × (9 / 24) 9 / 144 θ = 11 / 24 1/12 11 / 24 (1 / 12) × (11 / 24) 11 / 144 θ = 13 / 24 1/12 13 / 24 (1 / 12) × (13 / 24) 13 / 144 θ = 15 / 24 1/12 15 / 24 (1 / 12) × (15 / 24) 15 / 144 θ = 17 / 24 1/12 17 / 24 (1 / 12) × (17 / 24) 17 / 144 θ = 19 / 24 1/12 19 / 24 (1 / 12) × (19 / 24) 19 / 144 θ = 21 / 24 1/12 21 / 24 (1 / 12) × (21 / 24) 21 / 144 θ = 23 / 24 1/12 23 / 24 (1 / 12) × (23 / 24) 23 / 144 n 1 – θ i ∆ θ Total 1 1 i =1 May 30, 2014 17 / 18

Density historgram Density historgram for posterior pmf with 4 and 20 slices. density 2 density 2 1.5 1.5 1 1 .5 .5 x x 1/8 3/8 5/8 7/8 The original posterior pdf is shown in red. May 30, 2014 18 / 18

MIT OpenCourseWare http://ocw.mit.edu 18.05 Introduction to Probability and Statistics Spring 2014 For information about citing these materials or our Terms of Use, visit: http://ocw.mit.edu/terms.

Bayesian Updating: Continuous Priors 18.05 Spring 2014 Jeremy Orloff - PowerPoint PPT Presentation

Bayesian Updating: Continuous Priors 18.05 Spring 2014 Jeremy Orloff and Jonathan Bloom Beta distribution Beta ( a , b ) has density ( a + b 1)! a 1 (1 ) b 1 f ( ) = ( a 1)!( b 1)!

Bayesian Updating: Continuous Priors 18.05 Spring 2014 7 6 5 4 3 2 1 0 0 . 1 0 . 3 0 . 5 0

Bayesian Updating: Continuous Priors 18.05 Spring 2014 Jeremy Orloff and Jonathan Bloom Problem 6a

Bayesian Updating: Continuous Priors 18.05 Spring 2014 Jeremy Orloff and Jonathan Bloom Problem 6a

Conjugate Priors: Beta and Normal; Choosing Priors 18.05 Spring 2014 Jeremy Orloff and Jonathan

Conjugate Priors: Beta and Normal; Choosing Priors 18.05 Spring 2014 Jeremy Orloff and Jonathan

Mixture of g Priors for Bayesian Variable Selection Feng Liang, Rui Paulo et al. Sheng Zhang

Conjugate Priors: Beta and Normal 18.05 Spring 2018 Review: Continuous priors, discrete data

Choosing Priors Probability Intervals 18.05 Spring 2014 Conjugate priors A prior is conjugate

Bayesian Updating: Discrete Priors: 18.05 Spring 2014 http://xkcd.com/1236/ January 1, 2017

Bayesian Updating: Discrete Priors: 18.05 Spring 2014 http://xkcd.com/1236/ January 1, 2017

Bayesian Updating: Discrete Priors: 18.05 Spring 2014 Jeremy Orloff and Jonathan Bloom Courtesy of

Function Space Priors in Bayesian Deep Learning Roger Grosse Motivation Today Bayesian deep

Beyond Uniform Priors in Bayesian Network Structure Learning (for Discrete Bayesian Networks)

Being Bayesian About Being Bayesian About Net work St ruct ure Net work St ruct ure A Bayesian

Outline Intro to RL and Bayesian Learning History of Bayesian RL Model-based Bayesian

Updating Autonomous Start to an Updating Autonomous Start to an RTK Field Survey (Part II) RTK

On 2 n 2D M Mate terials & s & Thei heir Defects ects under electron irradiatjon

Non-perturbative determination of improvement coefficients using coordinate space methods in N f =

A variational finite volume scheme for Wasserstein gradient flows es 1 , T. O. Gallou et 2 , G.

HAMILTONIAN STRUCTURE OF THE BFCG THEORY Marko Vojinovi c Institute of Physics, University of

Confidence Intervals for Normal Data 18.05 Spring 2014 Jeremy Orloff and Jonathan Bloom Agenda

The Medical Resident App a UX perspective Bernard Schokman Design Moshpit @HISA_HIC #HIC18

Feb 13 Primary Productivity: Comparison among Biomes Central Surinam Reserve, Humid Tropical

New directions in attosecond physics Katalin Varj ELI-ALPS, Hungary Winter College on Extreme

Sambuz

Useful Links

Newsletter

Mail Us

Bayesian Updating: Continuous Priors 18.05 Spring 2014 Jeremy Orloff - PowerPoint PPT Presentation

Bayesian Updating: Continuous Priors 18.05 Spring 2014 Jeremy Orloff and Jonathan Bloom Beta distribution Beta ( a , b ) has density ( a + b 1)! a 1 (1 ) b 1 f ( ) = ( a 1)!( b 1)!

Bayesian Updating: Continuous Priors 18.05 Spring 2014 7 6 5 4 3 2 1 0 0 . 1 0 . 3 0 . 5 0

Bayesian Updating: Continuous Priors 18.05 Spring 2014 Jeremy Orloff and Jonathan Bloom Problem 6a

Bayesian Updating: Continuous Priors 18.05 Spring 2014 Jeremy Orloff and Jonathan Bloom Problem 6a

Conjugate Priors: Beta and Normal; Choosing Priors 18.05 Spring 2014 Jeremy Orloff and Jonathan

Conjugate Priors: Beta and Normal; Choosing Priors 18.05 Spring 2014 Jeremy Orloff and Jonathan

Mixture of g Priors for Bayesian Variable Selection Feng Liang, Rui Paulo et al. Sheng Zhang

Conjugate Priors: Beta and Normal 18.05 Spring 2018 Review: Continuous priors, discrete data

Choosing Priors Probability Intervals 18.05 Spring 2014 Conjugate priors A prior is conjugate

Bayesian Updating: Discrete Priors: 18.05 Spring 2014 http://xkcd.com/1236/ January 1, 2017

Bayesian Updating: Discrete Priors: 18.05 Spring 2014 http://xkcd.com/1236/ January 1, 2017

Bayesian Updating: Discrete Priors: 18.05 Spring 2014 Jeremy Orloff and Jonathan Bloom Courtesy of

Function Space Priors in Bayesian Deep Learning Roger Grosse Motivation Today Bayesian deep

Beyond Uniform Priors in Bayesian Network Structure Learning (for Discrete Bayesian Networks)

Being Bayesian About Being Bayesian About Net work St ruct ure Net work St ruct ure A Bayesian

Outline Intro to RL and Bayesian Learning History of Bayesian RL Model-based Bayesian

Updating Autonomous Start to an Updating Autonomous Start to an RTK Field Survey (Part II) RTK

On 2 n 2D M Mate terials &amp; s &amp; Thei heir Defects ects under electron irradiatjon

Non-perturbative determination of improvement coefficients using coordinate space methods in N f =

A variational finite volume scheme for Wasserstein gradient flows es 1 , T. O. Gallou et 2 , G.

HAMILTONIAN STRUCTURE OF THE BFCG THEORY Marko Vojinovi c Institute of Physics, University of

Confidence Intervals for Normal Data 18.05 Spring 2014 Jeremy Orloff and Jonathan Bloom Agenda

The Medical Resident App a UX perspective Bernard Schokman Design Moshpit @HISA_HIC #HIC18

Feb 13 Primary Productivity: Comparison among Biomes Central Surinam Reserve, Humid Tropical

New directions in attosecond physics Katalin Varj ELI-ALPS, Hungary Winter College on Extreme

Sambuz

Useful Links

Newsletter

Mail Us

On 2 n 2D M Mate terials & s & Thei heir Defects ects under electron irradiatjon