Adaptive rejection Metropolis sampling Dr. Jarad Niemi STAT 615 - - PowerPoint PPT Presentation

Adaptive rejection Metropolis sampling Dr. Jarad Niemi STAT 615 - Iowa State University November 14, 2017 Jarad Niemi (STAT615@ISU) Adaptive rejection Metropolis sampling November 14, 2017 1 / 20

(Logarithmically) Concave Univariate Function A function p ( θ ) is concave if p ((1 − t ) x + t y ) ≥ (1 − t ) p ( x ) + t p ( y ) for any 0 ≤ t ≤ 1 . p(y) p(x) x y If p ( x ) is twice differentiabe, then p ( x ) is concave if and only if p ′′ ( x ) ≤ 0 . A function p ( x ) is log-concave if log p ( x ) is concave. Jarad Niemi (STAT615@ISU) Adaptive rejection Metropolis sampling November 14, 2017 2 / 20

Examples X ∼ N (0 , 1) has a log-concave density since dx 2 log e − x 2 / 2 = d 2 d 2 dx 2 − x 2 / 2 = d dx − x = − 1 . X ∼ Ca (0 , 1) has a non-log-concave density since 1 + x 2 = 2( x 2 − 1) d 2 1 + x 2 = d 1 − 2 x dx 2 log (1 + x 2 ) 2 . dx 1e−01 distribution density cauchy 1e−03 exponential normal 1e−05 0 1 2 3 4 5 x Jarad Niemi (STAT615@ISU) Adaptive rejection Metropolis sampling November 14, 2017 3 / 20

Log-concave distributions Log-concave distributions normal exponential Uniform Laplace Gamma (shape parameter is ≥ 1 ) Wishart ( n ≥ p + 1 ) Dirichlet (all parameters ≥ 1 ) Non-log-concave distributions Log-normal Student t F -distribution Jarad Niemi (STAT615@ISU) Adaptive rejection Metropolis sampling November 14, 2017 4 / 20

Exponential distribution An exponential distribution has pdf p ( θ ; b ) = be − bθ and thus has log-density log p ( θ ; b ) = log( b ) − bθ which is trivially log-concave since d 2 dθ 2 log( b ) − bθ = d dθ − b = 0 ≤ 0 . The exponential distribution, or exponential function, is unique in that it matches the bound for the definition of log-concavity. Jarad Niemi (STAT615@ISU) Adaptive rejection Metropolis sampling November 14, 2017 5 / 20

Prior-posterior example The product of log-concave functions is also log-concave since � n n � � � log p i ( x ) = log p i ( x ) . i =1 i =1 Assume ind Y i ∼ N ( θ, 1) and θ ∼ La (0 , 1) then the posterior � n � � p ( θ | y ) ∝ N ( y i ; θ, 1) La ( θ ; 0 , 1) i =1 is log-concave since - N ( y i ; θ, 1) is a log-concave function for θ for each y i and - La ( θ ; 0 , 1) is a log-concave distribution. Jarad Niemi (STAT615@ISU) Adaptive rejection Metropolis sampling November 14, 2017 6 / 20

Rejection sampling Suppose we are interested in sampling from a target distribution p ( θ | y ) using a proposal q ( θ ) . To use this algorithm, we must find M ≥ p ( θ | y ) q ( θ ) ∀ θ where the optimal M is sup θ p ( θ | y ) /q ( θ ) . Rejection sampling performs the following 1. Sample θ ∼ q ( θ ) . 2. Accept θ as a draw from p ( θ | y ) with probability 1 p ( θ | y ) M q ( θ ) otherwise return to step 1. Jarad Niemi (STAT615@ISU) Adaptive rejection Metropolis sampling November 14, 2017 7 / 20

Rejection sampling envelope Target N + (0 , 1) and proposal Exp (1) . Then 2 /πe − θ 2 / 2 � ≤ 1 . 315489 = M e − θ 1.00 1.0 density log density 0.10 0.5 0.0 0 1 2 3 0.01 x 0 1 2 3 distribution Exp(1) M*Exp(1) N^+(0,1) x Jarad Niemi (STAT615@ISU) Adaptive rejection Metropolis sampling November 14, 2017 8 / 20

Rejection sampling example 0.9 M*Exp(x;1) accept 0.6 FALSE TRUE 0.3 0.0 0 1 2 3 4 5 x Jarad Niemi (STAT615@ISU) Adaptive rejection Metropolis sampling November 14, 2017 9 / 20

Adaptive rejection sampling Idea: build a piece-wise linear envelope to the log-density as a proposal distribution log density density Jarad Niemi (STAT615@ISU) Adaptive rejection Metropolis sampling November 14, 2017 10 / 20

Pseudo-algorithm for adaptive rejection sampling 1. Choose starting locations θ , call the set Θ 2. Construct piece-wise linear envelope log q ( θ ) to the log-density a. Calculate log f ( θ | y ) and (log f ( θ | y )) ′ . b. Find line intersections 3. Sample a proposed value θ ∗ from the envelope q ( θ ) a. Sample an interval b. Sample a truncated (and possibly negative of an) exponential r.v. 4. Perform rejection sampling a. Sample u ∼ Unif (0 , 1) b. Accept θ ∗ if u ≤ f ( θ ∗ | y ) /q ( θ ∗ ) . 5. If rejected, add θ ∗ to Θ and return to 2. Jarad Niemi (STAT615@ISU) Adaptive rejection Metropolis sampling November 14, 2017 11 / 20

Adaptive rejection sampling (ARS) in R library(ars) f = function(x) -x^2/2 # log of standard normal density fp = function(x) -x # derivative of log of standard normal density x = ars(1e4, f, fp) ARS samples 0.4 0.3 Density 0.2 0.1 0.0 −2 0 2 4 x Jarad Niemi (STAT615@ISU) Adaptive rejection Metropolis sampling November 14, 2017 12 / 20

ARS in R - non-log-concave density f = function(x) log(1/(1+x^2)) # log of standard cauchy density fp = function(x) -2*x/(1+x^2) # derivative of log of cauchy density x = ars(1e4, f, fp) ## ## Error in sobroutine initial_... ## ifault= 5 Jarad Niemi (STAT615@ISU) Adaptive rejection Metropolis sampling November 14, 2017 13 / 20

ARS in R - prior-posterior example ind Y i ∼ N ( θ, 1) and θ ∼ La (0 , 1) y = rnorm(10) f = Vectorize(function(theta) sum(-(y-theta)^2/2) - abs(theta)) fp = Vectorize(function(theta) sum((y-theta)) - (theta>0) + (theta<0)) x = ars(1e4, f, fp) Posterior for Normal data with Laplace prior on mean 1.5 1.0 Density 0.5 0.0 −1.0 −0.5 0.0 0.5 1.0 x Jarad Niemi (STAT615@ISU) Adaptive rejection Metropolis sampling November 14, 2017 14 / 20

Comments on ARS Derivative free ARS Checking for log-concavity Decreasing derivatives Initial points for unbounded support: initial derivative must be positive final derivative must be negative Lower bound for multiple samples Connect points Probability of acceptance increases at subsequent steps Jarad Niemi (STAT615@ISU) Adaptive rejection Metropolis sampling November 14, 2017 15 / 20

Adaptive rejection Metropolis sampling (ARMS) Adaptive rejection sampling is only suitable for log-concave densities. For non-log-concave densities adaptive rejection Metropolis sampling can be used Jarad Niemi (STAT615@ISU) Adaptive rejection Metropolis sampling November 14, 2017 16 / 20

ARMS algorithm 1. Choose starting locations for θ , call the set Θ . 2. Construct piece-wise linear pseudo-envelope log q ( θ ) to log p ( θ | y ) . 3. Sample θ ∗ ∼ q ( θ ) and U ∼ Unif (0 , 1) . a. If U ≤ p ( θ ∗ | y ) /q ( θ ∗ ) , proceed to Step 4. b. Otherwise, add θ ∗ to Θ and return to 3. 4. Perform Metropolis step: Set θ ( i ) = θ ∗ with probability � � min { p ( θ ( i − 1) | y ) , q ( θ ( i − 1) ) } 1 , p ( θ ∗ | y ) min p ( θ ( i ) | y ) min { p ( θ ∗ | y ) , q ( θ ∗ ) } otherwise set θ ( i ) = θ ( i − 1) . Jarad Niemi (STAT615@ISU) Adaptive rejection Metropolis sampling November 14, 2017 17 / 20

ARMS pseudo-envelope t_5 0 −1 −2 f(x, v) −3 −4 −5 −4 −2 0 2 4 x Jarad Niemi (STAT615@ISU) Adaptive rejection Metropolis sampling November 14, 2017 18 / 20

ARMS in R f = function(x,mean) -(x-mean)^2/2 x = dlm::arms(runif(1,3,17), f, function(x,mean) ((x-mean)>-7)*((x-mean)<7), 1e4, mean=10) hist(x,101,prob=TRUE,main="Gaussian(10,1)") curve(dnorm(x,10), add=TRUE, lwd=2, col='red') Gaussian(10,1) 0.4 0.3 Density 0.2 0.1 0.0 6 8 10 12 14 Jarad Niemi (STAT615@ISU) Adaptive rejection Metropolis sampling November 14, 2017 19 / 20 x

Theoretical consideration of ARMS ARMS is an independent Metropolis-Hastings algorithm Proposal changes, due to updating q , i.e. adding more points in to Θ , thus inhomogenous. We need to stop updating q at some point to enforce homogeneity. Jarad Niemi (STAT615@ISU) Adaptive rejection Metropolis sampling November 14, 2017 20 / 20

Adaptive rejection Metropolis sampling Dr. Jarad Niemi STAT 615 - - PowerPoint PPT Presentation

Adaptive rejection Metropolis sampling Dr. Jarad Niemi STAT 615 - Iowa State University November 14, 2017 Jarad Niemi (STAT615@ISU) Adaptive rejection Metropolis sampling November 14, 2017 1 / 20 (Logarithmically) Concave Univariate Function

Sampling Methods Oliver Schulte - CMPT 419/726 Bishop PRML Ch. 11 Sampling Rejection Sampling

Metropolis Sampling Ars` ene P erard-Gayot May 23, 2016 Introduction Background Metropolis

Adaptive Control Chapter 14: Adaptive regulation Rejection of unknown disturbances 1

Sampling Methods CMSC 678 UMBC Outline Recap Monte Carlo methods Sampling Techniques Uniform

Chapter 7. Sampling Chapter 7. Sampling methods? methods? Two types of sampling methods Two

Multiple importance sampling Slides for CS6630 lecture 6 sampling the BRDF sampling the

What is the strengths and weakness of these sampling methods? Sampling Strengths /

7 STEPS TO CRUSHING YOUR #REJECTION & GETTING BACK ON TRACK - FAST!

Acceptance-Rejection method The acceptance-rejection method is usually used when the inverse

Multi-parameter models - Metropolis sampling Applied Bayesian Statistics Dr. Earvin Balderama

Neural Nets for Adaptive Filter and Adaptive Neural Nets as Adaptive Filters Pattern Recognition

Adaptive Control Chapter 1: Introduction to Adaptive Control Adaptive Control Landau, Lozano,

Adaptive Control Chapter 11: Direct Adaptive Control 1 Adaptive Control Landau, Lozano,

Metropolis Of Boston Philoptochos Officers Workshop Saturday, November 23, 2013 Greek Orthodox

Metropolis-Hastings algorithm Dr. Jarad Niemi STAT 544 - Iowa State University April 2, 2019

Projet METROPOLIS METROlogie Pour LInternet et les Services Metropolis Project

Rejection Sampling Schemes for Extracting Uniform Distribution from Biased PUFs Rei Ueno , Kohei

CSE 473: Artificial Intelligence Bayesian Networks: Inference Hanna Hajishirzi Many slides over

Introduction to Machine Learning CMU-10701 Markov Chain Monte Carlo Methods Barnabs Pczos

Strategic Partnerships to Create Inclusive Career Pathways Tuesday, March 10th, 2020 3:00 PM -

Sampling from PDFs II CS295, Spring 2017 Shuang Zhao Computer Science Department University of

CS 4100: Artificial Intelligence Bayes Nets: Sampling Jan-Willem van de Meent, Northeastern

CS786 Lecture 13: May 14, 2012 Sampling techniques [KF Chapter 12] CS786 P. Poupart 2012 1

Inference Connecting models to data The problem with infection data Often only observe a

Adaptive rejection Metropolis sampling Dr. Jarad Niemi STAT 615 - - PowerPoint PPT Presentation

Adaptive rejection Metropolis sampling Dr. Jarad Niemi STAT 615 - Iowa State University November 14, 2017 Jarad Niemi (STAT615@ISU) Adaptive rejection Metropolis sampling November 14, 2017 1 / 20 (Logarithmically) Concave Univariate Function

Sampling Methods Oliver Schulte - CMPT 419/726 Bishop PRML Ch. 11 Sampling Rejection Sampling

Metropolis Sampling Ars` ene P erard-Gayot May 23, 2016 Introduction Background Metropolis

Adaptive Control Chapter 14: Adaptive regulation Rejection of unknown disturbances 1

Sampling Methods CMSC 678 UMBC Outline Recap Monte Carlo methods Sampling Techniques Uniform

Chapter 7. Sampling Chapter 7. Sampling methods? methods? Two types of sampling methods Two

Multiple importance sampling Slides for CS6630 lecture 6 sampling the BRDF sampling the

What is the strengths and weakness of these sampling methods? Sampling Strengths /

7 STEPS TO CRUSHING YOUR #REJECTION &amp; GETTING BACK ON TRACK - FAST!

Acceptance-Rejection method The acceptance-rejection method is usually used when the inverse

Multi-parameter models - Metropolis sampling Applied Bayesian Statistics Dr. Earvin Balderama

Neural Nets for Adaptive Filter and Adaptive Neural Nets as Adaptive Filters Pattern Recognition

Adaptive Control Chapter 1: Introduction to Adaptive Control Adaptive Control Landau, Lozano,

Adaptive Control Chapter 11: Direct Adaptive Control 1 Adaptive Control Landau, Lozano,

Metropolis Of Boston Philoptochos Officers Workshop Saturday, November 23, 2013 Greek Orthodox

Metropolis-Hastings algorithm Dr. Jarad Niemi STAT 544 - Iowa State University April 2, 2019

Projet METROPOLIS METROlogie Pour LInternet et les Services Metropolis Project

Rejection Sampling Schemes for Extracting Uniform Distribution from Biased PUFs Rei Ueno , Kohei

CSE 473: Artificial Intelligence Bayesian Networks: Inference Hanna Hajishirzi Many slides over

Introduction to Machine Learning CMU-10701 Markov Chain Monte Carlo Methods Barnabs Pczos

Strategic Partnerships to Create Inclusive Career Pathways Tuesday, March 10th, 2020 3:00 PM -

Sampling from PDFs II CS295, Spring 2017 Shuang Zhao Computer Science Department University of

CS 4100: Artificial Intelligence Bayes Nets: Sampling Jan-Willem van de Meent, Northeastern

CS786 Lecture 13: May 14, 2012 Sampling techniques [KF Chapter 12] CS786 P. Poupart 2012 1

Inference Connecting models to data The problem with infection data Often only observe a

7 STEPS TO CRUSHING YOUR #REJECTION & GETTING BACK ON TRACK - FAST!