CS786 Lecture 13: May 14, 2012 Sampling techniques [KF Chapter 12] - PDF document

10/07/2012 CS786 Lecture 13: May 14, 2012 Sampling techniques [KF Chapter 12] CS786 P. Poupart 2012 1 Sampling Techniques • Direct sampling • Rejection sampling • Likelihood weighting • Importance sampling • Markov chain Monte Carlo (MCMC) – Gibbs Sampling – Metropolis ‐ Hastings • Sequential Monte Carlo sampling (a.k.a. particle filtering) CS786 P. Poupart 2012 2 1

10/07/2012 Approximate Inference by Sampling • Expectation: � � � � � � � � � � �� – Approximate integral by sampling: � � � � � � � � ∑ �� where � � ~�� • Inference query: Pr��|�� ∑ Pr ��, �|�� – Approximate exponentially large sum by sampling: � � Pr � � � � ∑ Pr ��|� � , �� where � � ~��|�� CS786 P. Poupart 2012 3 Direct Sampling (a.k.a. forward sampling) • Unconditional inference queries (i.e., Pr �� ) • Bayesian networks only – Idea: sample each variable given the values of its parents according to the topological order of the graph. CS786 P. Poupart 2012 4 2

10/07/2012 Direct Sampling Algorithm Sort the variables by topological order For � � 1 to � do (sample � particles) For each variable � � do � ~ Pr � �� Sample � � � � �� • Approximation: Pr � � � � � � ∑ �� CS786 P. Poupart 2012 5 Example CS786 P. Poupart 2012 6 3

10/07/2012 Analysis • Complexity: ��|�|� where � � #variables • Accuracy � � � � � 2� �� – Absolute error � : P P � V � P V �� • Sample size � � �� ∉ �1 � �, 1 � �� 2� � �� – Relative error � : � � � �� • Sample size � � � �� CS786 P. Poupart 2012 7 Rejection Sampling • Conditional inference queries (i.e., Pr �� |�� ) • Bayesian networks only – Idea: sample each variable given the values of its parents according to the topological order of the graph, however reject samples that do not agree with evidence CS786 P. Poupart 2012 8 4

10/07/2012 Rejection Sampling Algorithm Sort the variables by topological order For � � 1 to � do (sample � particles) For each variable � � do � ~ Pr � �� Sample � � � � � ) Reject � � if � � is inconsistent with � (i.e., � � � �� ∧ � � � �� ∑ � � � �� • Approximation: Pr � � � �|� � � �� ∑ � � � �� CS786 P. Poupart 2012 9 Example CS786 P. Poupart 2012 10 5

10/07/2012 Analysis • Complexity: ��|�|� where � � #variables • Expected # samples that are accepted: �� Pr�� – Since Pr �� often decreases exponentially with the number of evidence variables, the number of samples also decreases exponentially. – For good accuracy: exponential # of samples often needed in practice. CS786 P. Poupart 2012 11 Likelihood Weighting • Conditional inference queries (i.e., Pr �� |�� ) • Bayesian networks only – Idea: sample each non ‐ evidence variable given the values of its parents in topological order. Assign weights to samples based on the probability of the evidence. CS786 P. Poupart 2012 12 6

10/07/2012 Likelihood Weighting Algorithm Sort the variables by topological order For � � 1 to � do (sample � particles) � � ← 1 For each variable � � do If � � is not an evidence variable do � ~ Pr � Sample � � � �� else � � ← � � ∗ Pr � � �� ∑ � � � � � �� • Approximation: Pr � � � �|� � � ∑ � � �� CS786 P. Poupart 2012 13 Example CS786 P. Poupart 2012 14 7

10/07/2012 Analysis • Complexity: ��|�|� where � � #variables • Effective sample size: �� Pr�� – Even though all samples are accepted, their importance is reweighted to a fraction equal to Pr �� – For good accuracy: the # of samples will be the same as for rejection sampling (hence exponential with the number of evidence variables). CS786 P. Poupart 2012 15 Importance Sampling • Likelihood weighting is a special case of importance sampling • General approach to estimate � � �� by sampling from � instead of � – Works for Bayes nets and probability densities • Idea: generate samples � from � and assign weights ��/�� CS786 P. Poupart 2012 16 8

10/07/2012 Importance Sampling Algorithm For � � 1 to � do (sample � particles) Sample � � from � Assign weight: � � ← �� /�� • Approximation: � � �� ∑ � � � � � �� – Unbiased estimator – Variance of estimator decreases linearly with sample size CS786 P. Poupart 2012 17 Normalized Importance Sampling • Often the reason why we are sampling from � instead of � is that we don’t know � . � an unnormalized version of � • But, we may know � � � � ∏ � – Markov nets: � � � � ∏ � � �� while � � �� , �� – Bayes nets: ��|�� while � • Idea: generate samples � from � and assign weights ��/�� . Normalize the estimator. � CS786 P. Poupart 2012 18 9

10/07/2012 Normalized Importance Sampling Algorithm For � � 1 to � do (sample � particles) Sample � � from � �� /�� Assign weight: � � ← � � � � � � � ∑ • Approximation: � � �� ∑ � � �� – Biased estimator for finite � (unbiased for � � ∞ ) – Variance of estimator decreases linearly with sample size CS786 P. Poupart 2012 19 Markov Chain Monte Carlo • Iterative sampling technique that converges to the desired distribution in the limit • Idea: set up a Markov chain such that its stationary distribution is the desired distribution CS786 P. Poupart 2012 20 10

10/07/2012 Markov Chain • Definition: A Markov chain is a linear chain Bayesian network with a stationary conditional distribution known as the transition function � � � � � � � � … • Initial distribution: Pr �� • Transition distribution: Pr �� |� �� CS786 P. Poupart 2012 21 Markov Chain • Definition: A Markov chain is a linear chain Bayesian network with a stationary conditional distribution known as the transition function � � � � � � � � … • Initial distribution: Pr �� • Transition distribution: Pr �� |� �� CS786 P. Poupart 2012 22 11

10/07/2012 Asymptotic Behaviour • Let Pr�� be the distribution at time step � Pr�� ∑ Pr � �..� � �..�� ∑ Pr�� Pr�� |� �� • In the limit (i.e., when � → ∞ ), the Markov chain may converge to stationary distribution � � � Pr �� Pr � � � � � ∑ Pr �� ′� Pr �� |� �� ′� � �� Pr ��|� � � � ∑ �� CS786 P. Poupart 2012 23 Stationary distribution ��|� � � be a matrix that represents the • Let � �|�� Pr transition function • If we think of � as a column vector, then � is an eigenvector of � with eigenvalue 1 �� CS786 P. Poupart 2012 24 12

10/07/2012 Ergodic Markov Chain • Definition: A Markov chain is ergodic when there is a non ‐ zero probability of reaching any state from any state in a finite number of steps • When the Markov chain is ergodic, there is a unique stationary distribution • Sufficient condition: detailed balance �� |�� Pr ��|� � � � � Pr Detailed balance  ergodicity  unique stationary dist. CS786 P. Poupart 2012 25 Markov Chain Monte Carlo • Idea: set up an ergodic Markov chain such that the unique stationary distribution is the desired distribution • Since the Markov chain is a linear chain Bayes net, we can use direct sampling (forward sampling) to obtain a sample of the stationary distribution CS786 P. Poupart 2012 26 13

10/07/2012 Generic MCMC Algorithm Sample � � ~ Pr �� For � � 1 to � do (sample � particles) Sample � � ~ Pr � � � �� • Approximation: � � � � ∑ �� • In practice, ignore the first � samples for a better estimate (burn ‐ in period): � � � � � �� ∑ �� CS786 P. Poupart 2012 27 Choosing a Markov Chain • Different Markov chains lead to different algorithms – Gibbs sampling – Metropolis Hastings CS786 P. Poupart 2012 28 14

CS786 Lecture 13: May 14, 2012 Sampling techniques [KF Chapter 12] - PDF document

10/07/2012 CS786 Lecture 13: May 14, 2012 Sampling techniques [KF Chapter 12] CS786 P. Poupart 2012 1 Sampling Techniques Direct sampling Rejection sampling Likelihood weighting Importance sampling Markov chain Monte Carlo

CS786 Lecture 15: May 21, 2012 MAP inference [KF Chapter 13] CS786 P. Poupart 2012 1 MAP Queries

CS786 Lecture 12: May 12, 2012 Inference as Optimization (continued) [KF Chapter 11] CS786 P.

CS786: Lecture 1 May 1st Basics: review of probability theory 1 CS 786 Lecture Slides (c)

Markov Networks [KF] Chapter 4 CS 786 University of Waterloo Lecture 7: May 24, 2012 Outline

Learning and Inference in Markov Logic Networks CS 786 University of Waterloo Lecture 24: July

Malaysian Healthy Ageing Society Plenary Lecture Plenary Lecture Plenary Lecture Plenary

Date: July 23, 2012 Arrowhead Elementary 06/2012 Arrowhead Elementary 07/2012 Leisure Park

Markov Logic Networks Matt Richardson and Pedro Domingos (2006), Markov Logic Networks, Machine

Be More strategy 2012-2015 March 2012 Be More strategy 2012-2015 BE MORE: THE NEW STRATEGY

Bank of Georgia Q2 2012 and 1H 2012 Results Presentation June 2012 October 2012 Contents Bank of

CEE 680 Lecture #2 1/22/2020 1 CEE 680 Lecture #2 1/22/2020 2 CEE 680 Lecture #2

May May 36% 20% 25% 19% May May May May 36% 20% 25% 19%

2012 NOMCOM FINAL REPORT TRANSPARENCY & ACCOUNTABILITY Vanda Scartezini 2012 Chair 2012

2012 LEVY HEARING December 18, 2012 Meeting of the Board 2012 Levy Calendar Discussion

Q2 2012 RESULTS FOR THE PERIOD ENDED 30 JUNE 2012 www.goldfields.co.za Interim Results Period

H1 2012 Results Main results Key figures H1 2012 H1 2011 Q2 2012 Q1 2012 Q2 2011 Q1 2011

CS 4100: Artificial Intelligence Bayes Nets: Sampling Jan-Willem van de Meent, Northeastern

Sampling from PDFs II CS295, Spring 2017 Shuang Zhao Computer Science Department University of

Adaptive rejection Metropolis sampling Dr. Jarad Niemi STAT 615 - Iowa State University November

Rejection Sampling Schemes for Extracting Uniform Distribution from Biased PUFs Rei Ueno , Kohei

Inference Connecting models to data The problem with infection data Often only observe a

RIP to HIP: The Data Reject Heterogeneous Labor Income Profiles Dmytro Hryshko Econ 312, Spring

Familywise error rate control by interactive unmasking Boyan Aaditya Larry Duan Ramdas

Significance of Alignments COMP 571 Luay Nakhleh, Rice University 2 Hypothesis Testing for

Sambuz

Useful Links

Newsletter

Mail Us

CS786 Lecture 13: May 14, 2012 Sampling techniques [KF Chapter 12] - PDF document

10/07/2012 CS786 Lecture 13: May 14, 2012 Sampling techniques [KF Chapter 12] CS786 P. Poupart 2012 1 Sampling Techniques Direct sampling Rejection sampling Likelihood weighting Importance sampling Markov chain Monte Carlo

CS786 Lecture 15: May 21, 2012 MAP inference [KF Chapter 13] CS786 P. Poupart 2012 1 MAP Queries

CS786 Lecture 12: May 12, 2012 Inference as Optimization (continued) [KF Chapter 11] CS786 P.

CS786: Lecture 1 May 1st Basics: review of probability theory 1 CS 786 Lecture Slides (c)

Markov Networks [KF] Chapter 4 CS 786 University of Waterloo Lecture 7: May 24, 2012 Outline

Learning and Inference in Markov Logic Networks CS 786 University of Waterloo Lecture 24: July

Malaysian Healthy Ageing Society Plenary Lecture Plenary Lecture Plenary Lecture Plenary

Date: July 23, 2012 Arrowhead Elementary 06/2012 Arrowhead Elementary 07/2012 Leisure Park

Markov Logic Networks Matt Richardson and Pedro Domingos (2006), Markov Logic Networks, Machine

Be More strategy 2012-2015 March 2012 Be More strategy 2012-2015 BE MORE: THE NEW STRATEGY

Bank of Georgia Q2 2012 and 1H 2012 Results Presentation June 2012 October 2012 Contents Bank of

CEE 680 Lecture #2 1/22/2020 1 CEE 680 Lecture #2 1/22/2020 2 CEE 680 Lecture #2

May May 36% 20% 25% 19% May May May May 36% 20% 25% 19%

2012 NOMCOM FINAL REPORT TRANSPARENCY &amp; ACCOUNTABILITY Vanda Scartezini 2012 Chair 2012

2012 LEVY HEARING December 18, 2012 Meeting of the Board 2012 Levy Calendar Discussion

Q2 2012 RESULTS FOR THE PERIOD ENDED 30 JUNE 2012 www.goldfields.co.za Interim Results Period

H1 2012 Results Main results Key figures H1 2012 H1 2011 Q2 2012 Q1 2012 Q2 2011 Q1 2011

CS 4100: Artificial Intelligence Bayes Nets: Sampling Jan-Willem van de Meent, Northeastern

Sampling from PDFs II CS295, Spring 2017 Shuang Zhao Computer Science Department University of

Adaptive rejection Metropolis sampling Dr. Jarad Niemi STAT 615 - Iowa State University November

Rejection Sampling Schemes for Extracting Uniform Distribution from Biased PUFs Rei Ueno , Kohei

Inference Connecting models to data The problem with infection data Often only observe a

RIP to HIP: The Data Reject Heterogeneous Labor Income Profiles Dmytro Hryshko Econ 312, Spring

Familywise error rate control by interactive unmasking Boyan Aaditya Larry Duan Ramdas

Significance of Alignments COMP 571 Luay Nakhleh, Rice University 2 Hypothesis Testing for

Sambuz

Useful Links

Newsletter

Mail Us

2012 NOMCOM FINAL REPORT TRANSPARENCY & ACCOUNTABILITY Vanda Scartezini 2012 Chair 2012