A New Method for Tackling Limited Monte Carlo Carlos Argelles - PowerPoint PPT Presentation

A New Method for Tackling Limited Monte Carlo Carlos Argüelles Austin Schneider Tianlu Yuan 1

Analysis Scenario: Binned data, Poisson likelihood, and Simulation 2

Analysis Scenario Three requirements: 1. Binned data counts 2. Independent rare processes 3. Modelled by simulation Applies to much of particle-physics and astrophysics 3

Analysis Scenario Three requirements: 1. Binned data counts 2. Independent rare processes 3. Modelled by simulation Applies to much of particle-physics and astrophysics PhysRevLett.121.221801 4

Analysis Scenario Three requirements: “It is well known that the 1. Binned data counts count of independent, rare 2. Independent rare processes natural processes can be 3. Modelled by simulation described by the Poisson Applies to much of particle-physics and distribution.” astrophysics 5

Analysis Scenario Generated event Detector / analysis Three requirements: properties response 1. Binned data counts 2. Independent rare processes 3. Modelled by simulation Applies to much of particle-physics and astrophysics 6

Reweighting Generated event Detector / analysis Simulating every physical hypothesis properties response theta is too expensive Reweighting modifies the physical hypothesis with the same simulation set Event Physical properties hypothesis 7

Approximate Expectations Sum weights to obtain expected number of events Using this approximation we construct the AdHoc likelihood from the Poisson likelihood The error on this approximation vanishes as we approach large simulation size 8

The Curse of Rare Processes and Small Signals The signals we look for are small Our virtual detector works similarly to the real thing. Sometimes we can work around this. Sometimes we can’t, or MC is too expensive. 9

Accounting for errors 10

Incorporating Errors Exact knowledge of lambda → treat lambda probabilistically using Bayes’ theorem The likelihood is informed by the MC This generalizes the likelihood Note that we recover the AdHoc likelihood when 11

Obtaining Monte Carlo can also be modelled by a Poisson process Number of MC events m is Poisson distributed For simplicity, consider the case of equal weights The data expectation 𝛍 is related to the weight So the likelihood of lambda becomes 12

Extension to Arbitrary Weights For arbitrary weights we can consider mu and sigma in terms of the “effective” weights and counts Proceed exactly as before but now where the factorial has been replaced by a gamma function*. *Note: this is fine because this factor does not depend on lambda and cancels in the normalization step, although for an un-normalized likelihood this might present a problem. 13

Equal → Arbitrary (An implicit assumption) Equal Weights Arbitrary Weights Distribution Scaled Poisson Compound Poisson Used in the likelihood Scaled Poisson Scaled Poisson Bohm and Zech (2012) showed that a scaled Poisson distribution is a good approximation to this when the first and second moments are matched The effective treatment uses the scaled Poisson distribution More details on this can be found in our paper, DOI:10.1007/JHEP06(2019)030 14

With the likelihood of lambda, we use Bayes’ theorem to compute the probability of lambda assuming a uniform prior Where G is the gamma distribution and 15

The Effective Likelihood Integrating over the true expectation, we now have the effective likelihood This accounts for the uncertainty from finite Monte Carlo sample size 16

Performance 17

A Toy Experiment Measure a resonance component on top of a steeply falling background. ● Simulate comparable amounts of signal and background ● Generated according to power-law distributions ● Smeared with different uncertainties ● 18 18

Point Estimation The effective likelihood produces similar results to the Poisson description The maximum likelihood is an unbiased estimator for large MC sample size 19

Coverage We produce 500 independent Monte Carlo sets and 500 data sets to test the coverage True coverage compared to Wilks’ asymptotically approximated coverage Effective likelihood provides a good estimate of the coverage AdHoc likelihood vastly underestimates the coverage 20

A 2D Bayesian Example The effective likelihood is also suitable for Bayesian analyses The effective likelihood broadens in the Increasing MC Size case of low MC sample size, providing robust error regions The AdHoc likelihood is liable to underestimate the width 21

Performance Comparison Comparing the runtime of the effective likelihood to other treatments 22

Caveats Bin to bin correlations are not directly built into the likelihood. ● Assume that correlated shape uncertainties will be handled implicitly by the reweighting. Estimate of variance in bin expectation relies on Monte Carlo events in the bin. ● If a population of events with large possible contribution to the variance is not included, then the estimate of the variance may be incorrect. Monte Carlo is needed in every bin. ● 23

Summary The exact expectation in a bin is usually unknown ● It is important to account for the uncertainty inherent with limited Monte Carlo ● samples The effective likelihood ● Provides a robust treatment of these errors, provided MC is available ○ Converges to the AdHoc likelihood for large MC ○ Has improved coverage properties ○ Can be substituted directly for the AdHoc likelihood ○ https://austinschneider.github.io/MCLLH/ 24

Likelihood Summary Implementations and paper links can be found here: https://austinschneider.github.io/MCLLH/ 25

A New Method for Tackling Limited Monte Carlo Carlos Argelles - PowerPoint PPT Presentation

A New Method for Tackling Limited Monte Carlo Carlos Argelles Austin Schneider Tianlu Yuan 1 Analysis Scenario: Binned data, Poisson likelihood, and Simulation 2 Analysis Scenario Three requirements: 1. Binned data counts 2.

Monte Carlo Generators Monte Carlo Generators Monte Carlo Generators QCD Lecture III P .

Monte Carlo Methods Guojin Chen Christopher Cprek Chris Rambicure Monte Carlo Methods 1.

Monte Carlo Approximation of Monte Carlo Filters Adam M. Johansen et al. Collaborators Include:

4. THE MONTE CARLO METHOD 4.1 I ntroduction This chapter is aimed at describing the Monte Carlo

BROCHURE 2019 TETRA JUICES DEL MONTE DEL MONTE 6 x 1L GOLD PINEAPPLE 6 x 1L 6 x 1L 6 x 1L

Introduction to Monte Carlo Method Andrzej Palczewski and Jan Palczewski Introduction to Monte

Chapter 5: Monte Carlo Methods Monte Carlo methods are learning methods Experience

Draft Introduction to (randomized) quasi-Monte Carlo Pierre LEcuyer MCQMC Conference,

Monte Carlo Estimation 7 January 2019 OSU CSE 1 Monte Carlo Methods Class of computational

Monte Carlo Localization Ximing Yu March 24, 2009 Ximing Yu Monte Carlo Localization 1

Monte Carlo Control CMPUT 366: Intelligent Systems S&B 5.3-5.5, 5.7 Lecture Outline 1.

Limitations of Realistic A Faster Method: . . . Monte-Carlo Techniques Monte-Carlo: . . . Proof

Techniques in Artificial Intelligence - Part I Todd W. Neller Gettysburg College Monte Carlo

Draft 1 Density estimation by Monte Carlo and randomized quasi-Monte Carlo (RQMC) Pierre

Approximate Counting Andreas-Nikolas Gbel National Technical University of Athens, Greece

Monte Carlo Simulation technique S. B. Santra Department of Physics Indian Institute of

Counting With Probabilities Philippe Flajolet, Algorithms; INRIARocquencourt (France)

What papers should be published? Relevance, plausibility, validity, and learning Alexander

Stochastic optimization in Hilbert spaces Aymeric Dieuleveut Aymeric Dieuleveut Stochastic

Biased and Unbiased Samples James J. Heckman Econ 312, Spring 2019 May 13, 2019 1 / 125

Lecture 3. Su ffi ciency Lecture 3. Su ffi ciency 1 (114) 3. Su ffi ciency 3.1. Su ffi cient

From unitary dynamics to statistical mechanics in isolated quantum systems Marcos Rigol

Hypothesis testing Timo Tiihonen 2014 Estimates Assume we have a random variable x and let F ( x

ENZYME REACTION KINETICS PTT311: ENZYME TECHNOLOGY CO3: Ability to assess the enzyme reaction

A New Method for Tackling Limited Monte Carlo Carlos Argelles - PowerPoint PPT Presentation

A New Method for Tackling Limited Monte Carlo Carlos Argelles Austin Schneider Tianlu Yuan 1 Analysis Scenario: Binned data, Poisson likelihood, and Simulation 2 Analysis Scenario Three requirements: 1. Binned data counts 2.

Monte Carlo Generators Monte Carlo Generators Monte Carlo Generators QCD Lecture III P .

Monte Carlo Methods Guojin Chen Christopher Cprek Chris Rambicure Monte Carlo Methods 1.

Monte Carlo Approximation of Monte Carlo Filters Adam M. Johansen et al. Collaborators Include:

4. THE MONTE CARLO METHOD 4.1 I ntroduction This chapter is aimed at describing the Monte Carlo

BROCHURE 2019 TETRA JUICES DEL MONTE DEL MONTE 6 x 1L GOLD PINEAPPLE 6 x 1L 6 x 1L 6 x 1L

Introduction to Monte Carlo Method Andrzej Palczewski and Jan Palczewski Introduction to Monte

Chapter 5: Monte Carlo Methods Monte Carlo methods are learning methods Experience

Draft Introduction to (randomized) quasi-Monte Carlo Pierre LEcuyer MCQMC Conference,

Monte Carlo Estimation 7 January 2019 OSU CSE 1 Monte Carlo Methods Class of computational

Monte Carlo Localization Ximing Yu March 24, 2009 Ximing Yu Monte Carlo Localization 1

Monte Carlo Control CMPUT 366: Intelligent Systems S&amp;B 5.3-5.5, 5.7 Lecture Outline 1.

Limitations of Realistic A Faster Method: . . . Monte-Carlo Techniques Monte-Carlo: . . . Proof

Techniques in Artificial Intelligence - Part I Todd W. Neller Gettysburg College Monte Carlo

Draft 1 Density estimation by Monte Carlo and randomized quasi-Monte Carlo (RQMC) Pierre

Approximate Counting Andreas-Nikolas Gbel National Technical University of Athens, Greece

Monte Carlo Simulation technique S. B. Santra Department of Physics Indian Institute of

Counting With Probabilities Philippe Flajolet, Algorithms; INRIARocquencourt (France)

What papers should be published? Relevance, plausibility, validity, and learning Alexander

Stochastic optimization in Hilbert spaces Aymeric Dieuleveut Aymeric Dieuleveut Stochastic

Biased and Unbiased Samples James J. Heckman Econ 312, Spring 2019 May 13, 2019 1 / 125

Lecture 3. Su ffi ciency Lecture 3. Su ffi ciency 1 (114) 3. Su ffi ciency 3.1. Su ffi cient

From unitary dynamics to statistical mechanics in isolated quantum systems Marcos Rigol

Hypothesis testing Timo Tiihonen 2014 Estimates Assume we have a random variable x and let F ( x

ENZYME REACTION KINETICS PTT311: ENZYME TECHNOLOGY CO3: Ability to assess the enzyme reaction

Monte Carlo Control CMPUT 366: Intelligent Systems S&B 5.3-5.5, 5.7 Lecture Outline 1.