Statistical Methods for Particle Physics Day 2: Statistical Tests - PowerPoint PPT Presentation

Statistical Methods for Particle Physics Day 2: Statistical Tests and Limits https://indico.desy.de/indico/event/19085/ Terascale Statistics School DESY, 19-23 February, 2018 Glen Cowan Physics Department Royal Holloway, University of London g.cowan@rhul.ac.uk www.pp.rhul.ac.uk/~cowan DESY Terascale School of Statistics / 19-23 Feb 2018 / Day 2 1 G. Cowan

Outline Day 1: Introduction and parameter estimation Probability, random variables, pdfs Parameter estimation maximum likelihood least squares Bayesian parameter estimation Introduction to unfolding Day 2: Discovery and Limits Comments on multivariate methods (brief) p -values Testing the background-only hypothesis: discovery Testing signal hypotheses: setting limits Experimental sensitivity DESY Terascale School of Statistics / 19-23 Feb 2018 / Day 2 2 G. Cowan

Frequentist statistical tests Consider a hypothesis H 0 and alternative H 1 . A test of H 0 is defined by specifying a critical region w of the data space such that there is no more than some (small) probability α , assuming H 0 is correct, to observe the data there, i.e., P ( x ∈ w | H 0 ) ≤ α data space Ω Need inequality if data are discrete. α is called the size or significance level of the test. If x is observed in the critical region, reject H 0 . critical region w DESY Terascale School of Statistics / 19-23 Feb 2018 / Day 2 3 G. Cowan

Definition of a test (2) But in general there are an infinite number of possible critical regions that give the same significance level α . So the choice of the critical region for a test of H 0 needs to take into account the alternative hypothesis H 1 . Roughly speaking, place the critical region where there is a low probability to be found if H 0 is true, but high if H 1 is true: DESY Terascale School of Statistics / 19-23 Feb 2018 / Day 2 4 G. Cowan

Type-I, Type-II errors Rejecting the hypothesis H 0 when it is true is a Type-I error. The maximum probability for this is the size of the test: P ( x ∈ W | H 0 ) ≤ α But we might also accept H 0 when it is false, and an alternative H 1 is true. This is called a Type-II error, and occurs with probability P ( x ∈ S - W | H 1 ) = β One minus this is called the power of the test with respect to the alternative H 1 : Power = 1 - β DESY Terascale School of Statistics / 19-23 Feb 2018 / Day 2 5 G. Cowan

A simulated SUSY event high p T jets high p T of hadrons muons p p missing transverse energy DESY Terascale School of Statistics / 19-23 Feb 2018 / Day 2 6 G. Cowan

Background events This event from Standard Model ttbar production also has high p T jets and muons, and some missing transverse energy. → can easily mimic a SUSY event. DESY Terascale School of Statistics / 19-23 Feb 2018 / Day 2 7 G. Cowan

Physics context of a statistical test Event Selection: the event types in question are both known to exist. Example: separation of different particle types (electron vs muon) or known event types (ttbar vs QCD multijet). E.g. test H 0 : event is background vs. H 1 : event is signal. Use selected events for further study. Search for New Physics: the null hypothesis is H 0 : all events correspond to Standard Model (background only), and the alternative is H 1 : events include a type whose existence is not yet established (signal plus background) Many subtle issues here, mainly related to the high standard of proof required to establish presence of a new phenomenon. The optimal statistical test for a search is closely related to that used for event selection. DESY Terascale School of Statistics / 19-23 Feb 2018 / Day 2 8 G. Cowan

Statistical tests for event selection Suppose the result of a measurement for an individual event is a collection of numbers x 1 = number of muons, x 2 = mean p T of jets, x 3 = missing energy, ... follows some n -dimensional joint pdf, which depends on the type of event produced, i.e., was it For each reaction we consider we will have a hypothesis for the pdf of , e.g., etc. E.g. call H 0 the background hypothesis (the event type we want to reject); H 1 is signal hypothesis (the type we want). DESY Terascale School of Statistics / 19-23 Feb 2018 / Day 2 9 G. Cowan

Selecting events Suppose we have a data sample with two kinds of events, corresponding to hypotheses H 0 and H 1 and we want to select those of type H 1 . Each event is a point in space. What ‘decision boundary’ should we use to accept/reject events as belonging to event types H 0 or H 1 ? H 0 Perhaps select events with ‘cuts’: H 1 accept DESY Terascale School of Statistics / 19-23 Feb 2018 / Day 2 10 G. Cowan

Other ways to select events Or maybe use some other sort of decision boundary: linear or nonlinear H 0 H 0 H 1 H 1 accept accept How can we do this in an ‘optimal’ way? DESY Terascale School of Statistics / 19-23 Feb 2018 / Day 2 11 G. Cowan

Test statistics The boundary of the critical region for an n -dimensional data space x = ( x 1 ,..., x n ) can be defined by an equation of the form where t ( x 1 ,…, x n ) is a scalar test statistic. We can work out the pdfs Decision boundary is now a single ‘cut’ on t , defining the critical region. So for an n -dimensional problem we have a corresponding 1-d problem. DESY Terascale School of Statistics / 19-23 Feb 2018 / Day 2 12 G. Cowan

Test statistic based on likelihood ratio How can we choose a test’s critical region in an ‘optimal way’? Neyman-Pearson lemma states: To get the highest power for a given significance level in a test of H 0 , (background) versus H 1 , (signal) the critical region should have inside the region, and ≤ c outside, where c is a constant chosen to give a test of the desired size. Equivalently, optimal scalar test statistic is N.B. any monotonic function of this is leads to the same test. DESY Terascale School of Statistics / 19-23 Feb 2018 / Day 2 13 G. Cowan

Classification viewed as a statistical test Probability to reject H 0 if true (type I error): α = size of test, significance level, false discovery rate Probability to accept H 0 if H 1 true (type II error): 1 - β = power of test with respect to H 1 Equivalently if e.g. H 0 = background, H 1 = signal, use efficiencies: DESY Terascale School of Statistics / 19-23 Feb 2018 / Day 2 14 G. Cowan

Purity / misclassification rate Consider the probability that an event of signal (s) type classified correctly (i.e., the event selection purity), Use Bayes’ theorem: prior probability Here W is signal region posterior probability = signal purity = 1 – signal misclassification rate Note purity depends on the prior probability for an event to be signal or background as well as on s/b efficiencies. DESY Terascale School of Statistics / 19-23 Feb 2018 / Day 2 15 G. Cowan

Neyman-Pearson doesn’t usually help We usually don’t have explicit formulae for the pdfs f ( x |s), f ( x |b), so for a given x we can’t evaluate the likelihood ratio Instead we may have Monte Carlo models for signal and background processes, so we can produce simulated data: generate x ~ f ( x |s) → x 1 ,..., x N generate x ~ f ( x |b) → x 1 ,..., x N This gives samples of “training data” with events of known type. Can be expensive (1 fully simulated LHC event ~ 1 CPU minute). DESY Terascale School of Statistics / 19-23 Feb 2018 / Day 2 16 G. Cowan

Approximate LR from histograms Want t ( x ) = f (x| s )/ f(x| b ) for x here N( x |s) One possibility is to generate N ( x |s) ≈ f ( x |s) MC data and construct histograms for both signal and background. Use (normalized) histogram x values to approximate LR: N( x |b) N ( x |b) ≈ f ( x |b) Can work well for single variable. x DESY Terascale School of Statistics / 19-23 Feb 2018 / Day 2 17 G. Cowan

Approximate LR from 2D-histograms Suppose problem has 2 variables. Try using 2-D histograms: signal background Approximate pdfs using N ( x,y| s), N ( x,y| b) in corresponding cells. But if we want M bins for each variable, then in n -dimensions we have M n cells; can’t generate enough training data to populate. → Histogram method usually not usable for n > 1 dimension. DESY Terascale School of Statistics / 19-23 Feb 2018 / Day 2 18 G. Cowan

Strategies for multivariate analysis Neyman-Pearson lemma gives optimal answer, but cannot be used directly, because we usually don’t have f ( x |s), f ( x |b). Histogram method with M bins for n variables requires that we estimate M n parameters (the values of the pdfs in each cell), so this is rarely practical. A compromise solution is to assume a certain functional form for the test statistic t ( x ) with fewer parameters; determine them (using MC) to give best separation between signal and background. Alternatively, try to estimate the probability densities f ( x |s) and f ( x |b) (with something better than histograms) and use the estimated pdfs to construct an approximate likelihood ratio. DESY Terascale School of Statistics / 19-23 Feb 2018 / Day 2 19 G. Cowan

Multivariate methods Many new (and some old) methods: Fisher discriminant (Deep) neural networks Kernel density methods Support Vector Machines Decision trees Boosting Bagging DESY Terascale School of Statistics / 19-23 Feb 2018 / Day 2 20 G. Cowan

Statistical Methods for Particle Physics Day 2: Statistical Tests - PowerPoint PPT Presentation

Statistical Methods for Particle Physics Day 2: Statistical Tests and Limits https://indico.desy.de/indico/event/19085/ Terascale Statistics School DESY, 19-23 February, 2018 Glen Cowan Physics Department Royal Holloway, University of London

Elementary Particle Physics in a Nutshell Elementary Particle Physics in a Nutshell

THEORETICAL PARTICLE PHYSICS IN KARLSRUHE I. The Team II. Research in Theoretical Particle

Some Statistical Tools for Particle Physics Particle Physics Colloquium MPI fr Physik u.

Subatomic (Particle) Physics in Canada The Canadian particle physics community Our

Project 2: Basic particle system Constrained Particle System Tinkertoys Requirements for

UK Particle Physics Outreach Very Selective Highlights Peter Watkins Head of Particle Physics

Particle Physics: The Standard Model Dirk Zerwas LAL zerwas@lal.in2p3.fr March 14, 2013 Dirk

Particle Physics: The Standard Model Dirk Zerwas LAL zerwas@lal.in2p3.fr March 8, 2012 Dirk

THE NORWEGIAN HIGH ENERGY HIGH ENERGY PARTICLE PARTICLE PHYSICS PHYSICS PROJECT 2006-11

Particle Fever M e l b o u r n e - A u g 2 1 - 2 0 1 5 1 Fundamental Particle Physics The

The Particle Physics Odyssey [ Where are we? Where are we going? ] G. Isidori The Particle

! Importance of Particle Adhesion ! Importance of Particle Adhesion ! History of Particle

20 Particle Systems Steve Marschner Eston Schweickart CS4620 Spring 2017 Examples of Particle

Particle dynamics Particle overview Particle system Forces Constraints

Particle dynamics Particle overview Particle system Forces Constraints

STFC Science Roadmap Challenges A) How did the Universe begin and how is it evolving? 1) What is

eIDAS Regulation (EU) 910/2014 Boosting trust in the Digital Single Market: the role of eIDAS

CIS 520: Machine Learning Introduc7on & Overview

in the high school B. Leitner Work place: St. Emeric Catholic High School, Primary School,

Trusted Architecture for Secure Shared Services (with Privacy) and Personal Data Store Sampo

Trusted Architecture for Secure Shared Services (with Privacy), Future of Internet PPP, and

Modeling Networks from Partially-Observed Network Data Mark S. Handcock University of Washington

me do this? A qualitative study into staff perceptions of their personal tutor role and the

Artificial Intelligence. Decision Tasks. Learning. Petr Pok Czech Technical University in

Statistical Methods for Particle Physics Day 2: Statistical Tests - PowerPoint PPT Presentation

Statistical Methods for Particle Physics Day 2: Statistical Tests and Limits https://indico.desy.de/indico/event/19085/ Terascale Statistics School DESY, 19-23 February, 2018 Glen Cowan Physics Department Royal Holloway, University of London

Elementary Particle Physics in a Nutshell Elementary Particle Physics in a Nutshell

THEORETICAL PARTICLE PHYSICS IN KARLSRUHE I. The Team II. Research in Theoretical Particle

Some Statistical Tools for Particle Physics Particle Physics Colloquium MPI fr Physik u.

Subatomic (Particle) Physics in Canada The Canadian particle physics community Our

Project 2: Basic particle system Constrained Particle System Tinkertoys Requirements for

UK Particle Physics Outreach Very Selective Highlights Peter Watkins Head of Particle Physics

Particle Physics: The Standard Model Dirk Zerwas LAL zerwas@lal.in2p3.fr March 14, 2013 Dirk

Particle Physics: The Standard Model Dirk Zerwas LAL zerwas@lal.in2p3.fr March 8, 2012 Dirk

THE NORWEGIAN HIGH ENERGY HIGH ENERGY PARTICLE PARTICLE PHYSICS PHYSICS PROJECT 2006-11

Particle Fever M e l b o u r n e - A u g 2 1 - 2 0 1 5 1 Fundamental Particle Physics The

The Particle Physics Odyssey [ Where are we? Where are we going? ] G. Isidori The Particle

! Importance of Particle Adhesion ! Importance of Particle Adhesion ! History of Particle

20 Particle Systems Steve Marschner Eston Schweickart CS4620 Spring 2017 Examples of Particle

Particle dynamics Particle overview Particle system Forces Constraints

Particle dynamics Particle overview Particle system Forces Constraints

STFC Science Roadmap Challenges A) How did the Universe begin and how is it evolving? 1) What is

eIDAS Regulation (EU) 910/2014 Boosting trust in the Digital Single Market: the role of eIDAS

CIS 520: Machine Learning Introduc7on &amp; Overview

in the high school B. Leitner Work place: St. Emeric Catholic High School, Primary School,

Trusted Architecture for Secure Shared Services (with Privacy) and Personal Data Store Sampo

Trusted Architecture for Secure Shared Services (with Privacy), Future of Internet PPP, and

Modeling Networks from Partially-Observed Network Data Mark S. Handcock University of Washington

me do this? A qualitative study into staff perceptions of their personal tutor role and the

Artificial Intelligence. Decision Tasks. Learning. Petr Pok Czech Technical University in

CIS 520: Machine Learning Introduc7on & Overview