Reflections on Statistical Data Analysis in Neutrino Experiments - PowerPoint PPT Presentation

Reflections on Statistical Data Analysis in Neutrino Experiments since NOMAD and F-C Bob Cousins Univ. of California, Los Angeles (UCLA) PHYSTAT- ν Workshop on Statistical Issues in Experimental Neutrino Physics, Fermilab September 21, 2016 Notes added after talk: mistake fixed on slide 18. Work partially supported by U.S. Dept. of Energy Award DE-SC000993 1 Bob Cousins, PhyStat-nu Fermilab 2016

Neutrino Mass Hierarchy Choosing between two simple hypotheses is the prototype problem in classic Neyman-Pearson theory of hypothesis testing (“simple” = no fit parameters). But rare in HEP. We do have (almost) simple cases, e.g., – Number of light ν flavors (e.g., 3 vs 4 in late 1980’s) – Spin 1 vs spin 2 for new resonance – Higgs spin-parity (assuming spin 0) either 0 + or 0 - Bob Cousins, PhyStat-nu Fermilab 2016 2

Neutrino Mass Hierarchy For MH, interesting complication of non-trivial nuisance parameters: phase δ CP , angle θ 23 Backhouse talk I concentrate on the simplest (but still rich!) case of simple vs simple testing. Although the ν community seems to have its confusion about that sorted out now, I thought it might be worth a “tutorial” on χ 2 , Iikelihood ratios. Bob Cousins, PhyStat-nu Fermilab 2016 3

Likelihood ratios, Central Limit Thm, χ 2 , and all that N quantities to measure. i=1,N Simple H A : true values are { f A ,i } Simple H B : true values are { f B ,i } Measurements { d i }, i=1,N with Gaussian rms σ i . 𝑂 1 L (H A ) = � 2 𝑗=1

𝑂 𝑒 𝑗 − 𝑔 A, i − 𝑒 𝑗 − 𝑔 B, i 2 2 − 2ln λ AB = − 2ln L H A + 2ln L (H B ) = � σ 𝑗 σ 𝑗 2 2 𝑗=1 By Central Limit Theorem , − 2ln λ AB → Gaussian , independent of true H . (Can let the σ i depend on H as well!) N.B: No mention yet of Wilks, χ 2 , DOF. Just CLT, if we are in asymptopia or dominated by the certain term when re-expressed in certain way. (A more high-powered discussion can invoke non-central chisquare; see Blennow et al, JHEP 1403 (2014) 028 ) The ν community calls the above “ ∆χ 2 ”, where, individually under H A and H B : 𝑂 𝑂 𝑒 𝑗 − 𝑔 A, i 𝑒 𝑗 − 𝑔 B, i 2 2 χ 2 (A) = � χ 2 (B) = � σ 𝑗 σ 𝑗 2 2 𝑗=1 𝑗=1 Since this is a bit of a long way around in my opinion, it is instructive to take a closer look, viewing these also as likelihood ratios. Bob Cousins, PhyStat-nu Fermilab 2016

𝑂 1 L (H sat ) = � 2 𝑗=1

Repeat the above with binned Poisson data Observed bin contents { n i }, i=1,N . Simple H A : true Poisson means are { f A ,i } Simple H B : true Poisson means are { f B ,i } H sat : f sat ,i ≡ n i . 𝑂 f A ,i 𝑜 𝑗 𝑓−𝑔 A, i L H A = � , similarly for L (H B ). 𝑜 𝑗 ! 𝑗=1 𝑂 n i 𝑜 𝑗 𝑓−𝑜 i L H sat = � 𝑜 𝑗 ! 𝑗=1 − 2ln λ A,B = − 2ln L H A → Gaussian by CLT. Once again, L H B − 2ln λ A,sat = − 2ln L H A → χ 2 , N DOF, similarly for − 2ln λ B,sat L H sat

GOF test based on Poisson LR − 2ln λ A,sat with saturated model was subject of my first foray into statistics literature... Bob Cousins, PhyStat-nu Fermilab 2016

A recent worked MC example for binned Poisson data is in “Should unfolded histograms be used to test hypotheses?” Cousins, May, Sun http://arxiv.org/abs/1607.07038 -2ln λ A,sat -2ln λ A,B -2ln λ A,B -2ln λ A,sat → Gaussian − 2ln λ A,B − 2ln λ A,sat → χ 2 , in this case 10 DOF − 2ln λ B,sat very similar Bob Cousins, PhyStat-nu Fermilab 2016

What about binned Poisson data? Now N is number of events (not bins). Let θ be vector of observable (energy, angles, etc.) with pdf 𝑞 ( θ | 𝐼 A ) . 𝑂 𝑂 ; similarly for L (H B ). L H A = � 𝑞 ( θ 𝑗 | 𝐼 A ) − 2 ln H A = � − 2ln( 𝑞 θ 𝑗 H A ) 𝑗=1 𝑗=1 − 2ln λ A,B = − 2ln L H A → Gaussian by CLT. Once again, L H B ; However, there is no natural analog to the saturated model and hence the individual GOF tests are ~arbitrary, and − 2ln λ A,B is not equivalent to a " ∆χ 2 " . ⇒ A reason why I prefer direct − 2ln λ A,B approach to " ∆χ 2 " approach. Bob Cousins, PhyStat-nu Fermilab 2016

Preparing for LHC, we imagined new dilepton resonance. H A : spin-1 Z ′ , or H B : spin-2 graviton G* Discriminating variable: quark-muon angle θ CS in Collins-Soper frame. spin-1 Z ′ mass 1.5 TeV spin-2 G* mass 1.5 TeV Bob Cousins, PhyStat-nu Fermilab 2016

− 2ln λ A,B = − 2ln L H A Histograms of for individual MC events L H B Mean = 0.087 Mean = -0.19 RMS = 0.72 RMS = 0.81 Event -2ln λ Event -2ln λ Each MC experiment is 50 samples from above. Add -2ln λ ’s from events: Gaussian by CLT Mean = 9.5 Mean = 4.4 mean = event mean × 50 RMS = 5.7 RMS = 5.1 RMS = event RMS × √ 50 (An earlier paper had erroneously assumed that -2ln λ was χ 2 .) Expt -2ln λ (Peak separation) / RMS scales beautifully with √ N.

Above is all “pre-data” characterization of the test How to characterize post-data ? In N-P theory, α is specified in advance . Suppose after obtaining data, you notice that with α =0.05 previously specified, you reject H 0 , but with α =0.01 previously specified, you accept H 0 . In fact, you determine that with the data set in hand, H 0 would be rejected for α ≥ 0.023. This interesting value has a name: After data are obtained, the p-value is the smallest value of α for which H 0 would be rejected, had it been specified in advance . Numerically (if not philosophically) the same as usual “value obtained or more extreme” due to Fisher. Large literature bashing p-values. I defend HEP: http://arxiv.org/abs/1310.3791 Bob Cousins, PhyStat-nu Fermilab 2016 13

Interpreting p-values and Z-values It is crucial to realize that that value of α was typically not specified in advance, so p-values do not correspond to Type I error rates of the experiments which report them. Interpretation of p-values is a long, contentious story – beware! In HEP, typically converted to Z-value, equivalent number of Gaussian sigma. At LHC, we had recent case that forced us to think about post-data interpretation of (nearly) simple vs simple test. Bob Cousins, PhyStat-nu Fermilab 2016 14

Early CMS Higgs spin-parity test of 0 + vs. 0 - Paper reported (fixing typo here): 1) -2ln( L 0- / L 0+ ) = 5.5 favoring 0 + 2) p-value = 0.72% for 0 - 3) p-value = 0.7 for 0+ 4) CL s = (0.72%) / (1–0.7) = 2.4%, “a more conservative value for judging whether the observed data are compatible with 0 - ”

Luc Demortier and Louis Lyons, http://arxiv.org/abs/1408.6123 “Testing Hypotheses in Particle Physics: Plots of p 0 versus p 1 ” Test of point null vs point alternative, two Gaussians with same σ , peak separation ∆µ . At a glance can see that contours of constant λ 01 are completely different topology from contours of e.g. p 0 . (For rest of plot, you will have to read their paper or stare at it for a long time.) Bob Cousins, PhyStat-nu Fermilab 2016

Number of light ν flavors in 1989: 3 light ν s known Crucial to test ν =3 vs (or more?) in Z decay. ν Mark II collab at SLAC SLC, facing imminent competition from LEP. ν 4 has “point hypotheses”, they treated N ν Rather than treating ν 3 and as a continuous parameter estimated with standard techniques, obtaining N ν = 2.8 ± 0.6 from resonance parameters of Z. “The 95%-C.L. limit, N ν <3.9, excludes to this level the presence of a fourth massless neutrino species within the standard-model framework.” ( Several interesting discussion points, Including benefit of downward fluctuation!) PRL 63, 2173 (1989) Bob Cousins, PhyStat-nu Fermilab 2016

Continuous Mass Hierarchy variable? The +1 and -1 for MH appear in the equations as simply that: arithmetic signs. Various authors (e.g., Capozzi, Lisi, and Marrone, PRD 89 013001) have suggested replacing ± 1 with (unbounded) continuous variable α . Reminiscent of continuous “number of light neutrino species” (which recall had BSM physics interpretation). In frequentist treatment, I think it is mostly a matter of presentation, since results from discrete way map to continuous way, and vice versa (particularly if F-C construction is used for confidence interval for α , with relevant set of C.L.’s). I encourage continuous α approach as part of toolkit. But…Eligio Lisi has explained to me that α is highly correlated with ∆ m 2 , and contributes to increase its overall uncertainty. This leads to the undesired result that power is lost due to consideration of unphysical (or at least non- SM) values of MH. Ugh. NOTE added after talk : I mis-stated Eligio’s point above at the time of the talk; I believe that it is now repaired. -BC Bob Cousins, PhyStat-nu Fermilab 2016 18

Addition of Nuisance Parameter δ to MH Test Small variation of nuisance parameters seems not to upset the formalism, and some relevant examples with toys still give nicely Gaussian distribution of LR test statistic. However the situation can become harder – see talk by Sara Algeri at Tokyo. If the CP phase δ is treated as a nuisance parameter in the MH determination, then great care is needed. Providing the MH results as a function of δ (same δ in numerator and denominator of LR) would seem to be mandatory, before attempting to “eliminate” δ by profiling or marginalizing.. Bob Cousins, PhyStat-nu Fermilab 2016 19

Reflections on Statistical Data Analysis in Neutrino Experiments - PowerPoint PPT Presentation

Reflections on Statistical Data Analysis in Neutrino Experiments since NOMAD and F-C Bob Cousins Univ. of California, Los Angeles (UCLA) PHYSTAT- Workshop on Statistical Issues in Experimental Neutrino Physics, Fermilab September 21, 2016

Reflections for quantum query algorithms Reflections Ben Reichardt University of Waterloo

Solar Neutrino and Solar Neutrino and Neutrino Physics in Brazil Neutrino Physics in Brazil

Neutrino Neutrino Properties Properties Boris Kayser Neutrino 2008 May 28, 2008 1 What Is

2018 RCMP Youth Academy Why the RCMP Youth Academy? Reflections Reflections Reflections The

Neutrino measurement with MACRO: Neutrino measurement with MACRO: neutrino oscillation, dark

Neutrino Mass Models Neutrino Mass Models Why BSM? Neutrino mass models roadmap Survey of

Aspects of neutrino masses Jessica Turner UCL 13 December 2019 Outline Neutrino masses and

Neutrino Coherent Scattering, neutrino dipole moments, and connection to cosmology A.B.

Composed, Distributed Reflections on Semantics and Statistical Machine Translation Timothy

Data and Analysis Part V Statistical Analysis of Data Alex Simpson Part V: Statistical Analysis

Leptogenesis Origin of the Matter-Antimatter Asymmetry in the Universe T.Yanagida Neutrino Mass

Neutrino Fishing Neutrino Fishing in the Bahamas in the Bahamas At different times work

Sterile Neutrinos in Cosmology Mikhail Shaposhnikov NEUTRINO 2008 Neutrino 2008, 30 May 2008

The Short-Baseline Neutrino Oscillation Program in the Fermilab Booster Neutrino Beam Neutrino

Neutrino Oscillation Tomography (and Neutrino Absorption Tomography) (and Neutrino

NEUTRINO DETECTORS 1 ART AND SCIENCE OF NEUTRINO DETECTORS Or rather Stories about neutrino

(AGSDest) An R-package for estimation in classical and adaptive group sequential trials Niklas

ADAPT Floating-Point Precision Tuning Ignacio Laguna, Harshitha Menon, Tristan Vanderbruggen

Hessenberg Reduction with Transient Error Resilience on GPU-Based Hybrid Architectures Yulu

Hypothesis testing Edwin Leuven Introduction Statistical inference until now looked as follows

Week 2: Inference for SLR Inference: sampling distributions, testing confidence intervals, and

Class Imbalance Multiclass Problems General Idea Original D Training data .... Step 1:

Microarrays False Discovery Rate Prof. Tesler Math 186 Winter 2019 Prof. Tesler

s trts r

Sambuz

Useful Links

Newsletter

Mail Us