on combining significances trivial examples
play

On Combining Significances. Trivial examples. S.I. Bityukov and N.V. - PDF document

PhyStat 2011 CERN, Geneva, January 18, 2011 On Combining Significances. Trivial examples. S.I. Bityukov and N.V. Krasnikov Institute for nuclear research RAS, Moscow, Russia Suppose one experiment sees a 3-sigma effect and another sees a


  1. PhyStat 2011 CERN, Geneva, January 18, 2011 On Combining Significances. Trivial examples. S.I. Bityukov and N.V. Krasnikov Institute for nuclear research RAS, Moscow, Russia “Suppose one experiment sees a 3-sigma effect and another sees a 4-sigma effect. What is combined sig- nificance?” (R. Cousins (2008). His conclusion - the question is not well-posed.) In this talk we discuss the problem of significances combining. Plan • Introduction • Combining methods • Examples • Weighted combination • The case of small statistics • Systematics • Conclusion 1

  2. PhyStat 2011 CERN, Geneva, January 18, 2011 Introduction Significance S is related with the probability by the relation (one-sided tail probability) S = Φ − 1 (1 − p ) = − Φ − 1 ( p ) , 1 + erf ( S 2 ) √ 1 � S − t 2 2 dt = √ Φ( S ) = −∞ e , 2 2 π √ 2 erf − 1 (1 − 2 p ) . S = For example, S = 5 corresponds to a p − value of 2 . 9 · 10 − 7 . ⇒ p = 3 . 2 · 10 − 5 . S = 4 = S = 3 = ⇒ p = 0 . 0013 . S = 2 = ⇒ p = 0 . 023 . S = 1 = ⇒ p = 0 . 16 . So the problem – how to combine a set of a p − values or a set of of S − significances ? 2

  3. PhyStat 2011 CERN, Geneva, January 18, 2011 Combining methods I There are several methods for significances combin- ing: 1. Fisher’s (R.A. Fisher, 1932) method based on the N choice of P = i =1 p i . A simple way to combine p i is � the use of relation i ln p i = χ 2 2 N,p , where χ 2 − 2 ν,p denotes the upper p � point of the probability integral of a central chi- squared of ν degrees of freedom. For two p 1 and p 2 combining (for example, F. James (2006)) p ( p 1 , p 2 ) = p 1 p 2 (1 − ln ( p 1 p 2 )) . Plus a lot of generalizations: I.J. Good (1955), H. Lancaster (1961) et al. 2. Tippett’s (L. Tippett, 1931) method using the small- est p i p = 1 − (1 − ( min p i )) N ≈ N · min p i . Plus generalizations: B. Wilkinson (1991), . . . . 3

  4. PhyStat 2011 CERN, Geneva, January 18, 2011 Combining methods II 3. Stouffer’s (S. Stouffer et al., 1949) method adding the inverse normal of the p i ’s √ � Φ − 1 ( p i ) = N Φ − 1 ( p ) , equivalently � S i √ S = N . Plus generalizations: F. Mosteller and R. Bush (1954), T. Liptak (1958), . . . , S. Bityukov et al. (2006). Here we shall compare the Fisher and Stouffer ap- proaches. Our conclusion: For high energy physics with Pois- son distribution in many cases the Stouffer approach is more natural. 4

  5. PhyStat 2011 CERN, Geneva, January 18, 2011 Example I Suppose the CMS experiment for some signature measures in july (2010) 10300 events and in august it detects 9700 events with the theoretical expectation λ july = λ august = 10000 in Poisson distribution Pois ( n, λ ) = λ n n ! e − λ . For λ ≫ 1 , n obs ≫ 1 Poisson distribution is approx- imated by normal distribution with mean µ = λ and variance σ 2 = λ and we find that S july = | 10300 − 10000 | √ = 3 , 10000 S august = | 9700 − 10000 | √ = 3 . 10000 If we analyze data for july plus august we find (using the fact that the sum of two Poisson processes with λ 1 and λ 2 is a Poisson process with λ = λ 1 + λ 2 ) that S july + august = | n obs,july + n obs,august − λ july − λ august | = 0 � λ july + λ august in perfect agreement with a theory. 5

  6. PhyStat 2011 CERN, Geneva, January 18, 2011 Example I If we combine significances using formula p ( p 1 , p 2 ) = p 1 p 2 (1 − ln ( p 1 p 2 )) , we find that (remember that S = 3 = ⇒ p i = 0 . 0013 , i = 1 , 2 ) p = 0 . 000026 ; S Fisher july + august = 4 . 05 Example II For other example with n july = 10300 , n august = 10400 λ july = λ august = 10000 we find that S july = 3 , S august = 4 , and S july + august = 3 + 4 √ 2 ≈ 4 . 95 . Fisher’s method gives: p july + august = 0 . 00000077 ; S Fisher july + august = 4 . 80 6

  7. PhyStat 2011 CERN, Geneva, January 18, 2011 Weighted combination For Poisson distribution with n obs ≫ 1 and λ ≫ 1 (when we can approximate this distribution by Gaus- sian) we can define significance as S 1 = n obs 1 − n 0 1 , S 2 = n obs 2 − n 0 2 . σ 1 σ 2 Here S i > 0 corresponds for excess of events, S i < 0 corresponds for shortage of events and σ i = √ λ i , i = 1 , 2 . Usualy people use S = | n obs − n 0 | . σ The rule for significances combining is S ( S 1 , S 2 ) = S 1 σ 1 + S 2 σ 2 . � σ 2 1 + σ 2 2 For general case S ( S 1 , S 2 , . . . , S n ) = S 1 σ 1 + . . . + S n σ 2 . σ 2 1 + . . . + σ 2 � 2 7

  8. PhyStat 2011 CERN, Geneva, January 18, 2011 The case of small statistics Consider now the case of small statistics. Namely, as an example, consider CMS experiment with n obs,july = n obs,august = 1 and λ = λ august = λ ≪ 1 . Then the proba- bility to observe n ≥ 1 events is determined by ∞ Pois ( n ≥ 1 , λ ) = i =1 Pois ( i, λ ) ≈ λ . � Correspondingly, i =2 Pois ( i, λ ) ≈ λ 2 ∞ Pois ( n ≥ 2 , λ ) = � 2 and we find that P july = λ july , P august = λ august , ( λ july + λ august ) 2 ( P july + P august ) 2 P july + august = = . 2 2 A. For P july = P august = 0 . 023 (2 σ : one − side ) P july + august = 0 . 001 (3 . 07 σ ) . Compare with Fisher’s formula P Fisher july + august = P july · P august (1 − ln ( P july · P august )) . Fisher’s method gives the value P Fisher july + august = 0 . 0044 (2 . 62 σ ) . 8

  9. PhyStat 2011 CERN, Geneva, January 18, 2011 The case of small statistics B. For the case S july = 3 σ , S august = 4 σ , n july = 1 and n august = 1 P july + august = 0 . 95 · 10 − 6 (4 . 76 σ ) july + august = 0 . 77 · 10 − 6 (4 . 81 σ ) P Fisher C. For the case S july = 3 σ , S august = 3 σ , n july = 1 and n august = 1 P july + august = 0 . 36 · 10 − 5 (4 . 49 σ ) july + august = 0 . 26 · 10 − 4 (4 . 05 σ ) P Fisher In general for n obs ≫ λ n = n 1 P ( n, λ 1 ) ≈ λ n 1 ∞ 1 n 1 ! e − λ 1 ; � n = n 2 P ( n, λ 2 ) ≈ λ n 2 ∞ 2 n 2 ! e − λ 2 � and P ( n, λ 1 + λ 2 ) ≈ ( λ 1 + λ 2 ) n 1 + n 2 ∞ e − ( λ 1 + λ 2 ) . � n 1 + n 2 ! n = n 1 + n 2 9

  10. PhyStat 2011 CERN, Geneva, January 18, 2011 Systematics Let us consider the influence of systematic effects related with nonexact knowledge of parameter λ in Poisson formula. Suppose n obs,july = n obs,august = 600 , λ july = λ august = 300 , and ǫ = 1 3 (uncertainty in the parameter λ determina- tion). For such case the significance is determined by ap- proximate formula n obs − λ S = λ + ( ǫλ ) 2 . � According to this formula 300 S july = S august = 300 + (100) 2 ≈ 3 . � For july+august 600 S july + august = 600 + (2 · 100) 2 ≈ 3 . � So we find that july+august combining does not help to increase significance, since the systematic er- ror ∼ ( ǫλ ) 2 dominates. 10

  11. PhyStat 2011 CERN, Geneva, January 18, 2011 Conclusions To conclude we think that in many cases for Poisson statistics the most natural rules of significances com- bining is the use of the fact that the sum of Poisson processes is the Poisson process.  Pois ( n 1 , λ 1 )      Pois ( n 2 , λ 2 )     � n i , � λ i ) = ⇒ Pois ( . . .       Pois ( n n , λ n )    In fact it is natural generalization of the original Stouffer method. 11

  12. PhyStat 2011 CERN, Geneva, January 18, 2011 References [1] R.D. Cousins, Annotated Bibliography of Some Papers on Combining Significances or p -values , arXiv:0705.2209 [physics.data-an] . [2] R.A. Fisher, Statistical Methods for Research Workers , Hafner, Darien, Connecticut, 14th edition, 1970. The method of combining significances seems to have appeared in the 4th edition of 1932. [3] F. James, Statistical Methods in Experimental Physics , 2nd edition, World Scientific, 2006. [4] I.J. Good, On the weighted combination of significance tests , Journal of the Royal Statistical Society. Series B (Methodological) 17(2):264-265. [5] H. Lancaster, The combination of probabilities: an applica- tion of orthinormal functions . Australian Journal of Statis- tics, 3 , 20-33, 1961. [6] L. Tippett, The Methods of Statistics , Williams and Nor- gate, Ltd., London, 1st edition. Sec. 3.5, 53-6, 1931, as cited by Birnbaum and by Westberg. [7] B. Wilkinson, A statistical consideration in psychological research , Psychological Bulletin , 48 , 156-158, 1951. [8] S. Stouffer, E. Suchman, L. DeVinnery, S. Star, and R.W. Jr, The American Soldier , volume I: Adjustment during Army Life. Princeton University Press, 1949. [9] F. Mosteller and R. Bush, Selected quantitative techniques , in ed. G. Lindzey, Handbookof Social Psychology vol. I , 289-334, Addison-Wesley, Cambridge, Mass., 1954. 12

  13. PhyStat 2011 CERN, Geneva, January 18, 2011 [10] T. Liptak, On the combination of independent tests , Maguar Tud. Akad. Mat. Kutato Int. Kozl. , 3 , 171-197, 1958. [11] S.Bityukov, N.Krasnikov and A.Nikitenko, On the combin- ing significances, arXiv:physics/0612178(2006); S.Bityukov, N.Krasnikov, A.Nikitenko and V.Smirnova, Two approaches to combining significances, PoS ACAT08:118(2008). 13

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend