Introduction to Unfolding in High Energy Physics
Mikael Kuusela
Institute of Mathematics, EPFL
Advanced Scientific Computing Workshop, ETH Zurich July 15, 2014
Mikael Kuusela (EPFL) Unfolding in HEP July 15, 2014 1 / 66
Introduction to Unfolding in High Energy Physics Mikael Kuusela - - PowerPoint PPT Presentation
Introduction to Unfolding in High Energy Physics Mikael Kuusela Institute of Mathematics, EPFL Advanced Scientific Computing Workshop, ETH Zurich July 15, 2014 Mikael Kuusela (EPFL) Unfolding in HEP July 15, 2014 1 / 66 Outline
Institute of Mathematics, EPFL
Mikael Kuusela (EPFL) Unfolding in HEP July 15, 2014 1 / 66
1
2
3
4
5
Mikael Kuusela (EPFL) Unfolding in HEP July 15, 2014 2 / 66
1
2
3
4
5
Mikael Kuusela (EPFL) Unfolding in HEP July 15, 2014 3 / 66
Figure : Smeared density
Folding
Unfolding
Figure : True density
Mikael Kuusela (EPFL) Unfolding in HEP July 15, 2014 4 / 66
1 Comparison of the measurement with future theories 2 Comparison of experiments with different responses 3 Input to a subsequent analysis 4 Exploratory data analysis
Mikael Kuusela (EPFL) Unfolding in HEP July 15, 2014 5 / 66
(GeV)
T
Jet p
200 300 1000 2000
d|y| (pb/GeV)
T
/dp σ
2
d
10
10
3
10
7
10
11
10
13
10
)
410 × |y| < 0.5 ( )
310 × 0.5 < |y| < 1.0 ( )
210 × 1.0 < |y| < 1.5 ( )
110 × 1.5 < |y| < 2.0 ( ) 10 × 2.0 < |y| < 2.5 (
CMS
= 7 TeV s
L = 5.0 fb R = 0.7
T
anti-k
T= p
Fµ =
Rµ NP Corr. ⊗ NNPDF2.1
,C
τ ln
,C
τ dN/dln N 1
0.00 0.05 0.10 0.15 0.20
,C
τ ln
,C
τ dN/dln N 1
0.00 0.05 0.10 0.15 0.20 > 30 GeV/c
T
, R=0.5, p
T
anti-k < 125 GeV/c
T,1
90 GeV/c < p Pythia6 Pythia8 Herwig++ MadGraph+Pythia6 Alpgen+Pythia6 Data
= 7 TeV, L = 3.2 pb s CMS
Mikael Kuusela (EPFL) Unfolding in HEP July 15, 2014 6 / 66
λ ∈ Rp
+ bin means of the true histogram
x ∈ Np
0 bin counts of the true histogram
µ ∈ Rn
+ bin means of the smeared histogram
y ∈ Nn
0 bin counts of the smeared histogram
1
The true counts are independent and Poisson distributed x|λ ∼ Poisson(λ), ⊥ ⊥ xi|λ
2
The propagation of events to neighboring bins is multinomial conditional on xi and independent for each true bin
Mikael Kuusela (EPFL) Unfolding in HEP July 15, 2014 7 / 66
Mikael Kuusela (EPFL) Unfolding in HEP July 15, 2014 8 / 66
Mikael Kuusela (EPFL) Unfolding in HEP July 15, 2014 9 / 66
−6 −4 −2 2 4 6 100 200 300 400 500 Smeared histogram −6 −4 −2 2 4 6 100 200 300 400 500 True histogram
Mikael Kuusela (EPFL) Unfolding in HEP July 15, 2014 10 / 66
−6 −4 −2 2 4 6 −2.5 −2 −1.5 −1 −0.5 0.5 1 1.5 2 x 10
13
Pseudoinverse True
Mikael Kuusela (EPFL) Unfolding in HEP July 15, 2014 11 / 66
1
2
3
4
5
Mikael Kuusela (EPFL) Unfolding in HEP July 15, 2014 12 / 66
1
2
3
4
5
Mikael Kuusela (EPFL) Unfolding in HEP July 15, 2014 13 / 66
L(λ) = p(y|λ) =
n
p(yi|λ) =
n
p
j=1 Kijλj
yi yi! e− p
j=1 Kijλj,
λ ∈ Rp
+
The likelihood function plays a key role in all sensible unfolding methods
In ill-posed problems, this is usually not the case, but the maximum likelihood solution still provides a good starting point
Mikael Kuusela (EPFL) Unfolding in HEP July 15, 2014 14 / 66
λ∈Rp
+
n
p
p
Mikael Kuusela (EPFL) Unfolding in HEP July 15, 2014 15 / 66
1 The log-likelihood has a maximum. 2 The log-likelihood is concave and hence all the maxima are global
3 The maximum is unique if and only if the columns of K are linearly
Mikael Kuusela (EPFL) Unfolding in HEP July 15, 2014 16 / 66
K is often non-square Even if K was square, it is often not invertible And even if K was invertible, K−1y often contains negative values
Mikael Kuusela (EPFL) Unfolding in HEP July 15, 2014 17 / 66
This is a widely used iterative algorithm for finding maximum likelihood solutions in problems that can be seen as containing incomplete
j
j
i=1 Kij n
l=1 Kilλ(k) l
Mikael Kuusela (EPFL) Unfolding in HEP July 15, 2014 18 / 66
Optics: Richardson (1972) Astronomy: Lucy (1974) Tomography: Shepp and Vardi (1982); Lange and Carson (1984); Vardi et al. (1985) HEP: Kondor (1983); M¨ ulthei and Schorr (1987); M¨ ulthei et al. (1987, 1989); D’Agostini (1995)
D’Agostini iteration is a fully frequentist technique for finding the MLE There is nothing Bayesian about it!
Mikael Kuusela (EPFL) Unfolding in HEP July 15, 2014 19 / 66
−10 −5 5 10 100 200 300 400 500 µ y Kλ(k)
Figure : Smeared histogram
−10 −5 5 10 100 200 300 400 500 λ λ(k)
Figure : True histogram
Mikael Kuusela (EPFL) Unfolding in HEP July 15, 2014 20 / 66
−10 −5 5 10 100 200 300 400 500 µ y Kλ(k)
Figure : Smeared histogram
−10 −5 5 10 100 200 300 400 500 λ λ(k)
Figure : True histogram
Mikael Kuusela (EPFL) Unfolding in HEP July 15, 2014 21 / 66
−10 −5 5 10 100 200 300 400 500 µ y Kλ(k)
Figure : Smeared histogram
−10 −5 5 10 200 400 600 800 λ λ(k)
Figure : True histogram
Mikael Kuusela (EPFL) Unfolding in HEP July 15, 2014 22 / 66
−10 −5 5 10 100 200 300 400 500 µ y Kλ(k)
Figure : Smeared histogram
−10 −5 5 10 500 1000 1500 λ λ(k)
Figure : True histogram
Mikael Kuusela (EPFL) Unfolding in HEP July 15, 2014 23 / 66
1
2
3
4
5
Mikael Kuusela (EPFL) Unfolding in HEP July 15, 2014 24 / 66
Due to the ill-posedness of the problem, it exhibits large, unphysical fluctuations In other words, the likelihood function alone does not contain enough information to constrain the solution
This is because the algorithm will start overfitting to the Poisson fluctuations in y
The number of iterations k now becomes a regularization parameter that controls the trade-off between fitting the data and taming unphysical features
Mikael Kuusela (EPFL) Unfolding in HEP July 15, 2014 25 / 66
−10 −5 5 10 100 200 300 400 500 µ y Kλ(k)
Figure : Smeared histogram
−10 −5 5 10 100 200 300 400 500 λ λ(k)
Figure : True histogram
Mikael Kuusela (EPFL) Unfolding in HEP July 15, 2014 26 / 66
Is there a more principled way of finding good solutions?
λ∈Rp
+
P(λ) is a penalty function which obtains large values for physically implausible solutions δ > 0 is a regularization parameter which controls the balance between maximizing the likelihood and minimizing the penalty
I.e., it penalizes for large oscillations
Mikael Kuusela (EPFL) Unfolding in HEP July 15, 2014 27 / 66
n
p
p
Mikael Kuusela (EPFL) Unfolding in HEP July 15, 2014 28 / 66
λ∈Rp
λ∈Rp
Mikael Kuusela (EPFL) Unfolding in HEP July 15, 2014 29 / 66
Norm of the solution: P(λ) = λ2 Curvature of the solution: P(λ) = Lλ2, where L is a discretized 2nd derivative operator “SVD” unfolding (H¨
P(λ) =
λ1/λMC
1
λ2/λMC
2
. . . λp/λMC
p
, where λMC is a MC prediction for λ TUnfold1 (Schmitt, 2012): P(λ) = L(λ − λMC)2
1Also more general penalty terms are allowed in TUnfold Mikael Kuusela (EPFL) Unfolding in HEP July 15, 2014 30 / 66
x∈Rp Ax − y2,
In this special case, the pseudoinverse is given by A† = (ATA)−1AT Hence, the least squares solution is: ˆ xLS = (ATA)−1ATy
Mikael Kuusela (EPFL) Unfolding in HEP July 15, 2014 31 / 66
λ∈Rp
−1 = diag
λ∈Rp
Mikael Kuusela (EPFL) Unfolding in HEP July 15, 2014 32 / 66
Unfolding in HEP July 15, 2014 33 / 66
T √
T √
T ˜
T˜
−1K + δLTL
−1y
Mikael Kuusela (EPFL) Unfolding in HEP July 15, 2014 34 / 66
−6 −4 −2 2 4 6 −100 100 200 300 400 500 600 Tikhonov regularization True
Mikael Kuusela (EPFL) Unfolding in HEP July 15, 2014 35 / 66
1
2
3
4
5
Mikael Kuusela (EPFL) Unfolding in HEP July 15, 2014 36 / 66
+p(y|λ′)p(λ′) dλ′ ,
+
The posterior mean: ˆ λ = E[λ|y] =
+λp(λ|y) dλ
The maximum a posteriori (MAP) estimator: ˆ λ = arg max
λ∈Rp
+
p(λ|y)
But note that the interpretation of the resulting Bayesian credible intervals is different from frequentist confidence intervals
Mikael Kuusela (EPFL) Unfolding in HEP July 15, 2014 37 / 66
It concentrates the probability mass of the posterior on physically plausible solutions
+,
+
Mikael Kuusela (EPFL) Unfolding in HEP July 15, 2014 38 / 66
λ∈Rp
+
λ∈Rp
+
λ∈Rp
+
λ∈Rp
+
Mikael Kuusela (EPFL) Unfolding in HEP July 15, 2014 39 / 66
+p(y|λ′)p(λ′) dλ′
The sample mean and sample quantiles can then be used to compute the posterior mean and the credible intervals
Mikael Kuusela (EPFL) Unfolding in HEP July 15, 2014 40 / 66
1
2
3
4
5
Mikael Kuusela (EPFL) Unfolding in HEP July 15, 2014 41 / 66
1
2
3
4
5
Mikael Kuusela (EPFL) Unfolding in HEP July 15, 2014 42 / 66
The parameter δ in Tikhonov regularization and Bayesian unfolding, the number of iterations in D’Agostini
But its value usually has a major impact on the unfolded spectrum
But this may create an undesired MC bias
Mikael Kuusela (EPFL) Unfolding in HEP July 15, 2014 43 / 66
Goodness-of-fit test in the smeared space (Veklerov and Llacer, 1987) Empirical Bayes estimation (Kuusela and Panaretos, 2014) L-curve (Hansen, 1992) (Generalized) cross validation (Wahba, 1990) ...
Mikael Kuusela (EPFL) Unfolding in HEP July 15, 2014 44 / 66
a
n, where n is the
Mikael Kuusela (EPFL) Unfolding in HEP July 15, 2014 45 / 66
1
2
3
4
5
Mikael Kuusela (EPFL) Unfolding in HEP July 15, 2014 46 / 66
λ∈Rp
+
Mikael Kuusela (EPFL) Unfolding in HEP July 15, 2014 47 / 66
But this is only true when ˆ λi is unbiased and has a symmetric sampling distribution
Regularization reduces variance by increasing the bias (bias-variance trade-off) Hence the SE confidence intervals may have lousy coverage
SE[ˆ λi ] SE[ˆ λi ]
p(ˆ λi|λ)
λi = E[ˆ λi ] Mikael Kuusela (EPFL) Unfolding in HEP July 15, 2014 48 / 66
But this is only true when ˆ λi is unbiased and has a symmetric sampling distribution
Regularization reduces variance by increasing the bias (bias-variance trade-off) Hence the SE confidence intervals may have lousy coverage
SE[ˆ λi ] SE[ˆ λi ]
p(ˆ λi|λ)
λi E[ˆ λi ] Mikael Kuusela (EPFL) Unfolding in HEP July 15, 2014 48 / 66
−5 5 −2000 −1000 1000 2000 δ = 1e−05
λ ˆ λ ± SE[ˆ λ]
−5 5 −100 100 200 300 400 500 600 δ = 0.001
λ ˆ λ ± SE[ˆ λ]
−5 5 −100 100 200 300 400 500 600 δ = 0.01
λ ˆ λ ± SE[ˆ λ]
−5 5 −100 100 200 300 400 500 600 δ = 1
λ ˆ λ ± SE[ˆ λ]
Mikael Kuusela (EPFL) Unfolding in HEP July 15, 2014 49 / 66
Hence these uncertainties should be understood as estimates of the spread of the sampling distribution of ˆ λ These should only be understood as approximate confidence intervals if it can be shown that the bias is negligible
Mikael Kuusela (EPFL) Unfolding in HEP July 15, 2014 50 / 66
1
2
3
4
5
Mikael Kuusela (EPFL) Unfolding in HEP July 15, 2014 51 / 66
1
The matrix depends on the shape of the spectrum within each true bin Kij =
, i = 1, . . . , n, j = 1, . . . , p
2
The smearing of the variable of interest may depend on the MC distribution of some auxiliary variables
For example, the energy resolution of jets depends on the pseudorapidity distribution of the jets
Mikael Kuusela (EPFL) Unfolding in HEP July 15, 2014 52 / 66
1
2
3
4
5
Mikael Kuusela (EPFL) Unfolding in HEP July 15, 2014 53 / 66
1
Matrix inversion
2
D’Agostini iteration
3
The SVD flavor of Tikhonov regularization
4
The TUnfold flavor of Tikhonov regularization
This is an obsolete method that replaces the full response matrix K by a diagonal approximation and while doing so introduces a huge MC bias This method should not be used!
Mikael Kuusela (EPFL) Unfolding in HEP July 15, 2014 54 / 66
Figure from Adye (2011)
Mikael Kuusela (EPFL) Unfolding in HEP July 15, 2014 55 / 66
Mikael Kuusela (EPFL) Unfolding in HEP July 15, 2014 56 / 66
For small niter, the solution is biased towards λMC; for large niter, we get a solution close to the MLE Note that the default niter = 4 is completely arbitrary and with no
Mikael Kuusela (EPFL) Unfolding in HEP July 15, 2014 57 / 66
λ∈Rp
1
2
p
Small kreg corresponds to a large δ and a large kreg to a small δ
Also includes a contribution from the uncertainty of K
Mikael Kuusela (EPFL) Unfolding in HEP July 15, 2014 58 / 66
λ∈Rp
TUnfold actually provides a lot of extra functionality which cannot be accessed through RooUnfold
The supported choices are identity, 1st derivative and 2nd derivative
If δ is not chosen manually, it is found automatically using the L-curve technique, but this only seems to work when n ≫ p
2In the case of the TUnfold wrapper, the RooUnfold documentation is not explicit
about the choice of λMC (it does not seem to come from res in this case)
Mikael Kuusela (EPFL) Unfolding in HEP July 15, 2014 59 / 66
These slides: www.cern.ch/mkuusela/ETH_workshop_July_2014/ slides.pdf RooUnfold website: http://hepunx.rl.ac.uk/~adye/software/unfold/ RooUnfold.html RooUnfold class documentation: http://hepunx.rl.ac.uk/~adye/software/unfold/ htmldoc/RooUnfold.html
Mikael Kuusela (EPFL) Unfolding in HEP July 15, 2014 60 / 66
1
2
3
4
5
Mikael Kuusela (EPFL) Unfolding in HEP July 15, 2014 61 / 66
It is crucial to understand the ingredients that go into an unfolding procedure Unfolding algorithms should never be used as black boxes!
There is a MC dependence in both the smearing matrix and the regularization The uncertainties should be understood as standard errors and do not necessarily provide good coverage properties The regularization parameter has a major impact on the solution and should be chosen in a data-dependent way
Mikael Kuusela (EPFL) Unfolding in HEP July 15, 2014 62 / 66
Issues Related to Discovery Claims in Search Experiments and Unfolding, CERN-2011-006, pages 313–318, CERN, Geneva, Switzerland, 17–20 January 2011.
Nuclear Instruments and Methods A, 362:487–498, 1995.
incomplete data via the EM algorithm. Journal of the Royal Statistical Society. Series B (Methodological), 39(1):1–38, 1977.
SIAM Review, 34(4):561–580, 1992.
Instruments and Methods in Physics Research A, 372:469–481, 1996.
Mikael Kuusela (EPFL) Unfolding in HEP July 15, 2014 63 / 66
Fredholm’s integral equations of the first kind. Nuclear Instruments and Methods, 216:177–181, 1983.
particle spectra at the Large Hadron Collider. arXiv:1401.8274 [stat.AP], 2014.
transmission tomography. Journal of Computer Assisted Tomography, 8(2): 306–316, 1984.
Astronomical Journal, 79(6):745–754, 1974.
ulthei and B. Schorr. On an iterative method for the unfolding of spectra. Nuclear Instruments and Methods in Physics Research A, 257:371–377, 1987.
ulthei, B. Schorr, and W. T¨
integral equations of the first kind. Mathematical Methods in the Applied Sciences, 9:137–168, 1987.
Mikael Kuusela (EPFL) Unfolding in HEP July 15, 2014 64 / 66
ulthei, B. Schorr, and W. T¨
maximum likelihood reconstruction method. Mathematical Methods in the Applied Sciences, 11:331–342, 1989.
random fields with applications to Bayesian tomography. IEEE Transactions on Image Processing, 7(7):1029–1044, 1998.
1985.
Mikael Kuusela (EPFL) Unfolding in HEP July 15, 2014 65 / 66
statistical hypothesis testing. IEEE Transactions on Medical Imaging, 6(4): 313–319, 1987.
Mikael Kuusela (EPFL) Unfolding in HEP July 15, 2014 66 / 66
Mikael Kuusela (EPFL) Unfolding in HEP July 15, 2014 67 / 66
1
Unfold y to obtain ˆ λ
2
Fold ˆ λ to obtain ˆ µ = Kˆ λ
3
Obtain a resampled observation y∗ ∼ Poisson(ˆ µ)
4
Unfold y∗ to obtain ˆ λ∗
5
Repeat R times from 3
r=1 follows the sampling distribution of
I.e., it is our best understanding of the sampling distribution of ˆ λ for the data at hand
This is very difficult to do using competing methods
Mikael Kuusela (EPFL) Unfolding in HEP July 15, 2014 68 / 66
i,1−α/2, 2ˆ
i,α/2],
i,α denotes the α-quantile of the bootstrap sample {ˆ
i
r=1
α/2 α/2 p(ˆ θ|θ = ˆ θ0) p(ˆ θ|θ = ˆ θU) p(ˆ θ|θ = ˆ θL) ˆ θL ˆ θ0 ˆ θU
Mikael Kuusela (EPFL) Unfolding in HEP July 15, 2014 69 / 66
i,1−α/2, 2ˆ
i,α/2],
i,α denotes the α-quantile of the bootstrap sample {ˆ
i
r=1
α/2 α/2 p(ˆ θ|θ = ˆ θ0) = g(ˆ θ) g(ˆ θ + ˆ θ0 − ˆ θU) g(ˆ θ + ˆ θ0 − ˆ θL) ˆ θL ˆ θ0 ˆ θU
Mikael Kuusela (EPFL) Unfolding in HEP July 15, 2014 69 / 66
−6 −4 −2 2 4 6 −50 50 100 150 200 250 300 350 400 450
λ ˆ λ SE BS
Figure : Tikhonov regularization with 95% bin-wise confidence intervals. The SE intervals cover in 23 bins out of 40, while the bootstrap intervals cover in 32 bins.
Mikael Kuusela (EPFL) Unfolding in HEP July 15, 2014 70 / 66