Neural Network Fragmentation Functions
Valerio Bertone
NIKHEF and VU Amsterdam
REF 2017
November 15, 2017, Madrid
[V. Bertone et al., Eur. Phys. J. C 77 (2017) no.8, 516]
Neural Network Fragmentation Functions [V. Bertone et al. , Eur. - - PowerPoint PPT Presentation
Neural Network Fragmentation Functions [V. Bertone et al. , Eur. Phys. J. C 77 (2017) no.8, 516] Valerio Bertone NIKHEF and VU Amsterdam REF 2017 November 15, 2017, Madrid Motivation Why bother about fragmentation functions Comparison
Valerio Bertone
NIKHEF and VU Amsterdam
November 15, 2017, Madrid
[V. Bertone et al., Eur. Phys. J. C 77 (2017) no.8, 516]
pT
|η| < 1.0 |η| < 0.8 √s = 7.0 TeV
pT
|η| < 1.0 √s = 1960.0 GeV
pT
|η| < 1.0 |η| < 0.8 √s = 2760 GeV
pT
|η| < 1.0 |η| < 0.8 |η| < 2.5 √s = 900 GeV
d’Enterria et al. [ArXiv:1311.1415]
Comparison data/theory for inclusive charged-hadron pT spectra: Large energy data tend to be overshot by predictions obtained with most of the current FF sets ⇒ gluon FF at large z?
construct a set of data replicas which reproduces the statistical features of the
clear statistical interpretation.
flexible (redundant) functional form parametrised by a large set of parameters.
suitable exploration of the parameter space to avoid to fall into local minima of the figure of merit.
exploit the random distribution of the statistical fluctuations in a given MC replica to avoid over-learning.
The NNi(1) term ensures that fi(x) −
→
x→1 0
g(x) = sign(x) ln(|x| + 1) Activation function:
ξ(j)
i
= g
(j-1)th layer
X
k
ξ(j−1)
k
ω(j)
ki − θ(j) i
!
Vogt [Phys.Rev. D48 (1993)]
µ2 ∂ ∂µ2 Dh
NS
= P +
NS ⊗ Dh NS
µ2 ∂ ∂µ2 ✓Dh
Σ
Dh
g
◆ = ✓Pqq Pqg Pgq Pgg ◆ ⊗ ✓Dh
Σ
Dh
g
◆ Numerical implementation up to NNLO in the APFEL code:
careful benchmark against in the the N-space MELA code, perfect agreement with QCDNUM (after a correction of a bug in the latter).
L + Fh T ≡ Fh 2
= ˆ σh ⇥ C2,q ⊗ Dh
Σ + C2,g ⊗ Dh g + C2,NS ⊗ Dh NS
⇤ Identified hadron (π±, K±, p/p, …)
L + Fh T ≡ Fh 2
= ˆ σh ⇥ C2,q ⊗ Dh
Σ + C2,g ⊗ Dh g + C2,NS ⊗ Dh NS
⇤ Perturbative Wilson coefficients known up to NNLO in the massless scheme
L + Fh T ≡ Fh 2
= ˆ σh ⇥ C2,q ⊗ Dh
Σ + C2,g ⊗ Dh g + C2,NS ⊗ Dh NS
⇤ Only a limited set of combinations of FFs can be constrained by SIA cross sections
L + Fh T ≡ Fh 2
= ˆ σh ⇥ C2,q ⊗ Dh
Σ + C2,g ⊗ Dh g + C2,NS ⊗ Dh NS
⇤ Numerical implementation up to NNLO in the APFEL code:
partial benchmark against the DSS implementation.
. Anderle, F. Ringer, M. Stratmann [arXiv:1510.05854]
inclusive SIA data only constrains three FF combinations, heavy-quark FFs constrained directly by tagged SIA data.
substantial heavy-quark intrinsic component, heavy-quark FFs parametrised on the same footing as the light FFs.
αs(MZ) = 0.118, αem(MZ) = 1/127, mc = 1.51 GeV, mb = 4.92 GeV
zmin ≤ z ≤ zmax, zmin = ⇢ 0.02 for √s = MZ 0.075
, zmax = 0.9
u+, Dh s++d+, Dh c+, Dh b+, Dh g
inclusive SIA data only constrains three FF combinations, heavy-quark FFs constrained directly by tagged SIA data.
substantial heavy-quark intrinsic component, heavy-quark FFs parametrised on the same footing as the light FFs.
αs(MZ) = 0.118, αem(MZ) = 1/127, mc = 1.51 GeV, mb = 4.92 GeV
zmin ≤ z ≤ zmax, zmin = ⇢ 0.02 for √s = MZ 0.075
, zmax = 0.9
u+, Dh s++d+, Dh c+, Dh b+, Dh g
contributions ∝ Mh/ sz2 and ln(z)
inclusive SIA data only constrains three FF combinations, heavy-quark FFs constrained directly by tagged SIA data.
substantial heavy-quark intrinsic component, heavy-quark FFs parametrised on the same footing as the light FFs.
αs(MZ) = 0.118, αem(MZ) = 1/127, mc = 1.51 GeV, mb = 4.92 GeV
zmin ≤ z ≤ zmax, zmin = ⇢ 0.02 for √s = MZ 0.075
, zmax = 0.9
u+, Dh s++d+, Dh c+, Dh b+, Dh g
contributions ∝ ln(1 - z)
10.5 12 14 22 29 34 44 58 91.2 0.01 0.1 1
√s [GeV] z π±
TASSO
π±
BELLE TOPAZ
π±
BABAR TPC SLD
π±
ALEPH DELPHI OPAL
0.01 0.1
z K±
10.5 12 14 22 29 34 44 58 91.2 0.01 0.1 1
√s [GeV] z π±
TASSO
π±
BELLE TOPAZ
π±
BABAR TPC SLD
π±
ALEPH DELPHI OPAL
0.01 0.1
z K±
10.5 12 14 22 29 34 44 58 91.2 0.01 0.1 1
√s [GeV] z π±
TASSO
π±
BELLE TOPAZ
π±
BABAR TPC SLD
π±
ALEPH DELPHI OPAL
0.01 0.1
z K±
10.5 12 14 22 29 34 44 58 91.2 0.01 0.1 1
√s [GeV] z π±
TASSO
π±
BELLE TOPAZ
π±
BABAR TPC SLD
π±
ALEPH DELPHI OPAL
0.01 0.1
z K±
Z.Phys. C17 (1983) 5-15 Z.Phys. C42 (1989) 189
10.5 12 14 22 29 34 44 58 91.2 0.01 0.1 1
√s [GeV] z π±
TASSO
π±
BELLE TOPAZ
π±
BABAR TPC SLD
π±
ALEPH DELPHI OPAL
0.01 0.1
z K±
10.5 12 14 22 29 34 44 58 91.2 0.01 0.1 1
√s [GeV] z π±
TASSO
π±
BELLE TOPAZ
π±
BABAR TPC SLD
π±
ALEPH DELPHI OPAL
0.01 0.1
z K±
Z.Phys. C17 (1983) 5-15 Z.Phys. C42 (1989) 189
10.5 12 14 22 29 34 44 58 91.2 0.01 0.1 1
√s [GeV] z π±
TASSO
π±
BELLE TOPAZ
π±
BABAR TPC SLD
π±
ALEPH DELPHI OPAL
0.01 0.1
z K±
10.5 12 14 22 29 34 44 58 91.2 0.01 0.1 1
√s [GeV] z π±
TASSO
π±
BELLE TOPAZ
π±
BABAR TPC SLD
π±
ALEPH DELPHI OPAL
0.01 0.1
z K±
Z.Phys. C17 (1983) 5-15 Z.Phys. C42 (1989) 189
X.Q. Lu, Ph.D. thesis, Johns Hopkins U., 1986
10.5 12 14 22 29 34 44 58 91.2 0.01 0.1 1
√s [GeV] z π±
TASSO
π±
BELLE TOPAZ
π±
BABAR TPC SLD
π±
ALEPH DELPHI OPAL
0.01 0.1
z K±
Z.Phys. C17 (1983) 5-15 Z.Phys. C42 (1989) 189
X.Q. Lu, Ph.D. thesis, Johns Hopkins U., 1986
1 1 1 1 0.01 0.1 1
z K±
0.01 0.1 1
z p/p
20 30 40 50 60 70 80 90 100
√s [GeV]
10 20 30 40 50 60 70 80 90 100
√s [GeV]
10 20 30 40 50 60 70 80 90 100
√s [GeV]
10 20 30 40 50 60 70 80 90 100
√s [GeV]
1
π± π± π± π±
ALEPH DELPHI OPAL
0.01 0.1 1
z K±
0.01 0.1 1
z p/p
Fit quality increasingly better going from LO to NNLO: substantial from LO to NLO, more moderate from NLO to NNLO. NNLO corrections are anyway beneficial (particularly for pions).
Fit quality increasingly better going from LO to NNLO: substantial from LO to NLO, more moderate from NLO to NNLO. NNLO corrections are anyway beneficial (particularly for pions). Tension between BELLE and BABAR for kaons and protons:
higher-order corrections.
Fit quality increasingly better going from LO to NNLO: substantial from LO to NLO, more moderate from NLO to NNLO. NNLO corrections are anyway beneficial (particularly for pions). Tension between BELLE and BABAR for kaons and protons:
higher-order corrections. Anomalously small χ2 for BELLE: possible underestimate of the uncorrelated systematic uncertainty.
Possible tension also between DELPHI inclusive and the other experiments at MZ:
Fit quality increasingly better going from LO to NNLO: substantial from LO to NLO, more moderate from NLO to NNLO. NNLO corrections are anyway beneficial (particularly for pions). Tension between BELLE and BABAR for kaons and protons:
higher-order corrections. Anomalously small χ2 for BELLE: possible underestimate of the uncorrelated systematic uncertainty.
10-4 10-3 10-2 10-1 100 101 0.2 0.4 0.6 0.8
z
dσ/dz [GeV] BELLE
NNLO theory π± K± p/p
0.4 0.6 0.8
z
(1/σtot) dσ/dz BABAR
0.8 1 1.2 BELLE data/theory (NNLO) 0.8 1 1.2 BELLE 0.8 1 1.2 BELLE 0.8 1 1.2 BABAR (prompt) 0.8 1 1.2 BABAR (conventional) 0.2 0.4 0.6 0.8 1 0.8 1 1.2
z
BABAR (prompt)
Data/Theory comparison for BELLE and BABAR using NNFF1.0 at NNLO: the bands indicate the 1-σ uncertainty. Very good description in the region not excluded by the kinematic cuts (shaded areas). Different trend of the data at low z for kaons and particularly for protons: possible reason of the worsening of the χ2.
Data/Theory comparison for the experiments at MZ using NNFF1.0 at NNLO. Very good description in the region allowed by the kinematic cuts. Often also the data excluded by the cuts are well described. The predictions for pions for DELPHI overshoot the data:
compared to the other experiments at MZ.
10-2 10-1 100 101 102
(1/σtot) dσ/dz ALEPH
NNLO theory π± K± p/p
DELPHI
10-2 10-1 100 101 102 0.01 0.1
z
(1/σtot) dσ/dz OPAL
0.01 0.1
z
(1/σtot) dσ/dz SLD
0.6 1 1.4 data/theory (NNLO) ALEPH (inclusive) 0.6 1 1.4 ALEPH (inclusive) 0.6 1 1.4 ALEPH (inclusive) 0.6 1 1.4 DELPHI (inclusive) 0.6 1 1.4 DELPHI (inclusive) 0.6 1 1.4 DELPHI (inclusive) 0.6 1 1.4 OPAL (inclusive) 0.6 1 1.4 OPAL (inclusive) 0.6 1 1.4 SLD (inclusive) 0.6 1 1.4 SLD (inclusive) 0.01 0.1 1 0.6 1 1.4
z
SLD (inclusive)
3 6 9 12
zDu+
π±
(z,Q)
2 4 6 8
zDc+
π±
(z,Q) zDd++s+
π± (z,Q)
Q = 10 GeV
LO NLO NNLO
zDg
π±
(z,Q) zDb+
π±
(z,Q)
1 2
Ratio to LO
1 2 1 2
Ratio to NLO
1 2 0.1 1
z
0.1 1
z
0.1 1
z
0.5 1 1.5 2
zDu+
K±
(z,Q)
0.5 1 1.5 2
zDc+
K±
(z,Q) zDd++s+
K± (z,Q)
Q = 10 GeV
LO NLO NNLO
zDg
K±
(z,Q) zDb+
K±
(z,Q)
1 2
Ratio to LO
1 2 1 2
Ratio to NLO
1 2 0.1 1
z
0.1 1
z
0.1 1
z
0.5 1 1.5 2
zDu+
p/p
0.25 0.5 0.75 1
zDc+
p/p
zDd++s+
p/p
Q = 10 GeV
LO NLO NNLO
zDg
p/p
zDb+
p/p
1 2
Ratio to LO
1 2 1 2
Ratio to NLO
1 2 0.1 1
z
0.1 1
z
0.1 1
z
More sizeable differences between NNFF1.0 and JAM16: the d+ + s+ and gluon distributions well beyond 1-σ, generally smaller uncertainties of the JAM16 distributions (despite similar dataset).
3 6 9 12
zDu+
π±
(z,Q)
2 4 6 8
zDc+
π±
(z,Q) zDd++s+
π± (z,Q)
Q = 10 GeV, NLO
NNFF1.0 1σ DEHSS 68% CL JAM 1σ
zDg
π±
(z,Q) zDb+
π±
(z,Q)
1 2
Ratio to NNFF1.0
1 2 1 2
Ratio to NNFF1.0
1 2 0.1 1
z
0.1 1
z
0.1 1
z
Compare our results against the most recent FF sets: DEHSS
JAM16
Only available for pions and kaons at NLO. Pions: fair agreement between NNFF1.0 and DEHSS: differences u+ and gluon distributions at the 1-σ level.
0.5 1 1.5 2
zDu+
K±
(z,Q)
0.5 1 1.5 2
zDc+
K±
(z,Q) zDd++s+
K± (z,Q)
Q = 10 GeV, NLO
NNFF1.0 1σ DEHSS 68% CL JAM 1σ
zDg
K±
(z,Q) zDb+
K±
(z,Q)
1 2
Ratio to NNFF1.0
1 2 1 2
Ratio to NNFF1.0
1 2 0.1 1
z
0.1 1
z
0.1 1
z
larger differences in the uncertainties, particularly for the gluon distributions. Compare our results against the most recent FF sets: DEHSS
JAM16
Only available for pions and kaons at NLO. Kaons: more substantial differences: fair agreement only for b+.
this data includes, not only pions, kaons and protons, but also heavier (and less abundant) charged hadrons.
TASSO, TPC, ALEPH, DELPHI, OPAL, SLD.
ALEPH, DELPHI, OPAL.
this data provides a strong handle on the gluon distribution. As a consequence it is not possible to go beyond NLO (i.e. O(αs2)) yet.
dramatic reduction of the uncertainty, better agreement with CMS data.
1 2 3 4 5 6 7 1 10 100 E d3σ/d3p [Theory / Data] pT [GeV] CMS charged particle differential cross section at 2.76 TeV for |η|<1 Data NNFF1.0 NNFF1.0 (no FL) DSS07
based on SIA data, provided at LO, NLO, and NNLO, for π±, K± and p/p, faithful uncertainty estimate (validated by closure tests), remarkable perturbative convergence, acceptable agreement with other determinations.
Laeder et al. [ArXiv:1103.5979]
Extraction of the longitudinally polarised parton distribution functions:
In the presence of semi-inclusive DIS data the strange quark distribution is very sensitive to the choice of the FF set used in the analysis. Even fitting PDFs and FFs simultaneously does no lead to a definitive answer.
5 10 15 20 25 30
zDh±
Σ (z,Q2)
Q2=M2
Z
NNPDF DSS
0.4 1 1.6 0.01 0.1 z
NNPDF/DSS
(5×) zDh±
g (z,Q2)
0.01 0.1 z 2 4
zDh±
c+ (z,Q2)
0.4 1 1.6
NNPDF/DSS
2 4
zDh±
b+ (z,Q2)
0.01 0.1 1 0.4 1 1.6 z
NNPDF/DSS
helps implement physical constraints (e.g. fi(1) = 0 and integrability), determines the behaviour in the extrapolation regions, facilitates the task of the neural network making the fit easier.
0.5 1 1.5 2 2.5
5 10 x g0(x) g1(x)
ξ(j)
i
= g
(j-1)th layer
X
k
ξ(j−1)
k
ω(j)
ki − θ(j) i
!
Saturating function g0(x) = 1 1 + e−x Non-saturating function g1(x) = sign(x) ln(|x| + 1) Removing the preprocessing functions requires a proper choice of the activation function of the neural networks:
θ(1)
1
θ(1)
2
θ(1)
3
θ(2)
1
θ(2)
2
θ(3)
1
ω(1)
11
ω(1)
12
ω(1)
13
ω(2)
11
ω(2)
12
ω(2)
21
ω(2)
22 ω(2) 31
ω(2)
32
ω(3)
11
ω(3)
21
Input Hidden Hidden Output
Saturating function g0(x) = 1 1 + e−x Non-saturating function g1(x) = sign(x) ln(|x| + 1) Removing the preprocessing functions requires a proper choice of the activation function of the neural networks:
θ(1)
1
θ(1)
2
θ(1)
3
θ(2)
1
θ(2)
2
θ(3)
1
ω(1)
11
ω(1)
12
ω(1)
13
ω(2)
11
ω(2)
12
ω(2)
21
ω(2)
22 ω(2) 31
ω(2)
32
ω(3)
11
ω(3)
21
Input Hidden Hidden Output
ξ(j)
i
= g
(j-1)th layer
X
k
ξ(j−1)
k
ω(j)
ki − θ(j) i
!
xDg(x,Q2)
Data region
Saturating function g0(x) = 1 1 + e−x Non-saturating function g1(x) = sign(x) ln(|x| + 1)
ξ(j)
i
= g
(j-1)th layer
X
k
ξ(j−1)
k
ω(j)
ki − θ(j) i
!
xDg(x,Q2)
Data region
θ(1)
1
θ(1)
2
θ(1)
3
θ(2)
1
θ(2)
2
θ(3)
1
ω(1)
11
ω(1)
12
ω(1)
13
ω(2)
11
ω(2)
12
ω(2)
21
ω(2)
22 ω(2) 31
ω(2)
32
ω(3)
11
ω(3)
21
Input Hidden Hidden Output
Saturating function g0(x) = 1 1 + e−x Non-saturating function g1(x) = sign(x) ln(|x| + 1)
ξ(j)
i
= g
(j-1)th layer
X
k
ξ(j−1)
k
ω(j)
ki − θ(j) i
!
xDg(x,Q2)
Data region
θ(1)
1
θ(1)
2
θ(1)
3
θ(2)
1
θ(2)
2
θ(3)
1
ω(1)
11
ω(1)
12
ω(1)
13
ω(2)
11
ω(2)
12
ω(2)
21
ω(2)
22 ω(2) 31
ω(2)
32
ω(3)
11
ω(3)
21
Input Hidden Hidden Output
How do we know whether our fitting strategy is reliable?
1) Assume underlying FFs are known (e.g. HKNS07). 2) Generate pseudo-data with given statistical and correlated systematics. 3) Perform a fit and compare to the “truth”.
If needed, use the closure tests to tune the fitting algorithm. Levels of closure tests: NNPDF Collaboration [arXiv:1410.8849]
level 0:
data point central values equal to the HKNS07 “true” values, uncertainties assumed equal to the experimental ones, we must find χ2 ~ 0 and that uncertainty on predictions tends to zero.
level 2:
data points obtained as random fluctuations with exp. covariance matrix about the “truth”, generate Monte Carlo replicas of this data, fit a PDF set to each Monte Carlo replica, we must find χ2 ~ 1 and that HKNS07 “true” FFs are within the 1-σ band.
Predictions coincide with the data central values. Prediction uncertainties shrink to zero. FFs in the data region very close to the “truth”. Uncertainties blow up in the extrapolation region.
kinematic cut
xDg(x,Q2)
Predictions coincide with the data central values. Prediction uncertainties shrink to zero. FFs in the data region very close to the “truth”. Uncertainties blow up in the extrapolation region.
kinematic cut
xDg(x,Q2)
xDu(x,Q2) xDg(x,Q2)
kinematic cut kinematic cut
χ2 to data and PDF uncertainty on predictions tend to zero as the training length increases. LEVEL 0 closure test works!
50
Fits produced with increasing (fixed) training length. All fits on the same dataset (NNPDF2.3 dataset).
χ2 tends to χ2MSTW (0.96) and the MSTW “true PDFs” are 68% of times within the 1-σ error band. LEVEL 2 closure test works!
51
Fits produced with increasing (fixed) training length. All fits on the same dataset (NNPDF2.3 dataset).
Both LEVEL 0 and LEVEL 2 gave a positive result.
PDF errors in LEVEL 0 closure test reflect the functional uncertainty. PDF errors in LEVEL 2 closure test reflect the data uncertainty.
In the data region: data uncertainty ≫ functional uncertainty In the extrapolation region: data uncertainty ~ functional uncertainty. Prove of the reliability of the NNPDF methodology.
0.6 1 1.4 TASSO12 data/theory (NNLO) π± TASSO14 TASSO22 0.6 1 1.4 TASSO34 TASSO44 TPC (inclusive) 0.2 1 1.8 TPC (uds tagged) TPC (c tagged) TPC (b tagged) 0.6 1 1.4 TOPAZ DELPHI (uds tagged) DELPHI (b tagged) 0.6 1 1.4 0.01 0.1
z
SLD (uds tagged) 0.01 0.1
z
SLD (c tagged) 0.01 0.1 1
z
SLD (b tagged)
0.2 1 1.8 TASSO12 data/theory (NNLO) p/p
TASSO22 0.2 1 1.8 TASSO30 TASSO34 TPC (inclusive) 0.2 1 1.8 TOPAZ DELPHI (uds tagged) DELPHI (b tagged) 0.2 1 1.8 0.01 0.1
z
SLD (uds tagged) 0.01 0.1
z
SLD (c tagged) 0.01 0.1 1
z
SLD (b tagged)
0.2 1 1.8 TASSO12 data/theory (NNLO) K± TASSO14 0.2 1 1.8 TASSO22 TASSO34 TPC (inclusive) 0.2 1 1.8 TOPAZ DELPHI (uds tagged) DELPHI (b tagged) 0.2 1 1.8 0.01 0.1
z
SLD (uds tagged) 0.01 0.1
z
SLD (c tagged) 0.01 0.1 1
z
SLD (b tagged)