BERKELEY LAB
Office of Science 1
GNNs for HL-LHC Tracking
ExaTrkX @ Berkeley Lab
Daniel Murnane
GNNs for HL-LHC Tracking ExaTrkX @ Berkeley Lab Daniel Murnane - - PowerPoint PPT Presentation
GNNs for HL-LHC Tracking ExaTrkX @ Berkeley Lab Daniel Murnane Office of BERKELEY LAB 1 Science Goal Sub-second processing of HL-LHC hit data into: Seeds (i.e. triplets) for further processing with traditional techniques, AND/OR
BERKELEY LAB
Office of Science 1
ExaTrkX @ Berkeley Lab
Daniel Murnane
BERKELEY LAB
Office of Science 2
Goal
Sub-second processing of HL-LHC hit data into:
techniques, AND/OR
BERKELEY LAB
Office of Science 3
The Current Pipeline
Raw hit data embedded Filter likely, adjacent doublets Train/classify doublets in GNN Filter, convert to triplets Train/classify tripets in GNN Apply cut for seeds DBSCAN for track labels
BERKELEY LAB
Office of Science 4
Dataset
Competition” dataset
100,000 hits of around 10,000 particles
BERKELEY LAB
Office of Science 5
Dataset
“TrackML score” 𝑇 ∈ 0,1
track labelled with same unique label ⇒ 𝑇 = 1
test case, and ignore noise
BERKELEY LAB
Office of Science 6
Embedding + MLP Construction
1.
For each hit in event, embed features (co-ordinates, cell direction data, etc.) into N- dimensional space
2.
Associate hits from same tracks as close in N-dimensional distance
3.
Score each hit within embedding neighbourhood against the “seed” hit at centre
4.
Filter by score, to create a set of doublets for the neighbourhood
5.
All doublets in event generate a graph, converted to a directed graph (by ordering layers)
BERKELEY LAB
Office of Science 7
Segmentation
Hard cut One-directional soft cut Bi-directional soft cut
A full graph from the embedding does not fit on a single GPU. Therefore the event graphs are segmented, according to how large the GNN model is expected to be.
BERKELEY LAB
Office of Science 8
Previous ML Approaches
8
BERKELEY LAB
Office of Science 9
Graph Neural Network for Edge Classification
BERKELEY LAB
Office of Science 10
Passing information around the graph gives it learning power
“aware” of its neighbours by concatenating the neighbouring hidden features
neighbourhood learning passes information around the graph
generalisation of a flat CNN convolution
BERKELEY LAB
Office of Science 11
11 11
with Recursion
GNN Edge prediction architecture
with Residuals + +
BERKELEY LAB
Office of Science 12
12 12
with Recursion
GNN Edge prediction architecture
with Residuals + +
Have found best efficiency & purity performance.
BERKELEY LAB
Office of Science 13
Edge attention architecture
13 13 x n iterations (hyperparameter) 𝑦 𝑧 𝑨
BERKELEY LAB
Office of Science 14
14 14 𝑦 𝑧 𝑨
ℎ1 … ℎ𝑜
x n iterations
Edge attention architecture
BERKELEY LAB
Office of Science 15
15 15 x n iterations
ℎ1 … ℎ𝑜 ℎ1 … ℎ𝑜
1
ℎ1 … ℎ𝑜
0,1
Edge attention architecture
BERKELEY LAB
Office of Science 16
16 16 x n iterations 0.6
ℎ1 … ℎ𝑜
0,1
Edge attention architecture
BERKELEY LAB
Office of Science 17
17 17 x n iterations 0.6 0.4 0.1 0.4 0.9 0.1 0.8 0.8
Edge attention architecture
BERKELEY LAB
Office of Science 18
18 18 x n iterations 0.6 0.4 0.4 0.1 0.8
ℎ1 ℎ2 ℎ3 ℎ4 ℎ5
Edge attention architecture
BERKELEY LAB
Office of Science 19
19 19 x n iterations +
ℎ1
ℎ2 ℎ3
ℎ5
ℎ4ℎ1 … ℎ𝑜
0.6 0.4 0.4 0.1 0.8
Edge prediction architecture
BERKELEY LAB
Office of Science 20
20 20 x n iterations
ℎ1 … ℎ𝑜 ℎ1 … ℎ𝑜
1
ℎ1 … ℎ𝑜
0,1
0.6
Edge attention architecture
BERKELEY LAB
Office of Science 21
21 21 x n iterations 0.9 0.2 0.1 0.2 0.9 0.3 0.9 0.2
ℎ1 … ℎ𝑜
0,1
x n iterations (hyperparameter)
Edge attention architecture
BERKELEY LAB
Office of Science 22
Doublet GNN Performance
Threshold 0.5 0.8 Accuracy 0.9761 0.9784 Purity 0.9133 0.9694 Efficiency 0.9542 0.9052 Two points to keep in mind
that had much lower efficiency than the learned embedding. This GNN is classifying a ∼ 96% efficient doublet dataset
doublets to create triplets without losing efficiency
BERKELEY LAB
Office of Science 23
Why not simply join together our doublet predictions?
23 23
0.99
x1 x2 x3 x4
0.01 0.99 Distance from detector centre
BERKELEY LAB
Office of Science 24
Doublet choice can be ambiguous
24 24
0.99
x1 x2 x3 x4
0.87 0.84 Distance from detector centre
Not so easy… so teach the network how to combine
BERKELEY LAB
Office of Science 25
But a GNN doesn’t know about “triplets”
25 25
?
x1 x2 x3 x4
Distance from detector centre
A GNN only knows about nodes and edge
?
BERKELEY LAB
Office of Science 26
Moving to a “doublet graph” gives us back GNN power
26 26
0.99
x1 x2 x3 x4
0.87 0.84
Now… nodes represent doublets, edges represent triplets
x2 x2
BERKELEY LAB
Office of Science 27
Moving to a “doublet graph” gives us back GNN power
27 27
0.99
x1 x2 x3 x4
0.87 0.84
Now… nodes represent doublets, edges represent triplets
x2 x2
BERKELEY LAB
Office of Science 28
Triplet Propaganda
Threshold 0.5 0.8 Accuracy 0.9761 0.9784 Purity 0.9133 0.9694 Efficiency * relative 0.9542 0.9052
Doublet GNN Triplet GNN
Threshold 0.5 0.8 Accuracy 0.9960 0.9957 Purity 0.9854 0.9923 Efficiency * relative 0.9939 0.9850
BERKELEY LAB
Office of Science 29
Triplet propaganda
Gold: Unambiguously correct triplet or quadruplet Other colours: False positive/negative Key:
Silver: Ambiguously correct triplet or quadruplet (i.e. edge shared by correct triplet and false positive triplet) Bronze dashed: Correct triplet, but missed quadruplet (i.e. edge shared by correct triplet and false negative triplet) Red: Completely false positive triplet Blue dashed: Completely false negative triplet
BERKELEY LAB
Office of Science 30
Gold: Unambiguously correct triplet or quadruplet Other colours: False positive/negative Key:
Silver: Ambiguously correct triplet or quadruplet (i.e. edge shared by correct triplet and false positive triplet) Bronze dashed: Correct triplet, but missed quadruplet (i.e. edge shared by correct triplet and false negative triplet) Red: Completely false positive triplet Blue dashed: Completely false negative triplet
Triplet propaganda
BERKELEY LAB
Office of Science 31
Black: Triplet classifier correctly labelled, doublet classifier mislabelled Red: Doublet classifier correctly labelled, triplet classifier mislabelled In this graph, triplet classifier Fixes 389 edges Worsens 10 edges
Triplet GNN improves doublet GNN results
BERKELEY LAB
Office of Science 32
Seeding: Final Performance
Purity: 99.1% ± 0.07% Efficiency: 88.6% ± 0.19% - This is objective Inference time: ∼ 5 seconds per event per GPU, split between:
BERKELEY LAB
Office of Science 33
Seeding: Next Steps
increasing graph size e.g. For TrackML data:
construction
99% efficiency
𝑃 6,000
to N-plet GNN
BERKELEY LAB
Office of Science 34
Track Labelling
GOAL Given a classified doublet and/or triplet graph, use edge scores to group likely nodes into tracks and label with unique identifier.
BERKELEY LAB
Office of Science 35
DBSCAN on a Graph
neighbourhood density
with each distance element given by 𝑒𝑗𝑘 = 1 − 𝑓𝑗𝑘
directed graph does not perform well with DBSCAN.
BERKELEY LAB
Office of Science 36
DBSCAN Performance
from TrackML data, where every hit is connected to hits of a shared track in adjacent layers, with a high score (e.g. 0.99), and randomly connected to other hits with a low score (e.g. 0.01)
edges to reduce efficiency, or mislabel fake edges to reduce purity
TrackML score against efficiency
score against purity
BERKELEY LAB
Office of Science 37
GNN TrackML Score Performances
0.989
BERKELEY LAB
Office of Science 38
GNN TrackML Score Performances
0.989
truth graph 0.957
BERKELEY LAB
Office of Science 39
GNN TrackML Score Performances
0.989
truth graph 0.957
doublet hits 0.935
Loss from embedding construction 96% efficiency
BERKELEY LAB
Office of Science 40
GNN TrackML Score Performances
0.989
truth graph 0.957
doublet graph using truth 0.935
classification 0.815
Loss from embedding construction 96% efficiency
BERKELEY LAB
Office of Science 41
GNN TrackML Score Performances
doublet graph (truth) 0.846
Loss from embedding construction 96% efficiency Lost doublets
BERKELEY LAB
Office of Science 42
GNN TrackML Score Performances
doublet graph (truth) 0.846
triplet GNN classification 0.815
Loss from embedding construction 96% efficiency Lost doublets
BERKELEY LAB
Office of Science 43
Missing Doublets
All Hits Missing Doublet Hits
𝜃 𝜃 𝜚 𝜚
BERKELEY LAB
Office of Science 44
Missing Doublets
All Hits Missing Doublet Hits
𝜃 𝜃 𝜚 𝜚 Doublets on end of barrel
BERKELEY LAB
Office of Science 45
Missing Doublets
All Hits Missing Doublet Hits
𝜃 𝜃 𝜚 𝜚 Doublets on edge of segments
BERKELEY LAB
Office of Science 46
Stitching
𝜃 𝜚
Pre-clean-up Post-clean-up
BERKELEY LAB
Office of Science 47
Ignoring Fragmented Tracks
E.g. Although most of this track is outside the barrel, we keep the track to challenge the GNN
𝑨 𝑧 𝑦 𝑦
BERKELEY LAB
Office of Science 48
Track Labelling: Final-ish Performance
range (-2.1, 2.1) 0.912
BERKELEY LAB
Office of Science 49
Track Labelling: Final-ish Performance
range (-2.1, 2.1) 0.912
classification in eta (-2.1, 2.1) 0.876
BERKELEY LAB
Office of Science 50
Track Labelling: Final-ish Performance
range (-2.1, 2.1) 0.912
classification in eta (-2.1, 2.1) 0.876
2.1) & no fragments 0.925
BERKELEY LAB
Office of Science 51
Track Labelling: Final-ish Performance
range (-2.1, 2.1) 0.912
classification in eta (-2.1, 2.1) 0.876
2.1) & no fragments 0.925
eta (-2.1, 2.1) & no fragments 0.888
BERKELEY LAB
Office of Science 52
Track Labelling: Final-ish Performance
emulating whole detector (no punishment for tracks crossing detector volumes) recovers almost all missing doublets
improvement areas are now seen: 1. Doublet-to-triplet efficiency, and 2. Embedding construction efficiency
+ 0.015 TrackML score
BERKELEY LAB
Office of Science 53
Summary