Event t Reconst r t ruct i t ion
- n Modern and Fut u
Ivan Kisel GSI, Darmstadt MPI, Munich, 16 November 2010 1000 - - PowerPoint PPT Presentation
Event t Reconst r t ruct i t ion on Modern and Fut u t ure Com put e t er Archit e t ect u t ures Ivan Kisel GSI, Darmstadt MPI, Munich, 16 November 2010 1000 charged particles/collision Rekonstruktionsherausforderung im
16 November 2010, MPI Munich Ivan Kisel, GSI 2/17
Vocabulary: Collision = Event Trajectory = Track Measurement = Hit
Beam Target Silicon Detector
CBM experiment at FAIR/GSI
16 November 2010, MPI Munich Ivan Kisel, GSI 3/28
Research Center Accelerator (GeV) Experiment Physics
SLAC, USA PEP-II, e- x e+ (9 x 3.1) BaBar B-Physics Fermilab, USA Tevatron, p x p (1000 x 1000) D0 Universal CDF Universal BNL, USA RHIC, Heavy Ions PHENIX Quark-Gluon-Plasma STAR Quark-Gluon-Plasma KEK, Japan KEK-B, e- x e+ (8 x 3.5) BELLE B-Physics CERN, Switzerland LHC, p x p (7000 x 7000) ATLAS Universal CMS Universal ALICE Quark-Gluon-Plasma LHCb B-Physics DESY, Germany HERA, e+/- x p (27.5 x 920) ZEUS Proton-Physics H1 Proton-Physics HERMES Spin-Physics HERA-B B-Physics FAIR/GSI, Germany SIS 100/300, p, Heavy Ions PANDA Quark-Physics CBM Quark-Gluon-Plasma
3/28
ALI CE (CERN)
16 November 2010, MPI Munich Ivan Kisel, GSI 4/28
16 November 2010, MPI Munich Ivan Kisel, GSI 5/28
16 November 2010, MPI Munich Ivan Kisel, GSI 6/28
Magnet Muon Chambers Silicon Detector Electromagnetic Calorimeter Hadron Calorimeter
16 November 2010, MPI Munich Ivan Kisel, GSI 7/28
Time consuming!!! Kalman Filter Kalman Filter Combinatorics
16 November 2010, MPI Munich Ivan Kisel, GSI 8/28
Track finding: Wich hits in detector belong to the same track? – Cellular Automaton (CA)
1 2 3 4
Cellular Automaton:
Perfect for many-core CPU/GPU !
Detector layers Hits
1000 Hits 1000 Tracks Cellular Automaton:
estimate a possible position on a track.
collect segments into track candidates.
16 November 2010, MPI Munich Ivan Kisel, GSI 9/28
Track fit: Estimation of the track parameters at one or more hits along the track – Kalman-Filter (KF) Detector layers Hits
r – Track parameters C – Precision Initialising Prediction Correction Precision
1 2 3
Position, direction and momentum
Nowadays the Kalman-Filter is used in almost all HEP experiments
Kalman Filter:
KF as a recursive least squares method KF Block-diagram
1 2 3
16 November 2010, MPI Munich Ivan Kisel, GSI 10/28
Hough Transformation Kalman Filter Cellular Automaton
Extremely low resolution and efficiency
16 November 2010, MPI Munich Ivan Kisel, GSI 11/28
The reconstruction package CATS based on the Cellular Automaton for track finding and the Kalman Filter for track fitting
(SUSi, HOLMES, L2Sili, OSCAR, RANGER, TEMA) based on traditional methods in efficiency, accuracy and speed
Tracking quality Time consumption
Ninel x 50 tracks Time/event (sec)
Tracking efficiency
Ninel x 50 tracks Efficiency
16 November 2010, MPI Munich 12/18 Ivan Kisel, GSI
16 November 2010, MPI Munich Ivan Kisel, GSI 13/28
2000
Cores and Threads realize the task level of parallelism
2010 2015
Thread1 Thread2 … … exe r/w r/w exe exe r/w ... ...
Vectors (SIMD) = data level of parallelism
D S S S S
SIMD = Single Instruction, Multiple Data
Fundamental redesign
is necessary HEP: cope with high data rates !
16 November 2010, MPI Munich Ivan Kisel, GSI 14/28
63% of the maximal GPU utilization (ALICE)
70% of the maximal Cell performance (CBM) Cooperation with Intel (ALICE/CBM)
6.5 ms/event (CBM)
Future systems are heterogeneous
16 November 2010, MPI Munich Ivan Kisel, GSI 15/28
Vector classes: Cooperation with the Intel Ct group
16 November 2010, MPI Munich Ivan Kisel, GSI 16/28
Vector classes enable easy vectorization of complex algorithms
Vector classes overload scalar C operators with SIMD/SIMT extensions
16 November 2010, MPI Munich Ivan Kisel, GSI 17/28
(85% combinatorial space points) Track reconstruction in STS/MVD and displaced vertex search are required in the first trigger level. Reconstruction packages:
Cellular Automaton (CA)
Kalman Filter (KF)
KF Particle
16 November 2010, MPI Munich Ivan Kisel, GSI 18/28
Parameterization
December 21, 1968. The Apollo 8 spacecraft has just been sent on its way to the Moon.
003:46:31 Collins: Roger. At your convenience, would you please go P00 and Accept? We're going to update to your W-matrix.
Optimization
1 2 3
KF was considerably reworked
16 November 2010, MPI Munich Ivan Kisel, GSI 19/28
Motivated by, but not restricted to Cell !
blade11bc4 @IBM, Böblingen: 2 Cell Broadband Engines with 256 kB Local Store at 2.4 GHz
10000x faster
The KF speed was increased by 5 orders of magnitude
16 November 2010, MPI Munich Ivan Kisel, GSI 20/28
scalar double single -> 2 4 8 16 32 1.00 10.00 0.10 0.01
Scalability on different CPU architectures – speed-up 100 Data Stream Parallelism (10x) Task Level Parallelism (100x)
2xCell SPE (16 ) Woodcrest ( 2 ) Clovertown ( 4 ) Dunnington ( 6 )
SIMD Cores and Threads
Time/Track, µs Threads Cores Threads SIMD Real-time performance on different Intel CPU platforms Real-time performance on NVIDIA GPU graphic cards
The Kalman Filter Algorithm performs at ns level
CBM Progr. Rep. 2008
16 November 2010, MPI Munich Ivan Kisel, GSI 21/28
Top view Front view Efficiency Scalability
Highly efficient reconstruction of 150 central collisions per second
Intel X5550, 2x4 cores at 2.67 GHz
16 November 2010, MPI Munich Ivan Kisel, GSI 22/28
60 000 Cores Sverre Jarp
Farm PC CPU Farm Sub-Farm PC CPU/GPU Socket Core Thread Vector
monitoring the farm
high-level parallelism
low-level parallelism
Big Bang
16 November 2010, MPI Munich Ivan Kisel, GSI 23/28
45 participants from Austria, China, Germany, India, Italy, Norway, Russia, Switzerland, UK and USA
16 November 2010, MPI Munich Ivan Kisel, GSI 24/28
t 1990 2000 2010 t 1990 2000 2010
Consolidate efforts of:
Software redesign can be synchronized between the experiments
16 November 2010, MPI Munich Ivan Kisel, GSI 25/28
Different experiments have similar reconstruction problems
CBM (FAI R/ GSI ) ALI CE (CERN)
Track reconstruction is the most time consuming part of the event reconstruction, therefore many-core CPU/GPU platforms. Track finding is based in both cases on the Cellular Automaton method, track fitting – on the Kalman Filter method.
NVIDIA GPU 240 cores (ALICE HLT Group) Intel CPU 8 cores (CBM Reco Group)
107 collisions/s Collider Fixed-Target Forward geometry Cylindrical geometry 104 collisions/s
16 November 2010, MPI Munich Ivan Kisel, GSI 26/28
Track finding Track fitting Vertex finding/fitting Ring finding (PID)
Time consuming!!! Kalman Filter Kalman Filter Combinatorics
Detector/geometry independent RICH specific Track model dependent Detector dependent
16 November 2010, MPI Munich Ivan Kisel, GSI 27/28
Uni-Frankfurt/FIAS: Vector classes GPU implementation GSI: Algorithms development Many-core optimization HEPHY (Vienna)/Uni-Gjovik: Kalman Filter track fit Kalman Filter vertex fit OpenLab (CERN): Many-core optimization Benchmarking Intel: Ct implementation Many-core optimization Benchmarking
16 November 2010, MPI Munich Ivan Kisel, GSI 28/28
Follow-up Workshop: 10-11 March 2011 at CERN contact Sverre Jarp