Markus Frank (CERN) & Albert Puig (UB) An opportunity - PowerPoint PPT Presentation

Markus Frank (CERN) & Albert Puig (UB)

 An opportunity (Motivation)  Adopted approach  Implementation specifics  Status  Conclusions 2

Readout Network Online cluster Data logging (event selection) facility CPU CPU CPU CPU CPU CPU Storage CPU CPU CPU 3

 ~16000 CPU cores foreseen (~1000 boxes)  Environmental constraints:  2000 1U boxes space limit  50 x 11 kW cooling/power limit  Computing power equivalent to that provided by all Tier 1’s to LHCb. Readout Network  Storage system: Online cluster Data logging  40 TB installed (event selection) facility  400-500 MB/s CPU CPU CPU CPU CPU CPU Storage CPU CPU CPU 4

 Significant idle time of the farm  During LHC winter shutdown (~ months)  During beam period, experiment and machine downtime (~ hours) Could we use it for reconstruction?  Farm is fully LHCb controlled  Good internal network connectivity - Slow disk access (only fast for a very few nodes via Fiber Channel interface) 5

 Background information: + 1 file (2GB) contains 60.000 events. + It takes 1-2s to reconstruct an event.  Cannot reprocess à la Tier-1 (1 file per core)  Cannot perform reconstruction in short idle periods:  Each file takes 1-2 s/evt * 60k evt ~ 1 day.  Insufficient storage or CPUs not used efficiently:  Input: 32 TB (16000 files * 2 GB/file)  Output: ~44 TB (16000 * 60k evt * 50 kB/evt)  A different approach is needed  Distributed reconstruction architecture. 6

 Files are split in events and distributed to many cores, which perform reconstruction:  First idea: full parallelization (1 file/16k cores)  Reconstruction time: 4-8 s  Full speed not reachable (only one file open!)  Practical approach: split the farm in slices of subfarms (1 file/n subfarms).  Example: 4 concurrent open files yield a reconstruction time of 30s/file. 7

ECS Control and allocation of resources Farm … … Database Database Subfarms Switch Reco Manager … … Job steering Storage Nodes Storage DIRAC Stor Connection to LHCb production system See A.Tsaregorodtsev talk, DIRAC3 - the new generation of the LHCb grid software 8

ECS Control and allocation of resources Farm … … Database Database Subfarms Switch Reco Manager … … Job steering Storage Nodes Storage DIRAC Stor Connexion to production system 9

 Control using standard LHCb ECS software:  Reuse of existing components for storage and subfarms.  New components for reconstruction tree management.  See Clara Gaspar’s talk ( LHCb Run Control System ).  Allocate, configure, start/stop resources (storage and subfarms).  Task initialization slow, so tasks don’t restart on file change.  Idea: tasks sleep during data-taking, and are only restarted on configuration change. 10

PVSS Control subfarm x 50 subfarms 1 control PC each 4 PC each Reco Reco Reco Reco Input x 8 cores/PC Reco Reco Reco Reco Output 1 Reco task/core Data management tasks Event (data) Event (data) processing management 11

Processing Node  Data pr Data processing block ocessing block  Producers put events in a buffer manager (MBM) Producer  Consumers receive events from the MBM Buffer manager Source Node Consumer Buffer Sender  Data transf Data transfer block er block  Senders access events from MBM Target Node  Receivers get data and declare it Receiver to MBM Buffer 12

Brunel Brunel Brunel Reco 1 per core Input Output Receiver Sender Worker node Sender Receiver Events Events Storage Reader Storage Writer Storage nodes Stor Storage e 13

ECS Control and allocation of resources Farm … … Database Database Subfarms Switch Reco Manager … … Job steering Storage Nodes Storage DIRAC Stor Connection to LHCb production system 14

 Granularity to file level.  Individual event flow handled TODO automatically by allocated resource slices. PREPARING  Reconstruction: specific actions in specific order. PREPARED  Each file is treated as a Finite State Machine (FSM) PROCESSING  Reconstruction information stored in a database. DONE ERROR  System status  Protection against crashes 15

 Job steering done by a Reco Manager:  Holds the each FSM instance and moves it through all the states based on the feedback from the static resources.  Sends commands to the readers and writers: files to read and filenames to write to.  Interacts with the database. 16

 The Online Farm will be treated like a CE connected to the LHCb Production system.  Reconstruction is formulated as DIRAC jobs, and managed by DIRAC WMS Agents.  DIRAC interacts with the Reco Manager through a thin client, not directly with the DB.  Data transfer in and out of the Online farm managed by DIRAC DMS Agents. 17

 Resource handling and Reco Manager implemented.  Integration in LHCb Production system recently decided, not implemented yet.  Current performance constrained by hardware. Reco  Reading from disk: ~130 MB/s. Receiver Sender  FC saturated with 3 readers.  Reader saturates CPU at 45MB/s. Sender Receiver  Test with dummy reconstruction Storage Writer Storage Reader  Just copy data from input to output  Stable throughput 105 MB/s Storage Stor e  Constrained by Gigabit network  Upgrade to 10 Gigabit planned 18

 Software  Pending implementation of thin client for interfacing with DIRAC.  Hardware  Network upgrade to 10 Gbit in storage nodes before summer.  More subfarms and PCs to be installed.  From current ~4800 cores to the planned 16000. 19

 The LHCb Online cluster needs huge resources for data event selection of LHC collisions.  These resources have much idle time (50% of the time).  They can be used on idle periods by applying a parallelized architecture to data reprocessing.  A working system is already in place, pending integration in the LHCb Production system.  Planned hardware upgrades to meet DAQ requirements should overcome current bandwidth constraints. 20

Markus Frank (CERN) & Albert Puig (UB) An opportunity - PowerPoint PPT Presentation

Markus Frank (CERN) & Albert Puig (UB) An opportunity (Motivation) Adopted approach Implementation specifics Status Conclusions 2 Readout Network Online cluster Data logging (event selection) facility CPU CPU CPU

Overview of the SPS LLRF upgrade Gregoire Hagmann (CERN) Mattia Rizzi (CERN) Philippe

Accelera'ng records management at CERN Andrew Short andrew.short@cern.ch CERN Accelerator

Marek Domaracky CERN IT Vidyo@CERN CERN WebRTC Future 3 VIDYO@CERN: SCALE AND

Benchmarking topics at Benchmarking topics at CERN CERN Helge Meinhard / CERN- -IT IT Helge

Oracle at CERN CERN openlab summer students programme 2011 Eric Grancher eric.grancher@cern.ch

Databases Services at CERN Databases Services at CERN for the Physics Community Luca Canali,

Minute of PACMAN kick-off meeting 20/11/2013 Participants: K. Artoos (CERN), F. Bordry (CERN), A.

AIDA - Abstract Interfaces for Data Analysis Andreas Pfeiffer, CERN/IT Andreas Pfeiffer, CERN/IT

IPv6 deployment at CERN ISGC, Taipei, 16 th March 2016 edoardo.martelli@cern.ch CERN IT

Live eMate eMate repair at WWNC repair at WWNC Live Frank Gr Gr ndel ndel Frank

Adaptation strategies to climate change of Spanish Mediterranean pastoralism J. Muntan-Puig, F.

Dr. Sergio Puig 1 PROBLEM Non-elite actors are systematically excluded from the

HUMSAT F. Aguado University of Vigo A. Castro ESA Education Office Jordi Puig-Suari

WIDE project Davide Barcelli, Alberto Bemporad (UNISI) Vicen Puig, Gabriela Cembrano (UPC)

Towards ds a self elf auto tomated CE CERN Clo Cloud Jos Castro Len CERN Cloud

GT DB CF CF Grid Technology Information Technology in CERN Vladimir karupelov

Axial anomaly and BABAR data Y.N. Klopot 1 , A.G. Oganesian 2 , O.V. Teryaev 1 1 Bogoliubov

How AI is Transforming Manufacturing Webinar Agenda TIME TOPIC KEY ITEMS PRESENTER Rob

Norena Beaty, Nick Faenza, Tanner Hamann, Owen McGovern, Santiago Miret, and Mark Reese

Joy Whinney Director Food Standards Agency Wales Food Standards Agency Improve Diet and Health

Julian Jensen Ned Djilali Peter Wild Institute for Integrated Energy Systems Department of

SAUCE: A Web-based Automated Assessment Tool for Teaching Parallel Programming Euro-EDUPAR,

Exet Excretion Rem fo t d

Guidelines BEDP Study Session September 28 th , 2018 Background: On August 6, 2018, City

Sambuz

Useful Links

Newsletter

Mail Us