The ATLAS EventIndex architecture, design choices, deployment and - PowerPoint PPT Presentation

CHEP 2015 Okinawa – 13-17 April 2015 The ATLAS EventIndex architecture, design choices, deployment and first operation experience Dario Barberis Genoa University/INFN On behalf of the ATLAS Collaboration D. Barberis (1), S.E. Cárdenas Zárate (2), J. Cranshaw (3), A. Favareto (1), Á. Fernández Casaní (4), E. Gallas (5), C. Glasman (6), S. González de la Hoz (4), J. H ř ivná č (7), D. Malon (3), F. Prokoshin (2), J. Salt Cairols (4), J. Sánchez (4), R. Többicke (8), R. Yuan (7) (1) Università di Genova and INFN, Genova, Italy — (2) Universidad Técnica Federico Santa Maria, Valparaíso, Chile — (3) Argonne National Laboratory, Argonne, IL, United States — (4) Instituto de Física Corpuscular (IFIC), University of Valencia and CSIC, Valencia, Spain — (5) University of Oxford, Oxford, UK — (6) Universidad Autónoma de Madrid, Madrid, Spain — (7) LAL, Université Paris-Sud and CNRS/IN2P3, Orsay, France — (8) CERN, Geneva, Switzerland 1 Dario Barberis: ATLAS EventIndex

CHEP 2015 Okinawa – 13-17 April 2015 What is the ATLAS EventIndex? A system designed to be a complete catalogue of ATLAS events ● All events, real and simulated data ■ All processing stages ■ Contents ● Event identifiers (run and event numbers, trigger stream, luminosity block etc.) ■ Trigger patterns ■ References (pointers) to the events at each processing stage (RAW, ESD, (x)AOD, ■ NTUP) in all permanent files on storage generated by the ATLAS Production System (central productions) Size and constraints ● ATLAS collects a few billion real events each year of data taking and generates more ■ than twice that number of simulated events ~350 B/event � 2 TB of raw information (6 TB after internal replication in Hadoop) ■ in the EventIndex only for LHC Run 1 The trigger rate for Run 2 is more than twice that for Run 1 � Simulated data not counted yet � Needs clever storage structure with smart search and retrieve tools ■ 2 Dario Barberis: ATLAS EventIndex

CHEP 2015 Okinawa – 13-17 April 2015 Use cases Event picking Event ¡list ¡ ● Run ¡ Give me the reference (pointer) to Download ¡ ■ number ¡ events ¡ Event ¡ "this" event in "that" format for a number ¡ with ¡DDM ¡ Event ¡list ¡ Trigger ¡ given processing cycle (Rucio) ¡ stream ¡ tools ¡ Data ¡format ¡ Event service GUIDs ¡ ● Processing ¡ (references) ¡ Give me the references for this list cycle ¡ ■ of ¡logical ¡ of events (to be distributed to HPC or files ¡ Event ¡ Process ¡ cloud clusters for processing) selec*on ¡ Pointers ¡to ¡ Central ¡ events ¡on ¡ Run ¡ events ¡ ¡ Technically the same as event picking ■ Event ¡ numbers ¡ the ¡Grid ¡ ¡ within ¡files ¡ Triggers ¡ Index ¡ More info in the talk by T. Wenaus with ¡WM ¡ ■ Condi*ons ¡ Server ¡ system ¡ Data ¡format ¡ (contribution #183) (PanDA) ¡ Processing ¡ Trigger checks and event skimming cycle ¡ ● Count, or give me the list of, events ■ Event ¡ passing "this" selection and their count ¡ Event ¡ request ¡ references sta*s*cs ¡ Run ¡ Display ¡ Production consistency checks number ¡ Matches ¡ ● results ¡ Trigger ¡ Duplicates ¡ Technical checks that processing ■ Data ¡format ¡ Missing ¡ Processing ¡ cycles are complete cycle ¡ 3 Dario Barberis: ATLAS EventIndex

CHEP 2015 Okinawa – 13-17 April 2015 EventIndex Project Breakdown We defined 4 major work areas (or tasks): ● 1) Core architecture 2) Data collection and storage 3) Query services 4) Functional testing and operation; system monitoring Tier-‑0 ¡ Interac*ve ¡ processing ¡ Web ¡ send ¡ query ¡ job ¡ Server ¡ Hadoop ¡ Hadoop ¡ retrieve ¡ send ¡ Interface ¡ Storage ¡ query ¡ Server ¡ Cluster ¡ send ¡ Grid ¡ PanDA ¡ retrieve ¡ processing ¡ Server ¡ job ¡ 4 Dario Barberis: ATLAS EventIndex

CHEP 2015 Okinawa – 13-17 April 2015 Data Collection EventIndex Producer: event processing task which can More info in the talk by J. Sánchez ● run at Tier-0 (initial reconstruction at CERN) or on in this session (contribution #222) Grid sites (downstream processing) Sends event metadata via ActiveMQ message broker to the Hadoop store at CERN ■ EventIndex Consumer: reads the messages from the message broker ● Organizes data into Hadoop MapFile objects ■ Does validation tasks assessing, for example, dataset completeness ■ Flags aborted, obsoleted, invalid data for further action ■ 5 Dario Barberis: ATLAS EventIndex

CHEP 2015 Okinawa – 13-17 April 2015 Storage and Query Services (1) Hadoop was chosen as the storage technology: ● Platform is provided and supported by CERN-IT ■ DDM (Distributed Data Management) project also uses Hadoop ■ Plenty of tools to organise the data, index them internally and search them ■ Showed satisfactory performance in prototype populated with a year of ■ ATLAS data (1 TB in the previous TAGDB in Oracle for 2011 data) Storage Structure: ● Data are stored as mapfiles in HDFS (Hadoop File System) ■ Data is catalogued Hadoop HBase: metadata about HDFS files. ■ Search performance enhanced using keyed indexes based on use cases: ● Searches based on a key give immediate results (seconds) ■ Complex searches use MapReduce (MR) and require 1-2 minutes for typical ■ event collections 6 Dario Barberis: ATLAS EventIndex

CHEP 2015 Okinawa – 13-17 April 2015 Storage and Query Services (2) Typical performance figures for search/ Query Search Base Retrieved Time (s) ● count/retrieve operations on Run1 data: Get Run/ 123492895 1 30 Event Total time depends mainly on the amount of ■ Retrieve all 123492895 123492895 3400 retrieved information (time to write the Count all 123492895 0 290 output file with the search results) Retrieve "count" is always much faster than � with trigger 123492895 939220 142 "retrieve" stream & sw version Count with trigger 123492895 0 130 stream & sw version Retrieve 123492895 41284 204 with GUID Count with 123492895 0 192 GUID Timings measured on the CERN ● More info in the poster by J. H ř ivná č Hadoop cluster with 18 nodes (contribution #221) Search services via CLI and Web ● Service GUI 7 Dario Barberis: ATLAS EventIndex

CHEP 2015 Okinawa – 13-17 April 2015 Trigger Decoding Event-wise trigger decisions ● �� are stored natively in bit �� masks �� Trigger bit to name mapping ● is imported into HBase from �� the Conditions Metadata �� (COMA) database �� This makes decoded trigger �� ● �� decisions available to Event Index users by trigger name �� for event counting or selection To facilitate searches ● �� the names of fired �� triggers per event are stored in Hadoop �� In addition, the data �� ● may be indexed by �� trigger to improve �� performance �� More info in the poster by �� F. Prokoshin (contrib. #220) 8 Dario Barberis: ATLAS EventIndex

CHEP 2015 Okinawa – 13-17 April 2015 System Monitoring Monitors the health of all servers and processes ● involved in the chain: ActiveMQ brokers and Consumers ■ Hadoop cluster and Web servers ■ Contents monitoring under development ● 9 Dario Barberis: ATLAS EventIndex

The ATLAS EventIndex architecture, design choices, deployment and - PowerPoint PPT Presentation

CHEP 2015 Okinawa 13-17 April 2015 The ATLAS EventIndex architecture, design choices, deployment and first operation experience Dario Barberis Genoa University/INFN On behalf of the ATLAS Collaboration D. Barberis (1), S.E. Crdenas

Measuring DNSSEC using RIPE Atlas Kaveh Ranjbar RIPE NCC RIPE Atlas Coverage RIPE Atlas 2

ATLAS Searches for SUSY Chris Young, CERN ATLAS Group What have we not looked for? 1 / 37 ATLAS

ATLAS ROOT I/O pt 2 Atlas Hot Topics (with reference to CHEP presentations) Big data

ATLAS I/O Overview Peter van Gemmeren (ANL) gemmeren@anl.gov for many in ATLAS 8/23/2018 Peter

Top Properties from ATLAS Chris Young (CERN), on behalf of ATLAS 27th May 2020 1 / 19 Top

Atlas Summit 2016 C ALL FOR P RESENTA TION P ROPOSALS The Atlas Society is currently planning the

Atlas Arteria Investor Presentation July 2018 Important notice and disclaimer Disclaimer Atlas

ATLAS Shrugged ATLAS Shrugged Pat O Toole Toole Pat O (with apologies to Ayn Rand and

Macquarie Atlas Roads Limited Macquarie Atlas Roads International Limited 2016 Annual General

World Wide Computing and the ATLAS World Wide Computing and the ATLAS Experiment Experiment th

Highlights and Searches in ATLAS Dave Charlton University of Birmingham on behalf of the ATLAS

Data Management in ATLAS Angelos Molfetas on behalf of the ATLAS DQ2 team 1 ATLAS DDM

H result from ATLAS Lydia Brenner Introduction ATLAS I will try to compare some

Project ATLAS Michelle Warf NCDOT EAU Caitlyn Meyer ATLAS GIS Consultant February 25

Atlas Arteria 2018 Full Year Results Presentation 28 February 2019 Important notice and

Atlas Analysis Infrastructure in Atlas Analysis Infrastructure in Japan Japan Hiroshi Sakamoto

Dia iabetes care and dementia among old lder adults: a nati tionwide 3-year lo longitudinal

Dark Matter Tim M.P . Tait University of California, Irvine Grenoble January 21-22, 2016

A performance evaluation of Maurice Troute(Intel Corporation) LATTICE 2016

Dark matter: astrophysical evidence Uros Seljak Slides from Risa Wechsler dark matter: do we

A 3.5 keV Photon Line from a 3.5 keV ALP Line Markus Rummel, University of Oxford Seminar,

Dark matter annihilation and non-thermal processes in Galaxy clusters Lidia Pieri GGI workshop

Chapter 3 Interfacing to a microprocessor ECE 3120 Dr. Mohamed Mahmoud

Hyper Natural Deduction for Gdel Logic a natural deduction system for parallel reasoning 1