LSST Data Management Overview
DM Project Manager LSST Rio 25th September 2018
WilliamO’Mullane•Rio Brazil•September 2018
LSST Data Management Overview DM Project Manager LSST Rio 25 th - - PowerPoint PPT Presentation
LSST Data Management Overview DM Project Manager LSST Rio 25 th September 2018 WilliamOMullane Rio Brazil September 2018 Outline Introduction LSST status Data Management Overview Data Management Recent Achievements Conclusion
WilliamO’Mullane•Rio Brazil•September 2018
WilliamO’Mullane•Rio Brazil•September 2018 2
Jos de Bruijne
Simulation frame: Amina Helmi Image credit: R. Jay GaBany
WilliamO’Mullane•Rio Brazil•September 2018 4
˘ Zeljko Ivezi´ c
WilliamO’Mullane•Rio Brazil•September 2018 6
WilliamO’Mullane•Rio Brazil•September 2018 7
see also http://www.lsst.org and Ivezic et al. (2008)-arXiv:0805.2366
10-year simulation of LSST survey: number of visits in u,g,r band (Aitoff projection of eq. coordinates) WilliamO’Mullane•Rio Brazil•September 2018 8
WilliamO’Mullane•Rio Brazil•September 2018 9
WilliamO’Mullane•Rio Brazil•September 2018 10
http://ls.st/8p0 WilliamO’Mullane•Rio Brazil•September 2018 11
WilliamO’Mullane•Rio Brazil•September 2018 12
WilliamO’Mullane•Rio Brazil•September 2018 13
WilliamO’Mullane•Rio Brazil•September 2018 14
WilliamO’Mullane•Rio Brazil•September 2018 15
WilliamO’Mullane•Rio Brazil•September 2018 16
WilliamO’Mullane•Rio Brazil•September 2018 17
WilliamO’Mullane•Rio Brazil•September 2018 18
Data Management Leadership Team (DMLT) DM Project Manager William O’Mullane DM Deputy Manager John Swinbank Project Manager Victor Krabbendam DM Subsystem Scientist Leanne Guy Deputy DM Sub- system Scientist Colin Slater Project Scientist ˇ Zeljko Ivezi´ c Systems Engineering / DMCCB Manager William O’Mullane Deputy Manager John Swinbank DM Subsystem Scientist Leanne Guy Pipelines Scientist Robert Lupton Systems Engineer Tim Jenness (DMCCB Chair) Software Architect Kian Tat Lim Platform & Interfaces Gregory Dubois-Felsmann Config/Release Manager Gabriele Comoretto DM Documentalist Jonathan Sick Project Control/Scheduler Kevin Long DM Admin Libby Petrick LSST Documentalist Robert McKercher SUIT T/CAM Xiuqin Wu Product Owner Gregory Dubois- Felsmann Data Access Services T/CAM Fritz Mueller Product Owner Colin Slater Science Pipelines T/CAM John Swinbank Deputy Yusra AlSayyad DRP Owner Jim Bosch Alerts Owner Eric Bellm SQuaRE T/CAM Frossie Economou Product Owner Simon Krughoff Data Facility T/CAM Margaret Gelman Infrastructure Owner Michelle Butler Proc Systems Owner Robert Gruendl LHN & Base Site T/CAM Jeff Kantor Product Owner Michelle Butler DM Subsystem Science Team DMLT Legend Science or Product Owner Role Technical Role Managment or Admin Role 1
Leanne Guy replaces Mario Juri´ c (who stays with the project) as Subsystem Scientist. Michelle Butler replaces Don Petravick as Infrastructure Product Owner. New Release Manager Gabriele Comoretto. Deputies John Swinbank (PM), Colin Slater (PS), Yusra AlSayyad (Pipelines) and Vaikunth Thukral (DAX). Toughest thing in any project is communication.
WilliamO’Mullane•Rio Brazil•September 2018 19
Upper diagram courtesy K-T Lim, LDM-148. Lower diagram by Tim Jenness; covers only the Science Pipelines codebase. WilliamO’Mullane•Rio Brazil•September 2018 20
Data rights and access details being worked on LPM-261 US 2400 cores, 3PB DB, 4PB disk , Chile 10% of that
International agencies interested in establishing international DACs (intDACs) - potential added value to LSST science that
infrastructure investments
planning
Data access fees are a critical component of provisioning and funding LSST operations Goal: need to structure an approach to intDACs in a way that is practical, seamless and beneficial to the full LSST science community while not undermining the operations funding
WilliamO’Mullane•Rio Brazil•September 2018 21
WilliamO’Mullane•Rio Brazil•September 2018 22
locally
>source /opt/lsst/software/stack/loadLSST.bash
WilliamO’Mullane•Rio Brazil•September 2018 23
Figure from Eric Bellm WilliamO’Mullane•Rio Brazil•September 2018 24
a subset of LSST alerts based only on data in the alert packet can use lightcurve, variability parameters, colors, etc., no crossmatch to external catalogs Runs in the LSST Data Access Center(-> users must have data rights)
Provide public access to alerts Classification and Crossmatch to other catalogs or data streams Provide filtering, visualization, and search Coordinate scientific activity and/or followup observations Aggregate alert annotations (community classifications, etc.)
see LDM-612 Working out selection process for 2019
WilliamO’Mullane•Rio Brazil•September 2018 25
Observatory System Spec LSE-30 (OSS) DM Data Acq ICD LSE-68 DM Camera ICD LSE-69 DM Telescope Control Sys ICD LSE-75 DM Summit Infra ICD LSE-76 DM Base Infra ICD LSE-77 DM EPO ICD LSE-131 DM Telescope Aux ICD LSE-140 LSST System Requirements LSE-29 (LSR) LSST Science Requirements LPM-17 (SRD) Specification and Design Planning Test and Validation June 24, 2018 Coming in 2018 Interface Control Document (ICD) Needs Update In CCB/Released DM System Requirements LSE-61 (DMSR) LSST Data Quality Assurance Plan LSE-63 (DQAP) LSST Data Products LSE-163 (DPDD) DM Validation & Test Plan LDM-503 (SVTP) DM Science Accep- tance Test Speci- fication LDM-639 DM PMP LDM-294 DM Releases for Ver- ificaiton/Integration LDM-564 DM Verification Control (VCD) Component Archi- tecture LDM-148 Science Platform Requirements LDM-554 Data BackBone Requirements LDM-635 L1 (prompt) Requirements LDM-602 L2 (DRP) Require- ments LDM-562 Middleware Requirements LDM-556 Database Require- ments LDM-555 Science Platform Design LDM-542 Middleware Design LDM-152 Database De- sign LDM-135 Services & Infras- tructure LDM-129 Pipeline De- sign LDM-151 Network De- sign LSE-78 User Documentation NCSA Enclave Test Spec LDM-532 DM Raw Image Archiving Service Test Spec LDM-538 Comm Cluster Test Spec LDM-541 Qserv test spec LDM-552 Data Services Test Spec LDM-536 Data BackBone Test Spec LDM-535 Science Platform Test Spec LDM-540 L1 Test Spec LDM-533 L2 Test Spec LDM-534 L2 KPMs LDM-502 L2 Test Reports NCSA Enclave Test Reports Base Enclave Test Reports Comm Cluster Test Reports DBB Test Reports Qserv Test Reports Data Services Test Reports Science Platform Test Reports L1 Test Reports Infrastructure Test Reports
WilliamO’Mullane•Rio Brazil•September 2018 26
www.lsst.org
community.lsst.org www.lsst.io dr1.lsst.io pipelines.lsst.io Firefly contextual help (in app) LSST Science Platform Landng Page Data Product Reference Guides alerts.lsst.io L1 Data Reference Installation Guide Release Notes Frameworks Contribution Guide Getting Started Processing Data Modules firefly-client.lsst.io lsst-texmf.lsst.io firefly.lsst.io developer.lsst.io qserv.lsst.io ltd-keeper.lsst.io ltd-mason.lsst.io ltd-conveyor.lsst.io Brokers Website or application Documentation site (Sphinx)
Interior documentation page or section
Legend
Notebook Aspect nb.lsst.io API Aspect documentation Portal Aspect documentation
WilliamO’Mullane•Rio Brazil•September 2018 27
2018 2019 2020 2021 2022 Operations Science Platform with WISE data in PDAC LDM-503-01 HSC reprocessing LDM-503-02 Alert generation validation LDM-503-03 Interface Verifjcation: Single Visit LSST-1200 Aux Tel DAQ integration functionality test LDM-503-04 AuxTel DAQ Interface Verifjcation and Spectrograph Ops Rehearsal LDM-503-04b Alert distribution validation LDM-503-05 COMP: Partial Camera Image data for Data Management* CAMM6995 Small Scale CCOB Data Access LDM-503-08b EFD ETL Under DM Control LSST-1220 DM ComCam interface verifjcation readiness LDM-503-06 Camera data processing LDM-503-07 Ops rehearsal for commissioning #1 LDM-503-09 Pipelines Release Fall 2018 LDM-503-09a Spectrograph data acquisition LDM-503-08 Atmospheric Telescope Completion T&SC-1150-0600 DAQ validation LDM-503-10 Large Scale CCOB Data Access LDM-503-10b Ops rehearsal for commissioning #2 LDM-503-11 ComCam Ops Readiness LDM-503-11a Pipelines Release Fall 2019 LDM-503-11b Ops rehearsal for commissioning #3 LDM-503-12 Start Early System I&T LSST-1510 LSSTCam Ops Readiness LDM-503-12a Engineering First Light w/ComCam LSST-1513 DMS Archive Center Complete COMC-1664 Pipelines Release Fall 2020 LDM-503-13a Ops Rehearsal for DRP #1 LDM-503-13 DM Readiness for Science Verifjcation LDM-503-14 System First Light LSST-1520 Start of Science Verifjcation LSST-1540 Pipelines Release Fall 2021 LDM-503-15a Ops Rehearsal for DRP #2 LDM-503-15 Science Verifjcation Complete LSST-1560 Ops Rehearsal for DRP #3 LDM-503-16 Final Pipelines Delivery LDM-503-17a Final operations rehearsal LDM-503-17 Start of Full Science Operations LSST-1620 1
WilliamO’Mullane•Rio Brazil•September 2018 28
WilliamO’Mullane•Rio Brazil•September 2018 29
WilliamO’Mullane•Rio Brazil•September 2018 30
Aug: Mountain base network up Oct: “Generation 3” pipeline execution middleware Nov: Ready for spectrograph data acquisition Dec: Prototype QA/Commissioning Environment
LDM-532; LDM-533; LDM-534; LDM-535; LDM-536 LDM-537; LDM-538; LDM-539; LDM-540; LDM-541
WilliamO’Mullane•Rio Brazil•September 2018 31
WilliamO’Mullane•Rio Brazil•September 2018 32
Official operations late 2019 — milestone LDM-503-08 in preparation for that.
WilliamO’Mullane•Rio Brazil•September 2018 33
WilliamO’Mullane•Rio Brazil•September 2018 34
WilliamO’Mullane•Rio Brazil•September 2018 35
Figure: AlSayyad. WilliamO’Mullane•Rio Brazil•September 2018 35
First(equal) post-replan milestone hit by the DM project DMTR-53! Demonstrating a end-to-end alert production pipeline.
Simultaneous astro- and photometric fitting to source lists derived from multiple images. The all new, much improved, more generic replacement for the HSC-specific meas_mosaic.
Figure shows the variation in photometric calibration not captured by single frame processing, normalized to 1. This demonstrates fine structure in photometry which Jointcal picks up but per-CCD processing doesn’t catch. Figure: Parejko. WilliamO’Mullane•Rio Brazil•September 2018 36
DTN 1
NIC1 NIC2 NIC3
DTN 2
NIC1 NIC2 NIC3
DTN 3
NIC1 NIC2 NIC3 LSST DWDM Transponder 10x10G 1 2 3 4 5 6
La Serena
Management Router
REUNA
LSST DWDM Transponder 10x10G 1 2 3 4 5 6 AmLight AndesLight1 Brocade MLXe 1/1 1/2 1/3 1/4 1/5 1/6 3/2 3/1 10G SR+ 1G UTP DWDM
AmLight
Pacific Atlantic 100G Leased AmLight AMPATH01 Brocade MLXe 7/2 AmLight AMPATH02 Brocade MLXe 7/2 6/2
FLR + Internet2
Internet2 Rtsw-chic 0/0/5 100G VID 3890 VID 3891 VID 3892 VID 3893 VID 3894 VID 3895 VID 3890 VID 3891 VID 3892 VID 3893 VID 3894 VID 3895 VID 3890 VID 3891 VID 3892 VID 3893 VID 3894 VID 3895
Level3 SCL AMPATH 6000W Chicago ICCN NCSA/NPCF
VID 3890 VID 3891 VID 3892 VID 3893 VID 3894 VID 3895 7/1 7/1
DTN 3
NCSA EX9214 NIC NCSA MX960
DTN 1
NIC
DTN 2
NIC
DTN 4
NIC
DTN 5
NIC
DTN 6
NIC
NCSAnet
NIC 9/1/0 VID 3890 VID 3891 VID 3892 VID 3893 VID 3894 VID 3895 VID 3890 VID 3891 VID 3892 VID 3893 VID 3894 VID 3895 NIC NIC NIC NIC NIC
WilliamO’Mullane•Rio Brazil•September 2018 37
Three 30-node clusters operating:
AllWISE + NEOWISE)
30% DR1 KPM measurements DMTR-17 Deployment under Kubernetes
IN2P3 Qserv Cluster - Fritz Muller WilliamO’Mullane•Rio Brazil•September 2018 38
Working within the LSST Systems Engineering Early Pathfinder group, developing and testing integration of T&S, Camera, and DM service software via a series of early integration activities. Initial header service developed and configured for Camera subsystem and AuxTel use cases, ability to acquire pixel data and write FITS files, all commandable by OCS. Demonstrated
with Spectrograph in Tucson
WilliamO’Mullane•Rio Brazil•September 2018 39
Images Krughoff(←) and Wu (↑) WilliamO’Mullane•Rio Brazil•September 2018 40
WilliamO’Mullane•Rio Brazil•September 2018 41
WilliamO’Mullane•Rio Brazil•September 2018 42
g,r(1.5 hrs) ,i(3 hrs) PSF matched co-add (≈ 27.5) http://www.lsst.org http://community.lsst.org Images:Lupton and HSC colaboration see also Lupton et al. (2004) WilliamO’Mullane•Rio Brazil•September 2018 43
WilliamO’Mullane•Rio Brazil•September 2018 44
WilliamO’Mullane•Rio Brazil•September 2018 45
Acronym Description AI Action Item AP Alerts Production API Application Programming Interface AURA Association of Universities for Research in Astronomy AVRO Apache data serialization system BBC German shipping company C Specific programming language (also called ANSI-C) CCD Charge-Coupled Device D Deutschland (Germany) D Specific project phase (production; concluded by QR and FAR) DAX Data Access Services DB DataBase DM Data Management DMTN DM Technical Note DMTR Data Management Test Report DRP Data Release Production DTN Data Transfer Node EFD Engineering Facilities Database EIA Early Integration Activity EIE European Industrial Engineering - Italian engineering company (Dome) FITS Flexible Image Transport System WilliamO’Mullane•Rio Brazil•September 2018 46
HSC Hyper Suprime-Cam IPAC Infrared Processing and Analysis Center IR Infra Red ISR Instrument Signal Removal K Kelvin; SI unit of temperature KPM Key Performance Metric L1 Level 1 (ambiguous could mean milestone or processing) L2 Level 2 (ambiguous could mean milestone or processing) LDM LSST Data Management (handle for controlled documents) LPM LSST Project Management (Document Handle) LSE LSST Systems Engineering (Document Handle) LSST Large Synoptic Survey Telescope M2 Second mirror MIA Missing In Action MN Meeting Minutes MOPS Moving Object Pipeline System N Newton; SI unit of force NASA National Aeronautics and Space Administration (USA) NCSA National Center for Supercomputing Applications NEO Near-Earth Object NSF National Science Foundation OCS Observatory Control System PB PetaByte PDAC Prototype Data Access Center WilliamO’Mullane•Rio Brazil•September 2018 47
PM Project Manager PS Project Scientist PSF Point Spread Function QA Quality Assurance Qserv Query Service, Proprietary LSST Database system S Strip (CCD chip along-scan coordinate identifier in focal plane) SDSS Sloan Digital Sky Survey SPIE the international society for optics and photonics SUIT Science User Interface and Tools T&S Telescope and Site TB TeraByte US United States USA United States of America arcmin arcminute, minute of arc (unit of angle) kg kilogram; SI unit of mass s second; SI unit of time WilliamO’Mullane•Rio Brazil•September 2018 48
[LDM-612], Bellm, E., co authors, 2018, Plans and Policies for LSST Alert Distribution, LDM-612, URL https://ls.st/LDM-612 [DMTR-53], Bellm, E., Swinbank, J., 2018, LDM-503-03 (Alert Generation) Test Report, DMTR-53, URL https://ls.st/DMTR-53 [LDM-533], Bellm, E.C., 2017, Level 1 System Software Test Specification, LDM-533, URL https://ls.st/LDM-533 [DMTR-51], Bosch, J., Chiang, H.F ., Gower, M., et al., 2017, LDM-503-02 (HSC Reprocessing) Test Report, DMTR-51, URL https://ls.st/DMTR-51 [DMTR-61], Butler, M., Parsons, J., 2018, LDM-503-04 and LDM-503-04b (Raw Image Archiving Service) Test Report, DMTR-61, URL https://ls.st/DMTR-61 [LDM-538], Butler, M., Parsons, J., Gower, M., 2018, Raw Image Archiving Service Test Specification, LDM-538, URL https://ls.st/LDM-538 [LSE-79], Claver, C., The LSST Commissioning Planning Team, 2017, System AI&T and Commissioning Plan, LSE-79, URL https://ls.st/LSE-79 [LDM-540], Dubois-Felsmann, G., 2018, LSST Science Platform Test Specification, LDM-540, URL https://ls.st/LDM-540 [LDM-542], Dubois-Felsmann, G., Lim, K.T., Wu, X., et al., 2017, LSST Science Platform Design, LDM-542, URL https://ls.st/LDM-542 [DMTR-52], Dubois-Felsmann, G.P ., 2018, LDM-503-01 (WISE Data Loaded in PDAC) Test Report, DMTR-52, URL https://ls.st/DMTR-52 Ivezic, Z., et al., 2008, ArXiv e-prints (arXiv:0805.2366), ADS Link Jenness, T., Economou, F ., Findeisen, K., et al., 2018, In: Software and Cyberinfrastructure for Astronomy V, vol. 10707 of Proc. SPIE, 1070709, doi:10.1117/12.2312157, ADS Link [DMTN-087], Juric, M., Jones, L., 2018, Proposed Modifications to Solar System Processing and Data Products, DMTN-087, URL https://dmtn-087.lsst.io, LSST Data Management Technical Note [LSE-319], Juri´ c, M., Ciardi, D., Dubois-Felsmann, G., 2017, LSST Science Platform Vision Document, LSE-319, URL https://ls.st/LSE-319 [LSE-163], Juri´ c, M., et al., 2017, LSST Data Products Definition Document, LSE-163, URL https://ls.st/LSE-163 [Document-28547], Kantor, J., 2018, LSST Network Bandwidth Tests between Chile and the United States , Document-28547, URL https://ls.st/Document-28547 [LDM-148], Lim, K.T., Bosch, J., Dubois-Felsmann, G., et al., 2018, Data Management System Design, LDM-148, URL https://ls.st/LDM-148 WilliamO’Mullane•Rio Brazil•September 2018 49
Lupton, R., Blanton, M.R., Fekete, G., et al., 2004, PASP, 116, 133 (arXiv:astro-ph/0312483), doi:10.1086/382245, ADS Link [LDM-572], O’Mullane, W., Petravick, D., 2017, Chilean Data Access Center, LDM-572, URL https://ls.st/LDM-572 [LDM-564], O’Mullane, W., Economou, F ., Jenness, T., Loftus, A., 2018, Data Management Software Releases for Verification/Integration, LDM-564, URL https://ls.st/LDM-564 [LDM-294], O’Mullane, W., Swinbank, J., Juri´ c, M., DMLT, 2018, Data Management Organization and Management, LDM-294, URL https://ls.st/LDM-294 [LDM-503], O’Mullane, W., Swinbank, J., Juri´ c, M., Economou, F ., 2018, Data Management Test Plan, LDM-503, URL https://ls.st/LDM-503 [DMTN-028], Patterson, M.T., 2018, Benchmarking a distribution system for LSST alerts, DMTN-028, URL https://dmtn-028.lsst.io, LSST Data Management Technical Note [LDM-230], Petravick, D., Butler, M., Gelman, M., 2018, Concept of Operations for the LSST Data Facility Services, LDM-230, URL https://ls.st/LDM-230 [LDM-534], Swinbank, J.D., 2017, Level 2 System Software Test Specification, LDM-534, URL https://ls.st/LDM-534 [LDM-151], Swinbank, J.D., et al., 2017, Data Management Science Pipelines Design, LDM-151, URL https://ls.st/LDM-151 [DMTR-17], Thukral, V., 2018, Qserv Fall 17 Large Scale Tests/KPMs, DMTR-17, URL https://ls.st/DMTR-17 [LDM-532], Unknown, 2017, NCSA Enclave Test Specification, LDM-532, URL https://ls.st/LDM-532 [LDM-535], Unknown, 2017, Data Backbone Test Specification, LDM-535, URL https://ls.st/LDM-535 [LDM-536], Unknown, 2017, Data Backbone Data Services Test Specification, LDM-536, URL https://ls.st/LDM-536 [LDM-537], Unknown, 2017, Data Backbone Infrastructure Test Specification, LDM-537, URL https://ls.st/LDM-537 [LDM-539], Unknown, 2017, Data Access Center Enclave Test Specification, LDM-539, URL https://ls.st/LDM-539 [LDM-541], Unknown, 2017, Commissioning Cluster Enclave Test Specification, LDM-541, URL https://ls.st/LDM-541 [LPM-261], Willman, B., Graham, M., O’Mullane, W., Petravick, D., 2018, Access Policy for LSST Data and Data Access Center, LPM-261, URL https://ls.st/LPM-261 WilliamO’Mullane•Rio Brazil•September 2018 50