Data Curation at Large Experimental Facilities with Open Source Software
Line Pouchard, Pavol Juhas, Kerstin Kleese Van Dam
Computational Science Initiative
Stuart Campbell
National Synchrotron Light Source – II Brookhaven National Laboratory
Data Curation at Large Experimental Facilities with Open Source - - PowerPoint PPT Presentation
Data Curation at Large Experimental Facilities with Open Source Software Line Pouchard, Pavol Juhas, Kerstin Kleese Van Dam Computational Science Initiative Stuart Campbell National Synchrotron Light Source II Brookhaven National
Computational Science Initiative
National Synchrotron Light Source – II Brookhaven National Laboratory
BNL provides supports data-rich experimental facilities:
(NSLS-II)
(CFN)
experiment
computing facilities for BNL, RIKEN, & US QCD communities
ATLAS QCD CFN NSLS II RHIC
Cost and schedule
User Facility
X-rays
Hard X-Ray Spectroscopy 6-BM (BMM): Beamline for Mater. Measurement 7-ID-1 (SST-1): Spectroscopy Soft and Tender 7-ID-2 (SST-2): Spectroscopy Soft and Tender 7-BM (QAS): Quick X-ray Absorption and Scattering
8-ID (ISS): Inner Shell Spectroscopy
8-BM (TES): Tender X-ray Absorption Spectroscopy Imaging & Microscopy 3-ID (HXN): Hard X-ray Nanoprobe 4-BM (XFM): X-ray Fluorescence Microscopy 5-ID (SRX): Sub-micron Resolution X-ray Spectroscopy 18-ID (FXI): Full-Field X-ray Imaging Structural Biology 16-ID (LIX): X-ray Scattering for Biology 17-ID-1 (AMX): Highly Automated MX 17-ID-2 (FMX): Frontier Microfocusing MX 17-BM (XFP): X-ray Footprinting for Bio Macromolecules 19-ID (NYX): Microdiffraction Beamline Soft X-Ray Scattering & Spectroscopy 2-ID (SIX): Soft Inelastic X-ray Scattering 21-ID (ESM): Photoemission-Microscopy Facility 22-IR (FIS/MET): Magneto, Ellips, High-P Infrared 23-ID-1 (CSX-1): Coherent Soft X-ray Scattering 23-ID-2 (CSX-2): Soft X-ray Spectr & Polarization Complex Scattering 10-ID (IXS): Inelastic X-ray Scattering 11-ID (CHX): Coherent Hard X-ray Scattering 11-BM (CMS): Complex Materials Scattering 12-ID (SMI): Soft Matter Interfaces Diffraction & In Situ Scattering 4-ID (ISR): In-Situ & Resonant X-Ray Studies 27-ID (HEX): High Energy X-ray Diffraction 28-ID-1 (PDF): X-Ray Atomic Pair Distribution Function
28-ID-2 (XPD): X-Ray Powder Diffraction
Experiments:
This workflow independently runs at each of the beamlines Bluesky interacts with proprietary detector software Once acquired, metadata is stored in a Databroker database and data in a file system partition for each beam line Used at all of the operational NSLS-II beamlines so far Facilitates sharing code across beamlines and facilities. Supported by copious user-friendly documentation at https://nsls-ii.github.io
Promotes discovery
sources Supports searches
sources to discover new relations Presents integrated results to NSLSII user
https://github.com/NSLS-II/sciprovenance
NSLSII and external data
1) iss – beamline 8-ID, Inner Shell Spectroscopy
2) xpd – beamline 28-ID-2, X-Ray Powder Diffraction
3) COD – subset of CIF fields from Crystallography Open Database http://www.crystallography.net/cod
full-text search engine https://elastic.co
Note: presented collections are a subset of what exists in DataBroker
Computational Scientists Data Scientists Experimentalists
The authors gratefully acknowledge the funding support from the U.S. Department of Energy Office of Science/ Office of Advanced Scientific Computing Research. This research used resources of the National Synchrotron Light Source II, a U.S. Department of Energy (DOE) Office of Science User Facility operated for the DOE Office of Science by Brookhaven National Laboratory. This manuscript has been authored by employees of Brookhaven Science Associates, LLC operated under Contract No. DESC0012704.
Van Dam
Data Acquisition, Management, and Analysis Group, S. Campbell