The Helmholtz Association Project „Large Scale Data Management and Analysis“ (LSDMA)
Kilian Schwarz, GSI; Christopher Jung, KIT
The Helmholtz Association Project Large Scale Data Management and - - PowerPoint PPT Presentation
The Helmholtz Association Project Large Scale Data Management and Analysis (LSDMA) Kilian Schwarz, GSI; Christopher Jung, KIT Overview Motivation Data Life Cycle LSDMAs dual approach Facts and Numbers Initial
The Helmholtz Association Project „Large Scale Data Management and Analysis“ (LSDMA)
Kilian Schwarz, GSI; Christopher Jung, KIT
2 05.10.2012 Christopher Jung SCC, KIT
Overview
3 05.10.2012 Christopher Jung SCC, KIT
Why is Scientific Big Data important?
4 05.10.2012 Christopher Jung SCC, KIT
Examples of Scientific Big Data in non-HEP
Examples for sciences with Big Data:
throughput microscopy (zebra fish embryos)
mapping
needs yet
5 05.10.2012 Christopher Jung SCC, KIT
Challenges of Big Data
6 05.10.2012 Christopher Jung SCC, KIT
Data Life Cycle
Inspiration for LSDMA: support the whole data life cycle!
7 05.10.2012 Christopher Jung SCC, KIT
Dual approach: community-specific and generic
Data Life Cycle Labs
communities
– Optimization of the data life cycle – Community-specific data analysis tools and services
Data Services Integration Team
– Interface between federated data infrastructures and DLCLs/communities – Integration of data services into scientific working process
8 05.10.2012 Christopher Jung SCC, KIT
Facts and numbers
Helmholtz Association in 2015
German climate research center
9 05.10.2012 Christopher Jung SCC, KIT
Initial communities
– Smart grids, battery research, fusion research
– Climate model, environmental satellite data
– Virtual human brain map
– Synchroton radiation, nanoscopy, systems biology, electron- microscopical imaging techniques
– Photon Science: Petra 3, XFEL – FAIR@GSI (14 experiments with big and small communities)
10 05.10.2012 Christopher Jung SCC, KIT
LHC Computing – Prototype for FAIR
within an already running experiment
FAIR
way, and to some extend they already go back to ALICE
(funding, network architecture, software development and more ...)
11 05.10.2012 Christopher Jung SCC, KIT
– triggerless “online” system
GPU
– Grid/Cloud infrastructure
compute jobs to Clouds
– create interfaces to existing environments (AliEn, ...)
– long term data archives
gStore
– meta data calatog and data analysis
To be developed within LSDMA (DLCL: structure of matter) in collaboration with LSDMA – DSIT, the FAIR community, and ALICE (whereever synergy can be found)
Goals for GSI/FAIR in LSDMA
– include the distributed FAIR T0/T1 centre into a global Grid/Cloud infrastructure – Federated Identity Management
– Global File System – Optimization of Data Storage
Additional synergies via DSIT
12 05.10.2012 Christopher Jung SCC, KIT
Next Steps at GSI
know candidates ? – GSI DSIT already started to hire people
CBM, based on the experiences with ALICE (AliEn/xrootd/...)
also in close collaboration with DSIT and ALICE
13 05.10.2012 Christopher Jung SCC, KIT
Summary and Outlook
the whole data life cycle, using a community-specific and a generic approach