maximizing data accessibility at the esac science data
play

Maximizing data accessibility at the ESAC Science Data Centre J. - PowerPoint PPT Presentation

Maximizing data accessibility at the ESAC Science Data Centre J. Gonzlez-Nez, C. Arviset, J. Salgado ESAC Science Data Center (ESDC) Issue/Revision: 1.0 Reference: Asteriscs EDP ESDC Archives Status: Issued ESA UNCLASSIFIED - Releasable


  1. Maximizing data accessibility at the ESAC Science Data Centre J. González-Núñez, C. Arviset, J. Salgado ESAC Science Data Center (ESDC) Issue/Revision: 1.0 Reference: Asteriscs EDP ESDC Archives Status: Issued ESA UNCLASSIFIED - Releasable to the Public

  2. ESAC Science Data Centre The Digital Library of the Universe Ø Large set of science archives co-located at ESAC are a major research asset for community : • Astronomy, Planetary, Solar Heliospheric Ø Different types of data: • Raw data, calibrated processed data, high level data products, … • All data public and available on-line after a short proprietary period Ø Need to be kept readily available for future users and novel uses by various types of users: • Scientific Community (public access) • PI team and observers (controlled access) • Science Operations Team (privileged access) Ø Archive Strategy Plan for 5-20+ years Juan Gonzalez | ASTERICS European Data Provider Forum | 16/06/2016 | Slide 2 ESA UNCLASSIFIED - Releasable to the Public

  3. ESAC Science Archives Strategy Enable maximum scientific exploitation of data sets Enable efficient long-term preservation of data, software and knowledge, using modern technology Enable cost-effective archive production by integration in, and across, projects Juan Gonzalez | ASTERICS European Data Provider Forum | 16/06/2016 | Slide 3 ESA UNCLASSIFIED - Releasable to the Public

  4. International Collaboration and Interoperability amongst Archives IVOA - International Virtual Observatory Alliance • Formed in 2002, 20 member projects • Defines interoperability standards (VO framework) amongst astronomical (ground and space based) archives • Working Groups and Interest Groups per technical domain (data access, data model, registry, applications, semantics, operations, …) • http://www.ivoa.net/ IPDA - International Planetary Data Alliance • Formed in 2004, 12 space agencies • Defines archiving guidelines for planetary data • Defines interoperability standards amongst planetary archives • http://planetarydata.org/ Juan Gonzalez | ASTERICS European Data Provider Forum | 16/06/2016 | Slide 4 ESA UNCLASSIFIED - Releasable to the Public

  5. Data Architecture @ ESDC archives Juan Gonzalez | ASTERICS European Data Provider Forum | 16/06/2016 | Slide 5 ESA UNCLASSIFIED - Releasable to the Public

  6. OAIS to ABSI OAIS (2003) Juan Gonzalez | ASTERICS European Data Provider Forum | 16/06/2016 | Slide 6 ESA UNCLASSIFIED - Releasable to the Public

  7. OAIS to ABSI Archives Building System Infrastructure (2006) Juan Gonzalez | ASTERICS European Data Provider Forum | 16/06/2016 | Slide 7 ESA UNCLASSIFIED - Releasable to the Public

  8. ABSI based architecture Programmatic AIO science archive Browser GUI RPC Database SAMP VO services S*AP, SIAP Data Repository VO Apps Euro-VO Registry Juan Gonzalez | ASTERICS European Data Provider Forum | 16/06/2016 | Slide 8 ESA UNCLASSIFIED - Releasable to the Public

  9. Recent Drivers (Gaia, Euclid) 1. SOC integrated archives: a. Archive as a science operations building block b. Data processing metadata & intermediate products 2. Archive consortiums (DPAC, Euclid Consortium) a. Integration with WP-developed modules: Visualization, DM, Xmatch, Validation etc. 3. Large data volumes: a. Distributed data storage b. ‘Big Data’ analysis techniques Juan Gonzalez | ASTERICS European Data Provider Forum | 16/06/2016 | Slide 9 ESA UNCLASSIFIED - Releasable to the Public

  10. Archives Integration within Consortium Archive is fully part of Science Operations, from the start Strong collaboration with SGS and Consortium Juan Gonzalez | ASTERICS European Data Provider Forum | 16/06/2016 | Slide 10 ESA UNCLASSIFIED - Releasable to the Public

  11. Archives Integration within Consortium Archive is fully part of Science Operations, from the start Strong collaboration with SGS and Consortium Juan Gonzalez | ASTERICS European Data Provider Forum | 16/06/2016 | Slide 11 ESA UNCLASSIFIED - Releasable to the Public

  12. Data collections stats for Gaia Juan Gonzalez | ASTERICS European Data Provider Forum | 16/06/2016 | Slide 12 ESA UNCLASSIFIED - Releasable to the Public

  13. The Gaia Archive Juan Gonzalez | ASTERICS European Data Provider Forum | 16/06/2016 | Slide 13 ESA UNCLASSIFIED - Releasable to the Public

  14. Gaia Archive Current Architecture Programmatic TAP+, VOSpace science archive Browser GUI VO services TAP+ RDBMS SAMP TAP+, S*AP Distributed Data Repository VO Apps Euro-VO Registry Juan Gonzalez | ASTERICS European Data Provider Forum | 16/06/2016 | Slide 14 ESA UNCLASSIFIED - Releasable to the Public

  15. Extending VO protocols: TAP + TAP+ I/F Command Public area line tools • Publicly released catalogues Restricted area External • Catalogues during validation or Apps proprietary exploitation Data User Space Validation • User-uploaded data Juan Gonzalez | ASTERICS European Data Provider Forum | 16/06/2016 | Slide 15 ESA UNCLASSIFIED - Releasable to the Public

  16. ESASky : Multi-mission visualization ESASky backend Obs. metadata MOCs Catalogues TAP, HTTP All-Sky HIPS Maps sky.esa.int science archive science archive Juan Gonzalez | ASTERICS European Data Provider Forum | 16/06/2016 | Slide 16 ESA UNCLASSIFIED - Releasable to the Public

  17. ASTERICS at ESDC archives Juan Gonzalez | ASTERICS European Data Provider Forum | 16/06/2016 | Slide 17 ESA UNCLASSIFIED - Releasable to the Public

  18. Ongoing collaborations 1. ADQL Auto-complete Provide suggestions in the Gaia Archive ADQL • Query editor Stelios Voutsinas (WFAU, University of Edinburgh) • GENIUS - ASTERICS 2. pgSphere development Extension maintenance, configuration • management Implementation of Gaia CU9 requested features • Markus Nullmeier (ARI, Universität Heidelberg) - • ASTERICS Juan Gonzalez | ASTERICS European Data Provider Forum | 16/06/2016 | Slide 18 ESA UNCLASSIFIED - Releasable to the Public

  19. Challenges Gaia Archive Preparation (2011) Juan Gonzalez | ASTERICS European Data Provider Forum | 16/06/2016 | Slide 19 ESA UNCLASSIFIED - Releasable to the Public

  20. Challenges 1. “Bringing the code to the data” a. GAVIP project (Docker) b. TAP+: user areas, user defined functions 2. Cloud Services a. ESAC Cloud 3. Large scale processing (Spark, Hadoop): a. Data Mining Work Package b. ‘Grand Challenges’ c. Prototype simple use cases Juan Gonzalez | ASTERICS European Data Provider Forum | 16/06/2016 | Slide 20 ESA UNCLASSIFIED - Releasable to the Public

  21. Thanks! http://www.cosmos.esa.int/web/esdc Juan Gonzalez | ASTERICS European Data Provider Forum | 16/06/2016 | Slide 21 ESA UNCLASSIFIED - Releasable to the Public

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend