Data Preservation lesperienza di ESA e il caso studio in EVER-EST - - PowerPoint PPT Presentation

data preservation l esperienza di esa e il caso studio in
SMART_READER_LITE
LIVE PREVIEW

Data Preservation lesperienza di ESA e il caso studio in EVER-EST - - PowerPoint PPT Presentation

Data Preservation lesperienza di ESA e il caso studio in EVER-EST ISMAR 05 July 2016 ESA UNCLASSIFIED For Official Use Data lifecycle: sensed data needs to be acquired Level 0 001 4100 002 4102 003 4102 004 6144 005 6150


slide-1
SLIDE 1

ESA UNCLASSIFIED – For Official Use

Data Preservation l’esperienza di ESA e il caso studio in EVER-EST ISMAR 05 July 2016

slide-2
SLIDE 2

Data lifecycle: sensed data needs to be acquired…

001 4100 002 4102 003 4102 004 6144 005 6150 006 7168 007 6146

Level 0

slide-3
SLIDE 3

...to be combined and processed to get this

3

Level 2 Level 0 Level 1 Processing Processing/c

  • mbining
slide-4
SLIDE 4

...or even further to get this

Scientific Publications

slide-5
SLIDE 5

...through complex processing schemes

Algorithm Manual

slide-6
SLIDE 6

Earth Medical History

2

Since when are you feeling like this ?

3

I don’t remember but luckily someone has preserved and updated my health records; here they are

4

Great!! We have new instruments today but knowing your past is crucial. Let me consult with my colleagues …

5

We identified the cause and have the right treatment for you !!!

  • 1. EO Heritage Data :

2.

  • the health record of our planet

Heritage Data Programme (LTDP+)

  • preserve and valorize the past to better shape the future

Heritage Data Programme (LTDP+)

  • preserve and valorize the past to better shape the future

1

I don’t feel very well

6

Thank you !!! I feel much better now !!!

slide-7
SLIDE 7

Heritage Data Long Term Preservation Programme Objectives

  • 1. ESA EO heritage missions data & knowledge holdings preservation

Secure state-of-art preservation of ESA EO heritage data assets and associated information holdings relying on state of art technologies, systems and standards, and ensuring data authenticity and understandability in the long term (Authority Entity) with the assumption/awareness that one can never anticipate the exploitation value

  • f the data asset in the future
  • 2. Coordination and cooperation with MS on

data preservation

establish an harmonised approach in cooperation with data owners in the MS to ensure European EO data preservation coordination with non-European actors

  • 3. European interest heritage data assets

preservation

prevent loss of non-ESA EO data at risk and judged of European long-term interest

ESA UNCLASSIFIED – For Official Use

slide-8
SLIDE 8

Heritage Data: Valorise the past to better understand the future

ESA UNCLASSIFIED – For Official Use

Spot 5 map

slide-9
SLIDE 9

ESA Long Time Data Series (excerpt)

Covered by LTDP Covered by LTDP+

slide-10
SLIDE 10

ESA UNCLASSIFIED – For Official Use

Data Management Plan

Discoverability

DMP-1: METADATA FOR DISCOVERY

Accessibility

DMP-2: ONLINE ACCESS

Usability

DMP-3: DATA ENCODING DMP-4: DATA DOCUMENTATION DMP-5: DATA TRACEABILITY DMP-6: DATA QUALITY-CONTROL

Preservation

DMP-7: DATA PRESERVATION DMP-8: DATA AND METADATA VERIFICATION

Curation

DMP-9: DATA REVIEW AND REPROCESSING DMP-10: PERSISTENT AND RESOLVABLE IDENTIFIERS

slide-11
SLIDE 11

ESA UNCLASSIFIED – For Official Use

Digital Objects Preservation

Generic definition:

Digital Preservation is the management and maintenance of digital objects so they can be accessed and used by future users. In Earth Observation Context

Digital Objects:

Data Records: these include raw data and/or Level-0 data, higher-level products, browse images, auxiliary and ancillary data, calibration and validation data sets, and descriptive metadata; Associated Knowledge: this includes all the Tools used in the Data Records generation, quality control, visualization and value adding, and all the Information needed to make the Data Records understandable and usable by the Designated Community.

slide-12
SLIDE 12

ESA UNCLASSIFIED – For Official Use

Data Records Preservation

slide-13
SLIDE 13

DATA Records Preservation

Data in different formats, quality, accuracy, etc… Heritage Data Programme:

  • Ensures preservation of data and knowledge
  • Ensure data discoverability and accessibility
  • Valorise and aligns old data to new data to generate time series
slide-14
SLIDE 14

Data Preservation Heritage Missions

SEASAT ESA DATA HOLDINGS RECOVERY

  • Data recovered from an LTO Tape in 2012 (received from DLR), information needed to understand and open

data retrieved from paper documents.

  • Processor created starting from JERS-1 SAR one: alignment of SEASAT, ALOS and JERS-1 SAR L-band data. Full

reprocessing campaign, metadata catalogue, web page creation with all information for users. Data archiving and dissemination.

  • JERS-1 (1992-1998) SAR and OPS will follow in Q2 2016.

Comparison of images from Seasat, ERS-2 and Sentinel-1 to map etreat

  • f two large glaciers in

southeast Greenland

  • ver a 36-year period.
slide-15
SLIDE 15

SEASAT SAR data recovery:

  • Greenland glaciers as seen by 3 generations of radar missions

Comparison of images from Seasat on 16 August 1978, ERS-2 on August 1996 and Sentinel-1 on 20 August 2014 to map the retreat of two large glaciers in southeast Greenland over a 36-year period. Retreat has been estimated at a rate of about 180 m/yr (top glacier) and 61.5 m/yr (bottom glacier)

Seasat L-band SAR ESA data holdings (European coverage Jul-Oct 1978) fully recovered and accessible online for the first time ever. SEASAT, ALOS and JERS-1 data aligned to generate a long time data series in L-band.

slide-16
SLIDE 16

NOAA AVHRR

  • 1987-2011

Nimbus-7

  • Coastal Zone Color Scanner (CZCS)
  • Nov 1978 – Jun 1986

Marine Observation Satellites MOS-1 & MOS-1b

  • MESSR (Multi-Spectral Electronic Self-Scanning Radiometer)
  • VTIR (Visible and Thermal Infrared Radiometer)
  • Feb 1987 – Nov 1995 & Feb 1990 – April 1996

Adeos-1

  • OCTS (Ocean Colour and Temperature Scanner)
  • Aug 1996 – Jun 1997

SPOT-1 & SPOT-2

  • Feb 1986 – Dec 1990 & Jan 1990 – Jun 2009
  • HRV (High Resolution Visible) & HRVIR (High Resolution Visible IR)

Priority order being confirmed with CCI representatives

Heritage datasets

slide-17
SLIDE 17

Preserving Bits: Technology Timeline

Archives evolution Robotic Libraries

slide-18
SLIDE 18

Preserving Bits: Technology Timeline

Archive Media evolution CCT 140MB --------- T10000 8TB

slide-19
SLIDE 19

Discoverability: Outputs Timeline

Discoverability evolution

On-request access (via phone or fax). Paper catalogue quicklook to check availability, followed by generation of image on paper and physical delivery (e.g. courier)

Web Catalogues Data Retrieval Time: from days to seconds

slide-20
SLIDE 20

Accessibility: Outputs Timeline

Meteosat 1984

Paper / Photographic Paper Data Delivery Format evolution

http://esamultimedia.esa.int/docs/corporate/ESA50_slideshow.pdf

Digital Images

First ERS image Sentinel -1

Film / Printouts

slide-21
SLIDE 21

Accessibility: Outputs Timeline

Media dissemination evolution HDD/CD/DVD Exabyte DLT CCT CCT 140MB --------- Unlimited

slide-22
SLIDE 22

From Selected Scientist to Crowd Science Outputs Timeline

Usability evolution Selected and expert users (restricted access and usability)

Medium and Large registered Designated community Open & Free on TEP- Thematic Exploitation Platform Open & Free on App & Scientific Cloud

Any user can access the data Selected scientist Restricted Communities

slide-23
SLIDE 23

ESA UNCLASSIFIED – For Official Use

Associated Knowledge Preservation

slide-24
SLIDE 24

ESA UNCLASSIFIED – For Official Use

Associated Knowledge elements

Software/Tools:

  • Software Applications:
  • Data Product generation
  • Quality control
  • Product visualization
  • Value adding

Information:

  • Documentation
  • Images
  • Metadata file (information on creation,

access rights, restrictions, preservation history, and rights management)

  • Multimedia (Video/Audio)
  • SW related “IT Infrastructure”:
  • Compiler
  • Programming language
  • Storage system
  • Operative System
  • Libraries
  • Databases
  • Workflows
  • Bi directional links
  • Schemas
  • Email
slide-25
SLIDE 25

ESA UNCLASSIFIED – For Official Use

Information Preservation

slide-26
SLIDE 26

Information Format for Digital Preservation

  • Text documents (often MS Word, Excel Files, txt, etc.) can

be preserved as:

  • PostScript, PDF, DSSSL, RTF, ASCII, SGML, TIFF, CGM
  • PostScript, PDF, RTF are proprietary
  • DSSSL, SGML not (yet?) widely used
  • CGM has multiple variants in use
  • Images can be preserved as:
  • Loss of Quality JPEG, JPEG2000
  • Lossless compression TIFF, PBM, PNG, FITS.
  • Metadata can be preserved as:
  • ASCII, the most durable format for metadata because it is

widespread, backwards compatible when used with Unicode (superset of ASCII), and utilizes human- readable characters, not numeric codes.

  • For higher functionality, SGML or XML should be used.
  • Multimedia can be preserved as:
  • AVI, QuickTime, MPEG, WMV, MJ2.

ESA UNCLASSIFIED – For Official Use

PDF , PDF/A, FITS TIFF , FITS ASCII, XML

MJ2

From various Standards and Publications Sources

slide-27
SLIDE 27

Preservation metadata: PREMIS

  • 1. Descriptive metadata (domain specific EAD for documents, FITs,

SAFE, etc, ISAR, ISAD)

  • 2. Administrative (including rights and permissions)
  • 3. Technical (physical needed for implementing most static preservation

functions)

  • 4. Structural
  • 5. Documenting digital provenance (history)
  • 6. Documenting relationship in the preservation repository
  • 7. Collaborative
  • 8. Generated during the life cycle of the asset to be preserved
slide-28
SLIDE 28

ESA UNCLASSIFIED – For Official Use

Data Stewardship Best Practices/Guidelines

slide-29
SLIDE 29

Data Stewardship Best Practices

CEOS-WGISS DSIG ESA LTDP Team and GSCB LTDP WG OGC CCSDS CEOS

USA Japan Brazil Other Europe

GEO

? ?

CEOS CEOS CEOS

Spaceborne Earth Observation Spaceborne Earth Observation Spaceborne, airborne, and in- situ Earth Observation Spaceborne, airborne, and in- situ Earth Observation NEEDS

slide-30
SLIDE 30

Technical Documents Policy Documents

CEOS Data Stewardship Best Practices Document Tree

Preservation Workflow EO Data Set Consolidation Process

EO Data Preservation Guidelines

Glossary of Acronyms and Terms CEOS EO Space Data Sets EO Data Stewardship Cooperative Framework

Preserved Data Set Content Persistent Identifiers Best Practice

Data Purge Alert Data Purge Alert Procedure & White Paper Individual

  • rganizations'

policies (stewardship, access, ...)

Applied to

Support Technical implementation procedures Guidelines and best practices on specific topics General guidelines and best practices

High level framework documents Applied to

Associated Knowledge Preservation

slide-31
SLIDE 31

EVER-EST

slide-32
SLIDE 32

Thanks for your attention

ESA UNCLASSIFIED – For Official Use

slide-33
SLIDE 33

Backup Slides

ESA UNCLASSIFIED – For Official Use

slide-34
SLIDE 34

Heritage Mission Preservation Preserved Data Set Content

EO Missions/Sensors Dataset is defined as:

  • 1. Data Records: these include raw data and/or Level-0

data, higher-level products, browse images, auxiliary and ancillary data, calibration and validation data sets, and descriptive metadata;

  • 2. Associated Knowledge: this includes all the Tools

used in the Data Records generation, quality control, visualization and value adding, and all the Information needed to make the Data Records understandable and usable by the Designated Community

ESA UNCLASSIFIED – For Official Use

slide-35
SLIDE 35

Data Records

  • Raw data
  • Level 0 data (L0)
  • Level 1 (L1) to higher levels mission data products when generated

as part of the mission requirements and/or reprocessed

  • Browses whenever generated
  • Ancillary data (spacecraft ephemeris information, attitude, etc.)
  • Auxiliary data (required to process the telemetry payload data to

generate the nominal mission products)

  • Calibration and validation datasets (needed to calibrate the satellite

instruments and monitor data quality)

  • Metadata

ESA UNCLASSIFIED – For Official Use

slide-36
SLIDE 36

Tools

  • 1. L0 consolidation software
  • 2. Data processing software (for products generation from

Level 0 to higher levels according to mission requirements)

  • 3. Quality control software
  • 4. Data/products visualization tools
  • 5. Value adding tools

ESA UNCLASSIFIED – For Official Use

slide-37
SLIDE 37

Information

  • 1. Mission architecture documents describing purpose, scope and

performances of the mission and of the on-board instruments,

  • 2. Data and product format specifications.
  • 3. Measurement requirements and/or measurement performances

(theoretical models).

  • 4. Instruments characteristics, performances and instrument

description (physical implementations).

  • 5. Reports concerned with measurement trends, failures, changes of

performances, un-availabilities

  • 6. Reports and outcomes from events such as: congresses, studies,

communities and investigators concerned with models’ review, algorithm changes, and Cal/Val changes affecting data processing chains.

ESA UNCLASSIFIED – For Official Use

slide-38
SLIDE 38

Information

  • 1. Documents related to the process of data qualification: precision,

numerical representations, formats, uncertainties, errors, adjustment/correction methods (e.g. Cal/Val procedures and documents).

  • 2. Document related to workflows, work procedure, documentation

three and bi-directional link

  • 3. Scientific publications based on the data exploitation or relevant to

them (properly linked to the data) and outreach material.

  • 4. Administrative (Memorandum, Intellectual Property Rights, etc.)
  • 5. Mission Data Records and Documentation Tree

ESA UNCLASSIFIED – For Official Use

slide-39
SLIDE 39
  • Data records and knowledge consolidation, maturity assessment and improvements

implementation, based on new user requirements and value of the data set for societal application domains

  • Metadata: for accessibility and interoperability with family of sensors, data

management, citation (persistent identifiers)

  • Documentation as need to fill in gaps of broadened target community knowledge

base and maximize exploitation

  • Alignment to output format of latest sensor family product to facilitate massive long

term data series processing

  • Reprocessing baseline alignment and characterisation to latest family sensors

algorithm, product model, and/or improved ancillary and auxiliary data (navigation/calibration data)

  • Quality information generation and extraction in accordance to guidelines

Heritage Mission Curation