Processing and analysis of Earth Observation data
Carsten Brockmann, Brockmann Consult GmbH
ESA Climate Change Initiative Toolbox Science Lead
Big Data Analytics & GIS, Münster 20.-21. September 2017.
Processing and analysis of Earth Observation data Carsten - - PowerPoint PPT Presentation
Processing and analysis of Earth Observation data Carsten Brockmann, Brockmann Consult GmbH ESA Climate Change Initiative Toolbox Science Lead Big Data Analytics & GIS, Mnster 20.-21. September 2017. Earth Observation Managing big EO
Carsten Brockmann, Brockmann Consult GmbH
ESA Climate Change Initiative Toolbox Science Lead
Big Data Analytics & GIS, Münster 20.-21. September 2017.
Earth Observation Managing big EO data is increasingly complex. But not just technically.
Lake MacKay, Australia
Tonga, Pacific
Arctic Ocean
Karavasta Lagoon, Albania
Satellite Images = Measurement Data
Turning image data into information
Satellite Images = Measurement Data
Generic Tools
Instrument specific tools
Thematic processing, synergy
SNAP Architecture
SNAP Engine
Java SE 8 Platform NetBeans RCP
SNAP Desktop Sentinel-3 Toolbox (S3TBX) Sentinel-2 Toolbox (S2TBX) Sentinel-1 Toolbox (S1TBX)
Python GeoTools JAI NetCDF …
Any combination
add-ons is allowed, even none, as SNAP Desktop is a already a useful stand-alone application for EO data exploitation. Programming language layer 3rd-party library layer SNAP layer
SNAP Architecture
SNAP Engine
Java SE 8 Platform NetBeans RCP
SNAP Desktop Sentinel-3 Toolbox (S3TBX) Sentinel-2 Toolbox (S2TBX) Sentinel-1 Toolbox (S1TBX)
Python GeoTools JAI NetCDF …
Any combination
add-ons is allowed, even none, as SNAP Desktop is a already a useful stand-alone application for EO data exploitation. Programming language layer 3rd-party library layer SNAP layer
SNAP Application Modes
Golden Age of Earth Observation
By the end of 2017, the operational Sentinel-1, -2, -3 and -5p satellites alone will continuously collect a volume of 27 Terabytes per day / 10PB per year. It could take around 2.5 years to download 1 Petabyte
process it on your own computer (Wagner, 2015)
Data Local Processing
Optimising data transfer Sharing input data Sharing result Rapid turn-around cycles
Hadoop approach Concurrent data-local processing Tasks are transferred over the network Good scalability Archive-centric approach Network storage Data are transferred over the network Risk of network bottleneck
Data Local Processing
Calvalus Adapters
Calvalus Processing System for EO Data
Calvalus On-demand Portal Calvalus Bulk Production Calvalus Adapters SNAP GPF Operators and Graphs SNAP Aggre- gators Linux Executables
Apply Apache Hadoop to earth observation
Transfer the algorithm to the data (data-local in a narrower sense) Avoid an archive-centric approach
Add Calvalus software layers for EO data processing and validation
EO processing workflows Data processor plug-in framework Bulk production control Portal
Integrate data processors
Linux executables SNAP & BEAM GPF operators and aggregators Ppen for other frameworks
access
L1 File L2 Processor (Mapper Task) L2 File L1 File L2 Processor (Mapper Task) L2 File L1 File L2 Processor (Mapper Task) L2 File L1 File L2 Processor (Mapper Task) L2 File L1 File L2 Processor (Mapper Task) L2 File
step necessary
Data Local Processing
access
concatenate intermediate results to final result (reduce)
Map-Reduce on Calvalus
Map-Reduce on Calvalus
(Reducer Task)
access
L1 File
L2 Proc. + Spat. Binning (Mapper Task)
Spatial Bins L1 File
L2 Proc. + Spat. Binning (Mapper Task)
Spatial Bins L1 File
L2 Proc. + Spat. Binning (Mapper Task)
Spatial Bins L1 File
L2 Proc. + Spat. Binning (Mapper Task)
Spatial Bins L1 File
L2 Proc. + Spat. Binning (Mapper Task)
Spatial Bins
(Reducer Task) Temp. Bins Temp. Bins L3 Formatting L3 File
L1 File L2 Proc. & Matcher (Mapper Task) Output Records L1 File L2 Proc. & Matcher (Mapper Task) Output Records L1 File L2 Proc. & Matcher (Mapper Task) Output Records L1 File L2 Proc. & Matcher (Mapper Task) Output Records
Matchup Analysis (Reducer Task) MA Report Input Records L1 File L2 Proc. & Matcher (Mapper Task) Output Records
Map-Reduce on Calvalus
Supported by SNAP Graph Processing Framework
Streaming on Calvalus
EO Data & Data-Processing Platforms
European Space Agency & national Space Agencies
European Commission:
Copernicus Collaborative Ground Segments Private offers
The Urban Thematic Exploitation Platform
Visualisation & Analysis Urban TEP Processing Centres Urban TEP portal + gateway
gateway to ...
Datasets and services Geo-browser Processing request forms and result access
Portal functions
Analysis and visualisation
Combination of satellite products and socio-economic data Derivation of new criteria
Processing request form
Istanbul Moscow Sao Paulo
Global binary raster mask showing location of human settlements (12m/75m)
GUF
▪ SAR4Urban (2015-2016)
Beijing
ERS-2 PRI & ASAR IMP VV 2002-2003 15m spatial resolution 48 scenes
Urban growth
Beijing
S1A IW GRDH VV 2014-2015 10m spatial resolution 31 scenes
Urban growth
Urban TEP is ...
... that meet space, time and feature dimensions of the domain
... to access and use them, ... to generate more of them
Package Upload Local test processing Deployment Processing Request Concurrent processing VM for download Browser for request submission Urban TEP portal Urban TEP processing centres
Processor development model
Systematic or on-demand processing
Urban TEP processing centres
IT4Innovations Brockmann Consult DLR
cluster (Salomon HPC) cluster (Calvalus/Hadoop) YARN scheduler virtualised env. (GeoFarm) +cluster(Calvalus/Hadoop)
OLCI, MERIS Sentinel-1 and other datasets Geoserver WPS + own backend implementation Calvalus WPS + Urban TEP config+extension
Geoserver WMS
processing GUF subsetting, Sentinel-2 timescan processing GUF and other Urban datasets fast internet access, HPC. host of portal and analysis/visualisation distributed data-local processing and concurrent aggregation systematic generation of datasets
Copernicus Data and Exploitation Platform – Deutschland National entry point to the EU Copernicus Sentinel Satellite Systems, their data products and the products of the Copernicus Services Processing facilities on the platform
EU DIAS
Confusing?
Climate Monitoring Data Climate change is a global challenge. Open climate data is crucial.
The objective of the Climate Change Initiative (CCI) is to realise the full potential of the long-term global Earth Observation archives that ESA together with its Member States have established over the last 30 years, as a significant and timely contribution to the Essential Climate Variable databases required by the United Nations Framework Convention on Climate Change (UNFCCC).
ESA Climate Change Initiative (CCI)
ESA Climate Change Initiative (CCI) 16 projects >300 scientists >100 organisations 18 countries Since 2009
7 years. X individuals X organisations X projects
Climate Monitoring Data An Overview of Climate Data Production.
Essential Climate Variables have been defined by the global science community to support the United Nations Framework Convention on Climate Change (UNFCCC). Step 1. Deciding what to actually measure.
Criteria of Essential Climate Variables (ECV)
Cost effective. Using proven technology.
Satellites can help
Rapid Measurement. Constant watch.
Step 2. Get the raw satellite data. Current data. Archived data. Planning for the future.
Example
Step 3. Process the data. Gridding, Homogenisation, Calibration & Validation, Quality. Scientific processing - application of state-of-the–art algorithms distilled from the very latest scientific reasoning.
Step 4. Distribution of Climate Data Products. “Just give me the data”.
ESA Climate Change Initiative (CCI) Managing Complexity of Climate Data Production. Open Data Challenges & Approaches
Managing Open Data Complexity
Managing Open Data Complexity
Managing Open Data Complexity
Managing Open Data Complexity
Managing Open Data Complexity
Managing Open Data Complexity
“Climate Analysis Toolbox for ESA” A software to facilitate processing and analysis of all the data products generated by the ESA Climate Change Initiative Programme (CCI).
ESA UNCLASSIFIED - For Official Use
61
Web Service (WebAPI) { RESTful }
Command-Line App (CLI)
Desktop App (GUI)
Python Core Lib (API) Python Core Lib (API)
Plugin 1 Plugin 2 Plugin 1 Plugin 2 ESA Open Data Portal and other data services
Process 1 Process 2 Process 3
Cate Desktop
62
published by CCI Open Data Portal
just subsets
▪ Temporal subset ▪ Spatial subset ▪ Variable subset
local data sources
63
64
65
66
67
workflow step
executed from Python
Command-Line Interface (CLI)
68
Python Programming & Batch Processing
Exported CLI Calls
69
Execution Scenarios
70
Cate Desktop Cate Desktop Cate Desktop
71
Processing and analysis of Earth Observation data
Managing big EO data is increasingly complex. But not just technically.
Sentinel Toolbox SNAP: step.esa.int CCI Toolbox: github.com/CCI-Tools/CCI-Tools.github.io ect-core.readthedocs.io cci-tools.github.io Earth System Datacube: earthsystemdatacube.net/