Treating geospatial complex data by compression and reduced order - PowerPoint PPT Presentation

Treating geospatial complex data by compression and reduced order methods 1 Stefano De Marchi Department of Mathematics “Tullio Levi-Civita” University of Padova (UNIPD) – Italy Wroclaw, September 19, 2018 1 GeoEssential ERA-PLANET team at UNIPD: F. Aiolli, S. De Marchi, W. Erb, E. Perracchione, F. Piazzon, M. Polato, M. Putti , A. Sperduti, M. Vianello.

Complexity of data Complexity is a tougher nut to crack! ֒ → complexity of the data is as much of a difficulty in making use of the data as is their size/dimension ← ֓ Although people understand intuitively that complexity is a real problem in data analysis it is not always an easy notion to define. In many cases, we recognize the complexity when we see it! Complex and topological data analysis: https://www.ayasdi.com/blog/author/gunnar-carlsson/ and plenary by Marian Mrozek on Combinatorical topological dynamics . 2 of 35

Outline 1 Goals Image compression 2 Time evolution and prediction by RBF-based model 3 4 Reduced order or model reduction methods Time evolution and prediction: Machine Learning 5 Future work 6 3 of 35

Goals of the GeoEssential project 1 Efficient method for image compression well suited for geospatial data modelling. 2 From several data (temperature, soil humidity, satellite images, ....) create a model to forecast the evolution of the dynamics in time and evaluate related uncertainties. For the first item, we have developed an efficient polynomial-based scheme, which enables us to compress images. For the second item, both Radial Basis Function (RBF)-based reduced order methods and machine learning tools are used. 4 of 35

Image compression Theoretical basis [Piazzon et al. 2017] Theorem (Discrete Caratheodory-Tchakaloff) Let µ be a discrete multivariate measure on R d , M � λ i δ x i , λ i > 0 , x i ∈ R d , µ := i = 1 supported in X = { x 1 , . . . , x M } ⊂ R d and let S := span { φ 1 , φ 2 , . . . , φ L } a linear space of functions that are continuous on a compact neighborhood of X with N = dim ( S | X ) ≤ L. Then, there exists a quadrature rule for µ s.t. ∀ f ∈ S | X M m � � � fd µ := f ( x i ) λ i = f ( t j ) ω j , . X i = 1 j = 1 with nodes { t j } m j = 1 ⊂ X and positive weights ω j with m ≤ N ≤ L 5 of 35 Obs: this is a subsampling of discrete measures

Image compression Computational aspect, I The problem of finding the subspace is ”suggested” by the Tchakaloff’s theorem (1959). choose any c ∈ R M linear independent w.r.t. the columns of V t (V=Vandermonde-like) and solve  min ω ≥ 0 � c ; ˜ ω � ,    ˜  V t ˜  ω = b ,  where V t ˜ ω = b (:= V t λ ) , Obs The feasible region is a polytope. The minimum of the objective is achieved on a vertex (i.e. sparsity). 6 of 35

Image compression Computational aspect, II The minimum problem can be solved in two (alternative) ways 1 Simplex method (which is the standard solver) (or basis pursuit algorithm). 2 The Lawson-Hanson (non-negative least squares) algorithm for the relaxed problem  min � V t ˜ ω − b � 2 ,      ω ≥ 0 , ˜  The Lawson-Hanson algorithm finds sparse solutions ... and this is the case. = ⇒ Both algorithms are thinning procedures for image compression with r = M / m ≥ M / N ≫ 1. 7 of 35

Example 1 Figure: Example of image compression. The compression factor is about 70. 8 of 35

Time evolution, prediction by RBF-based model Notation X N = { x i , i = 1 , . . . , N } ⊆ Ω : set of distinct, scattered data sites (nodes) of Ω ⊆ R M F N = { f i = f ( x i ) , i = 1 , . . . , N } , data values (or measurements), obtained by sampling some (unknown) function f : Ω −→ R at the nodes x i , Scattered data interpolation problem Find a function R : Ω −→ R s.t. R |X N = F N . RBF interpolation: consider φ : [ 0 , ∞ ) → R and form N � R ( x ) = c k φ ( � x − x k � 2 ) , x ∈ Ω . k = 1 11 of 35

Uniqueness of the solution The problem reduces to solving a linear system Ac = f , with ( A ) ik = φ ( � x i − x k � 2 ) , i , k = 1 , . . . , N . The problem is well-posed if φ is strictly positive definite 2 Kernel notation Let Φ : R M × R M −→ R be a strictly positive definite kernel. Then A becomes ( A ) ik = Φ ( x i , x k ) , i , k = 1 , . . . , N . 2 We remark that the uniqueness of the interpolant can be ensured also for the general case of strictly conditionally positive definite functions of order L by 12 of 35 adding a polynomial term.

Popular radial basis functions In Table 1, we present several RBFs. Here r := � · � 2 and ε the shape parameter. φ ( r ) = e − ( ε r ) 2 Gaussian C ∞ G φ ( r ) = ( 1 + ( ε r ) 2 ) − 1 / 2 Inverse MultiQuadric C ∞ IMQ φ ( r ) = e − ε r ern C 0 Mat´ M0 φ ( r ) = e − ε r ( 1 + ε r ) ern C 2 Mat´ M2 φ ( r ) = e − ε r ( 3 + 3 ε r + ( ε r ) 2 ) ern C 4 Mat´ M4 φ ( r ) = ( 1 − ε r ) 2 Wendland C 0 W0 + φ ( r ) = ( 1 − ε r ) 4 Wendland C 2 + ( 4 ε r + 1 ) W2 � 35 ( ε r ) 2 + 18 ε r + 3 � φ ( r ) = ( 1 − ε r ) 6 Wendland C 4 W4 + Table: most popular RBFs 13 of 35

Model Reduction methods Motivation Smaller model dimension, reduced requirements Similar precision, error control Automatic reduction, not “manual” Applications: parametric PDEs, ODEs, adaptive grids, parallel computing and HPC.... Reference: www.haasdonk.de/data/drwa2018 tutorial given at the Dolomites Research Week on Approximation 2018, Canazei (I) 10-14/9/2018 . 14 of 35

Reduced order methods Defintion The problem can be visualized in a nested diagram Given a set of N data, the aim is finding a suitable subspace (reduced), spanned by m ≪ N (functions and) centers. Figure: Communication diagram for macro and microscale models. 15 of 35

Point selection procedure Greedy-based approach [Haasdonk, Santin 2017] Consider a function f : Ω −→ R and denote by R its RBF interpolant on X N centers. The procedure can be summarized as follows: starting X 0 � ∅ k ≥ 1 Determine the sequence x k = arg max | f ( x ) − R ( x ) | , x ∈ X N \ X k − 1 � �� P XN ,φ ( x ) and form X k = X k − 1 ∪ { x k } . repeat Continue until a suitable maximal subspace of m terms, m ≪ N , is found. P X N ,φ : power function 16 of 35

Filtering by Ensemble Kalman filter ( EnKF ) The EnKF works for non-linear models, takes into account unavoidable uncertainty (noise) in the measurements and enables us to get an estimate for the next step, say t k + 1 . EnKF is a generalization of the well-known Extended KF When the dynamics is linear the Kalman filter provides an optimal estimate of the state, while EnKF for non-linear model is suboptimal. More details in [GeoEssential Report 1, Sept. 2018]. 17 of 35

VOSS As data set for the current study of time series, we consider the data collected in the South-Eastern part of the Veneto Region and available at http://voss.dmsa.unipd.it/ . This data set has been created for an experimental study of the organic soil compaction and prediction of the land subsidence related to climate changes in the South-Eastern area of the Venice Lagoon catchment (VOSS - Venice Organic Soil Subsidence). The data were collected with the contribution of the University of Padova (UNIPD) from 2001 to 2006. 18 of 35

Example I RBF: Matern M6. 1.4 1.4 data data reduced bases reduced bases 1.2 1.2 model model prevision prevision 1 1 0.8 0.8 0.6 0.6 0.4 0.4 0.2 0.2 0 0 0 0.2 0.4 0.6 0.8 1 0 0.2 0.4 0.6 0.8 1 Figure: Graphical results via RBF-reduced order methods coupled with EnKF. Left: temperature data, Right: potentiometer samples. Figure: Accuracy for RBF-reduced order methods and Kalman filter. B indicates the index of the last basis extracted. 19 of 35

Example II RBF: Matern M6. 1.4 1.4 data data reduced bases reduced bases 1.2 1.2 prevision prevision 1 1 0.8 0.8 0.6 0.6 0.4 0.4 0.2 0.2 0 0 0 0.2 0.4 0.6 0.8 1 0 0.2 0.4 0.6 0.8 1 Figure: The RBF-reduced order methods and Kalman filter applied iteratively on the temperature data set. The figure shows the progresses of the algorithm for two different time steps (i.e. data assimilation). 20 of 35

Support Vector Machine (SVM) Kernel-based methods are one of the most used machine learning approaches. Support Vector Machine (SVM) is the most famous and successful kernel method! The basic idea behind this kind of schemes is related to the so-called kernel trick which allows to implicitly compute vector similarities/classification (defined in terms of dot-product) 21 of 35

Time evolution-prediction by ML Learning with kernels [Sch˝ olkopt and Smola 2001] In ML kernels are defined as Φ( x , y ) = � φ ( x ) , φ ( y ) � , where φ : Ω → H (future map) maps the vectors x , y to a (higher dimensional) feature (or embedding) space H [Shawe-Taylor and Cristianini 2004]. The main idea consists in using kernels to project data points in an higher dimensional space where the task is “easier” (for example linear): the ”Kernel Trick”. y y φ x x z Figure: “Kernel Trick”: binary classification by a future map φ : R 2 → R 3 . 22 of 35

Treating geospatial complex data by compression and reduced order - PowerPoint PPT Presentation

Treating geospatial complex data by compression and reduced order methods 1 Stefano De Marchi Department of Mathematics Tullio Levi-Civita University of Padova (UNIPD) Italy Wroclaw, September 19, 2018 1 GeoEssential ERA-PLANET team

Treating Tobacco Treating Tobacco Treating Tobacco Treating Tobacco Dependence and Providing

Geospatial & Hexagon Emerging Geospatial Technology Trends Middle East Geospatial Forum Claudio

JLCIMT/State CIO Geospatial Data Sharing Workgroup Geospatial Framework Data Sharing Among Public

14.9.2 JPEG2000 compression DCT compression basis for JPEG wavelet compression

Lossless compression in lossy compression systems Almost every lossy compression system

Interoperability in Geospatial Web Services Jeff de La Beaujardire, PhD NASA Geospatial

JPEG Compression Ian Snyder December 11, 2009 Ian Snyder JPEG Compression Outline

Lecture 9: Compression 1 / 52 Compression Recap Bu ff er Management Recap 2 / 52 Compression

Tradeoffs in XML Database Compression James Cheney University of Edinburgh Data Compression

From Sorting to Heaps to Compression Data Compression video on demand/set top box jpeg

Complex Numbers Complex Numbers 1 / 19 Complex Numbers Complex numbers ( C ) are an extension of

Digital Image Compression Digital Image Compression Digital Image Compression and JPEG Standards

Digital Video Compression Digital Video Compression Digital Video Compression and H.261

Bachelor of Geospatial Science The University of the South Pacific Geospatial Science Unit School

Geospatial Engineering: A Lever to Assist Developing Countries to Bridge the Geospatial Digital

CLICK HERE TO KNOW MORE AN OPEN DISCUSSION ON ROLE OF GEOSPATIAL BRINGING GEOSPATIAL

What do Recurrent Neural Network Grammars Learn About Syntax ? Authors: Adhiguna Kuncoro, Miguel

10/20/08 Today P561: Network Systems Internet routing (BGP) Week 4: Internetworking II

students: an integrated approach Paul Greenbank Centre for Learning and Teaching Introduction

Disciplined Agile In The City AGILE in the CITY, LONDON 2018 About today My

Fast and stable rational RBF-based Partition of Unity interpolation 1 Stefano De Marchi

Efficient Solution of Sequences of Linear Systems Jurjen Duintjer Tebbens Institute of Computer

Preconditioner updates for sequences of symmetric positive definite linear systems arising in

Em Empowering H Health th Supporting Innovation Dr r Nic Nic Woods Health Lead, Microsoft

Treating geospatial complex data by compression and reduced order - PowerPoint PPT Presentation

Treating geospatial complex data by compression and reduced order methods 1 Stefano De Marchi Department of Mathematics Tullio Levi-Civita University of Padova (UNIPD) Italy Wroclaw, September 19, 2018 1 GeoEssential ERA-PLANET team

Treating Tobacco Treating Tobacco Treating Tobacco Treating Tobacco Dependence and Providing

Geospatial &amp; Hexagon Emerging Geospatial Technology Trends Middle East Geospatial Forum Claudio

JLCIMT/State CIO Geospatial Data Sharing Workgroup Geospatial Framework Data Sharing Among Public

14.9.2 JPEG2000 compression DCT compression basis for JPEG wavelet compression

Lossless compression in lossy compression systems Almost every lossy compression system

Interoperability in Geospatial Web Services Jeff de La Beaujardire, PhD NASA Geospatial

JPEG Compression Ian Snyder December 11, 2009 Ian Snyder JPEG Compression Outline

Lecture 9: Compression 1 / 52 Compression Recap Bu ff er Management Recap 2 / 52 Compression

Tradeoffs in XML Database Compression James Cheney University of Edinburgh Data Compression

From Sorting to Heaps to Compression Data Compression video on demand/set top box jpeg

Complex Numbers Complex Numbers 1 / 19 Complex Numbers Complex numbers ( C ) are an extension of

Digital Image Compression Digital Image Compression Digital Image Compression and JPEG Standards

Digital Video Compression Digital Video Compression Digital Video Compression and H.261

Bachelor of Geospatial Science The University of the South Pacific Geospatial Science Unit School

Geospatial Engineering: A Lever to Assist Developing Countries to Bridge the Geospatial Digital

CLICK HERE TO KNOW MORE AN OPEN DISCUSSION ON ROLE OF GEOSPATIAL BRINGING GEOSPATIAL

What do Recurrent Neural Network Grammars Learn About Syntax ? Authors: Adhiguna Kuncoro, Miguel

10/20/08 Today P561: Network Systems Internet routing (BGP) Week 4: Internetworking II

students: an integrated approach Paul Greenbank Centre for Learning and Teaching Introduction

Disciplined Agile In The City AGILE in the CITY, LONDON 2018 About today My

Fast and stable rational RBF-based Partition of Unity interpolation 1 Stefano De Marchi

Efficient Solution of Sequences of Linear Systems Jurjen Duintjer Tebbens Institute of Computer

Preconditioner updates for sequences of symmetric positive definite linear systems arising in

Em Empowering H Health th Supporting Innovation Dr r Nic Nic Woods Health Lead, Microsoft

Geospatial & Hexagon Emerging Geospatial Technology Trends Middle East Geospatial Forum Claudio