Kriging a.k.a. Gaussian Process Regression(GPR) Yubo Paul Yang, - PowerPoint PPT Presentation

Kriging a.k.a. Gaussian Process Regression(GPR) Yubo “Paul” Yang, Algorithm Interest Group, Jan. 17 2019

What is kriging? Kriging is an interpolation method. Kriging minimizes the variance of prediction error at data points. Kriging provides uncertainty estimates to its predictions. Problem: Given samples of a random scalar field, predict value at un-sampled location. Solution: Design unbiased estimator with minimum prediction error variance. 𝑧 0 = 𝒙 𝑈 𝒛 ො 𝐹[ො 𝑧 0 ] = 𝐹[𝑧 0 ] Minimize 𝑊[ො 𝑧 0 − 𝑧 0 ]

Why krige? Interpolation is generally useful. Error estimates on predictions. Gold! Photo by Sharon McCutcheon on Unsplash

How to krige? : ordinary Derive ordinary kriging equation: assume stationary mean 𝐹 𝑧 𝑗 = 𝜈, ∀𝑗 𝑂 𝑧 0 = 𝒙 𝑈 𝒛 = ෍ ො 𝑥 𝑗 𝑧 𝑗 𝑗=1 1. Design unbiased estimator 𝐹[ො 𝑧 0 ] = 𝐹 𝑧 0 = 𝜈 𝑂 𝑂 𝑂 𝑂 ෍ 𝑥 𝑗 = 1 𝐹 ො 𝑧 0 = 𝐹 ෍ 𝑥 𝑗 𝑧 𝑗 = ෍ 𝑥 𝑗 𝐹 𝑧 𝑗 = 𝜈 ෍ 𝑥 𝑗 𝑗=1 𝑗=1 𝑗=1 𝑗=1 2. Minimize prediction error variance 𝑊[ො 𝑧 0 − 𝑧 0 ] 𝑊 𝑧 0 − ො 𝑧 0 = 𝐷𝑝𝑤 𝑧 0 , 𝑧 0 − 2𝐷𝑝𝑤 𝑧 0 , ො 𝑧 0 + 𝐷𝑝𝑤[ො 𝑧 0 , ො 𝑧 0 ] 𝑂 𝑂 𝑥 𝑗 𝐷𝑝𝑤[𝑧 0 , 𝑧 𝑗 ] = 𝒙 𝑈 𝒆 𝑧 0 = 𝐷𝑝𝑤 𝑧 0 , σ 𝑗=1 𝑥 𝑗 𝑧 𝑗 = σ 𝑗=1 𝐷𝑝𝑤 𝑧 0 , ො 𝑒 𝑗 ≡ 𝐷𝑝𝑤[𝑧 0 , 𝑧 𝑗 ] 𝑧 0 = 𝒙 𝑈 𝑫 𝒙 𝐷𝑝𝑤 ො 𝑧 0 , ො 𝐷 𝑗𝑘 ≡ 𝐷𝑝𝑤[𝑧 𝑗 , 𝑧 𝑘 ] 𝑧 0 = 𝐷 00 − 2𝒙 𝑈 𝒆 + 𝒙 𝑈 𝑫 𝒙 𝑊 𝑧 0 − ො [1] Wikipedia [2] Kriging Example [3] Ordinary Kriging by MSU Ashton Shortridge

How to krige? : ordinary Ordinary kriging equation: assume stationary mean 𝐹 𝑧 𝑗 = 𝜈, ∀𝑗 𝑧 0 = 𝐷 00 − 2𝒙 𝑈 𝒆 + 𝒙 𝑈 𝑫 𝒙 minimize 𝑊 𝑧 0 − ො 𝑂 With the constaint σ 𝑗=1 𝑥 𝑗 = 1 Solve ordinary kriging equation using Lagrange multiplier 𝜇 minimize 𝑧 0 − 2𝜇(𝟐 𝑈 𝒙 − 1) 𝑊 𝑧 0 − ො 𝒙 𝑫 𝟐 𝜇 = 𝒆 𝟐 𝑈 0 1 [1] Wikipedia [2] Kriging Example [3] Ordinary Kriging by MSU Ashton Shortridge

How to krige? : ordinary Ordinary kriging equation: assume stationary mean 𝐹 𝑧 𝑗 = 𝜈, ∀𝑗 𝜇 = 𝟐 𝑈 𝑫 −1 𝒆 − 1 𝒙 𝑫 𝟐 𝜇 = 𝒆 𝒙 = 𝑫 −1 (𝒆 − 𝜇𝟐) solution 𝟐 𝑈 𝟐 𝑈 𝑫 −1 𝟐 0 1 implementation result 𝑧 0 = 𝒙 𝑈 𝒛 𝐹 ො 𝑧 0 − 𝑧 0 = 𝐷 00 − 𝒙 𝑈 𝒆 − 𝜇 𝑊 ො [1] Wikipedia [2] Kriging Example [3] Ordinary Kriging by MSU Ashton Shortridge

How to krige? : simple Simple kriging: assume 𝐹 𝑧 𝑗 = 0, ∀𝑗 ⇒ no constraint on weights 𝒙 𝑫 𝟐 𝜇 = 𝒆 𝑫𝒙 = 𝒆 becomes 𝟐 𝑈 0 1 implementation result 𝑧 0 = 𝒙 𝑈 𝒛 𝐹 ො 𝑧 0 − 𝑧 0 = 𝐷 00 − 𝒙 𝑈 𝒆 𝑊 ො

The secret sauce: correlation function The correlation function 𝐷𝑝𝑤(𝒚 1 , 𝒚 2 ) should capture covariance in data. (CM people think g( r )) 𝐷𝑝𝑤(𝒚 1 , 𝒚 2 ) is used to build the 𝑫 matrix and the 𝒆 vector. 𝐷𝑝𝑤(𝒚 1 , 𝒚 2 ) is related to the so-called variogram by 𝛿 𝒚 1 , 𝒚 2 = 𝐷𝑝𝑤 𝒚 0 , 𝒚 0 − 𝐷𝑝𝑤(𝒚 1 , 𝒚 2 ) Exponential sine squared Squared exponential 2 𝜌 𝐷𝑝𝑤 𝒚 1 , 𝒚 2 = exp − 𝒚 1 − 𝒚 2 2 𝑈 𝒚 1 − 𝒚 2 𝐷𝑝𝑤 𝒚 1 , 𝒚 2 = exp −2 sin 2𝑀 2 𝑀

The secret sauce: correlation function A kriging expert knows how to choose the correlation function form and parameters . 𝑀 =1.5 𝑀 =5.0 Squared exponential Squared exponential 𝐷𝑝𝑤 𝒚 1 , 𝒚 2 = exp − 𝒚 1 − 𝒚 2 2 𝐷𝑝𝑤 𝒚 1 , 𝒚 2 = exp − 𝒚 1 − 𝒚 2 2 2𝑀 2 2𝑀 2

Historical Review Kriging was used for time series analysis back in the 1940s. Kriging got its name from Danie G. Krige’s master thesis for predicting the location of gold deposits in South Africa in 1960. Kriging is used extensively in geostatistics and meteorology. Kriging was reformulated in the context of Baysian inference in the late 1990s. Kriging is now known as Gaussian process regression. The choice of correlation function is phrased as a machine learning problem. [1] Wikipedia [2] Chapter 2.8 RW2006

Gaussian Process Gaussian process is the generalization of multivariate distribution to infinite variables. Gaussian process is probability distribution over functions. Gaussian variable Normal distribution Gaussian vector Multivariate distribution Gaussian process [1] Chapter 2.2 RW2006

Gaussian Process Regression [1] Chris Fonnsbeck blog

Recent Applications [1] A. P. Bartok et. al., “Machine Learning a General -Purpose Interatomic Potential for Silicon,” Phys. Rev. X 8 , 041048 (2018). [2] A. Kamath et. al., “Neural networks vs Gaussian process regression for representing potential energy surfaces: A comparative study of fit quality and vibrational spectrum accuracy,” J. Chem. Phys. 148 , 241702 (2018). [3] A. Denzel and J. Kastner, “Gaussian Process Regression for Transition State Search,” J. Chem. Theory Comput., 14 (11), pp 5777-5786 (2018). [4] G. Schmitz and O. Christiansen, “Gaussian process regression to accelerate geometry optimization relying on numerical differentiation,” J. Chem. Phys. 148 , 241704 (2018).

Conclusions Kriging is a minimal-variance unbiased interpolation algorithm. Kriging result depends critically on the choice of correlation function (variogram). Kriging outputs a Gaussian process. Recently combined with Baysian inference and machine learning. Problem: Given samples of a random scalar field, predict value at un-sampled location. Solution: Design unbiased estimator with minimum prediction error variance. 𝑧 0 = 𝒙 𝑈 𝒛 ො 𝐹[ො 𝑧 0 ] = 𝐹[𝑧 0 ] Minimize 𝑊[ො 𝑧 0 − 𝑧 0 ]

Kriging a.k.a. Gaussian Process Regression(GPR) Yubo Paul Yang, - PowerPoint PPT Presentation

Kriging a.k.a. Gaussian Process Regression(GPR) Yubo Paul Yang, Algorithm Interest Group, Jan. 17 2019 What is kriging? Kriging is an interpolation method. Kriging minimizes the variance of prediction error at data points. Kriging

Estimation of a rare event probability on a complex system modeled with a kriging algorithm.

Better Simulation Metamodeling: The Why, What and How of Stochastic Kriging Jeremy Staum

An Introduction to Spatial Autocorrelation and Kriging Matt Robinson and Sebastian Dietrich RenR

Visual comparison of Moving Window Kriging models Dr Urka Demar National Centre for

MAPPING MINIMUM DAILY TEMPERATURE IN SPAIN USING KRIGING WITH EXTERNAL DRIFT Andrs CHAZARRA,

The DiceKriging and DiceOptim packages: kriging-based metamodeling and optimization for computer

Comparing and generating Latin Hypercube designs in Kriging models Giovanni Pistone, Grazia

Gaussian Process Lei Tang Arizona State University Jul. 31th, 2007 Lei Tang (ASU) Gaussian

Spatial covariance-robust minimax prediction based on experimental design ideas Gunter Spoeck

Comparing different interpolation methods Kuhnt on two-dimensional test functions Introduction

15.1 Last Lecture Want to solve a regression problem. confidence band f = argmin f 2

Reconst nstruct ruct Radio o Map with Automatic atically ally Constru tructed cted Gaussia

I ntroduction to Mobile Robotics Gaussian Processes Wolfram Burgard Cyrill Stachniss Giorgio

Kernel Methods for Regression Support Vector Regression Gaussian Mixture Regression Gaussian

Introduction to Nonparametric Bayesian Modeling and Gaussian Process Regression Piyush Rai Dept.

GP-BayesFilters Bayes Filters CSE-571 u(k-1) u(k) u(k+1)

Gaussian Processes to Speed up Hamiltonian Monte Carlo Matthieu L Murray, Iain

DD2434 - Advanced Machine Learning Gaussian Processes Carl Henrik Ek { chek } @csc.kth.se Royal

Safe model-based learning for robot control Breaking your robot is only fun in simulation Felix

NONLINEAR REGRESSION Sylvain Calinon Robot Learning & Interaction Group Idiap Research

Repeatability Simulations Definition: repeatability A (distributed) simulation program is Each

Nonlinear dynamic stochastic general equilibrium models David Schenck Senior Econometrician

The Government Revenue Dataset 2017 Toward Closer Cohesion of International Tax Statistics

Implementation Training for CDBG Projects

Kriging a.k.a. Gaussian Process Regression(GPR) Yubo Paul Yang, - PowerPoint PPT Presentation

Kriging a.k.a. Gaussian Process Regression(GPR) Yubo Paul Yang, Algorithm Interest Group, Jan. 17 2019 What is kriging? Kriging is an interpolation method. Kriging minimizes the variance of prediction error at data points. Kriging

Estimation of a rare event probability on a complex system modeled with a kriging algorithm.

Better Simulation Metamodeling: The Why, What and How of Stochastic Kriging Jeremy Staum

An Introduction to Spatial Autocorrelation and Kriging Matt Robinson and Sebastian Dietrich RenR

Visual comparison of Moving Window Kriging models Dr Urka Demar National Centre for

MAPPING MINIMUM DAILY TEMPERATURE IN SPAIN USING KRIGING WITH EXTERNAL DRIFT Andrs CHAZARRA,

The DiceKriging and DiceOptim packages: kriging-based metamodeling and optimization for computer

Comparing and generating Latin Hypercube designs in Kriging models Giovanni Pistone, Grazia

Gaussian Process Lei Tang Arizona State University Jul. 31th, 2007 Lei Tang (ASU) Gaussian

Spatial covariance-robust minimax prediction based on experimental design ideas Gunter Spoeck

Comparing different interpolation methods Kuhnt on two-dimensional test functions Introduction

15.1 Last Lecture Want to solve a regression problem. confidence band f = argmin f 2

Reconst nstruct ruct Radio o Map with Automatic atically ally Constru tructed cted Gaussia

I ntroduction to Mobile Robotics Gaussian Processes Wolfram Burgard Cyrill Stachniss Giorgio

Kernel Methods for Regression Support Vector Regression Gaussian Mixture Regression Gaussian

Introduction to Nonparametric Bayesian Modeling and Gaussian Process Regression Piyush Rai Dept.

GP-BayesFilters Bayes Filters CSE-571 u(k-1) u(k) u(k+1)

Gaussian Processes to Speed up Hamiltonian Monte Carlo Matthieu L Murray, Iain

DD2434 - Advanced Machine Learning Gaussian Processes Carl Henrik Ek { chek } @csc.kth.se Royal

Safe model-based learning for robot control Breaking your robot is only fun in simulation Felix

NONLINEAR REGRESSION Sylvain Calinon Robot Learning &amp; Interaction Group Idiap Research

Repeatability Simulations Definition: repeatability A (distributed) simulation program is Each

Nonlinear dynamic stochastic general equilibrium models David Schenck Senior Econometrician

The Government Revenue Dataset 2017 Toward Closer Cohesion of International Tax Statistics

Implementation Training for CDBG Projects

NONLINEAR REGRESSION Sylvain Calinon Robot Learning & Interaction Group Idiap Research