[PPT] - A Unifying Framework for Sparse Gaussian Process Approximation using PowerPoint Presentation

SLIDE 1

A Unifying Framework for Sparse Gaussian Process Approximation using Power Expectation Propagation

Dr. Richard E. Turner (ret26@cam.ac.uk)

Computational and Biological Learning Lab, Department of Engineering, University of Cambridge

...joint work with Thang Bui, Cuong Nguyen and Josiah Yan

1 / 22

SLIDE 2

Manfred Opper is a God

2 / 22

SLIDE 3

Motivation: Gaussian Process Regression inputs

utputs

3 / 22

SLIDE 4

Motivation: Gaussian Process Regression inputs

utputs

?

3 / 22

SLIDE 5

Motivation: Gaussian Process Regression inputs

utputs

?

3 / 22

SLIDE 6

Motivation: Gaussian Process Regression inputs

utputs

?

inference & learning

3 / 22

SLIDE 7

Motivation: Gaussian Process Regression inputs

utputs

?

inference & learning intractabilities computational analytic

3 / 22

SLIDE 8

A Brief History of Gaussian Process Approximations

FITC: Snelson et al. “Sparse Gaussian Processes using Pseudo-inputs” PITC: Snelson et al. “Local and global sparse Gaussian process approximations” EP: Csato and Opper 2002 / Qi et al. "Sparse-posterior Gaussian Processes for general likelihoods.” VFE: Titsias “Variational Learning of Inducing Variables in Sparse Gaussian Processes” DTC / PP: Seeger et al. “Fast Forward Selection to Speed Up Sparse Gaussian Process Regression”

4 / 22

SLIDE 9

A Brief History of Gaussian Process Approximations

approximate generative model exact inference

FITC: Snelson et al. “Sparse Gaussian Processes using Pseudo-inputs” PITC: Snelson et al. “Local and global sparse Gaussian process approximations” EP: Csato and Opper 2002 / Qi et al. "Sparse-posterior Gaussian Processes for general likelihoods.” VFE: Titsias “Variational Learning of Inducing Variables in Sparse Gaussian Processes” DTC / PP: Seeger et al. “Fast Forward Selection to Speed Up Sparse Gaussian Process Regression”

4 / 22

SLIDE 10

A Brief History of Gaussian Process Approximations

approximate generative model exact inference methods employing pseudo-data

FITC: Snelson et al. “Sparse Gaussian Processes using Pseudo-inputs” PITC: Snelson et al. “Local and global sparse Gaussian process approximations” EP: Csato and Opper 2002 / Qi et al. "Sparse-posterior Gaussian Processes for general likelihoods.” VFE: Titsias “Variational Learning of Inducing Variables in Sparse Gaussian Processes” DTC / PP: Seeger et al. “Fast Forward Selection to Speed Up Sparse Gaussian Process Regression”

4 / 22

SLIDE 11

A Brief History of Gaussian Process Approximations

approximate generative model exact inference methods employing pseudo-data

FITC: Snelson et al. “Sparse Gaussian Processes using Pseudo-inputs” PITC: Snelson et al. “Local and global sparse Gaussian process approximations” EP: Csato and Opper 2002 / Qi et al. "Sparse-posterior Gaussian Processes for general likelihoods.” VFE: Titsias “Variational Learning of Inducing Variables in Sparse Gaussian Processes” DTC / PP: Seeger et al. “Fast Forward Selection to Speed Up Sparse Gaussian Process Regression”

FITC PITC DTC

4 / 22

SLIDE 12

A Brief History of Gaussian Process Approximations

approximate generative model exact inference methods employing pseudo-data

FITC: Snelson et al. “Sparse Gaussian Processes using Pseudo-inputs” PITC: Snelson et al. “Local and global sparse Gaussian process approximations” EP: Csato and Opper 2002 / Qi et al. "Sparse-posterior Gaussian Processes for general likelihoods.” VFE: Titsias “Variational Learning of Inducing Variables in Sparse Gaussian Processes” DTC / PP: Seeger et al. “Fast Forward Selection to Speed Up Sparse Gaussian Process Regression”

FITC PITC DTC

A Unifying View of Sparse Approximate Gaussian Process Regression Quinonero-Candela & Rasmussen, 2005 (FITC, PITC, DTC)

4 / 22

SLIDE 13

A Brief History of Gaussian Process Approximations

approximate generative model exact inference exact generative model approximate inference methods employing pseudo-data

FITC: Snelson et al. “Sparse Gaussian Processes using Pseudo-inputs” PITC: Snelson et al. “Local and global sparse Gaussian process approximations” EP: Csato and Opper 2002 / Qi et al. "Sparse-posterior Gaussian Processes for general likelihoods.” VFE: Titsias “Variational Learning of Inducing Variables in Sparse Gaussian Processes” DTC / PP: Seeger et al. “Fast Forward Selection to Speed Up Sparse Gaussian Process Regression”

FITC PITC DTC

A Unifying View of Sparse Approximate Gaussian Process Regression Quinonero-Candela & Rasmussen, 2005 (FITC, PITC, DTC)

4 / 22

SLIDE 14

A Brief History of Gaussian Process Approximations

approximate generative model exact inference exact generative model approximate inference methods employing pseudo-data

FITC: Snelson et al. “Sparse Gaussian Processes using Pseudo-inputs” PITC: Snelson et al. “Local and global sparse Gaussian process approximations” EP: Csato and Opper 2002 / Qi et al. "Sparse-posterior Gaussian Processes for general likelihoods.” VFE: Titsias “Variational Learning of Inducing Variables in Sparse Gaussian Processes” DTC / PP: Seeger et al. “Fast Forward Selection to Speed Up Sparse Gaussian Process Regression”

VFE EP PP FITC PITC DTC

A Unifying View of Sparse Approximate Gaussian Process Regression Quinonero-Candela & Rasmussen, 2005 (FITC, PITC, DTC)

4 / 22

SLIDE 15

A Brief History of Gaussian Process Approximations

approximate generative model exact inference exact generative model approximate inference methods employing pseudo-data

FITC: Snelson et al. “Sparse Gaussian Processes using Pseudo-inputs” PITC: Snelson et al. “Local and global sparse Gaussian process approximations” EP: Csato and Opper 2002 / Qi et al. "Sparse-posterior Gaussian Processes for general likelihoods.” VFE: Titsias “Variational Learning of Inducing Variables in Sparse Gaussian Processes” DTC / PP: Seeger et al. “Fast Forward Selection to Speed Up Sparse Gaussian Process Regression”

VFE EP PP FITC PITC DTC

A Unifying View of Sparse Approximate Gaussian Process Regression Quinonero-Candela & Rasmussen, 2005 (FITC, PITC, DTC)

4 / 22

SLIDE 16

A Brief History of Gaussian Process Approximations

approximate generative model exact inference exact generative model approximate inference methods employing pseudo-data

FITC: Snelson et al. “Sparse Gaussian Processes using Pseudo-inputs” PITC: Snelson et al. “Local and global sparse Gaussian process approximations” EP: Csato and Opper 2002 / Qi et al. "Sparse-posterior Gaussian Processes for general likelihoods.” VFE: Titsias “Variational Learning of Inducing Variables in Sparse Gaussian Processes” DTC / PP: Seeger et al. “Fast Forward Selection to Speed Up Sparse Gaussian Process Regression”

VFE EP PP FITC PITC DTC

A Unifying View of Sparse Approximate Gaussian Process Regression Quinonero-Candela & Rasmussen, 2005 (FITC, PITC, DTC) A Unifying Framework for Sparse Gaussian Process Approximation using Power Expectation Propagation Bui, Yan and Turner, 2016 (VFE, EP, FITC, PITC ...)

4 / 22

SLIDE 17

EP pseudo-point approximation

true posterior

5 / 22

SLIDE 18

EP pseudo-point approximation

true posterior

5 / 22

SLIDE 19

EP pseudo-point approximation

true posterior

marginal likelihood posterior

5 / 22

SLIDE 20

EP pseudo-point approximation

true posterior approximate posterior

marginal likelihood posterior

5 / 22

SLIDE 21

EP pseudo-point approximation

true posterior approximate posterior

marginal likelihood posterior

5 / 22

SLIDE 22

EP pseudo-point approximation

true posterior approximate posterior

marginal likelihood posterior

5 / 22

SLIDE 23

EP pseudo-point approximation

true posterior approximate posterior

marginal likelihood posterior

5 / 22

SLIDE 24

EP pseudo-point approximation

input locations of 'pseudo' data

utputs and covariance

'pseudo' data

true posterior approximate posterior

marginal likelihood posterior exact joint

f new GP

regression model

5 / 22

SLIDE 25

EP algorithm

6 / 22

SLIDE 26

EP algorithm

1. remove

take out one pseudo-observation likelihood

cavity

6 / 22

SLIDE 27

EP algorithm

1. remove
2. include

take out one pseudo-observation likelihood add in one true observation likelihood

cavity tilted

6 / 22

SLIDE 28

EP algorithm

1. remove
2. include
3. project

take out one pseudo-observation likelihood add in one true observation likelihood project onto approximating family

cavity tilted KL between unnormalised stochastic processes

6 / 22

SLIDE 29

EP algorithm

1. remove
2. include
3. project
4. update

take out one pseudo-observation likelihood add in one true observation likelihood project onto approximating family update pseudo-observation likelihood

cavity tilted KL between unnormalised stochastic processes

6 / 22

SLIDE 30

EP algorithm

1. remove
2. include
3. project
4. update

take out one pseudo-observation likelihood add in one true observation likelihood project onto approximating family update pseudo-observation likelihood

cavity tilted

1. minimum: moments matched at pseudo-inputs
2. Gaussian regression: matches moments everywhere

KL between unnormalised stochastic processes

6 / 22

SLIDE 31

EP algorithm

1. remove
2. include
3. project
4. update

take out one pseudo-observation likelihood add in one true observation likelihood project onto approximating family update pseudo-observation likelihood

cavity tilted

1. minimum: moments matched at pseudo-inputs
2. Gaussian regression: matches moments everywhere

KL between unnormalised stochastic processes rank 1

6 / 22

SLIDE 32