Gaussian Processes for Active Sensor Management Alexander N. Dolia , - PowerPoint PPT Presentation

OED for KRR, and the MVCE Gaussian Processes for Active Sensor Management Alexander N. Dolia , University of Southampton This poster is based on •A.N.Dolia, C.J.Harris, J.Shawe-Taylor, D.M.Titterington, Kernel Ellipsoidal Trimming , submitted to the Special Issue of the Journal Computational Statistics and Data Analysis on Machine Learning and Robust Data Mining. under review. •A.N.Dolia, T.De Bie, C.J.Harris, J.Shawe-Taylor, D.M.Titterington. Optimal experimental design for kernel ridge regression, and the minimum volume covering ellipsoid , Workshop on Optimal Experimental Design, Southampton, 22-26 September, 2006 Joint work with: Dr. Tijl De Bie , Katholieke Universiteit Leuven Prof. John Shawe-Taylor , University of Southampton Prof. Chris Harris , University of Southampton Prof. Mike Titterington , University of Glasgow

OED for KRR, and the MVCE Problem Statement Aim is to estimate locations of the senso rs and number of repetitions given a set of possible sensors locations, cost of measurements and upper bound for the number of repetitions at given sensor locations in order to get good prediction f ( x ) • Sensor network: N sensors measure signals at positions x i • Sensors measure function y i = f ( x i ) = x 0 i w + n i • Weight vector w gives information about ‘system’ x 1 • Position sensors optimally at X D x 3 x 2 • Estimate w based on X D

OED for KRR, and the MVCE Optimal experiment design? y i x 0 i w x i Optimal experiment design (OED) idea: • Given a set of n data points X = { x i } • Choose multiset X D = { x D,i } ⊆ X with N data points, N i times x i • Measure at x D,i → y D = { y D,i } with y D,i = x 0 D,i w + n i • Estimate w based on { X D , y D } → ˆ w

OED for KRR, and the MVCE Optimal experiment design for RR • Result is thus a non-convex optimization problem: ⎛ − 1 ⎞ ⎛ ⎞ X ⎝ X i + γ I + 1 ⎜ ⎟ 4 γ 2 α i x i x 0 α i x i x 0 ⎠ min α − logdet ⎝ ⎠ i i i α 0 e = 1 s.t. α ≥ 0 • Minimize tight upper bound: ⎛ ⎞ ⎝ X α ∗ α i x i x 0 ⎠ γ = argmin α − logdet i + γ I i α 0 e = 1 s.t. α ≥ 0 • This is a convex optimization problem again

OED for KRR, and the MVCE Regularized MVCE • What about the dual of the regularized D-OED? min M , μ logdet( M ) + μ + γ trace( M − 1 ) i M − 1 x i < = μ x 0 s.t. • The optimum is given by: X M ∗ α ∗ γ ,i x i x 0 γ = i + γ I i where α ∗ γ is the solution of the regularized D-OED problem P i 1 • Interpretation: trace( M − 1 ) = λ i → fit an ellipsoid, but make sure none of the eigenvalues of M ∗ γ is too small.. .

OED for KRR, and the MVCE Kernel ridge regression (KRR) • Kernel ridge regression (KRR): K D = X D X 0 D γ I ) − 1 y β = ( K D + e Least squares X Ridge regression w RR = X 0 ˆ D β = β i x D,i Kernel RR i X X f ( x ) = x 0 ˆ β i x 0 x D,i = w RR = β i k ( x , x D,i ) i i • Everything expressed in terms of K D (i.e. in terms of inner products/kernels): ‘kernel trick’ • If we want to do OED for KRR, we need to write it entirely in terms of kernel evaluations/innerproducts—can we?

OED for KRR, and the MVCE Kernel MVCE • Mahalanobis distances x 0 ( P i + γ I ) − 1 x in terms of in- i α ∗ γ ,i x i x 0 ner products/kernel evaluations? • Let AKA = V Λ V 0 (eigenvaluedecomposition), then (deriva- tion not shown. . . ): ³ ´ X 1 i + γ I ) − 1 x x 0 x − x 0 X 0 AV Λ ( Λ + γ I ) − 1 V 0 AXx x 0 ( α ∗ γ ,i x i x 0 = γ i Novelty detection MVCE and duality • Express in terms of k ( x , x ) = x 0 x and k = Xx , then: Regularized MVCE ³ ´ Kernel MVCE X 1 x 0 ( α ∗ γ ,i x i x 0 i + γ I ) − 1 x k ( x , x ) − k 0 AV Λ ( Λ + γ I ) − 1 V 0 Ak = γ i completely expressed in terms of kernels

OED for KRR, and the MVCE OED: summary D-OED MVCE ⎛ ⎞ ⎝ X standard α i x i x 0 ⎠ min α − logdet min M , μ logdet ( M ) + μ i i i M − 1 x i < = μ x 0 s.t. α 0 1 = 1 s.t. α ≥ 0 ⎛ ⎞ regularized ⎝ X logdet( M ) + μ + γ trace( M − 1 ) α i x i x 0 min M , μ ⎠ min α − logdet i + γ I x 0 i M − 1 x i < = μ i s.t. α 0 1 = 1 s.t. α ≥ 0 min a − logdet( AKA + γ I ) kernel a 0 a < = 1 s.t. a ≥ 0

OED for KRR, and the MVCE Experiment

OED for KRR, and the MVCE Generalised D-optimal Experimental Design

OED for KRR, and the MVCE Conclusions • Two seemingly very different algorithms within one optimization framework • A way to perform optimal experimental design in high dimensional spaces, such as kernel induced feature spaces • A way to perform minimum volume covering ellipsoidestima- tion in high dimensional spaces to perform novelt y detection • Nice features: Convex optimisation and sparse solution

Gaussian Processes for Active Sensor Management Alexander N. Dolia , - PowerPoint PPT Presentation

OED for KRR, and the MVCE Gaussian Processes for Active Sensor Management Alexander N. Dolia , University of Southampton This poster is based on A.N.Dolia, C.J.Harris, J.Shawe-Taylor, D.M.Titterington, Kernel Ellipsoidal Trimming , submitted

Gaussian Filter The Gaussian filter 1 2 1 A Gaussian kernel gives less 1 2 4 2 weight to

CSci 8980: Advanced Topics in Graphical Models Gaussian Processes Instructor: Arindam Banerjee

Gaussian Processes Dan Cervone NYU CDS November 10, 2015 Dan Cervone (NYU CDS) Gaussian

CMPUT 466 Introduction to Gaussian Processes Dan Lizotte The Plan Introduction to Gaussian

Non-Gaussian likelihoods for Gaussian Processes Alan Saul Outline Motivation Non-Gaussian

Lecture 3 Capacity of Multiuser Gaussian Channels The Gaussian uplink: 6.1 The fading

State Space Gaussian Processes with Non-Gaussian Likelihoods Hannes Nickisch 1 Arno Solin 2

Sensor Relocation Mesh-based Sensor Relocation Mesh-based Sensor Relocation Objective for

Multiple-output Gaussian processes Mauricio A. Alvarez Department of Computer Science, The

The Active Card An Active Mind in an Active Body More people, More Active, More often! The

Active Adversary Lecture 7 CCA Security MAC Active Adversary Active Adversary An active

Another introduction to Gaussian Processes Richard Wilkinson School of Maths and Statistics

Gaussian Processes for Big Data James Hensman joint work with Nicol o Fusi, Neil D. Lawrence

Gaussian Processes Seung-Hoon Na Chonbuk National University Gaussian Process Regression

Faster Gaussian Lattice Sampling using Information Leakage Gaussian Sampling Our Work Lazy

CS70: Jean Walrand: Lecture 36. Gaussian and CLT CS70: Jean Walrand: Lecture 36. Gaussian and

Conditional tSNE Bo Kang Jefrey Lijffijt Tijl De Bie Ghent University Daro Garca Garca

A fluctuating boundary integral method for Brownian suspensions Yuanxun Bill Bao 1 Manas Rachh 2

Ellie Dodge, PhD Angelina Maia, PhD, RD University of New England, Program R.I.T,

Stochastic Deep Networks Gwendoline De Bie, Gabriel Peyr, Marco Cuturi Deep Architectures on

Numb3rs 11 2 10 3 GCD 9 4 8 5 7 6 The Skippy Clock 0 12 1 Has 13 hours on its dial!

March 8, 2019 Agenda GILTI-related Consolidated Issues FDII-related Issues

Passenger-oriented railway disposition timetables in case of severe disruptions Yousef Maknoon

Bypassing Android Password Manager Apps Without Root Stephan Huber, Siegfried Rasthofer, Steven