Scalable Training of Inference Networks for Gaussian-Process Models - PowerPoint PPT Presentation

Jan 25, 2024 •334 likes •483 views

Scalable Training of Inference Networks for Gaussian-Process Models Jiaxin Shi Tsinghua University Joint work with Mohammad Emtiyaz Khan Jun Zhu Gaussian Process mean function covariance function / kernel inducing points Posterior

Scalable Training of Inference Networks for Gaussian-Process Models Jiaxin Shi Tsinghua University Joint work with Mohammad Emtiyaz Khan Jun Zhu
Gaussian Process mean function covariance function / kernel inducing points Posterior inference complexity, conjugate likelihoods Gaussian field Sparse variational GP [Titsias, 09; Hensman et al., 13]
Inference Networks for GP Models Remove sparse assumption Data Prediction Inputs Gaussian field Inference network Observations
Inference Networks for GP Models Remove sparse assumption Data Prediction Inputs Gaussian field Inference network Observations
Examples of Inference Networks Bayesian neural networks: ● [Sun et al., 19] freq (s) intractable output density ○ sin weights (w) function space cos weight space Inference network architecture can be derived from ● sin the weight-space posterior Random feature expansions ○ [Cutajar, et al., 18] Deep neural nets ○
Minibatch Training is Difficult Functional Variational Bayesian Neural Networks (Sun et al., 19) Measurement points Consider matching variational and true posterior processes at arbitrary ● Full batch fELBO ● Practical fELBO ● This objective is doing improper minibatch for the KL divergence term ●
Scalable Training of Inference Networks for GP Models Stochastic, functional mirror descent work with the functional density directly [Dai et al., 16; Cheng & Boots, 16] ● natural gradient in the density space ○ minibatch approximation with stochastic functional gradient ○ closed-form solution as an adaptive Bayesian filter ● seeing next data point adapted prior sequentially applying Bayes’ rule is the most natural gradient ● in conjugate models: equivalent to natural gradient for exponential families ○ [Raskutti & Mukherjee, 13; Khan & Lin, 17]
Scalable Training of Inference Networks for GP Models Minibatch training of inference networks student teacher an idea from filtering: bootstrap ● similar idea: temporal difference (TD) learning with function approximation ○
Scalable Training of Inference Networks for GP Models Minibatch training of inference networks (Gaussian likelihood case) closed-form marginals of at locations ● equivalent to GP regression ○ (Nonconjugate case) optimize an upper bound of ●
Scalable Training of Inference Networks for GP Models Measurement points vs. inducing points M=2 M=5 M=20 SVGP GPNet inducing points - expressiveness of variational approximation ● measurement points - variance of training ●
Scalable Training of Inference Networks for GP Models Effect of proper minibatch training Fix underfitting ● N=100, batch size=20 FBNN, M=20 GPNet, M=20 Better performance with more measurement points ● Airline Delay (700K)
Scalable Training of Inference Networks for GP Models Regression & Classification Regression benchmarks GP classification with a prior derived from infinite-width Bayesian ConvNets
Poster #227 Code: https://github.com/thjashin/gp-infer-net

Recommend

Gaussian Filter The Gaussian filter 1 2 1 A Gaussian kernel gives less 1 2 4 2 weight to

Gaussian Filter The Gaussian filter 1 2 1 A Gaussian kernel gives less 1 2 4 2 weight to pixels further from 16 the center of the window 1 2 1 This kernel is an approximation of a Gaussian function Gaussian filtering versus mean

405 views • 11 slides

Lecture 3 Capacity of Multiuser Gaussian Channels The Gaussian uplink: 6.1 The fading

Lecture 3 Capacity of Multiuser Gaussian Channels The Gaussian uplink: 6.1 The fading Gaussian uplink: 6.3 (parts) The Gaussian downlink: 6.2 The fading Gaussian downlink: 6.4 (parts) Mikael Skoglund, Theoretical Foundations of

347 views • 10 slides

Scalable Gaussian Processes Zhenwen Dai Amazon September 4, 2018 @GPSS2018 Zhenwen Dai (Amazon)

Scalable Gaussian Processes Zhenwen Dai Amazon September 4, 2018 @GPSS2018 Zhenwen Dai (Amazon) Scalable Gaussian Processes September 4, 2018 @GPSS2018 1 / 55 Gaussian process Input and Output Data: X = ( x 1 , . . . , x N ) y = ( y 1 ,

620 views • 57 slides

Scalable Gaussian Processes Zhenwen Dai Amazon 9 September 2019 @GPSS 2019 Zhenwen Dai (Amazon)

Scalable Gaussian Processes Zhenwen Dai Amazon 9 September 2019 @GPSS 2019 Zhenwen Dai (Amazon) Scalable Gaussian Processes 9 September 2019 @GPSS 2019 1 / 46 Gaussian process Input and Output Data: X = ( x 1 , . . . , x N ) y = ( y 1 ,

516 views • 47 slides

Faster Gaussian Lattice Sampling using Information Leakage Gaussian Sampling Our Work Lazy

Faster Gaussian Lattice Sampling using Lazy FPA L. Ducas P.Q. Nguyen Introduction Lattices based Signatures Before Gaussian Sampling Preventing Faster Gaussian Lattice Sampling using Information Leakage Gaussian Sampling Our Work Lazy

984 views • 67 slides

Non-Gaussian likelihoods for Gaussian Processes Alan Saul Outline Motivation Non-Gaussian

Non-Gaussian likelihoods for Gaussian Processes Alan Saul Outline Motivation Non-Gaussian posteriors Approximate methods Laplace approximation Variational bayes Expectation propagation Comparisons GP regression - recap so far Model the

1.41k views • 113 slides

CS70: Jean Walrand: Lecture 36. Gaussian and CLT CS70: Jean Walrand: Lecture 36. Gaussian and

CS70: Jean Walrand: Lecture 36. Gaussian and CLT CS70: Jean Walrand: Lecture 36. Gaussian and CLT Warning: CS70: Jean Walrand: Lecture 36. Gaussian and CLT Warning: This lecture is also rated R. CS70: Jean Walrand: Lecture 36. Gaussian and

1.53k views • 120 slides

Inference in Bayesian networks Chapter 14.45 Chapter 14.45 1 Outline Exact inference

Inference in Bayesian networks Chapter 14.45 Chapter 14.45 1 Outline Exact inference by enumeration Exact inference by variable elimination Approximate inference by stochastic simulation Approximate inference by Markov

530 views • 38 slides

Modern Gaussian Processes: Scalable Inference and Novel Applications (Part II-b) Approximate

Modern Gaussian Processes: Scalable Inference and Novel Applications (Part II-b) Approximate Inference Edwin V. Bonilla and Maurizio Filippone CSIROs Data61, Sydney, Australia and EURECOM, Sophia Antipolis, France July 14 th , 2019 1

1.14k views • 44 slides

Cache Coherence in Scalable Machines Scalable Cache Coherent Systems Scalable, distributed

Cache Coherence in Scalable Machines Scalable Cache Coherent Systems Scalable, distributed memory plus coherent replication Scalable distributed memory machines P-C-M nodes connected by network communication assist interprets

1.13k views • 87 slides

Gaussian Processes Dan Cervone NYU CDS November 10, 2015 Dan Cervone (NYU CDS) Gaussian

Gaussian Processes Dan Cervone NYU CDS November 10, 2015 Dan Cervone (NYU CDS) Gaussian Processes November 10, 2015 1 / 22 What are Gaussian processes? GPs let us do Bayesian inference on functions . Using GPs we can: Interpolate spatial

728 views • 54 slides

Scalable Gaussian processes with a twist of Probabilistic Numerics Kurt Cutajar EURECOM, Sophia

Scalable Gaussian processes with a twist of Probabilistic Numerics Kurt Cutajar EURECOM, Sophia Antipolis, France Data Science Meetup - October 30 th 2017 Agenda Kernel Methods Scalable Gaussian Processes (using Preconditioning)

481 views • 46 slides

& Exact inference for Gaussian networks Probabilistic Graphical Models Sharif University of

Factor analysis & Exact inference for Gaussian networks Probabilistic Graphical Models Sharif University of Technology Spring 2016 Soleymani Multivariate Gaussian distribution 2 /2 1/2 exp{ 1 1 2

1.13k views • 43 slides

Factor analysis & Exact inference for Gaussian networks Probabilistic Graphical Models

Factor analysis & Exact inference for Gaussian networks Probabilistic Graphical Models Sharif University of Technology Spring 2017 Soleymani Multivariate Gaussian distribution 2 /2 1/2 exp{ 1 1 2

808 views • 43 slides

Inference in Bayesian networks Chapter 14.45 Chapter 14.45 1 Outline Exact inference

Inference in Bayesian networks Chapter 14.45 Chapter 14.45 1 Outline Exact inference by enumeration Approximate inference by stochastic simulation Chapter 14.45 2 Inference tasks Simple queries: compute posterior marginal P (

450 views • 28 slides

Case Study: Approximate Bayesian Inference for Latent Gaussian Models by Using Integrated Nested

Case Study: Approximate Bayesian Inference for Latent Gaussian Models by Using Integrated Nested Laplace Approximations BIOSTAT830: Graphical Models December 08, 2016 Introduction - INLA Inference for latent Gaussian Markov random field

605 views • 35 slides

Improving RGB-D face recognition via transferring pretrained 2D networks Xingwang Xiong, Xu Wen,

Improving RGB-D face recognition via transferring pretrained 2D networks Xingwang Xiong, Xu Wen, and Cheng Huang INSTITUTE O https://github.com/XingwXiong/Face3D-Pytorch OF C COMPUTING T TECHNOLOGY 3D Face Recognition Algorithm Challenge

521 views • 13 slides

Software Defined Networks A quick overview Based primarily on the presentations of Prof.

Software Defined Networks A quick overview Based primarily on the presentations of Prof. Scott Shenker of UC Berkeley The Future of Networking, and the Past of Protocols Please watch the YouTube video of Shenkers talk

927 views • 49 slides

Wiretapping End-to-End Encrypted VoIP Calls Real-World Attacks on ZRTP Dominik Schrmann, Fabian

Institute of Operating Systems and Computer Networks Wiretapping End-to-End Encrypted VoIP Calls Real-World Attacks on ZRTP Dominik Schrmann, Fabian Kabus, Gregor Hildermeier, Lars Wolf, 2017-07-18 Introduction Man-in-the-Middle ZRTP

396 views • 24 slides

Security in in Security 802.11 Data Link Link Protocols Protocols 802.11 Data Gianluca Dini

Security in in Security 802.11 Data Link Link Protocols Protocols 802.11 Data Gianluca Dini Gianluca Dini Dept. of Ingegneria dellInformazione University of Pisa, Italy Via Diotisalvi 2, 56100 Pisa gianluca.dini@ing.unipi.it If you

467 views • 35 slides

routines that build mathematical understanding Janice Novakowski Vancouver Reggio-Inspired

routines that build mathematical understanding Janice Novakowski Vancouver Reggio-Inspired Mathematics August 29 2017 why mathematical routines? regular practice that responsive to students has multiple entry focus on a

753 views • 27 slides

Webmachine a practical executable model for HTTP Steve Vinoski Architect, Basho Technologies

Webmachine a practical executable model for HTTP Steve Vinoski Architect, Basho Technologies QCon SF 2011 16 Nov 2011 @stevevinoski http://steve.vinoski.net/ vinoski@ieee.org 1 Webmachine a practical executable model for HTTP a toolkit

901 views • 65 slides

Webmachine a practical executable model for HTTP Justin Sheehy justin@basho.com Webmachine a

Webmachine a practical executable model for HTTP Justin Sheehy justin@basho.com Webmachine a practical executable model for HTTP a toolkit for HTTP-based systems Webmachine a practical executable model for HTTP a toolkit for easily

660 views • 36 slides

Early Childhood Collaborative Series Child Development FINDING THE BALANCE? 1 10/30/2018 DR.

10/30/2018 Early Childhood Collaborative Series Child Development FINDING THE BALANCE? 1 10/30/2018 DR. THOMAS ARMSTRONG https://www.youtube.com/watch?v=aMx4Ui-F-tA SHIFT TO FULL DAY KINDERGARTEN Rationale Crunch for time with competing

146 views • 11 slides

Scalable Training of Inference Networks for Gaussian-Process Models - PowerPoint PPT Presentation

Scalable Training of Inference Networks for Gaussian-Process Models Jiaxin Shi Tsinghua University Joint work with Mohammad Emtiyaz Khan Jun Zhu Gaussian Process mean function covariance function / kernel inducing points Posterior

Gaussian Filter The Gaussian filter 1 2 1 A Gaussian kernel gives less 1 2 4 2 weight to

Lecture 3 Capacity of Multiuser Gaussian Channels The Gaussian uplink: 6.1 The fading

Scalable Gaussian Processes Zhenwen Dai Amazon September 4, 2018 @GPSS2018 Zhenwen Dai (Amazon)

Scalable Gaussian Processes Zhenwen Dai Amazon 9 September 2019 @GPSS 2019 Zhenwen Dai (Amazon)

Faster Gaussian Lattice Sampling using Information Leakage Gaussian Sampling Our Work Lazy

Non-Gaussian likelihoods for Gaussian Processes Alan Saul Outline Motivation Non-Gaussian

CS70: Jean Walrand: Lecture 36. Gaussian and CLT CS70: Jean Walrand: Lecture 36. Gaussian and

Inference in Bayesian networks Chapter 14.45 Chapter 14.45 1 Outline Exact inference

Modern Gaussian Processes: Scalable Inference and Novel Applications (Part II-b) Approximate

Cache Coherence in Scalable Machines Scalable Cache Coherent Systems Scalable, distributed

Gaussian Processes Dan Cervone NYU CDS November 10, 2015 Dan Cervone (NYU CDS) Gaussian

Scalable Gaussian processes with a twist of Probabilistic Numerics Kurt Cutajar EURECOM, Sophia

&amp; Exact inference for Gaussian networks Probabilistic Graphical Models Sharif University of

Factor analysis &amp; Exact inference for Gaussian networks Probabilistic Graphical Models

Inference in Bayesian networks Chapter 14.45 Chapter 14.45 1 Outline Exact inference

Case Study: Approximate Bayesian Inference for Latent Gaussian Models by Using Integrated Nested

Improving RGB-D face recognition via transferring pretrained 2D networks Xingwang Xiong, Xu Wen,

Software Defined Networks A quick overview Based primarily on the presentations of Prof.

Wiretapping End-to-End Encrypted VoIP Calls Real-World Attacks on ZRTP Dominik Schrmann, Fabian

Security in in Security 802.11 Data Link Link Protocols Protocols 802.11 Data Gianluca Dini

routines that build mathematical understanding Janice Novakowski Vancouver Reggio-Inspired

Webmachine a practical executable model for HTTP Steve Vinoski Architect, Basho Technologies

Webmachine a practical executable model for HTTP Justin Sheehy justin@basho.com Webmachine a

Early Childhood Collaborative Series Child Development FINDING THE BALANCE? 1 10/30/2018 DR.

& Exact inference for Gaussian networks Probabilistic Graphical Models Sharif University of

Factor analysis & Exact inference for Gaussian networks Probabilistic Graphical Models