Piecewise Bounds for Estimating Bernoulli- Logistic Latent Gaussian - - PowerPoint PPT Presentation

piecewise bounds for estimating bernoulli logistic latent
SMART_READER_LITE
LIVE PREVIEW

Piecewise Bounds for Estimating Bernoulli- Logistic Latent Gaussian - - PowerPoint PPT Presentation

Piecewise Bounds for Estimating Bernoulli- Logistic Latent Gaussian Models Mohammad Emtiyaz Khan Joint work with Benjamin Marlin, and Kevin Murphy University of British Columbia June 29, 2011 Piecewise Bounds for Binary Latent Gaussian Models


slide-1
SLIDE 1

Piecewise Bounds for Estimating Bernoulli- Logistic Latent Gaussian Models

Mohammad Emtiyaz Khan

Joint work with Benjamin Marlin, and Kevin Murphy University of British Columbia June 29, 2011

slide-2
SLIDE 2

Piecewise Bounds for Binary Latent Gaussian Models

ICML 2011.

Mohammad Emtiyaz Khan

Modeling Binary Data

Main Topic of our paper

Bernoulli-Logistic Latent Gaussian Models (bLGMs)

Image from http:/ / thenextweb.com/ in/ 2011/ 06/ 06/ india-to-join-the-open-data-revolution-in-july/

slide-3
SLIDE 3

Piecewise Bounds for Binary Latent Gaussian Models

ICML 2011.

Mohammad Emtiyaz Khan

bLGMs - Classification Models

Bayesian Logistic Regression and Gaussian Process Classification

(Jaakkola and Jordan 1996, Rasmussen 2004, Gibbs and Mackay 2000, Kuss and Rasmussen 2006, Nickisch and Rasmussen 2008, Kim and Ghahramani, 2003).

Figures reproduced using GPML toolbox

slide-4
SLIDE 4

Piecewise Bounds for Binary Latent Gaussian Models

ICML 2011.

Mohammad Emtiyaz Khan

bLGMs - Latent Factor Models

Probabilistic PCA and Factor Analysis models (Tipping 1999, Collins,

Dasgupta and Schapire 2001, Mohammed, Heller, and Ghahramani 2008, Girolami 2001, Yu and Tresp 2004).

slide-5
SLIDE 5

Piecewise Bounds for Binary Latent Gaussian Models

ICML 2011.

Mohammad Emtiyaz Khan

Parameter Learning is Intractable

Logistic Likelihood is not conjugate to the Gaussian prior.

x

We propose piecewise bounds to obtain tractable lower bounds to marginal likelihood.

slide-6
SLIDE 6

Piecewise Bounds for Binary Latent Gaussian Models

ICML 2011.

Mohammad Emtiyaz Khan

Learning in bLGMs

slide-7
SLIDE 7

Piecewise Bounds for Binary Latent Gaussian Models

ICML 2011.

Mohammad Emtiyaz Khan

Bernoulli-Logistic Latent Gaussian Models

Parameter Set

z1n y1n n=1:N µ Σ W z2n zLn y2n yDn y3n

slide-8
SLIDE 8

Piecewise Bounds for Binary Latent Gaussian Models

ICML 2011.

Mohammad Emtiyaz Khan

Learning Parameters of bLGMs

x

slide-9
SLIDE 9

Piecewise Bounds for Binary Latent Gaussian Models

ICML 2011.

Mohammad Emtiyaz Khan

some other tractable terms in m and V

+

Variational Lower Bound (Jensen’s)

x

slide-10
SLIDE 10

Piecewise Bounds for Binary Latent Gaussian Models

ICML 2011.

Mohammad Emtiyaz Khan

Quadratic Bounds

  • Bohning’s bound (Bohning, 1992)

x

slide-11
SLIDE 11

Piecewise Bounds for Binary Latent Gaussian Models

ICML 2011.

Mohammad Emtiyaz Khan

Quadratic Bounds

  • Bohning’s bound (Bohning, 1992)

x

  • Jaakkola’s bound (Jaakkola and Jordan, 1996)
  • Both bounds have unbounded error.
slide-12
SLIDE 12

Piecewise Bounds for Binary Latent Gaussian Models

ICML 2011.

Mohammad Emtiyaz Khan

Problems with Quadratic Bounds

1-D example with µ = 2, σ = 2 Generate data, fix µ = 2, and compare marginal likelihood and lower bound wrt σ

zn yn n=1:N µ σ

As this is a 1-D problem, we can compute lower bounds without Jensen’s inequality. So plots that follow have errors only due to error in bounds.

slide-13
SLIDE 13

Piecewise Bounds for Binary Latent Gaussian Models

ICML 2011.

Mohammad Emtiyaz Khan

Problems with Quadratic Bounds

Bohning Jaakkola Piecewise

Q1(x) Q2(x) Q3(x)

slide-14
SLIDE 14

Piecewise Bounds for Binary Latent Gaussian Models

ICML 2011.

Mohammad Emtiyaz Khan

Piecewise Bounds

slide-15
SLIDE 15

Piecewise Bounds for Binary Latent Gaussian Models

ICML 2011.

Mohammad Emtiyaz Khan

Finding Piecewise Bounds

  • Find Cut points, and

parameters of each pieces by minimizing maximum error.

  • Linear pieces (Hsiung, Kim and

Boyd, 2008)

  • Quadratic Pieces (Nelder-

Mead method)

  • Fixed Piecewise Bounds!
  • Increase accuracy by

increasing number of pieces.

slide-16
SLIDE 16

Piecewise Bounds for Binary Latent Gaussian Models

ICML 2011.

Mohammad Emtiyaz Khan

Linear Vs Quadratic

slide-17
SLIDE 17

Piecewise Bounds for Binary Latent Gaussian Models

ICML 2011.

Mohammad Emtiyaz Khan

Results

slide-18
SLIDE 18

Piecewise Bounds for Binary Latent Gaussian Models

ICML 2011.

Mohammad Emtiyaz Khan

Binary Factor Analysis (bFA)

Z1n Y1n n=1:N W Z2n ZLn Y2n YDn Y3n

  • UCI voting dataset with

D=15, N=435.

  • Train-test split 80-20%
  • Compare cross-entropy

error on missing value prediction on test data.

slide-19
SLIDE 19

Piecewise Bounds for Binary Latent Gaussian Models

ICML 2011.

Mohammad Emtiyaz Khan

bFA – Error vs Time

Bohning Jaakkola Piecewise Linear with 3 pieces Piecewise Quad with 3 pieces Piecewise Quad with 10 pieces

slide-20
SLIDE 20

Piecewise Bounds for Binary Latent Gaussian Models

ICML 2011.

Mohammad Emtiyaz Khan

bFA – Error Across Splits

Error with Piecewise Quadratic Error with Bohning and Jaakkola

Bohning Jaakkola

slide-21
SLIDE 21

Piecewise Bounds for Binary Latent Gaussian Models

ICML 2011.

Mohammad Emtiyaz Khan

Gaussian Process Classification

  • We repeat the experiments

described in Kuss and Rasmussen, 2006

  • We set µ =0 and squared

exponential Kernel Σij =σ exp[(xi-xj)^2/s]

  • Estimate σ and s.
  • We run experiments on

Ionoshphere (D = 200)

  • Compare Cross-entropy Prediction

Error for test data.

z1 y1 µ Σ z2 zD yD y2 X s σ

slide-22
SLIDE 22

Piecewise Bounds for Binary Latent Gaussian Models

ICML 2011.

Mohammad Emtiyaz Khan

GP – Marginal Likelihood

slide-23
SLIDE 23

Piecewise Bounds for Binary Latent Gaussian Models

ICML 2011.

Mohammad Emtiyaz Khan

GP – Prediction Error

slide-24
SLIDE 24

Piecewise Bounds for Binary Latent Gaussian Models

ICML 2011.

Mohammad Emtiyaz Khan

EP vs Variational

  • We see that the variational approach underestimates the

marginal likelihood in some regions of parameter space.

  • However, both methods give comparable results for

prediction error.

  • In general, the variational EM algorithm for parameter

learning is guaranteed to converge when appropriate numerical methods are used,

  • Nickisch and Rasmussen (2008) describe the

variational approach as more principled than EP.

slide-25
SLIDE 25

Piecewise Bounds for Binary Latent Gaussian Models

ICML 2011.

Mohammad Emtiyaz Khan

Conclusions

slide-26
SLIDE 26

Piecewise Bounds for Binary Latent Gaussian Models

ICML 2011.

Mohammad Emtiyaz Khan

  • Fixed piecewise bounds can give a significant

improvement in estimation and prediction accuracy relative to variational quadratic bounds.

  • We can drive the error in the logistic-log-partition

bound to zero by letting the number of pieces increase.

  • This increase in accuracy comes with a

corresponding increase in computation time.

  • Unlike many other frameworks, we have a very fine

grained control over the speed-accuracy trade-off through controlling the number of pieces in the bound.

Conclusions

slide-27
SLIDE 27

Piecewise Bounds for Binary Latent Gaussian Models

ICML 2011.

Mohammad Emtiyaz Khan

Thank You

slide-28
SLIDE 28

Piecewise Bounds for Binary Latent Gaussian Models

ICML 2011.

Mohammad Emtiyaz Khan

Piecewise-Bounds: Optimization Problem

slide-29
SLIDE 29

Piecewise Bounds for Binary Latent Gaussian Models

ICML 2011.

Mohammad Emtiyaz Khan

slide-30
SLIDE 30

Piecewise Bounds for Binary Latent Gaussian Models

ICML 2011.

Mohammad Emtiyaz Khan

slide-31
SLIDE 31

Piecewise Bounds for Binary Latent Gaussian Models

ICML 2011.

Mohammad Emtiyaz Khan

slide-32
SLIDE 32

Piecewise Bounds for Binary Latent Gaussian Models

ICML 2011.

Mohammad Emtiyaz Khan

Latent Gaussian Graphical Model

LED dataset, 24 variables, N=2000

z1n y1n n=1:N µ Σ z2n zDn yDn y3n

slide-33
SLIDE 33

Piecewise Bounds for Binary Latent Gaussian Models

ICML 2011.

Mohammad Emtiyaz Khan

Sparse Version

slide-34
SLIDE 34

Piecewise Bounds for Binary Latent Gaussian Models

ICML 2011.

Mohammad Emtiyaz Khan

Binary Latent Gaussian Models

We are interested in maximum likelihood estimate of parameters

Zln Ydn

n=1:N

µ Σ W

l =1:L d=1:D