Random Function Priors for Correlation Modeling Aonan Zhang John - - PowerPoint PPT Presentation

random function priors for correlation modeling
SMART_READER_LITE
LIVE PREVIEW

Random Function Priors for Correlation Modeling Aonan Zhang John - - PowerPoint PPT Presentation

Random Function Priors for Correlation Modeling Aonan Zhang John Paisley Columbia University Setup Model exchangeable data X = [ X 1 , , X N ] collection of features = ( k ) k K Z n X n k Z n = [ Z n 1 , , Z nk , ,


slide-1
SLIDE 1

Random Function Priors for Correlation Modeling

Aonan Zhang John Paisley Columbia University

slide-2
SLIDE 2

Model exchangeable data

the extent is used to express .

Setup

X = [X1, …, XN] Zn = [Zn1, …, Znk, …, ZnK] ∈ ℝK

+

θ = (θk)k∈K

θk Xn

θk Zn Xn

N K

collection of features

Zn ∈ {0,1}K Zn1, …, ZnK E.g. Sparse factor models: Topic models: Zn ∈ ΔK−1 Problem: model flexible correlations among Complexity: 2O(K)

α

Solution: random function priors Exponential family?

slide-3
SLIDE 3

Joint distribution Workflow to derive p(Z)

p(X, Z, θ) = p(Z) ⋅

K

k=1

p(θk) ⋅

N

n=1

p(Xn|Zn, θ)

application dependent i.i.d.

??? Exchangeability assumptions on p(Z) ————————————> p(Z) is a random function model Z

N rows K columns representation theorems

Model the matrix Z

slide-4
SLIDE 4

Representation theorem

ξ = ∑

n,k

Znkδτn,σk

Assumption: is separately exchangeable.

  • Proposition. A discrete random measure on is separately exchangeable, if and only if almost

surely, ξ S

ξ = ∑

n,k

fn(ϑk)δτn,σk +

R+ R2

+

trivial terms

random functions on Poisson process on

ξ

Trick: Transform Z (random matrix) to (random measure) on S.

ξ

Znk = fn(ϑk)

slide-5
SLIDE 5

The power of random function priors

  • 1. fn(ϑk) → f(hn, ϑk)

decoder network learned via inference networks hn = g(Xn)

  • 2. f(hn, ϑk) → f(hn, ϑk, ℓk)

augment the 2d Poisson process to higher dimension (ϑk, σk) (ϑk, σk, ℓk)

Model correlations through arbitrary moments Znk = f(h⊤

n ℓk)

𝔽[Znk1Znk2…Znkj] = 𝔽[f(h⊤

n ℓk1)⋯f(h⊤ n ℓkj)]

Assume Then

ℓ1 ℓ2 ℓ3 hn

Prototype to applicable models

ℓ4

slide-6
SLIDE 6

Visualize correlations via paintboxes

Each paintbox is a heatmap.

slide-7
SLIDE 7

Visualize correlations via paintboxes

Correlated Topics Un-correlated Topics

Zn,20 = f(hn, ℓ20) Zn,47 = f(hn, ℓ47) Zn,54 = f(hn, ℓ54) Zn,64 = f(hn, ℓ64) Zn,67 = f(hn, ℓ67) Zn,71 = f(hn, ℓ71)

hn hn

slide-8
SLIDE 8

Model performance

Our model: PRME

slide-9
SLIDE 9

Summarize

Code: https://github.com/zan12/prme

Poster: #222

A deeper understanding of IBP beyond the Beta-Bernoulli process. Connections to random graphs. A generalization of Kingman’s and Broderick’s paintbox models.

… More details

A representation theorem for correlation modeling.