Variational Inference of Sparse Network from Count Data Julien - - PowerPoint PPT Presentation

variational inference of sparse network from count data
SMART_READER_LITE
LIVE PREVIEW

Variational Inference of Sparse Network from Count Data Julien - - PowerPoint PPT Presentation

Variational Inference of Sparse Network from Count Data Julien Chiquet, Mahendra Mariadasou, St ephane Robin AgroParisTech/INRA (French National Institute for Agricultural Research) ICML, Long Beach Convention Center, June the 11th 2019 R/C++


slide-1
SLIDE 1

Variational Inference of Sparse Network from Count Data

Julien Chiquet, Mahendra Mariadasou, St´ ephane Robin

AgroParisTech/INRA (French National Institute for Agricultural Research) ICML, Long Beach Convention Center, June the 11th 2019

R/C++ PLNmodels package, development version on github install.packages("PLNmodels") https://jchiquet.github.io/PLNmodels/

1

slide-2
SLIDE 2

Sparse multivariate Poisson Lognormal model

A sparse latent multivariate Gaussian model Zi iid ∼ Np(0p, Ω−1), Ω sparse, Ω1,offdiagonal < c Yi | Zi ∼ P(exp{Oi + X⊤

i B + Zi})

(i, j) / ∈ E ⇔ Zi ⊥ ⊥ Zj|Z\{i,j} ⇔ Ωij = 0. Interpretation ◮ Dependency structure (network) encoded in the latent space (Ω) ◮ Additional effects due to covariates X are fixed ◮ Conditional Poisson distribution = noise model

2

slide-3
SLIDE 3

Sparse Variational Inference

Variational approximation Take qi ≡ N(mi, diag(si)) to approximate of p(Zi|Yi) model parameters θ = (B, Ω) variational parameters ψ = (M, S) where M = [m⊤

1 . . . m⊤ n ]⊤, S = [(s2 1)⊤ . . . (s2 n)⊤]⊤

Sparse lower bound of the likelihood J(θ, ψ) − λ Ω1,off = Eq[log pθ(Y, Z)] + H[qψ(Z)]−λ Ω1,off. Alternate optimization – objective is biconcave in (B, M, S) and Ω

  • 1. ( ˆ

B, ˆ M, ˆ S): gradient ascent

  • 2. ˆ

Ω: graphical-Lasso problem Selection of λ – StARS (Stability Approach to Regularization Selection)

3

slide-4
SLIDE 4

Illustration: first round of French Presidential 2017

source: https://data.gouv.fr

◮ data: votes cast for each of the 11 candidates in the more than 63,000 polling stations ◮ offset: log-registered population of voter (account for different station sizes) ◮ covariate: ” department”(administrative division, a proxy for geography) find competing candidates, who appeal to different voters, and compatible candidates Inferred network of partial correlation Latent Positions (PCA) (blue: negative, red: positive)

  • ARTHAUD

ASSELINEAU CHEMINADE

DUPONT−AIGNAN

FILLON

HAMON

LASSALLE

LE PEN

MACRON MÉLENCHON

POUTOU

4