Mixtures of Tree-Structured Probabilistic Graphical Models for Density Estimation in High Dimensional Spaces
- F. Schnitzler
University of Li` ege
24 September 2012
- F. Schnitzler (ULG)
Mixtures of Markov trees PhD defense 1 / 45
Mixtures of Tree-Structured Probabilistic Graphical Models for - - PowerPoint PPT Presentation
Mixtures of Tree-Structured Probabilistic Graphical Models for Density Estimation in High Dimensional Spaces F. Schnitzler University of Li` ege 24 September 2012 F. Schnitzler (ULG) Mixtures of Markov trees PhD defense 1 / 45 Density
Mixtures of Markov trees PhD defense 1 / 45
◮ A realisation belongs to {1, 2, 3, 4, 5, 6}. ◮ 10 realisations: D = (2, 3, 1, 3, 6, 1, 4, 2, 6, 2). ◮ A possible estimate, based on these realisations:
◮ high number of discrete variables p (thousands or more), ◮ low number of samples N (hundreds).
Mixtures of Markov trees PhD defense 2 / 45
Mixtures of Markov trees PhD defense 3 / 45
Mixtures of Markov trees PhD defense 4 / 45
Mixtures of Markov trees PhD defense 5 / 45
◮ D1 = (2, 3, 1, 3, 6, 1, 4, 2, 6, 2) : P1(”5”) = 0/10 ◮ D2 = (1, 5, 2, 6, 1, 1, 6, 1, 6, 5) : P2(”5”) = 2/10
Mixtures of Markov trees PhD defense 6 / 45
◮ D1 = (2, 3, 1, 3, 6, 1, 4, 2, 6, 2) : P1(”5”) = 0/10 ◮ D2 = (1, 5, 2, 6, 1, 1, 6, 1, 6, 5) : P2(”5”) = 2/10
Mixtures of Markov trees PhD defense 6 / 45
Mixture µ1 µ2 µ3 B C A D A C B D C B A D
Mixtures of Markov trees PhD defense 7 / 45
1 Background 2 Contributions (x3) 3 Final words
Mixtures of Markov trees PhD defense 8 / 45
1 Background 2 Contributions (x3) 3 Final words
Mixtures of Markov trees PhD defense 8 / 45
Mixtures of Markov trees PhD defense 9 / 45
Mixtures of Markov trees PhD defense 10 / 45
Mixtures of Markov trees PhD defense 11 / 45
Mixtures of Markov trees PhD defense 11 / 45
◮ bias tends to decrease, ◮ variance to increase.
Mixtures of Markov trees PhD defense 12 / 45
Mixtures of Markov trees PhD defense 13 / 45
Mixtures of Markov trees PhD defense 14 / 45
Mixtures of Markov trees PhD defense 14 / 45
Mixtures of Markov trees PhD defense 14 / 45
Mixtures of Markov trees PhD defense 14 / 45
Mixtures of Markov trees PhD defense 14 / 45
1 Background 2 Contributions ◮ Repeatedly learning a perturbed Markov tree ◮ Building a sequence of Markov trees ◮ Combining mixtures 3 Final words
Mixtures of Markov trees PhD defense 15 / 45
1 Learn the best structure T , given the observations. ◮ Define a score on the structures, and find the structure maximizing it. 2 Learn the parameters θ for the selected structure G. ◮ It amounts to counting observations.
Parameter learning in this thesis: P(Xi = x|PaXi = a) = 1 + ND(a, x) |Val(Xi )| +
x∈Val(Xi ) ND(a, x)
. ND(a, x) is the number of samples where Xi = x and PaXi = a.
Mixtures of Markov trees PhD defense 16 / 45
1
2
3
Mixtures of Markov trees PhD defense 17 / 45
ID(C; D) ID(A; B)
1
2
3
Mixtures of Markov trees PhD defense 17 / 45
ID(C; D) ID(A; B)
1
2
3
Mixtures of Markov trees PhD defense 17 / 45
ID(C; D) ID(A; B)
1
2
3
Mixtures of Markov trees PhD defense 17 / 45
ID(C; D) ID(A; B)
1
2
3
Mixtures of Markov trees PhD defense 17 / 45
ID(C; D) ID(A; B)
1
2
3
Mixtures of Markov trees PhD defense 17 / 45
ID(C; D) ID(A; B)
1
2
3
Mixtures of Markov trees PhD defense 17 / 45
ID(C; D) ID(A; B)
1
2
3
Mixtures of Markov trees PhD defense 17 / 45
ID(C; D) ID(A; B)
1
2
3
Mixtures of Markov trees PhD defense 17 / 45
Mixtures of Markov trees PhD defense 18 / 45
Mixtures of Markov trees PhD defense 19 / 45
◮ Construction of an uncomplete graph: O(KN) ◮ Computation of the maximum width spanning
Mixtures of Markov trees PhD defense 20 / 45
Mixtures of Markov trees PhD defense 21 / 45
Mixtures of Markov trees PhD defense 22 / 45
Mixtures of Markov trees PhD defense 23 / 45
Mixtures of Markov trees PhD defense 23 / 45
Mixtures of Markov trees PhD defense 23 / 45
Mixtures of Markov trees PhD defense 23 / 45
Mixtures of Markov trees PhD defense 23 / 45
Mixtures of Markov trees PhD defense 23 / 45
Mixtures of Markov trees PhD defense 23 / 45
Mixtures of Markov trees PhD defense 24 / 45
Mixtures of Markov trees PhD defense 25 / 45
1 Background 2 Contributions ◮ Repeatedly learning a perturbed Markov tree ◮ Building a sequence of Markov trees ◮ Combining mixtures 3 Final words
A C B D 1 1 1 1 1 1 1 1 1 1
A C B D 1 1 1 1 1 1 1 1 1 1
Mixtures of Markov trees PhD defense 26 / 45
Mixtures of Markov trees PhD defense 27 / 45
Mixtures of Markov trees PhD defense 27 / 45
◮ not likelily to be part of a tree (even if weights are perturbed), ◮ probably not meaningful (noise or not direct relation).
Mixtures of Markov trees PhD defense 28 / 45
Mixtures of Markov trees PhD defense 29 / 45
Mixtures of Markov trees PhD defense 30 / 45
T )
Mixtures of Markov trees PhD defense 31 / 45
T )
◮ Increase of the bias.
◮ The bias of these Markov trees is also smaller.
Mixtures of Markov trees PhD defense 32 / 45
T )
◮ Increase of the bias.
◮ The bias of these Markov trees is also smaller.
Mixtures of Markov trees PhD defense 32 / 45
100 200 206 208 210 212 214 216 Number of trees (mixture) Nagative log-likelihood Mixture of bagged Chow-Liu trees Skeleton method (α = 0.05) Chow-Liu tree Chow-Liu forest regularized by an oracle 100 200 548 550 552 554 556 558 560 546 Number of trees (mixture) Mixture of bagged Chow-Liu trees Chow-Liu tree Chow-Liu forest regularized by an oracle Skeleton method (α = 0.05)
Mixtures of Markov trees PhD defense 33 / 45
1 Background 2 Contributions ◮ Repeatedly learning a perturbed Markov tree ◮ Building a sequence of Markov trees ◮ Combining mixtures 3 Final words
Mixtures of Markov trees PhD defense 34 / 45
T )
Mixtures of Markov trees PhD defense 35 / 45
1 Build an EM mixture and associated soft partition {Dk}m1
2 Replace each tree Tk by a variance reducing mixture learnt on Dk.
Mixtures of Markov trees PhD defense 36 / 45
1 Build an EM mixture and associated soft partition {Dk}m1
2 Replace each tree Tk by a variance reducing mixture learnt on Dk.
Mixtures of Markov trees PhD defense 36 / 45
2 4 6 8 10 12 14 102 103 104 ˆ DKL(P || P ˆ
T )
Number of samples (N, logarithmic scale) EM mixture Mixture of 3 ensembles of 10 bagged CL trees
Mixtures of Markov trees PhD defense 37 / 45
1 2 3 4 5
10.7 11 11.3 ˆ DKL(P || P ˆ
T )
Mixtures of Markov trees PhD defense 38 / 45
EM mixture mixture of ensembles of 10 bagged CL trees Chow-Liu tree ensemble of 10 bagged CL trees 1 2 3 4 5 16 20 24 28 Number of trees (m1) ˆ DKL(P || P ˆ
T )
1 2 3 4 5 10 10.3 10.6 10.9 Number of trees (m1)
Mixtures of Markov trees PhD defense 39 / 45
1 Background 2 Contributions ◮ Repeatedly learning a perturbed Markov tree ◮ Building a sequence of Markov trees ◮ Combining mixtures 3 Final words
Mixtures of Markov trees PhD defense 40 / 45
◮ In particular, comparison to regularization
Mixtures of Markov trees PhD defense 41 / 45
◮ true Bayesian approaches ◮ other two-level mixtures
◮ application related: automatically set a value for the parameters
◮ tool: better measure the bias and the 2 types of variance of the
Mixtures of Markov trees PhD defense 42 / 45
◮ Regularization on each term of the mixture ◮ New models: ⋆ bounded tree-width models ⋆ conditional random fields
◮ Other class of target densities ◮ Other types of variables
Mixtures of Markov trees PhD defense 43 / 45
Mixtures of Markov trees PhD defense 44 / 45
Mixtures of Markov trees PhD defense 44 / 45
Mixtures of Markov trees PhD defense 44 / 45
◮ Regularization on each term of the mixture ◮ New models: ⋆ bounded tree-width models ⋆ conditional random fields
◮ Other class of target densities ◮ Other types of variables
Mixtures of Markov trees PhD defense 45 / 45