Latent Classification Models
Classification in continuous domains
Helge Langseth and Thomas D. Nielsen
LCM – p.1/??
Latent Classification Models Classification in continuous domains - - PowerPoint PPT Presentation
Latent Classification Models Classification in continuous domains Helge Langseth and Thomas D. Nielsen LCM p.1/ ?? Outline Probabilistic classifiers Nave Bayes classifiers Relaxing the assumptions Latent classification
LCM – p.1/??
LCM – p.2/??
n to the class:
y∈sp(Y) y′∈sp(Y)
1, . . . ,
✄N}.
LCM – p.3/??
LCM – p.4/??
LCM – p.5/??
LCM – p.6/??
LCM – p.7/??
LCM – p.8/??
0.2 0.4 0.6 0.8 1 1.2 1.4 70 71 72 73 74 75 SI
LCM – p.9/??
LCM – p.10/??
i = (
LCM – p.11/??
1
2
3
4
LCM – p.12/??
LCM – p.13/??
Proposition:
j=1
Proof outline:
j=1
j=1
j=1
n .
j=1
jΛj
✁j ;
✁j contains the eigenvectors of Σj and Λj is the
j matrices, and let Γj be the block diagonal matrix constructed from
LCM – p.14/??
71.5 72 72.5 73 73.5 0.1 0.2 0.3 0.4 0.5 0.6 0.7
LCM – p.15/??
m
m,
LCM – p.16/??
m
✁LCM – p.17/??
Proposition:
n and {Ψj}|sp(Y)| j=1
Proof outline:
m = 0, for all m. Thus:
LCM – p.18/??
LCM – p.19/??
For possible values of q and |sp (M)|:
For w = 1 . . . W:
Return classifier learned with these parameters.
LCM – p.19/??
LCM – p.20/??
LCM – p.21/??
LCM – p.21/??
X1 Z1 M Y X1 Z1 M Y
X1 Z1 M Y X1 Z1 M Y LCM – p.21/??
1 αk
j)
j)
1 αk
j) ·
j
✂m,i ←
☎j=1 P(Mi = m|
✄j)
j, Mi = m)
✆−1
✁j=1 xi,jP(Mi = m|
✄j)
j, Mi = m)
✂1 N
j=1
☎m=1 P(M = m|
✄j) ˆ
✄m,i
j, M = m)
✆1 N
j=1 P(M = m|
✄j)
☎m,i
j, M = m)
✆1 αy
j)
j) follows by Bayes rule):
j, M = m) =
✝y,m
✄↓
y,m
✁m)
j, M = m
✟y,m
✁m
✡j, M = m)
j, M = m) T,
y,m = (
✁mΓy)T
✞ ✁mΓy
✁m + Θm
✟−1
LCM – p.22/??
balance-scale
liver
breast
pima
crabs
sonar
diabetes
tae
glass
thyroid
glass2
vehicle
heart
wine
iris
LCM – p.23/??
LCM – p.24/??
NB: Naive Bayes with Gaussian leaves. NB/M: NB with mixture of Gaussians. FA/BIC: NB; factor analysis; q chosen according to BIC score. PCA/λ: NB; PCA; only eigenvectors with eigenvalue larger than the average were included. PCA/n: NB; data transformed by PCA where all n eigenvectors were included. CW/PCA/n: One PCA/n fitted per class; classification by Bayes rule. CG/PCA/n: Unsuper. clust. #clusters decided by BIC; One PCA/n per cluster; classification by voting. NB/D: Naive Bayes with discretized data. TAN/D: TAN classifier with discretized data. LCM(q): Linear LCM model where q was found by the wrapper approach. LCM(q)/S: Structural learning of LCM(q). LCM(q, m; T): Non-linear LCM, q and |sp (M)| found by wrappers. Covariances tied. LCM(q, m; U): As LCM(q, m; T), but covariances untied. LCM(q, m; T)/S: Structural learning of LCM(q, m; T).
LCM – p.25/??
Database NB NB/M FA/BIC PCA/λ PCA/n CW/PCA/n CG/PCA/n
balance-scale
−86.9 + / − 1.4 ∗53.9 + / − 2.0 ∗53.3 + / − 2.0 −86.9 + / − 1.4 −86.9 + / − 1.4
90.9 + / − 1.2
−76.3 + / − 1.7
breast
−96.2 + / − 0.7
97.1 + / − 0.6
−95.9 + / − 0.8 −94.9 + / − 0.8 −94.9 + / − 0.8 ∗90.6 + / − 1.1 −94.6 + / − 0.9
crabs
∗39.5 + / − 3.5 ∗39.5 + / − 3.5 ∗63.5 + / − 3.4 ∗83.0 + / − 2.7
93.5 + / − 1.7 94.0 + / − 1.7 94.5 + / − 1.6
diabetes
75.9 + / − 1.5
−72.5 + / − 1.6 ∗69.1 + / − 1.7
74.5 + / − 1.6 74.3 + / − 1.6
−69.8 + / − 1.7
74.5 + / − 1.6
glass
∗36.4 + / − 3.3 −65.9 + / − 3.2 ∗48.2 + / − 3.4 ∗46.3 + / − 3.4 −61.2 + / − 3.3 −58.4 + / − 3.4
72.0 + / − 3.1
glass2
∗62.0 + / − 3.8
78.0 + / − 3.2
∗60.8 + / − 3.8 ∗58.3 + / − 3.9 ∗65.7 + / − 3.7 ∗67.6 + / − 3.7 −72.4 + / − 3.5
heart
84.8 + / − 2.2 82.2 + / − 2.3 82.6 + / − 2.3
−77.0 + / − 2.6
80.7 + / − 2.4 81.5 + / − 2.4
−77.4 + / − 2.5
iris
−95.3 + / − 1.7 ∗92.7 + / − 2.1 ∗88.7 + / − 2.6
96.7 + / − 1.5
∗92.7 + / − 2.1
96.7 + / − 1.5 96.7 + / − 1.5
liver
−58.0 + / − 2.7 −67.0 + / − 2.5 ∗52.8 + / − 2.7 −60.3 + / − 2.6 −60.3 + / − 2.6
66.7 + / − 2.5 69.0 + / − 2.5
pima
75.0 + / − 1.6 74.2 + / − 1.6
∗68.6 + / − 1.7
74.7 + / − 1.6 74.2 + / − 1.6
−70.8 + / − 1.6
72.8 + / − 1.6
sonar
∗70.7 + / − 3.2 ∗63.9 + / − 3.3 ∗51.2 + / − 3.5 ∗54.3 + / − 3.5 ∗57.8 + / − 3.4 ∗70.2 + / − 3.2 ∗70.7 + / − 3.2
tae
54.3 + / − 4.1 52.3 + / − 4.1
−39.7 + / − 4.0 −39.7 + / − 4.0
52.9 + / − 4.1 55.0 + / − 4.0 53.6 + / − 4.1
thyroid
96.3 + / − 1.3 95.8 + / − 1.4 94.9 + / − 1.5
−90.2 + / − 2.0 −89.8 + / − 2.1
95.3 + / − 1.4
−89.8 + / − 2.1
vehicle
∗44.3 + / − 1.7 ∗59.5 + / − 1.7 ∗67.6 + / − 1.6 ∗75.5 + / − 1.5 ∗76.0 + / − 1.5 ∗66.3 + / − 1.6 −77.3 + / − 1.4
wine
−97.2 + / − 1.2 −97.2 + / − 1.2 −93.8 + / − 1.8 −94.4 + / − 1.7 −96.1 + / − 1.5
98.9 + / − 0.8
−97.2 + / − 1.2
Average
71.5 72.8 68.7 73.8 77.1 78.2 79.2 Database NB/D TAN/D LCM(q) LCM(q)/S LCM(q, m; T) LCM(q, m; U) LCM(q, m; T)/S
balance-scale
−65.8 + / − 1.9 −66.4 + / − 1.9
88.0 + / − 1.3 87.8 + / − 1.3 90.4 + / − 1.2 89.0 + / − 1.3 87.4 + / − 1.3
breast
97.4 + / − 0.6
−96.2 + / − 0.7
96.8 + / − 0.7 96.5 + / − 0.7 96.5 + / − 0.7 96.5 + / − 0.7 96.5 + / − 0.7
crabs
∗44.5 + / − 3.5 ∗46.5 + / − 3.5
94.5 + / − 1.6 95.5 + / − 1.5 95.5 + / − 1.5 95.5 + / − 1.5 94.5 + / − 1.6
diabetes
75.6 + / − 1.5 75.3 + / − 1.6 75.9 + / − 1.5 76.6 + / − 1.5 75.9 + / − 1.6 75.5 + / − 1.6 72.9 + / − 1.6
glass
71.0 + / − 3.1 71.0 + / − 3.1
−57.0 + / − 3.4
62.1 + / − 3.3 70.1 + / − 3.2 64.5 + / − 3.3 67.3 + / − 3.2
glass2
81.6 + / − 3.0 81.7 + / − 3.0
∗66.9 + / − 3.7 ∗67.6 + / − 3.7
85.3 + / − 3.0 81.0 + / − 3.3 79.8 + / − 3.1
heart
83.7 + / − 2.2 84.1 + / − 2.2 85.2 + / − 2.2 83.7 + / − 2.2 83.3 + / − 2.3 83.3 + / − 2.3 82.6 + / − 2.3
iris
∗93.3 + / − 2.0 ∗93.3 + / − 2.0
98.0 + / − 1.1 96.7 + / − 1.5 96.7 + / − 1.6 97.3 + / − 1.3
−95.3 + / − 1.7
liver
−58.0 + / − 2.7 −58.0 + / − 2.7
65.2 + / − 2.6 66.4 + / − 2.5 68.4 + / − 2.5 69.0 + / − 2.5 69.0 + / − 2.5
pima
76.2 + / − 1.5 74.7 + / − 1.6 74.0 + / − 1.6 74.4 + / − 1.6 75.0 + / − 1.6 75.6 + / − 1.5 73.7 + / − 1.6
sonar
−76.5 + / − 2.9 −77.9 + / − 2.9
81.2 + / − 2.7
−76.4 + / − 2.9
80.2 + / − 2.8 84.1 + / − 2.5 83.2 + / − 2.6
tae
−49.6 + / − 4.1 −51.6 + / − 4.1
56.9 + / − 4.0 53.6 + / − 4.1 61.5 + / − 4.0 58.9 + / − 4.0 56.9 + / − 4.0
thyroid
−92.6 + / − 1.8
93.0 + / − 1.7 95.3 + / − 1.4 95.8 + / − 1.4 94.4 + / − 1.6 95.4 + / − 1.4
−89.3 + / − 2.1
vehicle
∗59.1 + / − 1.7 ∗68.4 + / − 1.6
84.3 + / − 1.3
−80.4 + / − 1.4
83.5 + / − 1.3 84.2 + / − 1.3
−79.2 + / − 1.4
wine
98.9 + / − 0.8
−96.0 + / − 1.5
100.0 + / − 0.0 99.4 + / − 0.6 99.4 + / − 1.0 98.9 + / − 0.8 97.7 + / − 1.1
Average
74.9 75.6 81.3 80.9 83.8 83.2 81.7 LCM – p.26/??