Data Warehousing and Machine Learning
Probabilistic Classifiers Thomas D. Nielsen
Aalborg University Department of Computer Science
Spring 2008
DWML Spring 2008 1 / 34
Data Warehousing and Machine Learning Probabilistic Classifiers - - PowerPoint PPT Presentation
Data Warehousing and Machine Learning Probabilistic Classifiers Thomas D. Nielsen Aalborg University Department of Computer Science Spring 2008 DWML Spring 2008 1 / 34 Probabilistic Classifiers Conditional class probabilities Id. Savings
Aalborg University Department of Computer Science
DWML Spring 2008 1 / 34
Probabilistic Classifiers DWML Spring 2008 2 / 34
Probabilistic Classifiers DWML Spring 2008 2 / 34
Probabilistic Classifiers DWML Spring 2008 3 / 34
(a1,...,an,c)∈States(A1,...,An,C)
c′ P(a1, . . . , an, c′)
Probabilistic Classifiers DWML Spring 2008 4 / 34
c′∈States(C)
c∈States(C)
Probabilistic Classifiers DWML Spring 2008 5 / 34
c′∈States(C)
c∈States(C)
Probabilistic Classifiers DWML Spring 2008 5 / 34
Probabilistic Classifiers DWML Spring 2008 6 / 34
Probabilistic Classifiers DWML Spring 2008 7 / 34
Probabilistic Classifiers DWML Spring 2008 8 / 34
Probabilistic Classifiers DWML Spring 2008 9 / 34
Probabilistic Classifiers DWML Spring 2008 10 / 34
Probabilistic Classifiers DWML Spring 2008 10 / 34
Probabilistic Classifiers DWML Spring 2008 11 / 34
4
i=1
4
i=1
Probabilistic Classifiers DWML Spring 2008 12 / 34
Probabilistic Classifiers DWML Spring 2008 13 / 34
A B Class yes yes ⊕ yes no ⊖ no yes ⊖ no no ⊕
Probabilistic Classifiers DWML Spring 2008 14 / 34
Probabilistic Classifiers DWML Spring 2008 15 / 34
1
2
3
4
Ai ,Aj
Probabilistic Classifiers DWML Spring 2008 15 / 34
Cold Sore Throat? See spots? Fever? Angina
1
2
3
4
Probabilistic Classifiers DWML Spring 2008 16 / 34
Cold Sore Throat? See spots? Fever? Angina
MI(Cold, Angina) = 0 MI(Fever?, Angina) = 0.015076 MI(SoreThroat?, Angina) = 0.018016 MI(SeeSpots?, Angina) = 0.0180588 MI(Cold, Fever?) = 0.014392 MI(Cold, SoreThroat?) = 0.0210122 MI(Cold, SeeSpots?) = 0 MI(SoreThroat, Fever?) = 0.0015214 MI(Fever?, SeeSpots?) = 0.0017066 MI(SeeSpots?, SoreThroat?) = 0.0070697 1
2
3
4
Cold,Sore
Probabilistic Classifiers DWML Spring 2008 16 / 34
Cold Sore Throat? See spots? Fever? Angina
1
2
3
4
0.014 0.007 0.018 0.021 0.002 0.015 0.018 0.002
Cold Sore Throat? See spots? Fever? Angina
Probabilistic Classifiers DWML Spring 2008 16 / 34
Cold Sore Throat? See spots? Fever? Angina
1
2
3
4
0.014 0.007 0.018 0.021 0.002 0.015 0.018 0.002
Cold Sore Throat? See spots? Fever? Angina
Probabilistic Classifiers DWML Spring 2008 16 / 34
Cold Sore Throat? See spots? Fever? Angina
1
2
3
4
Cold Sore Throat? See spots? Fever? Angina
Probabilistic Classifiers DWML Spring 2008 16 / 34
Cold Sore Throat? See spots? Fever? Angina
1
2
3
4
Cold Sore Throat? See spots? Fever? Angina
Probabilistic Classifiers DWML Spring 2008 16 / 34
Cold Sore Throat? See spots? Fever? Angina
1
2
3
4
Cold Sore Throat? See spots? Fever? Angina
Probabilistic Classifiers DWML Spring 2008 16 / 34
1
2
3
4
5
C
Ai ,Aj
Probabilistic Classifiers DWML Spring 2008 17 / 34
C A1 A2 A3 A4 A5 L1 L2 L3
Probabilistic Classifiers DWML Spring 2008 18 / 34
Aalborg University Department of Computer Science
Evaluating Classifiers Evaluating Classifiers Spring 2008 19 / 34
1, . . . , c′
i }|/N
Evaluating Classifiers Evaluating Classifiers Spring 2008 20 / 34
1
2
3
Evaluating Classifiers Evaluating Classifiers Spring 2008 21 / 34
1
2
3
Evaluating Classifiers Evaluating Classifiers Spring 2008 21 / 34
20 40 60 80 100 0,01 0,02 0,03 0,04 0,05 0,06 0,07 0,08 Evaluating Classifiers Evaluating Classifiers Spring 2008 22 / 34
Evaluating Classifiers Evaluating Classifiers Spring 2008 22 / 34
Evaluating Classifiers Evaluating Classifiers Spring 2008 22 / 34
Evaluating Classifiers Evaluating Classifiers Spring 2008 23 / 34
10 20 30 0,02 0,04 0,06 0,08 0,10 0,12 0,14 0,16
Evaluating Classifiers Evaluating Classifiers Spring 2008 23 / 34
α/2 ± Zα/2
α/2 + 4Na − 4Na2
α/2)
Evaluating Classifiers Evaluating Classifiers Spring 2008 23 / 34
Evaluating Classifiers Evaluating Classifiers Spring 2008 23 / 34
Evaluating Classifiers Evaluating Classifiers Spring 2008 23 / 34
x,y∈{a,b,c}
Evaluating Classifiers Evaluating Classifiers Spring 2008 24 / 34
Evaluating Classifiers Evaluating Classifiers Spring 2008 25 / 34
Evaluating Classifiers Evaluating Classifiers Spring 2008 26 / 34
Evaluating Classifiers Evaluating Classifiers Spring 2008 26 / 34
Evaluating Classifiers Evaluating Classifiers Spring 2008 26 / 34
0.8 1 1.2 1.4 1.6 1.8 2 10 20 30 40 50 60 70 80 90 100
Lift(C, c)
Evaluating Classifiers Evaluating Classifiers Spring 2008 27 / 34
0.8 1 1.2 1.4 1.6 1.8 2 10 20 30 40 50 60 70 80 90 100
Lift(C, c) Lift(C′, c)
Evaluating Classifiers Evaluating Classifiers Spring 2008 27 / 34
Evaluating Classifiers Evaluating Classifiers Spring 2008 28 / 34
Evaluating Classifiers Evaluating Classifiers Spring 2008 29 / 34
always classify positive
Evaluating Classifiers Evaluating Classifiers Spring 2008 29 / 34
always classify negative always classify positive
Evaluating Classifiers Evaluating Classifiers Spring 2008 29 / 34
always classify negative classify positive with probability always classify positive q
Evaluating Classifiers Evaluating Classifiers Spring 2008 29 / 34
always classify negative classify positive with probability always classify positive q perfect classification
Evaluating Classifiers Evaluating Classifiers Spring 2008 29 / 34
Evaluating Classifiers Evaluating Classifiers Spring 2008 30 / 34
Evaluating Classifiers Evaluating Classifiers Spring 2008 31 / 34
Evaluating Classifiers Evaluating Classifiers Spring 2008 31 / 34
Evaluating Classifiers Evaluating Classifiers Spring 2008 32 / 34
Evaluating Classifiers Evaluating Classifiers Spring 2008 33 / 34
Evaluating Classifiers Evaluating Classifiers Spring 2008 34 / 34