Data mining
Machine Intelligence Thomas D. Nielsen September 2008
Data mining September 2008 1 / 37
Data mining Machine Intelligence Thomas D. Nielsen September 2008 - - PowerPoint PPT Presentation
Data mining Machine Intelligence Thomas D. Nielsen September 2008 Data mining September 2008 1 / 37 What is Data Mining? ? Introduction Data mining September 2008 2 / 37 What is Data Mining? ? Introduction Data mining September 2008
Data mining September 2008 1 / 37
Introduction Data mining September 2008 2 / 37
Introduction Data mining September 2008 2 / 37
Introduction Data mining September 2008 2 / 37
Introduction Data mining September 2008 3 / 37
Introduction Data mining September 2008 3 / 37
Introduction Data mining September 2008 3 / 37
Introduction Data mining September 2008 3 / 37
Introduction Data mining September 2008 4 / 37
Classification Data mining September 2008 5 / 37
SubAllCap yes/no TrustSend yes/no InvRet yes/no Body’adult’ yes/no Body’zambia’ yes/no
Classification Data mining September 2008 5 / 37
Cell-1 1..64 Cell-2 1..64 Cell-3 1..64 Cell-324 1..64
Classification Data mining September 2008 5 / 37
Classification Data mining September 2008 6 / 37
Classification Data mining September 2008 7 / 37
PL PW SL SW
Classification Data mining September 2008 8 / 37
Classification Data mining September 2008 9 / 37
Classification Data mining September 2008 9 / 37
Classification Data mining September 2008 10 / 37
Classification Data mining September 2008 11 / 37
Decision trees Data mining September 2008 12 / 37
Decision trees Data mining September 2008 13 / 37
Decision trees Data mining September 2008 14 / 37
Decision trees Data mining September 2008 15 / 37
Decision trees Data mining September 2008 16 / 37
Decision trees Data mining September 2008 16 / 37
Decision trees Data mining September 2008 17 / 37
Decision trees Data mining September 2008 18 / 37
c∈states(C)
3
i=1
Decision trees Data mining September 2008 18 / 37
Decision trees Data mining September 2008 19 / 37
Decision trees Data mining September 2008 20 / 37
a∈states(A)
Decision trees Data mining September 2008 21 / 37
Decision trees Data mining September 2008 22 / 37
Decision trees Data mining September 2008 23 / 37
Decision trees Data mining September 2008 24 / 37
Decision trees Data mining September 2008 25 / 37
Decision trees Data mining September 2008 26 / 37
Decision trees Data mining September 2008 27 / 37
k Nearest Neighbor K Nearest Neighbor September 2008 28 / 37
k Nearest Neighbor K Nearest Neighbor September 2008 29 / 37
k Nearest Neighbor K Nearest Neighbor September 2008 29 / 37
k Nearest Neighbor K Nearest Neighbor September 2008 29 / 37
k Nearest Neighbor K Nearest Neighbor September 2008 29 / 37
j=1(a1,j − a2,j)2
j=1 |a1,j − a2,j|
k Nearest Neighbor K Nearest Neighbor September 2008 30 / 37
k
j=1
k Nearest Neighbor K Nearest Neighbor September 2008 31 / 37
0.2 0.4 0.6 0.8 1
20 40 60 80 100 120 normalized values
A1 A2
k Nearest Neighbor K Nearest Neighbor September 2008 32 / 37
1 2 3
20 40 60 80 100 120 standardized values
A1 A2
n
j=1 aj,i
1 n−1
j=1(aj,i − mean(Ai))2
k Nearest Neighbor K Nearest Neighbor September 2008 33 / 37
k Nearest Neighbor K Nearest Neighbor September 2008 34 / 37
k Nearest Neighbor K Nearest Neighbor September 2008 35 / 37
k
i=1:ci=c
k Nearest Neighbor K Nearest Neighbor September 2008 36 / 37
k Nearest Neighbor K Nearest Neighbor September 2008 37 / 37