Machine Learning Probabilistic KNN.
Mark Girolami
girolami@dcs.gla.ac.uk
Department of Computing Science University of Glasgow
Probabilistic KNN June 21, 2007 – p. 1/3
Machine Learning Probabilistic KNN. Mark Girolami - - PowerPoint PPT Presentation
Machine Learning Probabilistic KNN. Mark Girolami girolami@dcs.gla.ac.uk Department of Computing Science University of Glasgow Probabilistic KNN June 21, 2007 p. 1/3 Probabilistic KNN KNN is a remarkably simple algorithm with proven
Mark Girolami
girolami@dcs.gla.ac.uk
Department of Computing Science University of Glasgow
Probabilistic KNN June 21, 2007 – p. 1/3
Probabilistic KNN June 21, 2007 – p. 2/3
Probabilistic KNN June 21, 2007 – p. 2/3
Probabilistic KNN June 21, 2007 – p. 2/3
Probabilistic KNN June 21, 2007 – p. 2/3
Probabilistic KNN June 21, 2007 – p. 2/3
Probabilistic KNN June 21, 2007 – p. 3/3
Probabilistic KNN June 21, 2007 – p. 3/3
k Mθ
k Mθ
Mθ
Probabilistic KNN June 21, 2007 – p. 4/3
Probabilistic KNN June 21, 2007 – p. 5/3
Probabilistic KNN June 21, 2007 – p. 5/3
Probabilistic KNN June 21, 2007 – p. 5/3
Probabilistic KNN June 21, 2007 – p. 5/3
Probabilistic KNN June 21, 2007 – p. 6/3
Probabilistic KNN June 21, 2007 – p. 6/3
Ns
Probabilistic KNN June 21, 2007 – p. 6/3
Probabilistic KNN June 21, 2007 – p. 7/3
Probabilistic KNN June 21, 2007 – p. 7/3
Probabilistic KNN June 21, 2007 – p. 7/3
Probabilistic KNN June 21, 2007 – p. 7/3
Probabilistic KNN June 21, 2007 – p. 8/3
Probabilistic KNN June 21, 2007 – p. 8/3
Probabilistic KNN June 21, 2007 – p. 8/3
0.5 1 1.5 2 2.5 3 3.5 4 4.5 5 x 10
4
2 4 6 8 10 0.5 1 1.5 2 2.5 3 3.5 4 4.5 5 x 10
4
20 40 60 80 100
Probabilistic KNN June 21, 2007 – p. 9/3
10 20 30 40 50 60 70 80 90 2000 4000 6000 8000 10000 12000 14000 16000 18000
K
10 20 30 40 50 60 70 80 90 12 14 16 18 20 22 24 26 28
K %CV−ERROR
Probabilistic KNN June 21, 2007 – p. 10/3
50 100 150 200 250 8 10 12 14 16 18 Size of Data Set Percentage Test Error PKNN KNN
25 to 250 data points. For each sub-sample size, 50 random subsets were sampled and each of these used to obtain a KNN and PKNN classifier which were then used to make predictions on the 1000 independent test points. The mean percentage performance and associated standard error obtained for each training set are shown in the above figure for each classifier.
Probabilistic KNN June 21, 2007 – p. 11/3
Probabilistic KNN June 21, 2007 – p. 12/3
Probabilistic KNN June 21, 2007 – p. 13/3
Probabilistic KNN June 21, 2007 – p. 14/3
Probabilistic KNN June 21, 2007 – p. 14/3
Probabilistic KNN June 21, 2007 – p. 14/3
Probabilistic KNN June 21, 2007 – p. 14/3
Probabilistic KNN June 21, 2007 – p. 14/3