Clustering by Support Vector Manifold Learning
Marcin Orchel
AGH University of Science and Technology in Poland
1 / 12
Clustering by Support Vector Manifold Learning Marcin Orchel AGH - - PowerPoint PPT Presentation
Clustering by Support Vector Manifold Learning Marcin Orchel AGH University of Science and Technology in Poland 1 / 12 Problem and My Contributions Problem Characterizations of clusters are boundary, center (prototype), cluster core,
AGH University of Science and Technology in Poland
1 / 12
2 / 12
(a)
(b)
(c)
C = 100.0, σ = 0.9, t = 0.01, thin dotted line—solution of kernel principal component analysis (KPCA) for σ = 0.9. (b) For points generated from a Lissajous curve. Solid line—solution of OCSVM for C = 1000.0, σ = 0.5, dashed line—solution of SVML for C = 100000.0, σ = 0.8, t = 0.01, thin dotted line—solution of KPCA for σ = 0.5. (c) Solid line— solution of SVML for c = 0, C = 100.0, σ = 0.9, t = 0.01, dashed line—solution of SVML for random values of c, C = 100.0, σ = 0.9, t = 0.01.
3 / 12
4 / 12
5 / 12
6 / 12
7 / 12
8 / 12
9 / 12
10 / 12
(a)
(b)
(c)
line—solution of support vector clustering (SVCL) for C = 10000.0, σ = 0.35. (b) Solid line—solution of support vector manifold learning clustering (SVMLC) for C = 100000.0, σ = 1.1, t = 0.01. (c) Solid line—solution of KPCA.
11 / 12
Table 1: Performance of SVMLC, SVCL, KPCA, SVML, OCSVM for real world data, part 2. The numbers in
descriptions of the columns mean the methods: 1 - SVMLC, 2 - SVCL, 3 - KPCA for the first row, 1 - SVML, 2 - OCSVM, 3 - KPCA for the second row. The test with id=0 is for all tests for the clustering experiment. The test with id=1 is for all tests for the manifold learning experiment. Column descriptions: rs – an average rank of the method for the mean error; the best method is in bold, tsf – the Friedman statistic for average ranks for the mean error; the significant value is in bold, tsn – the Nemenyi statistic for average ranks for the mean error, reported when the Friedman statistic is significant, the significant value is in bold, svr – the average rank for the number of nonzero coefficients (support vectors for support vector machines (SVM) methods); the smallest value is in bold. id rs1 rs2 rs3 tsf tsn12 tsn13 tsn23 sv1 sv2 sv3 1.71 1.93 2.36 4.5 – – – 2.83 1.67 1.5 1 1.49 2.98 1.53 33.09 -4.82 0.3 5.13 1.51 2.38 2.11
12 / 12