Less is More: Non-Redundant Subspace Clustering
Ira Assent ◦ Emmanuel Müller • Stephan Günnemann • Ralph Krieger • Thomas Seidl •
- Aalborg University, Denmark
- RWTH Aachen University, Germany
Less is More: Non-Redundant Subspace Clustering Ira Assent Emmanuel - - PowerPoint PPT Presentation
Less is More: Non-Redundant Subspace Clustering Ira Assent Emmanuel Mller Stephan Gnnemann Ralph Krieger Thomas Seidl Aalborg University, Denmark RWTH Aachen University, Germany MultiClust Workshop at SIGKDD 2010
Effective Models Efficient Computation Evaluation and Exploration of Results
Less is More: Non-Redundant Subspace Clustering 1 / 11
Effective Models Efficient Computation Evaluation and Exploration of Results
Less is More: Non-Redundant Subspace Clustering 2 / 11
Effective Models Efficient Computation Evaluation and Exploration of Results
1
2
3
Less is More: Non-Redundant Subspace Clustering 3 / 11
Effective Models Efficient Computation Evaluation and Exploration of Results
1 4 3 2 1,2 1,4 2,3 3,4 1,2,3 2,3,4 1,3,4 1,2,4
1,2,3,4
1 4 3 2 1,2 1,3 1,4 2,3 2,4 3,4 1,2,3 2,3,4 1,3,4 1,2,4
1,2,3,4
Less is More: Non-Redundant Subspace Clustering 4 / 11
Effective Models Efficient Computation Evaluation and Exploration of Results
[1] Assent, Krieger, Müller and Seidl: DUSC: Dimensionality Unbiased Subspace Clustering, in ICDM 2007. [2] Assent, Krieger, Müller and Seidl: INSCY: Indexing Subspace Clusters with In-Process-Removal of Redundancy, in ICDM 2008. Less is More: Non-Redundant Subspace Clustering 5 / 11
Effective Models Efficient Computation Evaluation and Exploration of Results
all possible clusters ALL relevance model interestingness
redundancy
relevant clustering M ALL
[3] Müller, Assent, Günnemann, Krieger and Seidl: Relevant Subspace Clustering: Mining the Most Interesting Non-Redundant Concepts in High Dimensional Data, in ICDM 2009. Less is More: Non-Redundant Subspace Clustering 6 / 11
Effective Models Efficient Computation Evaluation and Exploration of Results
1 4 3 2 1,2 1,4 2,3 3,4 1,2,3 2,3,4 1,3,4 1,2,4
1,2,3,4
1 4 3 2 1,2 1,3 1,4 2,3 2,4 3,4 1,2,3 2,3,4 1,3,4 1,2,4
1,2,3,4
1 4 3 2 1,2 1,4 2,3 3,4 1,2,3 2,3,4 1,3,4 1,2,4
1,2,3,4
1 4 3 2 1,2 1,3 1,4 2,3 2,4 3,4 1,2,3 2,3,4 1,3,4 1,2,4
1,2,3,4
depth-first breadth-first
Less is More: Non-Redundant Subspace Clustering 7 / 11
Effective Models Efficient Computation Evaluation and Exploration of Results
1 4 3 2 1,2 1,4 2,3 3,4 1,2,3 2,3,4 1,3,4 1,2,4
1,2,3,4
1 4 3 2 1,2 1,3 1,4 2,3 2,4 3,4 1,2,3 2,3,4 1,3,4 1,2,4
1,2,3,4
direct jump
Dimension 2 Dimension 1 D i m e n s i
4 interval 1 interval 2 1 2 1 2
[4] Müller, Assent, Krieger, Günnemann and Seidl: DensEst: Density Estimation for Data Mining in High Dimensional Spaces, in SDM 2009. Less is More: Non-Redundant Subspace Clustering 8 / 11
Effective Models Efficient Computation Evaluation and Exploration of Results
[5] Müller, Günnemann, Assent and Seidl: Evaluating Clustering in Subspace Projections of High Dimensional Data, in VLDB 2009. Less is More: Non-Redundant Subspace Clustering 9 / 11
Effective Models Efficient Computation Evaluation and Exploration of Results
unified algorithm repository
re-implementation rare case: common implementation
OpenSubspace
unified evaluation repository
[6] Müller, Assent, Krieger, Jansen and Seidl: Morpheus: Interactive Exploration of Subspace Clustering, in KDD 2008. [7] Assent, Müller, Krieger, Jansen and Seidl: Pleiades: Subspace Clustering and Evaluation, in PKDD 2008. [8] Günnemann, Färber, Kremer, Seidl: CoDA: Interactive Cluster Based Concept Discovery, in VLDB 2010 [9] Müller, Schiffer, Gerwert, Hannen, Jansen, Seidl: SOREX: Subspace Outlier Ranking Exploration Toolkit, in PKDD 2010. Less is More: Non-Redundant Subspace Clustering 10 / 11
Effective Models Efficient Computation Evaluation and Exploration of Results
Less is More: Non-Redundant Subspace Clustering 11 / 11
Effective Models Efficient Computation Evaluation and Exploration of Results
Less is More: Non-Redundant Subspace Clustering 11 / 11