ASCLU Alternative Subspace Clustering Stephan Gnnemann Ines Frber - - PowerPoint PPT Presentation

asclu alternative subspace clustering
SMART_READER_LITE
LIVE PREVIEW

ASCLU Alternative Subspace Clustering Stephan Gnnemann Ines Frber - - PowerPoint PPT Presentation

ASCLU Alternative Subspace Clustering Stephan Gnnemann Ines Frber Emmanuel Mller Thomas Seidl Data management and data exploration group RWTH Aachen University, Germany MultiClust at KDD 2010 July 25, 2010 Introduction Model


slide-1
SLIDE 1

ASCLU – Alternative Subspace Clustering

Stephan Günnemann Ines Färber Emmanuel Müller Thomas Seidl

Data management and data exploration group RWTH Aachen University, Germany

MultiClust at KDD 2010 July 25, 2010

slide-2
SLIDE 2

Introduction Model Conclusion

Why Subspace Clustering?

data allows to be clustered through different perspectives each object in various groupings based on different attributes ⇒ multiple views due to locally relevant dimensions of clusters ⇒ subspace clustering

ASCLU – Alternative Subspace Clustering 1 / 6

slide-3
SLIDE 3

Introduction Model Conclusion

Why Alternative Clustering?

  • ften trivial groupings or already detected clusters given

user not satisfied with previous results ⇒ aiming for alternative, yet comparable good groupings ⇒ avoid re-detection of already known clusters → combination of subspace clustering and alternative clustering

ASCLU – Alternative Subspace Clustering 2 / 6

slide-4
SLIDE 4

Introduction Model Conclusion

What To Do? – The General Picture

input: subspace clustering Known = {K1, . . . , Km}

Cluster selection approach

set of possible subspace clusters All → select optimal subset Res ⊆ All fulfilling specific properties avoid redundancy → w.r.t. known clusters → among novel clusters select alternative clusters

3 2 4 1 1 3 2

dim 3 dim 4 dim 1 dim 2

ASCLU – Alternative Subspace Clustering 3 / 6

slide-5
SLIDE 5

Introduction Model Conclusion

Alternative Subspace Clustering Model – I

Each cluster C ∈ Res ⊆ All should deviate from Known

1

deviating w.r.t. subspaces

known clusters in alternative subspaces are already different enough InAlterSubspace(Known, C) = {(Oi, Si) ∈ Known | |S ∩ Si| < β · |S|}

2

deviating w.r.t. objects

known clusters in similar subspaces should cover different objects |O\CoveredInSimilar(Known, C)| |O| ≥ α

3 2 4 1 1 3 2

dim 3 dim 4 dim 1 dim 2

Is C1 an alternative to Known?

ASCLU – Alternative Subspace Clustering 4 / 6

slide-6
SLIDE 6

Introduction Model Conclusion

Alternative Subspace Clustering Model – I

Each cluster C ∈ Res ⊆ All should deviate from Known

1

deviating w.r.t. subspaces

known clusters in alternative subspaces are already different enough InAlterSubspace(Known, C) = {(Oi, Si) ∈ Known | |S ∩ Si| < β · |S|}

2

deviating w.r.t. objects

known clusters in similar subspaces should cover different objects |O\CoveredInSimilar(Known, C)| |O| ≥ α

3 2 4 1 1 3 2

dim 3 dim 4 dim 1 dim 2

C1 is a valid alternative to Known!

ASCLU – Alternative Subspace Clustering 4 / 6

slide-7
SLIDE 7

Introduction Model Conclusion

Alternative Subspace Clustering Model – II

Avoiding redundancy

so far: C ∈ Res different to given clusters ⇒ non-redundant w.r.t. Known redundancy between X, Y ∈ Res still possible solution: C ∈ Res valid alternative to remaining novel clusters Res\{C}

a l t e r n a t i v e alternative

ASCLU – Alternative Subspace Clustering 5 / 6

slide-8
SLIDE 8

Introduction Model Conclusion

Alternative Subspace Clustering Model – II

Avoiding redundancy

so far: C ∈ Res different to given clusters ⇒ non-redundant w.r.t. Known redundancy between X, Y ∈ Res still possible solution: C ∈ Res valid alternative to remaining novel clusters Res\{C}

Optimal alternative subspace clustering

Given previous clustering Known and set of possible subspace clusters All, choose Res ⊆ All such that

1

∀C ∈ Res : C is a valid alternative to Known

2

∀C ∈ Res : C is a valid alternative to Res\{C}

3

Res is the most interesting clustering fulfilling 1 & 2

ASCLU – Alternative Subspace Clustering 5 / 6

slide-9
SLIDE 9

Introduction Model Conclusion

Conclusion ASCLU – Alternative Subspace Clustering

ASCLU detects alternatives based on → deviating subspaces → deviating object sets ASCLU avoids redundant clusters

ASCLU – Alternative Subspace Clustering 6 / 6

slide-10
SLIDE 10

Introduction Model Conclusion

Conclusion ASCLU – Alternative Subspace Clustering

ASCLU detects alternatives based on → deviating subspaces → deviating object sets ASCLU avoids redundant clusters Thank you for your attention. Questions?

ASCLU – Alternative Subspace Clustering 6 / 6