Descriptive clustering Christel VRAIN, Thi-Bich-Hanh DAO LIFO - - PowerPoint PPT Presentation

descriptive clustering
SMART_READER_LITE
LIVE PREVIEW

Descriptive clustering Christel VRAIN, Thi-Bich-Hanh DAO LIFO - - PowerPoint PPT Presentation

Descriptive clustering Christel VRAIN, Thi-Bich-Hanh DAO LIFO Universit dOrlans Workshop on Machine Learning and Explainability Dao - Vrain (LIFO) Descriptive clustering 08/10/2018 1 / 29 Motivation Clustering used extensively in


slide-1
SLIDE 1

Descriptive clustering

Christel VRAIN, Thi-Bich-Hanh DAO

LIFO Université d’Orléans

Workshop on Machine Learning and Explainability

Dao - Vrain (LIFO) Descriptive clustering 08/10/2018 1 / 29

slide-2
SLIDE 2

Motivation

Clustering used extensively in AI applications In many domains, data have very good features/attributes to form compact clusters, but

◮ features cannot explain the clustering well ◮ data also described by another set of (potentially sparse and noisy)

descriptors/tags that are useful for explanation

Setting Features/attributes Descriptors/tags Twitter network mention/retweet graph hashtag usage Images SIFT features tags Needs to balance compact clusters (w.r.t. to a distance between

  • bjects) with

◮ their consistency with human expectations ◮ their explanations to human

Aims:

1

find clusters close to the expert expectations by leveraging knowledge

2

discover simultaneously explanations during the clustering process

Dao - Vrain (LIFO) Descriptive clustering 08/10/2018 2 / 29

slide-3
SLIDE 3

Mainly two frameworks for clustering

Conceptual Clustering:

◮ introduced in the 80’s [Michalski & Stepp, 1983, Fisher, 1985] ◮ presently based on closed patterns (FCA and pattern mining) ◮ based on qualitative properties ◮ does not take into account quantitative attributes, nor distance

between objects (no notion of compactness, e.g. clusters diameter)

Distance-based clustering:

◮ based on dissimilarities between objects ◮ appropriate for quantitative data ◮ qualitative properties must be encapsulated in a distance Dao - Vrain (LIFO) Descriptive clustering 08/10/2018 3 / 29

slide-4
SLIDE 4

A declarative framework for constrained clustering in CP

Dao, Duong, Vrain, AIJ 2017

Input: a dataset or a dissimilarity measure between pairs of points Clusters are defined by an assignment of points to clusters: G[o] = c, c ∈ [1, k] Optimization criterion, e.g. minimizing the maximum diameter Constraints are put

◮ for representing a partition ◮ for breaking symmetries ◮ user constraints: size, diameter, split, . . .

G1 = 1 Gi ≤ maxj∈[1,i−1](Gj) + 1, for i ∈ [2, n] #{i | Gi = kmin} ≥ 1

Dao - Vrain (LIFO) Descriptive clustering 08/10/2018 4 / 29

slide-5
SLIDE 5

How to make clustering interpretable?

Before the clustering process: leverage human knowledge before clustering → actionable clustering After the clustering process → explain the cluster :

◮ Characterization ◮ Generalization ◮ Statistics

During the clustering process. Two assumptions

◮ Clustering and explanations are in the same representation space

→ conceptual clustering

◮ Clustering and explanations are in two different representation

spaces.

Dao - Vrain (LIFO) Descriptive clustering 08/10/2018 5 / 29

slide-6
SLIDE 6

Actionable clustering

Dao, Vrain, Duong, Davidson, ECAI 2016

Express constraints that makes the clustering useful for a given purpose Find useful groups each of which you can invite to a different dinner party equal number of males and females width of a cluster in terms of age at most 10 each person in a cluster should have at least r other people with the same hobby Instances 3, 9 are in the same cluster if 11, 15 are in different clusters. B1 ↔ (G11 = G15) B2 ↔ (G3 = G9) B1 ≤ B2

Dao - Vrain (LIFO) Descriptive clustering 08/10/2018 6 / 29

slide-7
SLIDE 7

Unifying conceptual and distance clustering

Dao, Lesaint, Vrain, JFPC 2015

  • taking into account quantitative and qualitative data
  • combining conditions/criteria from both frameworks

Data:

◮ a set O of objects, a set I of Boolean properties ◮ a dissimilarity measure d(o, o′) for any o, o′ in O ◮ a binary database D: Dop = 1, when o satisfies property p

Clusters are defined by:

1

assignment of points to clusters: G[o] = c, c ∈ [1, k]

2

description of clusters: A[c, p] = 1 iff p is in the description of cluster c.

Constraints

◮ Constraints of the distance-based model: partition, breaking

symmetries

◮ Constraints from the conceptual model: an object is in a cluster iff it

satisfies all its properties. ∀o ∈ O, ∀c ∈ C G[o] = c ⇔

p∈I A[c, p](1 − Dop) = 0

Dao - Vrain (LIFO) Descriptive clustering 08/10/2018 7 / 29

slide-8
SLIDE 8

Car Dataset

193 objects technical properties (22 attributes) :

◮ motorization (diesel or not) ◮ drive wheels (4, 2 front, 2 rear) ◮ power (between 48 and 288) ◮ etc.

discretization : 64 qualitative attributes price (quantitative attribute)

Dao - Vrain (LIFO) Descriptive clustering 08/10/2018 8 / 29

slide-9
SLIDE 9

Car dataset

Conceptual setting → (e) concepts + maximizing min. size of clusters → (f) concepts + maximizing min. size of concepts Price distribution not convincing Distance-based setting → (g) minimizing max diameter No convincing concepts Unified framework → (h) concepts + minimizing max diameter A better modeling of the 3 car ranges with concepts based on size, engine power, fuel consumption, . . .

Dao - Vrain (LIFO) Descriptive clustering 08/10/2018 9 / 29

slide-10
SLIDE 10

Descriptive clustering formulation

Dao, Kuo, Ravi, Vrain, Davidson, IJCAI 2018

Data: n data instances described by numerical features X and interpretable boolean descriptors/tags D Aims: Simultaneously look for clusters which are both

◮ good/compact in one modality (e.g. SIFT features for images or

graph distance)

◮ useful/descriptive in another modality (e.g. tags)

The objectives are not compatible → computation of a Pareto front corresponding to Pareto optimal solutions, allowing to model a trade-off with both objectives f: feature-focused objective to minimize compactness g: descriptor-focused objective to maximize interpretability

Dao - Vrain (LIFO) Descriptive clustering 08/10/2018 10 / 29

slide-11
SLIDE 11

Pareto optimal solutions and Pareto front

g f Criterion space

Partition P′ dominates P iff better in one criterion and not worse in the other P is a Pareto optimal solution iff there is no P′ which dominates P Pareto front = {(f(P), g(P)) | P is a Pareto optimal solution}

Dao - Vrain (LIFO) Descriptive clustering 08/10/2018 11 / 29

slide-12
SLIDE 12

Compute the complete Pareto front

P ← ∅; sf

1 ← minimize f subject to C;

i ← 1; while sf

i = NULL do

sg

i ← maximize g subject to C ∪ {f ≤ f(sf i )};

P ← P ∪ {sg

i };

i ← i + 1; sf

i ← minimize f subject to C ∪ {g > g(sg i−1)};

return P;

Dao - Vrain (LIFO) Descriptive clustering 08/10/2018 12 / 29

slide-13
SLIDE 13

Data and variables

Data:

◮ X: n × f matrix of n data instances with f numerical features ◮ D: n × r matrix of the same n instances with r tag indicators

Variables:

◮ cluster indication matrix Z: n × k boolean matrix

Zic = 1 indicates the i-th instance is in the c-cluster

◮ cluster description matrix S: k × r boolean matrix

Scp = 1 means the p-th tag is included in the description of the c-th cluster

Dao - Vrain (LIFO) Descriptive clustering 08/10/2018 13 / 29

slide-14
SLIDE 14

Partitioning constraints

Each instance is in one cluster Each cluster has at least one element Breaking symmetries between clusters ∀i = 1, . . . , n, k

c=1 Zic = 1

∀c = 1, . . . , k, n

i=1 Zic ≥ 1

Z11 = 1 ∀i = 2, . . . , n, ∀c = 2, . . . , k, i−1

j=1 Zjc−1 ≥ Zic

Each cluster description has at least one tag ∀c = 1, . . . , k,

n

  • i=1

Scp ≥ 1

Dao - Vrain (LIFO) Descriptive clustering 08/10/2018 14 / 29

slide-15
SLIDE 15

Cluster description constraints

Each cluster is described by a non empty subset of tags An instance in a cluster must satisfy most of its descriptions (up to α exceptions): ∀c = 1, . . . , k, ∀i = 1, . . . , n, Zic = 1 = ⇒

r

  • p=1

Scp(1 − Dip) ≤ α A tag is included in a cluster description if and only if most of the instances in the cluster (up to β exceptions) possess it: ∀c = 1, . . . , k, ∀p = 1, . . . , r, Scp = 1 ⇐ ⇒

n

  • i=1

Zic(1 − Dip) ≤ β With dense tags dataset, stronger version with α = β = 0

Dao - Vrain (LIFO) Descriptive clustering 08/10/2018 15 / 29

slide-16
SLIDE 16

Feature-focused optimization criteria

Finding compact clusters, based on their distance d(·, ·) defined over pairs of instances

f(Z, S) =

n

max

i<j,i,j=1 ZiZT j d(Xi, Xj)

<latexit sha1_base64="jSVOGtCK+Jy8ajRs97jD8Ie1fDs=">AC83icjVHLSsNAFD3G97vq0s1gESqUmoigoLoxqWi1VIfIUmnOjUvkokoxc9w50rc+gVu9RvEP9C/8M4YwQeiE5KcOfeM3PvdWNfpNI0nzuMzq7unt6+/oHBoeGR0cLY+G4aZYnHq17kR0nNdVLui5BXpZA+r8UJdwLX53vu6bqK753xJBVRuCMvYn4YOMehaArPkUTZhdlmqV7enmEr7CBwzu2WG6VmSi3VqzLo5DVbVG3W0c7jVLNFuWa3ZqxC0WzYurFfgIrB0XkazMqPOEADUTwkCEARwhJ2IeDlJ59WDARE3eINnEJIaHjHJcYIG1GWZwyHGJP6XtMu/2cDWmvPFOt9ugUn96ElAzTpIkoLyGsTmM6nmlnxf7m3dae6m4X9Hdzr4BYiRNi/9J9ZP5Xp2qRaGJR1yCoplgzqjovd8l0V9TN2aeqJDnExCncoHhC2NPKjz4zrUl17aq3jo6/6EzFqr2X52Z4VbekAVvfx/kTVOcqSxVra764upZPug+TmEKJxrmAVWxgE1WyvsI9HvBonBnXxo1x+5qdOSaCXxZxt0bkufqQ=</latexit><latexit sha1_base64="jSVOGtCK+Jy8ajRs97jD8Ie1fDs=">AC83icjVHLSsNAFD3G97vq0s1gESqUmoigoLoxqWi1VIfIUmnOjUvkokoxc9w50rc+gVu9RvEP9C/8M4YwQeiE5KcOfeM3PvdWNfpNI0nzuMzq7unt6+/oHBoeGR0cLY+G4aZYnHq17kR0nNdVLui5BXpZA+r8UJdwLX53vu6bqK753xJBVRuCMvYn4YOMehaArPkUTZhdlmqV7enmEr7CBwzu2WG6VmSi3VqzLo5DVbVG3W0c7jVLNFuWa3ZqxC0WzYurFfgIrB0XkazMqPOEADUTwkCEARwhJ2IeDlJ59WDARE3eINnEJIaHjHJcYIG1GWZwyHGJP6XtMu/2cDWmvPFOt9ugUn96ElAzTpIkoLyGsTmM6nmlnxf7m3dae6m4X9Hdzr4BYiRNi/9J9ZP5Xp2qRaGJR1yCoplgzqjovd8l0V9TN2aeqJDnExCncoHhC2NPKjz4zrUl17aq3jo6/6EzFqr2X52Z4VbekAVvfx/kTVOcqSxVra764upZPug+TmEKJxrmAVWxgE1WyvsI9HvBonBnXxo1x+5qdOSaCXxZxt0bkufqQ=</latexit><latexit sha1_base64="jSVOGtCK+Jy8ajRs97jD8Ie1fDs=">AC83icjVHLSsNAFD3G97vq0s1gESqUmoigoLoxqWi1VIfIUmnOjUvkokoxc9w50rc+gVu9RvEP9C/8M4YwQeiE5KcOfeM3PvdWNfpNI0nzuMzq7unt6+/oHBoeGR0cLY+G4aZYnHq17kR0nNdVLui5BXpZA+r8UJdwLX53vu6bqK753xJBVRuCMvYn4YOMehaArPkUTZhdlmqV7enmEr7CBwzu2WG6VmSi3VqzLo5DVbVG3W0c7jVLNFuWa3ZqxC0WzYurFfgIrB0XkazMqPOEADUTwkCEARwhJ2IeDlJ59WDARE3eINnEJIaHjHJcYIG1GWZwyHGJP6XtMu/2cDWmvPFOt9ugUn96ElAzTpIkoLyGsTmM6nmlnxf7m3dae6m4X9Hdzr4BYiRNi/9J9ZP5Xp2qRaGJR1yCoplgzqjovd8l0V9TN2aeqJDnExCncoHhC2NPKjz4zrUl17aq3jo6/6EzFqr2X52Z4VbekAVvfx/kTVOcqSxVra764upZPug+TmEKJxrmAVWxgE1WyvsI9HvBonBnXxo1x+5qdOSaCXxZxt0bkufqQ=</latexit><latexit sha1_base64="jSVOGtCK+Jy8ajRs97jD8Ie1fDs=">AC83icjVHLSsNAFD3G97vq0s1gESqUmoigoLoxqWi1VIfIUmnOjUvkokoxc9w50rc+gVu9RvEP9C/8M4YwQeiE5KcOfeM3PvdWNfpNI0nzuMzq7unt6+/oHBoeGR0cLY+G4aZYnHq17kR0nNdVLui5BXpZA+r8UJdwLX53vu6bqK753xJBVRuCMvYn4YOMehaArPkUTZhdlmqV7enmEr7CBwzu2WG6VmSi3VqzLo5DVbVG3W0c7jVLNFuWa3ZqxC0WzYurFfgIrB0XkazMqPOEADUTwkCEARwhJ2IeDlJ59WDARE3eINnEJIaHjHJcYIG1GWZwyHGJP6XtMu/2cDWmvPFOt9ugUn96ElAzTpIkoLyGsTmM6nmlnxf7m3dae6m4X9Hdzr4BYiRNi/9J9ZP5Xp2qRaGJR1yCoplgzqjovd8l0V9TN2aeqJDnExCncoHhC2NPKjz4zrUl17aq3jo6/6EzFqr2X52Z4VbekAVvfx/kTVOcqSxVra764upZPug+TmEKJxrmAVWxgE1WyvsI9HvBonBnXxo1x+5qdOSaCXxZxt0bkufqQ=</latexit>

Diameter arg min f(Z, S)

f(Z, S) = Σn

i<j,i,j=1ZiZT j d(Xi, Xj)

<latexit sha1_base64="JtFLvyJc6iUfIS5RvTB+qWpS4zo=">AC9XicjVHLSsNAFD3G97vq0s1gESqUkoigoLoxqWi1VIfIUmndWpeJBNRit/hzpW49Qvc6ieIf6B/4Z0xg9EJyQ5c+49Z+be68a+SKVpPncZ3T29f0Dg0PDI6Nj4WJyb0yhKPV73Ij5Ka6TcFyGvSiF9XosT7gSuz/fd0w0V3z/jSqicFdexPwocFqhaArPkUTZBatZqpd35tgqO9wRrcCxO2KlXWai3F61Lo9DVrdF3W4f7zZKNVuUa3Z7zi4UzYqpF/sJrBwUka+tqPCEQzQwUOGABwhJGEfDlJ6DmDBREzcETrEJYSEjnNcYoi0GWVxynCIPaVvi3YHORvSXnmWu3RKT69CSkZkTUV5CWJ3GdDzTzor9zbujPdXdLujv5l4BsRInxP6l+8j8r07VItHEkq5BUE2xZlR1Xu6S6a6om7NPVUlyiIlTuEHxhLCnlR9ZlqT6tpVbx0df9GZilV7L8/N8KpuSQO2vo/zJ6jOV5Yr1vZCcW09n/QApjGDEo1zEWvYxBaqZH2Fezg0Tg3ro0b4/Y91ejKNVP4soy7N5MDoGg=</latexit><latexit sha1_base64="JtFLvyJc6iUfIS5RvTB+qWpS4zo=">AC9XicjVHLSsNAFD3G97vq0s1gESqUkoigoLoxqWi1VIfIUmndWpeJBNRit/hzpW49Qvc6ieIf6B/4Z0xg9EJyQ5c+49Z+be68a+SKVpPncZ3T29f0Dg0PDI6Nj4WJyb0yhKPV73Ij5Ka6TcFyGvSiF9XosT7gSuz/fd0w0V3z/jSqicFdexPwocFqhaArPkUTZBatZqpd35tgqO9wRrcCxO2KlXWai3F61Lo9DVrdF3W4f7zZKNVuUa3Z7zi4UzYqpF/sJrBwUka+tqPCEQzQwUOGABwhJGEfDlJ6DmDBREzcETrEJYSEjnNcYoi0GWVxynCIPaVvi3YHORvSXnmWu3RKT69CSkZkTUV5CWJ3GdDzTzor9zbujPdXdLujv5l4BsRInxP6l+8j8r07VItHEkq5BUE2xZlR1Xu6S6a6om7NPVUlyiIlTuEHxhLCnlR9ZlqT6tpVbx0df9GZilV7L8/N8KpuSQO2vo/zJ6jOV5Yr1vZCcW09n/QApjGDEo1zEWvYxBaqZH2Fezg0Tg3ro0b4/Y91ejKNVP4soy7N5MDoGg=</latexit><latexit sha1_base64="JtFLvyJc6iUfIS5RvTB+qWpS4zo=">AC9XicjVHLSsNAFD3G97vq0s1gESqUkoigoLoxqWi1VIfIUmndWpeJBNRit/hzpW49Qvc6ieIf6B/4Z0xg9EJyQ5c+49Z+be68a+SKVpPncZ3T29f0Dg0PDI6Nj4WJyb0yhKPV73Ij5Ka6TcFyGvSiF9XosT7gSuz/fd0w0V3z/jSqicFdexPwocFqhaArPkUTZBatZqpd35tgqO9wRrcCxO2KlXWai3F61Lo9DVrdF3W4f7zZKNVuUa3Z7zi4UzYqpF/sJrBwUka+tqPCEQzQwUOGABwhJGEfDlJ6DmDBREzcETrEJYSEjnNcYoi0GWVxynCIPaVvi3YHORvSXnmWu3RKT69CSkZkTUV5CWJ3GdDzTzor9zbujPdXdLujv5l4BsRInxP6l+8j8r07VItHEkq5BUE2xZlR1Xu6S6a6om7NPVUlyiIlTuEHxhLCnlR9ZlqT6tpVbx0df9GZilV7L8/N8KpuSQO2vo/zJ6jOV5Yr1vZCcW09n/QApjGDEo1zEWvYxBaqZH2Fezg0Tg3ro0b4/Y91ejKNVP4soy7N5MDoGg=</latexit><latexit sha1_base64="JtFLvyJc6iUfIS5RvTB+qWpS4zo=">AC9XicjVHLSsNAFD3G97vq0s1gESqUkoigoLoxqWi1VIfIUmndWpeJBNRit/hzpW49Qvc6ieIf6B/4Z0xg9EJyQ5c+49Z+be68a+SKVpPncZ3T29f0Dg0PDI6Nj4WJyb0yhKPV73Ij5Ka6TcFyGvSiF9XosT7gSuz/fd0w0V3z/jSqicFdexPwocFqhaArPkUTZBatZqpd35tgqO9wRrcCxO2KlXWai3F61Lo9DVrdF3W4f7zZKNVuUa3Z7zi4UzYqpF/sJrBwUka+tqPCEQzQwUOGABwhJGEfDlJ6DmDBREzcETrEJYSEjnNcYoi0GWVxynCIPaVvi3YHORvSXnmWu3RKT69CSkZkTUV5CWJ3GdDzTzor9zbujPdXdLujv5l4BsRInxP6l+8j8r07VItHEkq5BUE2xZlR1Xu6S6a6om7NPVUlyiIlTuEHxhLCnlR9ZlqT6tpVbx0df9GZilV7L8/N8KpuSQO2vo/zJ6jOV5Yr1vZCcW09n/QApjGDEo1zEWvYxBaqZH2Fezg0Tg3ro0b4/Y91ejKNVP4soy7N5MDoGg=</latexit>

Sum of within-cluster distances

Dao - Vrain (LIFO) Descriptive clustering 08/10/2018 16 / 29

slide-17
SLIDE 17

Descriptor-focused optimization criteria

Tags may be dense, sparse or noisy ⇒ different objective functions

1

Minimize tag disagreement (MTD)

◮ minimize α + β ◮ useful when the tags contain noise 2

Max-min complete tag agreement (MMCTA):

◮ the tags of a cluster are shared by all its instances (α = β = 0) ◮ maximize the tag set of each cluster (size of the smallest) ◮ Use: tags well populated with little noise 3

Max-min neighborhood agreement (MMNA):

◮ each pair of instances in a same cluster must share at least q tags ◮ maximize q ◮ Use: tags are sparse Dao - Vrain (LIFO) Descriptive clustering 08/10/2018 17 / 29

slide-18
SLIDE 18

Minimize tag disagreement (MTD)

allows disagreements α number of tags an instance may not posses β number of instances a tag may not cover useful when tags are sparse and/or noisy A B C D E F G 1 3 4 5

Inst. Tags

2 α = 1, β = 2

Dao - Vrain (LIFO) Descriptive clustering 08/10/2018 18 / 29

slide-19
SLIDE 19

Max-min complete tag agreement (MMCTA)

the tags of a cluster are shared by all its instances (α = β = 0) useful when the tags are well populated with little noise A B C D E F G 1 2 3 4 5

Inst. Tags

MMCTA=1

Dao - Vrain (LIFO) Descriptive clustering 08/10/2018 19 / 29

slide-20
SLIDE 20

Max-min neighborhood agreement (MMNA)

Every pair of instances in a cluster must share at least q tags useful when tags are sparse A B C D E F G 1 2 3 4 5

Inst. Tags

MMNA=1

Dao - Vrain (LIFO) Descriptive clustering 08/10/2018 20 / 29

slide-21
SLIDE 21

Two methods

Integer linear programming (ILP): all the constraints and

  • bjectives can be transformed into linear form

Constraint programming (CP): using global constraints, reified constraints and restart search

Dao - Vrain (LIFO) Descriptive clustering 08/10/2018 21 / 29

slide-22
SLIDE 22

CP formulation

Supplementary variables Gi ∈ {1, . . . , k} for i = 1, .., n Gi = c means i-th instance is in c-cluster Channeling constraints: Zic = 1 ⇐ ⇒ Gi = c Global constraints: precede(G, [1, .., k]) atleast(G, 1, k) diameter(G, f, d) Reified constraints to express description constraints For MMCTA, MMNA, new global constraints to enforce: ∀i, j = 1, .., n, Gi = Gj = ⇒

r

  • p=1

DipDjp ≥ q

Dao - Vrain (LIFO) Descriptive clustering 08/10/2018 22 / 29

slide-23
SLIDE 23

Clustering tagged images

[Lampert C.H, Nickisch H., Harmeling S., CVPR 2009] 30000 images from 50 classes of animals Each image described by 2000 SIFT features and variable number of tags (black, fast, timid, etc.) Animal names are not given to algorithm Randomly sample 100 images from 10 first animal classes

Dao - Vrain (LIFO) Descriptive clustering 08/10/2018 23 / 29

slide-24
SLIDE 24

Trade off compactness vs. useful description

7,200 7,600 8,000 8,400 12 14 16 18

Diameter MMNA

Dao - Vrain (LIFO) Descriptive clustering 08/10/2018 24 / 29

slide-25
SLIDE 25

First Pareto point: Diameter minimized. MMCTA=4. MMNA=11

Cl# Composition by animals Description by tags C1 1 grizzly bear, 2 dalmatian, 1 horse, 2 blue whale big, fast, strong, muscle, newworld, smart C2 5 antelope, 2 grizzly bear, 2 beaver, 5 dalmatian, 5 persian cat, 5 horse, 6 german shepherd, 3 siamese cat furry, chewteeth, fast, quadrapedal, newworld, ground C3 69 beaver, 64 dalmatian, 42 persian cat, 29 blue whale, 42 siamese cat tail, fast, newworld, timid, smart, solitary C4 100 killer whale, 69 blue whale, 1 siamese cat tail, fast, fish, smart C5 95 antelope, 97 grizzly bear, 29 beaver, 29 dalma- tian, 53 persian cat, 94 horse, 94 german shep- herd, 54 siamese cat furry, chewteeth, fast, quadrapedal, newworld, ground

Dao - Vrain (LIFO) Descriptive clustering 08/10/2018 25 / 29

slide-26
SLIDE 26

Third Pareto point. MMCTA=9, MMNA=15

Cl# Composition by animals Description by tags C1 2 antelope, 4 dalmatian, 2 horse, 3 german shepherd, 4 siamese cat furry, lean, longleg, tail, chewteeth, walks, fast, muscle, quadrapedal, active, agility, newworld,

  • ldworld, ground

C2 2 beaver, 1 persian cat, 1 horse, 1 german shepherd furry, tail, chewteeth, fast, quadrapedal, agility, newworld, ground, smart C3 100 grizzly bear, 98 beaver, 99 persian cat, 1 siamese cat furry, paws, chewteeth, claws, fast, quadrapedal, fish, newworld, ground, smart, solitary C4 100 killer whale, 100 blue whale spots, hairless, toughskin, big, bulbous, flip- pers, tail, strainteeth, swims, fast, strong, fish, plankton, arctic, ocean, water, smart, group C5 98 antelope, 96 dalmatian, 97 horse, 96 german shepherd, 95 siamese cat furry, lean, longleg, tail, chewteeth, walks, fast, muscle, quadrapedal, active, agility, newworld,

  • ldworld, ground

Dao - Vrain (LIFO) Descriptive clustering 08/10/2018 26 / 29

slide-27
SLIDE 27

Fifth Pareto point: MMNA maximized. MMCTA=15, MMNA=18

Cl# Composition by animals Description by tags C1 100 antelope, 100 dalmatian furry, big, lean, longleg, tail, chewteeth, walks, fast, strong, muscle, quadrapedal, active, agility, newworld, oldworld, ground, timid, group C2 100 horse, 99 german shepherd, 98 siamese cat black, brown, gray, patches, furry, lean, lon- gleg, tail, chewteeth, walks, fast, muscle, quadrapedal, active, agility, newworld,

  • ld-

world, ground, smart, domestic C3 100 grizzly bear, 100 beaver, 1 siamese cat brown, furry, paws, chewteeth, claws, fast, mus- cle, quadrapedal, active, nocturnal, fish, new- world, ground, smart, solitary C4 100 killer whale, 100 blue whale spots, hairless, toughskin, big, bulbous, flip- pers, tail, strainteeth, swims, fast, strong, fish, plankton, arctic, ocean, water, smart, group C5 100 persian cat, 1 german shep- herd, 1 siamese cat gray, furry, pads, paws, tail, chewteeth, meat- teeth, claws, walks, fast, quadrapedal, agility, meat, newworld, oldworld, ground, smart, soli- tary, domestic

Dao - Vrain (LIFO) Descriptive clustering 08/10/2018 27 / 29

slide-28
SLIDE 28

Towards actionable/explainable clustering

Constrainability Explanability

Classic clustering Constrained Clustering AIJ 2017 Actionable clustering ECAI 2016 HIL clustering AAAI 2017 Conceptual clustering Descriptive clustering IJCAI 2018

Dao - Vrain (LIFO) Descriptive clustering 08/10/2018 28 / 29

slide-29
SLIDE 29

Future work

Framework is extendable to other types of compactness and description Scalability

◮ Smart data ◮ Sampling ?? ◮ Relaxing the search for an optimal solution

Learning constraints and preferences Leveraging knowledge by means of constraints in other frameworks

Dao - Vrain (LIFO) Descriptive clustering 08/10/2018 29 / 29