Demystifying Relational Latent Representations Sebastijan Dumani, - - PowerPoint PPT Presentation

demystifying relational latent representations
SMART_READER_LITE
LIVE PREVIEW

Demystifying Relational Latent Representations Sebastijan Dumani, - - PowerPoint PPT Presentation

Demystifying Relational Latent Representations Sebastijan Dumani, Hendrik Blockeel DTAI, KU Leuven September 6, ILP 2017 1 Outline 2/24 1 Introduction 2 Understanding latent features 3 Properties of latent spaces Demystifying


slide-1
SLIDE 1

Demystifying Relational Latent Representations

Sebastijan Dumančić, Hendrik Blockeel DTAI, KU Leuven September 6, ILP 2017

slide-2
SLIDE 2

1 – Outline

2/24

1 Introduction 2 Understanding latent features 3 Properties of latent spaces Demystifying Relational Latent Representations – Dumančić, Blockeel

slide-3
SLIDE 3

1 – Learning Relational Latent Representations

3/24 Learning versatile relational latent features with clustering and variety of similarities (CUR2LED)

[Dumančić and Blockeel, IJCAI 2017]

Demystifying Relational Latent Representations – Dumančić, Blockeel

slide-4
SLIDE 4

1 – Learning Relational Latent Representations

4/24 Benefits: better performance simpler models [with some overhead] Questions to be answered

1

Can we interpret latent features?

(approximate) definition of latent features/relations

2

What makes them effective?

distinctive properties?

Demystifying Relational Latent Representations – Dumančić, Blockeel

slide-5
SLIDE 5

2 – Outline

5/24

1 Introduction 2 Understanding latent features 3 Properties of latent spaces Demystifying Relational Latent Representations – Dumančić, Blockeel

slide-6
SLIDE 6

2 – Understanding latent features

6/24 latent features = clusters of vertices (instances) and edges (relationships) key idea: cluster prototype represents the meaning of a feature

Demystifying Relational Latent Representations – Dumančić, Blockeel

slide-7
SLIDE 7

2 – Understanding features - technicalities (1)

7/24 CUR2LED uses ReCeNT as a similarity measure for relational data ⇒ views instances as neighbourhood trees Data Neighbourhood tree

[Dumančić and Blockeel, MLJ 2017]

Demystifying Relational Latent Representations – Dumančić, Blockeel

slide-8
SLIDE 8

2 – Understanding features - technicalities (1)

8/24 CUR2LED uses ReCeNT as a similarity measure for relational data ⇒ views instances as neighbourhood trees Data Neighbourhood tree key idea: mean tree represents the meaning of a feature

Demystifying Relational Latent Representations – Dumančić, Blockeel

slide-9
SLIDE 9

2 – Understanding features - technicalities (2)

9/24 CUR2LED requires a (set of) similarity interpretation(s) → a specification what similarity reflects attribute sim neighbourhood attributes sim neighbourhood identity edge labels connectedness key idea: find a mean tree given the similarity interpretation

Demystifying Relational Latent Representations – Dumančić, Blockeel

slide-10
SLIDE 10

2 – Understanding features - technicalities (3)

10/24 CUR2LED compares neighbourhood trees by comparing distributions of elements within them elements selected by the similarity interpretation attributes values, edge types, identities ... key idea: mean tree ≈ elements that appear in all NTs (in a cluster) with similar frequency

Demystifying Relational Latent Representations – Dumančić, Blockeel

slide-11
SLIDE 11

2 – Understanding features - outline

11/24 Given a set of neighbourhood tree and a similarity interpretation ... 1.

Demystifying Relational Latent Representations – Dumančić, Blockeel

slide-12
SLIDE 12

2 – Understanding features - outline

12/24 Calculate the relative frequencies of elements within a tree 1. 2.

Demystifying Relational Latent Representations – Dumančić, Blockeel

slide-13
SLIDE 13

2 – Understanding features - outline

13/24 Summarize the relative frequencies of unique elements across trees 1. 2. 3.

Demystifying Relational Latent Representations – Dumančić, Blockeel

slide-14
SLIDE 14

2 – Understanding features - outline

14/24 Select elements with low standard deviation 1. 2. 3. (θ-confidence) An element with mean value µ and standard deviation σ in a cluster, is said to be θ-confident if σ < θ · µ.

Demystifying Relational Latent Representations – Dumančić, Blockeel

slide-15
SLIDE 15

2 – Understanding features - results

15/24 Use case: IMDB

Demystifying Relational Latent Representations – Dumančić, Blockeel

slide-16
SLIDE 16

2 – Understanding features - results

16/24 Use case: UWCSE

Demystifying Relational Latent Representations – Dumančić, Blockeel

slide-17
SLIDE 17

3 – Outline

17/24

1 Introduction 2 Understanding latent features 3 Properties of latent spaces Demystifying Relational Latent Representations – Dumančić, Blockeel

slide-18
SLIDE 18

3 – Why are they effective?

18/24 Properties of latent spaces: label entropy

distribution of labels within true instantiations of predicates proxy to a quantification of learning difficulty

sparsity

modelling local vs. global concept spread across a small number of local regions is easier to capture

redundancy

CUR2LED creates many features - are all of them necessary?

Demystifying Relational Latent Representations – Dumančić, Blockeel

slide-19
SLIDE 19

3 – Why are they effective? - label entropy

19/24

improved performance (3) no improvement (1)

when performance increases, latent representation has many predicates of low label entropy

Demystifying Relational Latent Representations – Dumančić, Blockeel

slide-20
SLIDE 20

3 – Why are they effective? - sparsity

20/24

improved performance (3) no improvement (1)

when performance increases, latent representation is sparser than the original one

Demystifying Relational Latent Representations – Dumančić, Blockeel

slide-21
SLIDE 21

3 – Why are they effective? - entropy vs. sparsity

21/24 Trivial explanation: many predicates with a very small number of true instantiations (not helpful)

improved performance (3) no improvement (1)

... not what’s happening here: latent predicates have a comparable number of true instantiations

Demystifying Relational Latent Representations – Dumančić, Blockeel

slide-22
SLIDE 22

3 – Why are they effective?

22/24

latent features successfully identify local regions in the instance space that match well with the provided labels

Demystifying Relational Latent Representations – Dumančić, Blockeel

slide-23
SLIDE 23

3 – Why are they effective? - redundancy

23/24

CUR2LED creates a lot of features similarity interpretations considered independently but many instances might be identical in several similarity interpretations every time a new clustering is obtained, check how much it

  • verlaps with an existing

clusterings using the adjusted Rand index

Demystifying Relational Latent Representations – Dumančić, Blockeel

slide-24
SLIDE 24

Thank you! Questions?

Demystifying Relational Latent Representations – Dumančić, Blockeel