demystifying relational latent representations
play

Demystifying Relational Latent Representations Sebastijan Dumani, - PowerPoint PPT Presentation

Demystifying Relational Latent Representations Sebastijan Dumani, Hendrik Blockeel DTAI, KU Leuven September 6, ILP 2017 1 Outline 2/24 1 Introduction 2 Understanding latent features 3 Properties of latent spaces Demystifying


  1. Demystifying Relational Latent Representations Sebastijan Dumančić, Hendrik Blockeel DTAI, KU Leuven September 6, ILP 2017

  2. 1 – Outline 2/24 1 Introduction 2 Understanding latent features 3 Properties of latent spaces Demystifying Relational Latent Representations – Dumančić, Blockeel

  3. 1 – Learning Relational Latent Representations 3/24 Learning versatile relational latent features with clustering and variety of similarities (CUR 2 LED) [Dumančić and Blockeel, IJCAI 2017] Demystifying Relational Latent Representations – Dumančić, Blockeel

  4. 1 – Learning Relational Latent Representations 4/24 Benefits: better performance simpler models [with some overhead] Questions to be answered Can we interpret latent features? 1 (approximate) definition of latent features/relations What makes them effective? 2 distinctive properties? Demystifying Relational Latent Representations – Dumančić, Blockeel

  5. 2 – Outline 5/24 1 Introduction 2 Understanding latent features 3 Properties of latent spaces Demystifying Relational Latent Representations – Dumančić, Blockeel

  6. 2 – Understanding latent features 6/24 latent features = clusters of vertices (instances) and edges (relationships) key idea: cluster prototype represents the meaning of a feature Demystifying Relational Latent Representations – Dumančić, Blockeel

  7. 2 – Understanding features - technicalities (1) 7/24 CUR 2 LED uses ReCeNT as a similarity measure for relational data ⇒ views instances as neighbourhood trees Data Neighbourhood tree [Dumančić and Blockeel, MLJ 2017] Demystifying Relational Latent Representations – Dumančić, Blockeel

  8. 2 – Understanding features - technicalities (1) 8/24 CUR 2 LED uses ReCeNT as a similarity measure for relational data ⇒ views instances as neighbourhood trees Data Neighbourhood tree key idea: mean tree represents the meaning of a feature Demystifying Relational Latent Representations – Dumančić, Blockeel

  9. 2 – Understanding features - technicalities (2) 9/24 CUR 2 LED requires a (set of) similarity interpretation(s) → a specification what similarity reflects attribute sim neighbourhood attributes sim neighbourhood identity edge labels connectedness key idea: find a mean tree given the similarity interpretation Demystifying Relational Latent Representations – Dumančić, Blockeel

  10. 2 – Understanding features - technicalities (3) 10/24 CUR 2 LED compares neighbourhood trees by comparing distributions of elements within them elements selected by the similarity interpretation attributes values, edge types, identities ... key idea: mean tree ≈ elements that appear in all NTs (in a cluster) with similar frequency Demystifying Relational Latent Representations – Dumančić, Blockeel

  11. 2 – Understanding features - outline 11/24 Given a set of neighbourhood tree and a similarity interpretation ... 1. Demystifying Relational Latent Representations – Dumančić, Blockeel

  12. 2 – Understanding features - outline 12/24 Calculate the relative frequencies of elements within a tree 1. 2. Demystifying Relational Latent Representations – Dumančić, Blockeel

  13. 2 – Understanding features - outline 13/24 Summarize the relative frequencies of unique elements across trees 1. 2. 3. Demystifying Relational Latent Representations – Dumančić, Blockeel

  14. 2 – Understanding features - outline 14/24 Select elements with low standard deviation 1. 2. 3. ( θ -confidence) An element with mean value µ and standard deviation σ in a cluster, is said to be θ -confident if σ < θ · µ . Demystifying Relational Latent Representations – Dumančić, Blockeel

  15. 2 – Understanding features - results 15/24 Use case: IMDB Demystifying Relational Latent Representations – Dumančić, Blockeel

  16. 2 – Understanding features - results 16/24 Use case: UWCSE Demystifying Relational Latent Representations – Dumančić, Blockeel

  17. 3 – Outline 17/24 1 Introduction 2 Understanding latent features 3 Properties of latent spaces Demystifying Relational Latent Representations – Dumančić, Blockeel

  18. 3 – Why are they effective? 18/24 Properties of latent spaces: label entropy distribution of labels within true instantiations of predicates proxy to a quantification of learning difficulty sparsity modelling local vs. global concept spread across a small number of local regions is easier to capture redundancy CUR 2 LED creates many features - are all of them necessary? Demystifying Relational Latent Representations – Dumančić, Blockeel

  19. 3 – Why are they effective? - label entropy 19/24 no improvement (1) improved performance (3) when performance increases, latent representation has many predicates of low label entropy Demystifying Relational Latent Representations – Dumančić, Blockeel

  20. 3 – Why are they effective? - sparsity 20/24 improved performance (3) no improvement (1) when performance increases, latent representation is sparser than the original one Demystifying Relational Latent Representations – Dumančić, Blockeel

  21. 3 – Why are they effective? - entropy vs. sparsity 21/24 Trivial explanation: many predicates with a very small number of true instantiations (not helpful) no improvement (1) improved performance (3) ... not what’s happening here: latent predicates have a comparable number of true instantiations Demystifying Relational Latent Representations – Dumančić, Blockeel

  22. 3 – Why are they effective? 22/24 latent features successfully identify local regions in the instance space that match well with the provided labels Demystifying Relational Latent Representations – Dumančić, Blockeel

  23. 3 – Why are they effective? - redundancy 23/24 CUR 2 LED creates a lot of features similarity interpretations considered independently but many instances might be identical in several similarity interpretations every time a new clustering is obtained, check how much it overlaps with an existing clusterings using the adjusted Rand index Demystifying Relational Latent Representations – Dumančić, Blockeel

  24. Thank you! Questions? Demystifying Relational Latent Representations – Dumančić, Blockeel

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend