spatio temporal veracity assessment
play

SPATIO-TEMPORAL VERACITY ASSESSMENT JOANA GONZALES MALAVERRI - PowerPoint PPT Presentation

SPATIO-TEMPORAL VERACITY ASSESSMENT JOANA GONZALES MALAVERRI (LAHDAK) FATIHA SAS (LAHDAK) GIANLUCA QUERCINI (MODHEL) LABORATOIRE DE RECHERCHE EN INFORMATIQUE (LRI) {MALAVERRI, FATIHA.SAIS,GIANLUCA.QUERCINI}@LRI.FR JOURNEE ROD MOTIVATION


  1. SPATIO-TEMPORAL VERACITY ASSESSMENT JOANA GONZALES MALAVERRI (LAHDAK) FATIHA SAÏS (LAHDAK) GIANLUCA QUERCINI (MODHEL) LABORATOIRE DE RECHERCHE EN INFORMATIQUE (LRI) {MALAVERRI, FATIHA.SAIS,GIANLUCA.QUERCINI}@LRI.FR JOURNEE ROD

  2. MOTIVATION facts facts knowledge Base facts facts facts Extracted from https://goo.gl/B2i5aG 2

  3. REAL-WORLD SCENARIO: WHAT IS THE BARACK OBAMA’S CITIZENSHIP Indonesian American I don’t know! But the probability that all errors in the certificate were inadvertent is 1 in 75 quadrillion. British Kenyan https://goo.gl/ 5YTBxK 3

  4. facts facts knowledge Base facts facts facts Extracted from https://goo.gl/B2i5aG 4

  5. VERACITY ASSESSMENT APPROACHES [Beretta et al. 16] • Majority voting • Basic approaches • Extended approaches • Extra knowledge (ontologies) • Source dependency detection 5

  6. VERACITY ASSESSMENT What is the nationality of those US presidents? American American d1. Edward British Kenyan Dickinson American Baker American American d2. American USA USA USA Barack British S3 USA Obama American S1 S2 S4 d3. Bill Clinton d1:nat d2:nat d3:nat S1 <British, Kenyan, American> S2 <American, British, American> S3 <American, American, American> S4 <USA, USA, American> 6

  7. VERACITY ASSESSMENT What is the nationality of those US presidents? American American d1. Edward British Kenyan Dickinson American Baker American American d2. American USA Barack British S3 USA Obama American S1 S2 S4 d3. Bill Clinton d3:nat d1:nat d2:nat Fact confidence S1 <British, Kenyan, American> Source S2 <American, British, American> + reliability S3 <American, American, American> S4 <USA, USA, American> 7

  8. VERACITY ASSESSMENT APPROACHES: LIMITATIONS  Single-Truth d3:nat d1:nat d2:nat S1: <d1:nat, British> is also true. S1 <British, Kenyan, American> S2 <American, British, American> S3 <American, American, American> S4 <USA, USA, American> 8

  9. VERACITY ASSESSMENT APPROACHES: LIMITATIONS  Single-Truth d3:nat d1:nat d2:nat S1: <d1:nat, British> is also true. S1 <British, Kenyan, American> S2 <American, British, American> • No contextual information S3 <American, American, American> S1: <d1:nat, British> is true in the temporal context [1811-1816] S4 <USA, USA, American> S1: <d1:birthdate, 1811> S1: <d1:birthPlace, London> S1: <d1:immigrationDate, 1816> 9

  10. VERACITY ASSESSMENT APPROACHES: LIMITATIONS  Single-Truth S1: <d1:nat, British> is also true. d3:nat d1:nat d2:nat S1 <British, Kenyan, American> • No contextual information S2 <American, British, American> S1: <d1:nat, British> is true in the temporal S3 <American, American, American> context [1811-1816] S4 <USA, USA, American> S1: <d1:birthdate, 1811> S1: <d1:birthPlace, London> S1: <d1:immigrationDate, 1816> • Some sources mainly created from data extracted from Wikipedia How reliable S1: <d1:nat, British> is? S1: <d1:birthPlace, London> 10 S1: <d1:immigrationDate, 1816>

  11. VERACITY ASSESSMENT APPROACHES: LIMITATIONS • No explanations d3:nat d1:nat d2:nat Why S1: <d1:nat, British> is true? S1 <British, Kenyan, American> S1: <d1:birthPlace, London> S2 <American, British, American> S3 <American, American, American> S4 <USA, USA, American> 11

  12. GOAL Build an approach to assess the veracity of facts taken from knowledge bases based on spatio-temporal information. 12

  13. YAGO KNOWLEDGE BASE  General purpose semantic knowledge base (KB) Integrates information extracted  from Wikipedia infoboxes, WordNet, and GeoNames > 10 million entities (persons,  cities, organizations), > 120 million facts about these  entities  Attaches temporal and spatial dimensions to many of its facts and Yago structure entities – meta facts. 13

  14. APPROACH: RULE BASED TEMPORAL META FACTS GENERATION 14

  15. RULE BASED TEMPORAL META FACTS GENERATION Focus on facts that may change over time: Brad Pitt acted in   the Fight Club in 1999  the Curious Case of Benjamin Button in 2008 Paul McCartney was/is married with   Heather Mills from 2002 to 2008  Nancy Shevell since 2011 15

  16. BASIC ALGORITHM TO INFER META FACTS 16

  17. TIMESTAMP GENERATION RULE 17

  18. TIMESTAMP GENERATION RULE 18

  19. TIMESTAMP GENERATION RULE 19

  20. TIMESTAMP GENERATION RULE 20

  21. TEMPORAL INTERVAL RULE 21

  22. TEMPORAL INTERVAL RULE 22

  23. TEMPORAL INTERVAL RULE validAfter 23

  24. TEMPORAL INTERVAL RULE 24

  25. TEMPORAL INTERVAL RULE 25

  26. TEMPORAL INTERVAL RULE 26

  27. CASE STUDY: MOVIE DOMAIN (YAGO) 27

  28. MOVIE DOMAIN (YAGO)  Yago: # of films: 151427  # of actors in Yago: 47800  # of release dates available: 136234  28

  29. RESULTS: actedIn 29

  30. RESULTS: actedIn - Results vs Yago MFs & IMDb: - 23 share the same information (Yago MFs & IMDb) - 16 have the same IMDb information Notice: Yago MFs are more accurate. - 8 MFs not inferred because some facts don’t have release date associated. 30

  31. RESULTS: actedIn - Results vs Yago MFs & IMDb: - 23 share the same information (Yago MFs & IMDb) - 16 have the same IMDb information Notice: Yago MFs are more accurate. - 8 MFs not inferred because some facts don’t have release date associated. 31

  32. RESULTS: wroteMusicFor 32

  33. RESULTS: wroteMusicFor - Results vs Yago MFs & IMDb: - 3 share the same information (Yago MFs & IMDb) - 118 MFs not inferred because some facts don’t have release date associated. 33

  34. ONGOING WORK:  Qualitative evaluation of the meta facts inferred.  Creating new set of rules.  Extend the approach to reason more globally on the whole graph while inferring meta facts. FUTURE WORK:  Spatial reasoning.  E.g.: Film release dates are associated to specific locations (country, city).  (Semi-)automatic approach for rule generation. 34

  35. MERCI ! 35

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend