Autonomously Reviewing and Validating the Knowledge Base of a - PowerPoint PPT Presentation

Autonomously Reviewing and Validating the Knowledge Base of a Never-Ending Learning System Saulo D. S. Pedro 1 , Ana Paula Appel 2 and Estevam R. Hruschka Jr. 1 1 Department of Computer Science – Federal University of S˜ ao Carlos, Brazil 2 IBM Research Brazil May, 2013 1 / 22

Outline Introduction 1 Motivation 2 Proposed Work 3 Experiments 4 Conclusion 5 2 / 22

Introduction Outline Introduction 1 Motivation 2 Proposed Work 3 Experiments 4 Conclusion 5 3 / 22

Introduction NELL (Never-Ending Language Learner) A computer system that runs 24/7; Gather knowledge from web pages to acquire knowledge to become a better learner each day; The content available on the web is not always reliable - can lead a false beliefs propagation because of noisy data; Part of the knowledge extracted by NELL should be supervised by humans to be incorporated definitely in KB. 4 / 22

Introduction Prophet Implements link prediction on NELL to finding new relations in the NELL’s KB and identifying the anomalies, → misplaced edges The relations and categories extracted by NELL are mapped as an ontology → complex network Use graph properties to investigate if the knowledge learned by NELL is correct or not. 5 / 22

Introduction Prophet Example Madison Square Garden Stadium Home to League Sport Uses Stadium Michael Redd A t h l e t e P l a s y r s e Milwaukee Bucks y i a n l P L Team Plays in League e a g u e Sport Team Basketball NBA 6 / 22

Introduction Prophet Rules R12a(sport, sportsleague):- players(sport, athlete), athleteplaysinleague(athlete, sportsleague), numberof(athlete) ≥ 10; R12b(sport, sportsleague):- sportteam(sport, sportsteam), teamplaysinleague(sportsteam, sportsleague), numberof(sportsteam) ≥ 10; R12c(sport, sportsleague):- sportusesstadium(sport, stadiumoreventvenue), stadiumhometoleague(stadiumoreventvenue, sportsleague), numberof(stadiumoreventvenue) ≥ 10 R12d(sport, sportsleague):- players(sport, athlete), athleteplaysinleague(athlete, sportsleague),sportteam(sport, sportsteam), teamplaysinleague(sportsteam, sportsleague), sportusesstadium(sport, stadiumoreventvenue), stadiumhometoleague(stadiumoreventvenue, sportsleague); 7 / 22

Introduction Prophet Misplaced Edges Cristiano Ronaldo Athlete Plays in League s r e y a l P Soccer NBA When Prophet identifies an outliers, it means that the its algorithm was able to determine a new rule but there are a few instances that do not match all the requirements of rule found by Prophet → misplaced edges. 8 / 22

Motivation Outline Introduction 1 Motivation 2 Proposed Work 3 Experiments 4 Conclusion 5 9 / 22

Motivation Motivation There are two possible scenarios for the anomalies: at least one relation (edge) in the anomaly should be wrong the two rules are right but because of combination made by Prophet the relation predicted is wrong 10 / 22

Motivation Motivation There are two possible scenarios for the anomalies: at least one relation (edge) in the anomaly should be wrong the two rules are right but because of combination made by Prophet the relation predicted is wrong The information gathered by Prophet could be just sent to human supervision. But we want to take: best profit from these anomalies advantage human opinion through Web communities thus configuring a self-supervision approach 10 / 22

Proposed Work Outline Introduction 1 Motivation 2 Proposed Work 3 Experiments 4 Conclusion 5 11 / 22

Proposed Work Prophet + SS-Crowd Problem Description How Conversing Learning techniques can be used to help reviewing and validating facts that were learned by NELL and were flagged as possible mistakes by Prophet . 12 / 22

Proposed Work Prophet + SS-Crowd Problem Description How Conversing Learning techniques can be used to help reviewing and validating facts that were learned by NELL and were flagged as possible mistakes by Prophet . Proposed Work A method to combine the knowledge gathered from web communities through the SS-Crowd component with the outliers identified by Prophet , i.e., use web QA users opinion to validate the anomalies. 12 / 22

Proposed Work Conversing Learning Based on Active Learning and Interactive Learning Allow machines to convert knowledge into content understandable by humans Autonomously ask people to take part in the knowledge acquisition and labelling process 13 / 22

Proposed Work Reaching web users assessment through SS-Crowd The proposed approach can be summarized by the following steps: Converting KB’s facts into human understandable sentences; Generating questions that will prompt users to decide whether the facts are correct or not; Receiving all the answers for an specific question; Combining the answers to produce a single result; Returning to Prophet that will use it as a parameter to create or not a new link in NELL’s KB. 14 / 22

Proposed Work Experiment with SS-Crowd Edges of an outlier identified by Prophet TeamPlaysSport(Manchester United, basketball) TeamWonTrophy(Manchester United, UEFA Champions League) Edges converted into human understandable questions: Manchester United is a team that plays sport basketball Manchester United is a team that won trophy UEFA Champions League: 15 / 22

Proposed Work Experiment with SS-Crowd Edges of an outlier identified by Prophet TeamPlaysSport(Manchester United, basketball) TeamWonTrophy(Manchester United, UEFA Champions League) Edges converted into human understandable questions: Manchester United is a team that plays sport basketball Manchester United is a team that won trophy UEFA Champions League: Expectation: At least one of the edges is wrong, confirming the health of outliers identification algorithm. 15 / 22

Experiments Outline Introduction 1 Motivation 2 Proposed Work 3 Experiments 4 Conclusion 5 16 / 22

Experiments Experiments Set up We used NELL’s KB at the 100th iteration → undirected graph 9,419 nodes and 24,132 edges.; We ran Prophet that found new rules and instances and misplace edges; all misplaced edges were sent to SS-Crowd to start the human assessment process; 17 / 22

Experiments Experiments Set up We used NELL’s KB at the 100th iteration → undirected graph 9,419 nodes and 24,132 edges.; We ran Prophet that found new rules and instances and misplace edges; all misplaced edges were sent to SS-Crowd to start the human assessment process; Table : Distribution of the relations considered in our tests Relations # of outliers # of answers 9 72 AthletePlaysInLeague & Players TeamPlaysSport & TeamPlaysInLeague 20 144 53 386 TeamPlaysSport & TeamWonTrophy 17 / 22

Experiments Results The rate of outliers with at least one wrong edge indicates the health of the anomalies detection algorithm; 18 / 22

Experiments Results The rate of outliers with at least one wrong edge indicates the health of the anomalies detection algorithm; Table : Numbers for edges evaluated as suitable or not to the real world through the web community eyes. Outliers at least one wrong edge 39 (47.56%) both edges correct 40 (48.19%) unresolved edges 3 (03.65%) 18 / 22

Experiments Experiment with SS-Crowd Edges of an outlier identified by Prophet TeamPlaysSport(Manchester United, basketball) TeamWonTrophy(Manchester United, UEFA Champions League) Edges converted into human understandable questions: Manchester United is a team that plays sport basketball; Manchester United is a team that won trophy UEFA Champions League: 19 / 22

Experiments Experiment with SS-Crowd Edges of an outlier identified by Prophet TeamPlaysSport(Manchester United, basketball) TeamWonTrophy(Manchester United, UEFA Champions League) Edges converted into human understandable questions: Manchester United is a team that plays sport basketball; Manchester United is a team that won trophy UEFA Champions League: Both relations are right!! 19 / 22

Experiments Experiment with SS-Crowd Edges of an outlier identified by Prophet TeamPlaysSport(Manchester United, basketball) TeamWonTrophy(Manchester United, UEFA Champions League) Edges converted into human understandable questions: Manchester United is a team that plays sport basketball; Manchester United is a team that won trophy UEFA Champions League: Both relations are right!! Manchester United is a basketball team and also a soccer team NELL was not successful to decide which one it should chose; 19 / 22

Conclusion Outline Introduction 1 Motivation 2 Proposed Work 3 Experiments 4 Conclusion 5 20 / 22

Conclusion Conclusion The results obtained in the performed experiments have shown that the combination of Prophet and SS-Crowd allows a never-ending learning system - NELL - to identify which edges are really wrong and which edges needs more time (NELL iterations) to fill the gaps on information to be considered valid. The experiments show that Prophet has a great accuracy. Most of the combination of edges that produce a misplaced connection are related to a co-reference problem restricted to NELL and is not a misbehaviour of Prophet itself. The validation of a learning machine with SS-Crowd is a useful approach to help self-supervision and self-revision in NELL. 21 / 22

Autonomously Reviewing and Validating the Knowledge Base of a - PowerPoint PPT Presentation

Autonomously Reviewing and Validating the Knowledge Base of a Never-Ending Learning System Saulo D. S. Pedro 1 , Ana Paula Appel 2 and Estevam R. Hruschka Jr. 1 1 Department of Computer Science Federal University of S ao Carlos, Brazil 2 IBM

Validating Procedural Knowledge in the Validating Procedural Knowledge in the Open Virtual

Knowledge-Based Agents knowledge knowledge representation, knowledge base, types of knowledge

TOWN OF SACKVILLE 2017 Tax Base $629,240,300 2018 Tax Base $619,997,885 2019 Tax Base

Reviewing and Approving Offset Reviewing and Approving Offset Projects and Protocols Projects

Applying Random Testing to a Base Type Environment Experience Report Vincent St-Amour Neil

Reviewing papers by Xingjian, Tolik Goal Reviewing - a public service: Conference/journal

Knowledge-Based Agents (Logical Agents) A knowledge-based agent needs (at least): A

Validating CDI Data for Report Integrity Fran Jurcak, MSN, RN, CCDS Clinical Documentation

VEA: Validating, Evolving & Anonymizing Data in Real Time Albert Franzi Cros, Data Engineer |

Validating Formal Descriptions of TCP/IP Introduction Beginning a TCP Experimental Formal

Random Walk Inference and Learning in A Large Scale Knowledge Base in A Large Scale Knowledge Base

VU @ D2.1.1 Part 1: Approximation Reasoning method Knowledge Knowledge base Base

Knowledge Base Exchange Marcelo Arenas 1 Elena Botoeva 2 Diego Calvanese 2 1 Dept. of Computer

Expanding the YAGO knowledge base Regexes Answering Queries with Unix Shell Thomas Rebele

ARS-seq: High-Resolution Mapping and Mutational Scanning of Autonomously Replicating Sequences

On Ability to Autonomously Execute Agent Programs with Sensing Sebastian Sardi na Giuseppe De

Social'Data'Science' David'Dreyer'Lassen' UCPH'ECON' September'24,'2015' In'God'we'trust,'

A Comparison of Covariate-based Predictition Methods for FIFA World Cups A. Groll Faculty of

Data from each single match ... <tackle,15.4,41.1,112> <pass,25.0,67.1,113>

Salt Lake Community College Veterans Services Basic Orientation to Support Services and VA

Random Variable Models of Computation Michael W. Mislove Tulane University New Orleans, LA

Portable Enforcement Solution International Product Marketing Department Portable PTZ Dome Body

+ 2. Model Selection Scores 3. New Stuff: fNML Score 2/30 + Bayesian Networks 3/30 Conditional

World Cup draw: quantifying (un)fairness and (im)balance Julien Guyon Bloomberg L.P.,

Autonomously Reviewing and Validating the Knowledge Base of a - PowerPoint PPT Presentation

Autonomously Reviewing and Validating the Knowledge Base of a Never-Ending Learning System Saulo D. S. Pedro 1 , Ana Paula Appel 2 and Estevam R. Hruschka Jr. 1 1 Department of Computer Science Federal University of S ao Carlos, Brazil 2 IBM

Validating Procedural Knowledge in the Validating Procedural Knowledge in the Open Virtual

Knowledge-Based Agents knowledge knowledge representation, knowledge base, types of knowledge

TOWN OF SACKVILLE 2017 Tax Base $629,240,300 2018 Tax Base $619,997,885 2019 Tax Base

Reviewing and Approving Offset Reviewing and Approving Offset Projects and Protocols Projects

Applying Random Testing to a Base Type Environment Experience Report Vincent St-Amour Neil

Reviewing papers by Xingjian, Tolik Goal Reviewing - a public service: Conference/journal

Knowledge-Based Agents (Logical Agents) A knowledge-based agent needs (at least): A

Validating CDI Data for Report Integrity Fran Jurcak, MSN, RN, CCDS Clinical Documentation

VEA: Validating, Evolving &amp; Anonymizing Data in Real Time Albert Franzi Cros, Data Engineer |

Validating Formal Descriptions of TCP/IP Introduction Beginning a TCP Experimental Formal

Random Walk Inference and Learning in A Large Scale Knowledge Base in A Large Scale Knowledge Base

VU @ D2.1.1 Part 1: Approximation Reasoning method Knowledge Knowledge base Base

Knowledge Base Exchange Marcelo Arenas 1 Elena Botoeva 2 Diego Calvanese 2 1 Dept. of Computer

Expanding the YAGO knowledge base Regexes Answering Queries with Unix Shell Thomas Rebele

ARS-seq: High-Resolution Mapping and Mutational Scanning of Autonomously Replicating Sequences

On Ability to Autonomously Execute Agent Programs with Sensing Sebastian Sardi na Giuseppe De

Social'Data'Science' David'Dreyer'Lassen' UCPH'ECON' September'24,'2015' In'God'we'trust,'

A Comparison of Covariate-based Predictition Methods for FIFA World Cups A. Groll Faculty of

Data from each single match ... &lt;tackle,15.4,41.1,112&gt; &lt;pass,25.0,67.1,113&gt;

Salt Lake Community College Veterans Services Basic Orientation to Support Services and VA

Random Variable Models of Computation Michael W. Mislove Tulane University New Orleans, LA

Portable Enforcement Solution International Product Marketing Department Portable PTZ Dome Body

+ 2. Model Selection Scores 3. New Stuff: fNML Score 2/30 + Bayesian Networks 3/30 Conditional

World Cup draw: quantifying (un)fairness and (im)balance Julien Guyon Bloomberg L.P.,

VEA: Validating, Evolving & Anonymizing Data in Real Time Albert Franzi Cros, Data Engineer |

Data from each single match ... <tackle,15.4,41.1,112> <pass,25.0,67.1,113>