clausie clause based open information extraction
play

ClausIE: Clause-Based Open Information Extraction Luciano Del Corro - PowerPoint PPT Presentation

ClausIE: Clause-Based Open Information Extraction Luciano Del Corro Rainer Gemulla Max-Planck-Institut fr Informatik May 2013 Del Corro, Gemulla (MPI) ClausIE May 2013 1 / 18 Open Information Extraction: From sentences to propositions


  1. ClausIE: Clause-Based Open Information Extraction Luciano Del Corro Rainer Gemulla Max-Planck-Institut für Informatik May 2013 Del Corro, Gemulla (MPI) ClausIE May 2013 1 / 18

  2. Open Information Extraction: From sentences to propositions GOAL: Extract information from natural text Del Corro, Gemulla (MPI) ClausIE May 2013 2 / 18

  3. Open Information Extraction: From sentences to propositions GOAL: Extract information from natural text Sentence Bell, a telecommunication company, which is based in Los Angeles, makes and distributes electronic, computer and building products. Del Corro, Gemulla (MPI) ClausIE May 2013 2 / 18

  4. Open Information Extraction: From sentences to propositions GOAL: Extract information from natural text Sentence Bell, a telecommunication company, which is based in Los Angeles, makes and distributes electronic, computer and building products. Extractions/Propositions (Bell, ’is’, a telecommunication company) (Bell, is based in, Los Angeles) (Bell, makes, electronic products) (Bell, distributes, electronic products) . . . Del Corro, Gemulla (MPI) ClausIE May 2013 2 / 18

  5. Open Information Extraction: From sentences to propositions GOAL: Extract information from natural text Sentence Bell, a telecommunication company, which is based in Los Angeles, makes and distributes electronic, computer and building products. Extractions/Propositions (Bell, ’is’, a telecommunication company) (Bell, is based in, Los Angeles) (Bell, makes, electronic products) (Bell, distributes, electronic products) . . . Most OIE extractors Propositions expressed as triples ( arg 1 , relation, arg 2 ) Verb based relation Arguments restricted to noun phrases Del Corro, Gemulla (MPI) ClausIE May 2013 2 / 18

  6. Open Information Extraction: challenges and applications Challenges/Requirements Domain independent Unbounded set of relations No filtering of information Structured output Scalable Del Corro, Gemulla (MPI) ClausIE May 2013 3 / 18

  7. Open Information Extraction: challenges and applications Challenges/Requirements Domain independent Unbounded set of relations No filtering of information Structured output Scalable Applications Structured search Automatic ontology construction Question answering Semantic role labeling, discourse parsing, ... ? Del Corro, Gemulla (MPI) ClausIE May 2013 3 / 18

  8. Outline Information and Representation 1 Open Information Extractors and Language Technology 2 ClausIE 3 Clauses in the English Language From clauses to propositions Results 4 Conclusions and Future Directions 5 Del Corro, Gemulla (MPI) ClausIE May 2013 4 / 18

  9. Information and Representation Outline Information and Representation 1 Open Information Extractors and Language Technology 2 ClausIE 3 Clauses in the English Language From clauses to propositions Results 4 Conclusions and Future Directions 5 Del Corro, Gemulla (MPI) ClausIE May 2013 5 / 18

  10. Information and Representation Information and Representation: a two-step approach Information What information is expressed? How much to retain? How to identify it? (e.g. non-verb mediated propositions‘) ⋆ Messi, a golden ball winner, plays in Barcelona Del Corro, Gemulla (MPI) ClausIE May 2013 5 / 18

  11. Information and Representation Information and Representation: a two-step approach Information What information is expressed? How much to retain? How to identify it? (e.g. non-verb mediated propositions‘) ⋆ Messi, a golden ball winner, plays in Barcelona Representation What is the form of the relation? ⋆ Messi plays in Barcelona → plays or plays in Triples or n-ary propositions? ⋆ (Messi, plays football in, Barcelona) or (Messi, plays, football, in Barcelona) What should be the scope of the arguments? ⋆ Gandhi was vegetarian Del Corro, Gemulla (MPI) ClausIE May 2013 5 / 18

  12. Information and Representation Information and Representation: a two-step approach Information What information is expressed? How much to retain? How to identify it? (e.g. non-verb mediated propositions‘) ⋆ Messi, a golden ball winner, plays in Barcelona Representation What is the form of the relation? ⋆ Messi plays in Barcelona → plays or plays in Triples or n-ary propositions? ⋆ (Messi, plays football in, Barcelona) or (Messi, plays, football, in Barcelona) What should be the scope of the arguments? ⋆ Gandhi was vegetarian We aim to separate these two phases Del Corro, Gemulla (MPI) ClausIE May 2013 5 / 18

  13. Open Information Extractors and Language Technology Outline Information and Representation 1 Open Information Extractors and Language Technology 2 ClausIE 3 Clauses in the English Language From clauses to propositions Results 4 Conclusions and Future Directions 5 Del Corro, Gemulla (MPI) ClausIE May 2013 6 / 18

  14. Open Information Extractors and Language Technology Open Information Extractors and Language Technology nsubj root dobj rcmod amod appos DP conj and prep in nsubjpass det conj and conj and nn auxpass nn Bell , a telecommunication company , which is based in Los Angeles , makes and distributes electronic , computer and building products chunks B-NP B-NP I-NP I-NP , B-NP B-VP I-VP B-PP B-NP I-NP , B-VP I-VP I-VP B-ADJP , B-NP I-NP I-NP I-NP POS NNP DT JJ NN , WDT VBZ VBN IN NNP NNP , VBZ CC VBZ JJ , NN CC NN NNS Del Corro, Gemulla (MPI) ClausIE May 2013 6 / 18

  15. Open Information Extractors and Language Technology Open Information Extractors and Language Technology nsubj root dobj rcmod amod appos DP conj and prep in nsubjpass det conj and conj and nn auxpass nn Bell , a telecommunication company , which is based in Los Angeles , makes and distributes electronic , computer and building products chunks B-NP B-NP I-NP I-NP , B-NP B-VP I-VP B-PP B-NP I-NP , B-VP I-VP I-VP B-ADJP , B-NP I-NP I-NP I-NP POS NNP DT JJ NN , WDT VBZ VBN IN NNP NNP , VBZ CC VBZ JJ , NN CC NN NNS Chunks/POS Dependency Parser TextRunner Wanderlust WOE pos WOE parse Reverb KrakeN OLLIE Del Corro, Gemulla (MPI) ClausIE May 2013 6 / 18

  16. ClausIE Outline Information and Representation 1 Open Information Extractors and Language Technology 2 ClausIE 3 Clauses in the English Language From clauses to propositions Results 4 Conclusions and Future Directions 5 Del Corro, Gemulla (MPI) ClausIE May 2013 7 / 18

  17. ClausIE Clauses in the English Language Clause Essentials A clause is like a simple sentence ⋆ Paul eats a chocolate bar Del Corro, Gemulla (MPI) ClausIE May 2013 7 / 18

  18. ClausIE Clauses in the English Language Clause Essentials A clause is like a simple sentence ⋆ Paul eats a chocolate bar A sentence can be composed by more than one clause ⋆ Anna drinks coffee and Bob plays football Del Corro, Gemulla (MPI) ClausIE May 2013 7 / 18

  19. ClausIE Clauses in the English Language Clause Essentials A clause is like a simple sentence ⋆ Paul eats a chocolate bar A sentence can be composed by more than one clause ⋆ Anna drinks coffee and Bob plays football Each clause encodes one or more propositions Del Corro, Gemulla (MPI) ClausIE May 2013 7 / 18

  20. ClausIE Clauses in the English Language Clause Essentials A clause is like a simple sentence ⋆ Paul eats a chocolate bar A sentence can be composed by more than one clause ⋆ Anna drinks coffee and Bob plays football Each clause encodes one or more propositions Clauses can have optional adverbials ⋆ He will take the exam in May Del Corro, Gemulla (MPI) ClausIE May 2013 7 / 18

  21. ClausIE Clauses in the English Language Clause Essentials A clause is like a simple sentence ⋆ Paul eats a chocolate bar A sentence can be composed by more than one clause ⋆ Anna drinks coffee and Bob plays football Each clause encodes one or more propositions Clauses can have optional adverbials ⋆ He will take the exam in May A minimal clause is a clause without its optional adverbials ⋆ He will take the exam Del Corro, Gemulla (MPI) ClausIE May 2013 7 / 18

  22. ClausIE Clauses in the English Language The seven clauses SV i → Albert Einstein died. 1 S: Subject, V: Verb, A: Adverbial, C: Complement, O i : Indirect Object, O: Direct Object Del Corro, Gemulla (MPI) ClausIE May 2013 8 / 18

  23. ClausIE Clauses in the English Language The seven clauses SV i → Albert Einstein died. 1 SV e A → Albert Einstein remained in Princeton. 2 S: Subject, V: Verb, A: Adverbial, C: Complement, O i : Indirect Object, O: Direct Object Del Corro, Gemulla (MPI) ClausIE May 2013 8 / 18

  24. ClausIE Clauses in the English Language The seven clauses SV i → Albert Einstein died. 1 SV e A → Albert Einstein remained in Princeton. 2 SV c C → Albert Einstein is smart. 3 S: Subject, V: Verb, A: Adverbial, C: Complement, O i : Indirect Object, O: Direct Object Del Corro, Gemulla (MPI) ClausIE May 2013 8 / 18

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend