ClausIE: Clause-Based Open Information Extraction
Luciano Del Corro Rainer Gemulla
Max-Planck-Institut für Informatik
May 2013
Del Corro, Gemulla (MPI) ClausIE May 2013 1 / 18
ClausIE: Clause-Based Open Information Extraction Luciano Del Corro - - PowerPoint PPT Presentation
ClausIE: Clause-Based Open Information Extraction Luciano Del Corro Rainer Gemulla Max-Planck-Institut fr Informatik May 2013 Del Corro, Gemulla (MPI) ClausIE May 2013 1 / 18 Open Information Extraction: From sentences to propositions
Del Corro, Gemulla (MPI) ClausIE May 2013 1 / 18
Del Corro, Gemulla (MPI) ClausIE May 2013 2 / 18
Del Corro, Gemulla (MPI) ClausIE May 2013 2 / 18
Del Corro, Gemulla (MPI) ClausIE May 2013 2 / 18
Del Corro, Gemulla (MPI) ClausIE May 2013 2 / 18
Del Corro, Gemulla (MPI) ClausIE May 2013 3 / 18
Del Corro, Gemulla (MPI) ClausIE May 2013 3 / 18
Del Corro, Gemulla (MPI) ClausIE May 2013 4 / 18
Information and Representation
Del Corro, Gemulla (MPI) ClausIE May 2013 5 / 18
Information and Representation
Del Corro, Gemulla (MPI) ClausIE May 2013 5 / 18
Information and Representation
Del Corro, Gemulla (MPI) ClausIE May 2013 5 / 18
Information and Representation
Del Corro, Gemulla (MPI) ClausIE May 2013 5 / 18
Open Information Extractors and Language Technology
Del Corro, Gemulla (MPI) ClausIE May 2013 6 / 18
Open Information Extractors and Language Technology
Del Corro, Gemulla (MPI) ClausIE May 2013 6 / 18
Bell , a telecommunication company , which is based in Los Angeles , makes and distributes electronic , computer and building products B-NP B-NP I-NP I-NP , B-NP B-VP I-VP B-PP B-NP I-NP , B-VP I-VP I-VP B-ADJP , B-NP I-NP I-NP I-NP NNP DT JJ NN , WDT VBZ VBN IN NNP NNP , VBZ CC VBZ JJ , NN CC NN NNS
nsubj det nn appos nsubjpass auxpass rcmod nn prep in conj and amod conj and conj and dobj root
Open Information Extractors and Language Technology
Del Corro, Gemulla (MPI) ClausIE May 2013 6 / 18
Bell , a telecommunication company , which is based in Los Angeles , makes and distributes electronic , computer and building products B-NP B-NP I-NP I-NP , B-NP B-VP I-VP B-PP B-NP I-NP , B-VP I-VP I-VP B-ADJP , B-NP I-NP I-NP I-NP NNP DT JJ NN , WDT VBZ VBN IN NNP NNP , VBZ CC VBZ JJ , NN CC NN NNS
nsubj det nn appos nsubjpass auxpass rcmod nn prep in conj and amod conj and conj and dobj root
ClausIE
Del Corro, Gemulla (MPI) ClausIE May 2013 7 / 18
ClausIE Clauses in the English Language
Del Corro, Gemulla (MPI) ClausIE May 2013 7 / 18
ClausIE Clauses in the English Language
Del Corro, Gemulla (MPI) ClausIE May 2013 7 / 18
ClausIE Clauses in the English Language
Del Corro, Gemulla (MPI) ClausIE May 2013 7 / 18
ClausIE Clauses in the English Language
Del Corro, Gemulla (MPI) ClausIE May 2013 7 / 18
ClausIE Clauses in the English Language
Del Corro, Gemulla (MPI) ClausIE May 2013 7 / 18
ClausIE Clauses in the English Language
1
Del Corro, Gemulla (MPI) ClausIE May 2013 8 / 18
ClausIE Clauses in the English Language
1
2
Del Corro, Gemulla (MPI) ClausIE May 2013 8 / 18
ClausIE Clauses in the English Language
1
2
3
Del Corro, Gemulla (MPI) ClausIE May 2013 8 / 18
ClausIE Clauses in the English Language
1
2
3
4
5
6
7
Del Corro, Gemulla (MPI) ClausIE May 2013 8 / 18
ClausIE Clauses in the English Language
1
2
3
4
5
6
7
Del Corro, Gemulla (MPI) ClausIE May 2013 8 / 18
ClausIE Clauses in the English Language
Pattern Clause Type Example Derived clauses Some extended patterns SViAA SV AE died in Princeton in 1955. (AE, died) (AE, died, in Princeton) (AE, died, in 1955) (AE, died, in Princeton, in 1955)
ClausIE Clauses in the English Language
Pattern Clause Type Example Derived clauses Some extended patterns SViAA SV AE died in Princeton in 1955. (AE, died) (AE, died, in Princeton) (AE, died, in 1955) (AE, died, in Princeton, in 1955) SVeAA SVA AE remained in Princeton until his death. (AE, remained, in Princeton) (AE, remained, in Princeton, until his death)
ClausIE Clauses in the English Language
Pattern Clause Type Example Derived clauses Some extended patterns SViAA SV AE died in Princeton in 1955. (AE, died) (AE, died, in Princeton) (AE, died, in 1955) (AE, died, in Princeton, in 1955) SVeAA SVA AE remained in Princeton until his death. (AE, remained, in Princeton) (AE, remained, in Princeton, until his death) SVcCA SVC AE is a scientist of the 20th century. (AE, is, a scientist) (AE, is, a scientist, of the 20th century) SVmtOA SVO AE has won the Nobel Prize in 1921. (AE, has won, the Nobel Prize) (AE, has won, the Nobel Prize, in 1921) ASVmtO SVO In 1921, AE has won the Nobel Prize. (AE, has won, the Nobel Prize) (AE, has won, the Nobel Prize, in 1921)
Del Corro, Gemulla (MPI) ClausIE May 2013 9 / 18
ClausIE From clauses to propositions
Del Corro, Gemulla (MPI) ClausIE May 2013 10 / 18
nsubj cop root
DP
ClausIE From clauses to propositions
Del Corro, Gemulla (MPI) ClausIE May 2013 10 / 18
nsubj cop root
DP Clause
ClausIE From clauses to propositions
Del Corro, Gemulla (MPI) ClausIE May 2013 10 / 18
nsubj cop root
DP Clause Object? Q1
ClausIE From clauses to propositions
Del Corro, Gemulla (MPI) ClausIE May 2013 10 / 18
nsubj cop root
DP Clause Object? Q1 Complement? Q2 No
ClausIE From clauses to propositions
Del Corro, Gemulla (MPI) ClausIE May 2013 10 / 18
nsubj cop root
DP Clause Object? Q1 Complement? Q2 Copular (SVC) No Yes
ClausIE From clauses to propositions
Del Corro, Gemulla (MPI) ClausIE May 2013 10 / 18
nsubj cop root
DP Clause Object? Q1 Complement? Q2 Copular (SVC) No Yes
ClausIE From clauses to propositions
Del Corro, Gemulla (MPI) ClausIE May 2013 11 / 18
nn nsubj prep in root
DP
ClausIE From clauses to propositions
Del Corro, Gemulla (MPI) ClausIE May 2013 11 / 18
nn nsubj prep in root
DP Clause
ClausIE From clauses to propositions
Del Corro, Gemulla (MPI) ClausIE May 2013 11 / 18
nn nsubj prep in root
DP Clause Object? Q1
ClausIE From clauses to propositions
Del Corro, Gemulla (MPI) ClausIE May 2013 11 / 18
nn nsubj prep in root
DP Clause Object? Q1 Complement? Q2 No
ClausIE From clauses to propositions
Del Corro, Gemulla (MPI) ClausIE May 2013 11 / 18
nn nsubj prep in root
DP Clause Object? Q1 Complement? Candidate adverbial? Q2 No No
ClausIE From clauses to propositions
Del Corro, Gemulla (MPI) ClausIE May 2013 11 / 18
nn nsubj prep in root
DP Clause Object? Q1 Complement? Candidate adverbial? Known non-
Q2 No No Yes
ClausIE From clauses to propositions
Del Corro, Gemulla (MPI) ClausIE May 2013 11 / 18
nn nsubj prep in root
DP Clause Object? Q1 Complement? Candidate adverbial? Known non-
Q2 Intransitive (SV) No No Yes Yes
ClausIE From clauses to propositions
Del Corro, Gemulla (MPI) ClausIE May 2013 11 / 18
nn nsubj prep in root
DP Clause Object? Q1 Complement? Candidate adverbial? Known non-
Q2 Intransitive (SV) No No Yes Yes
ClausIE From clauses to propositions
Del Corro, Gemulla (MPI) ClausIE May 2013 11 / 18
DP Clause Object? Q1 Complement? Candidate adverbial? Known non-
Known
Conservative? Q2 Q3 Q4 Q5 Q6 Copular (SVC) Intransitive (SV) Extended copular (SVA) No Yes No Yes No No Yes No yes no yes
direct object? Complement? Cand.
Potentially compl.-trans.? Conservative? Q7 Q8 Q9 Q10 Q11 Ditransitive (SVOO) Complex tran- sitive (SVOC) Monotransitive (SVO) Complex tran- sitive (SVOA) Yes No Yes No Yes Yes No No Yes No Yes
ClausIE From clauses to propositions
Del Corro, Gemulla (MPI) ClausIE May 2013 11 / 18
DP Clause Object? Q1 Complement? Candidate adverbial? Known non-
Known
Conservative? Q2 Q3 Q4 Q5 Q6 Copular (SVC) Intransitive (SV) Extended copular (SVA) No Yes No Yes No No Yes No yes no yes
direct object? Complement? Cand.
Potentially compl.-trans.? Conservative? Q7 Q8 Q9 Q10 Q11 Ditransitive (SVOO) Complex tran- sitive (SVOC) Monotransitive (SVO) Complex tran- sitive (SVOA) Yes No Yes No Yes Yes No No Yes No Yes
ClausIE From clauses to propositions
Del Corro, Gemulla (MPI) ClausIE May 2013 11 / 18
DP Clause Object? Q1 Complement? Candidate adverbial? Known non-
Known
Conservative? Q2 Q3 Q4 Q5 Q6 Copular (SVC) Intransitive (SV) Extended copular (SVA) No Yes No Yes No No Yes No yes no yes
direct object? Complement? Cand.
Potentially compl.-trans.? Conservative? Q7 Q8 Q9 Q10 Q11 Ditransitive (SVOO) Complex tran- sitive (SVOC) Monotransitive (SVO) Complex tran- sitive (SVOA) Yes No Yes No Yes Yes No No Yes No Yes
ClausIE From clauses to propositions
Del Corro, Gemulla (MPI) ClausIE May 2013 12 / 18
Bell , a telecommunication company , which is based in Los Angeles , makes and distributes electronic , computer and building products B-NP B-NP I-NP I-NP , B-NP B-VP I-VP B-PP B-NP I-NP , B-VP I-VP I-VP B-ADJP , B-NP I-NP I-NP I-NP NNP DT JJ NN , WDT VBZ VBN IN NNP NNP , VBZ CC VBZ JJ , NN CC NN NNS
nsubj det nn appos nsubjpass auxpass rcmod nn prep in conj and amod conj and conj and dobj root
ClausIE From clauses to propositions
Del Corro, Gemulla (MPI) ClausIE May 2013 12 / 18
Bell , a telecommunication company , which is based in Los Angeles , makes and distributes electronic , computer and building products B-NP B-NP I-NP I-NP , B-NP B-VP I-VP B-PP B-NP I-NP , B-VP I-VP I-VP B-ADJP , B-NP I-NP I-NP I-NP NNP DT JJ NN , WDT VBZ VBN IN NNP NNP , VBZ CC VBZ JJ , NN CC NN NNS
nsubj det nn appos nsubjpass auxpass rcmod nn prep in conj and amod conj and conj and dobj root
ClausIE From clauses to propositions
Del Corro, Gemulla (MPI) ClausIE May 2013 12 / 18
Bell , a telecommunication company , which is based in Los Angeles , makes and distributes electronic , computer and building products B-NP B-NP I-NP I-NP , B-NP B-VP I-VP B-PP B-NP I-NP , B-VP I-VP I-VP B-ADJP , B-NP I-NP I-NP I-NP NNP DT JJ NN , WDT VBZ VBN IN NNP NNP , VBZ CC VBZ JJ , NN CC NN NNS
nsubj det nn appos nsubjpass auxpass rcmod nn prep in conj and amod conj and conj and dobj root
ClausIE From clauses to propositions
Del Corro, Gemulla (MPI) ClausIE May 2013 12 / 18
Bell , a telecommunication company , which is based in Los Angeles , makes and distributes electronic , computer and building products B-NP B-NP I-NP I-NP , B-NP B-VP I-VP B-PP B-NP I-NP , B-VP I-VP I-VP B-ADJP , B-NP I-NP I-NP I-NP NNP DT JJ NN , WDT VBZ VBN IN NNP NNP , VBZ CC VBZ JJ , NN CC NN NNS
nsubj det nn appos nsubjpass auxpass rcmod nn prep in conj and amod conj and conj and dobj root
ClausIE From clauses to propositions
Del Corro, Gemulla (MPI) ClausIE May 2013 12 / 18
Bell , a telecommunication company , which is based in Los Angeles , makes and distributes electronic , computer and building products B-NP B-NP I-NP I-NP , B-NP B-VP I-VP B-PP B-NP I-NP , B-VP I-VP I-VP B-ADJP , B-NP I-NP I-NP I-NP NNP DT JJ NN , WDT VBZ VBN IN NNP NNP , VBZ CC VBZ JJ , NN CC NN NNS
nsubj det nn appos nsubjpass auxpass rcmod nn prep in conj and amod conj and conj and dobj root
ClausIE From clauses to propositions
Del Corro, Gemulla (MPI) ClausIE May 2013 12 / 18
Bell , a telecommunication company , which is based in Los Angeles , makes and distributes electronic , computer and building products B-NP B-NP I-NP I-NP , B-NP B-VP I-VP B-PP B-NP I-NP , B-VP I-VP I-VP B-ADJP , B-NP I-NP I-NP I-NP NNP DT JJ NN , WDT VBZ VBN IN NNP NNP , VBZ CC VBZ JJ , NN CC NN NNS
nsubj det nn appos nsubjpass auxpass rcmod nn prep in conj and amod conj and conj and dobj root
ClausIE From clauses to propositions
Del Corro, Gemulla (MPI) ClausIE May 2013 12 / 18
Bell , a telecommunication company , which is based in Los Angeles , makes and distributes electronic , computer and building products B-NP B-NP I-NP I-NP , B-NP B-VP I-VP B-PP B-NP I-NP , B-VP I-VP I-VP B-ADJP , B-NP I-NP I-NP I-NP NNP DT JJ NN , WDT VBZ VBN IN NNP NNP , VBZ CC VBZ JJ , NN CC NN NNS
nsubj det nn appos nsubjpass auxpass rcmod nn prep in conj and amod conj and conj and dobj root
ClausIE From clauses to propositions
Del Corro, Gemulla (MPI) ClausIE May 2013 12 / 18
Bell , a telecommunication company , which is based in Los Angeles , makes and distributes electronic , computer and building products B-NP B-NP I-NP I-NP , B-NP B-VP I-VP B-PP B-NP I-NP , B-VP I-VP I-VP B-ADJP , B-NP I-NP I-NP I-NP NNP DT JJ NN , WDT VBZ VBN IN NNP NNP , VBZ CC VBZ JJ , NN CC NN NNS
nsubj det nn appos nsubjpass auxpass rcmod nn prep in conj and amod conj and conj and dobj root
ClausIE From clauses to propositions
Del Corro, Gemulla (MPI) ClausIE May 2013 13 / 18
ClausIE From clauses to propositions
Del Corro, Gemulla (MPI) ClausIE May 2013 13 / 18
ClausIE From clauses to propositions
Del Corro, Gemulla (MPI) ClausIE May 2013 13 / 18
ClausIE From clauses to propositions
Del Corro, Gemulla (MPI) ClausIE May 2013 13 / 18
ClausIE From clauses to propositions
Del Corro, Gemulla (MPI) ClausIE May 2013 13 / 18
ClausIE From clauses to propositions
Del Corro, Gemulla (MPI) ClausIE May 2013 13 / 18
ClausIE From clauses to propositions
Del Corro, Gemulla (MPI) ClausIE May 2013 14 / 18
ClausIE From clauses to propositions
Del Corro, Gemulla (MPI) ClausIE May 2013 14 / 18
ClausIE From clauses to propositions
Del Corro, Gemulla (MPI) ClausIE May 2013 14 / 18
ClausIE From clauses to propositions
Del Corro, Gemulla (MPI) ClausIE May 2013 14 / 18
Results
Del Corro, Gemulla (MPI) ClausIE May 2013 15 / 18
Results
Del Corro, Gemulla (MPI) ClausIE May 2013 15 / 18
Results
Del Corro, Gemulla (MPI) ClausIE May 2013 16 / 18
Results
200 400 600 800 1000 0.0 0.2 0.4 0.6 0.8 1.0 Number of extractions Precision
ClausIE ClausIE (non− red.) ClausIE w/o CC ClausIE w/o CC (non− red.) Reverb OLLIE
200 400 600 800 1000 1200 0.0 0.2 0.4 0.6 0.8 1.0 Number of extractions Precision
ClausIE ClausIE (non− red.) ClausIE w/o CC ClausIE w/o CC (non− red.) Reverb OLLIE
Del Corro, Gemulla (MPI) ClausIE May 2013 17 / 18
Conclusions and Future Directions
Del Corro, Gemulla (MPI) ClausIE May 2013 18 / 18
Conclusions and Future Directions
Del Corro, Gemulla (MPI) ClausIE May 2013 18 / 18
Conclusions and Future Directions
Del Corro, Gemulla (MPI) ClausIE May 2013 18 / 18
Conclusions and Future Directions
Del Corro, Gemulla (MPI) ClausIE May 2013 18 / 18