Joint Posterior Revision of NLP Annotations via Ontological - - PowerPoint PPT Presentation
Joint Posterior Revision of NLP Annotations via Ontological - - PowerPoint PPT Presentation
#4479 Joint Posterior Revision of NLP Annotations via Ontological Knowledge Marco Rospocher Francesco Corcoglioniti Context: Knowledge Extraction Kia has hired Peter Schreyer as chief design officer. Joint Posterior Revision of NLP
Joint Posterior Revision of NLP Annotations via Ontological Knowledge - M. Rospocher & F. Corcoglioniti
Context: Knowledge Extraction
Kia has hired Peter Schreyer as chief design officer.
Joint Posterior Revision of NLP Annotations via Ontological Knowledge - M. Rospocher & F. Corcoglioniti
Context: Knowledge Extraction
Kia has hired Peter Schreyer as chief design officer.
Joint Posterior Revision of NLP Annotations via Ontological Knowledge - M. Rospocher & F. Corcoglioniti
Context: Knowledge Extraction
Kia has hired Peter Schreyer as chief design officer.
NLP Tasks:
- Named Entity Recognition and Classification (NERC)
Organization
Joint Posterior Revision of NLP Annotations via Ontological Knowledge - M. Rospocher & F. Corcoglioniti
Context: Knowledge Extraction
Kia has hired Peter Schreyer as chief design officer.
NLP Tasks:
- Named Entity Recognition and Classification (NERC)
Organization
- Entity Linking (EL)
dbpedia:Kia_Motors
Joint Posterior Revision of NLP Annotations via Ontological Knowledge - M. Rospocher & F. Corcoglioniti
Context: Knowledge Extraction
Kia has hired Peter Schreyer as chief design officer.
NLP Tasks:
- Named Entity Recognition and Classification (NERC)
Organization
- Entity Linking (EL)
dbpedia:Kia_Motors
- Semantic Role Labeling (SRL)
…
framenet:employer
Joint Posterior Revision of NLP Annotations via Ontological Knowledge - M. Rospocher & F. Corcoglioniti
Motivating Examples
- Mr. Washington was runner-up at Wimbledon in 1996.
Joint Posterior Revision of NLP Annotations via Ontological Knowledge - M. Rospocher & F. Corcoglioniti
Motivating Examples
- Mr. Washington was runner-up at Wimbledon in 1996.
http://nlp.stanford.edu:8080/corenlp
Joint Posterior Revision of NLP Annotations via Ontological Knowledge - M. Rospocher & F. Corcoglioniti
Motivating Examples
- Mr. Washington was runner-up at Wimbledon in 1996.
http://nlp.stanford.edu:8080/corenlp http://demo.dbpedia-spotlight.org
http://dbpedia.org/resource/ Washington_(state)
Joint Posterior Revision of NLP Annotations via Ontological Knowledge - M. Rospocher & F. Corcoglioniti
Motivating Examples
- Mr. Washington was runner-up at Wimbledon in 1996.
http://nlp.stanford.edu:8080/corenlp http://demo.dbpedia-spotlight.org
http://dbpedia.org/resource/ Washington_(state)
The GW Bridge is a double-decked suspension bridge over the Hudson.
Joint Posterior Revision of NLP Annotations via Ontological Knowledge - M. Rospocher & F. Corcoglioniti
Motivating Examples
- Mr. Washington was runner-up at Wimbledon in 1996.
http://nlp.stanford.edu:8080/corenlp http://demo.dbpedia-spotlight.org
http://dbpedia.org/resource/ Washington_(state)
The GW Bridge is a double-decked suspension bridge over the Hudson.
http://demo.dbpedia-spotlight.org
http://dbpedia.org/resource/ George_Washington_Bridge
Joint Posterior Revision of NLP Annotations via Ontological Knowledge - M. Rospocher & F. Corcoglioniti
Motivating Examples
- Mr. Washington was runner-up at Wimbledon in 1996.
http://nlp.stanford.edu:8080/corenlp http://demo.dbpedia-spotlight.org
http://dbpedia.org/resource/ Washington_(state)
The GW Bridge is a double-decked suspension bridge over the Hudson.
http://nlp.stanford.edu:8080/corenlp http://demo.dbpedia-spotlight.org
http://dbpedia.org/resource/ George_Washington_Bridge
Joint Posterior Revision of NLP Annotations via Ontological Knowledge - M. Rospocher & F. Corcoglioniti
Abstracting
… token1 token2 token3 token4 token5 token6 ….
Joint Posterior Revision of NLP Annotations via Ontological Knowledge - M. Rospocher & F. Corcoglioniti
Abstracting
… token1 token2 token3 token4 token5 token6 ….
Joint Posterior Revision of NLP Annotations via Ontological Knowledge - M. Rospocher & F. Corcoglioniti
Abstracting
… token1 token2 token3 token4 token5 token6 ….
Task1 Taskn Task2
Joint Posterior Revision of NLP Annotations via Ontological Knowledge - M. Rospocher & F. Corcoglioniti
Abstracting
… token1 token2 token3 token4 token5 token6 ….
Task1 Taskn Task2 a1,1 a1,2 a1,k … a2,1 a2,2 a2,i … an,1 an,2 an,j …
Joint Posterior Revision of NLP Annotations via Ontological Knowledge - M. Rospocher & F. Corcoglioniti
Abstracting
… token1 token2 token3 token4 token5 token6 ….
Task1 Taskn Task2 a1,1 a1,2 a1,k … a2,1 a2,2 a2,i … an,1 an,2 an,j …
Joint Posterior Revision of NLP Annotations via Ontological Knowledge - M. Rospocher & F. Corcoglioniti
Abstracting
… token1 token2 token3 token4 token5 token6 ….
Task1 Taskn Task2 a1,1 a1,2 a1,k … a2,1 a2,2 a2,i … an,1 an,2 an,j …
How can we assess and improve the coherence of the various NLP annotations on an entity mention?
RESEARCH PROBLEM
Joint Posterior Revision of NLP Annotations via Ontological Knowledge - M. Rospocher & F. Corcoglioniti
In a nutshell
… token1 token2 token3 token4 token5 token6 ….
Task1 Taskn Task2 a1,1 a1,2 a1,k … a2,1 a2,2 a2,i … an,1 an,2 an,j …
- ntological background knowledge
Joint Posterior Revision of NLP Annotations via Ontological Knowledge - M. Rospocher & F. Corcoglioniti
In a nutshell
… token1 token2 token3 token4 token5 token6 ….
Task1 Taskn Task2 a1,1 a1,2 a1,k … a2,1 a2,2 a2,i … an,1 an,2 an,j …
- ntological background knowledge
Joint Posterior Revision of NLP Annotations via Ontological Knowledge - M. Rospocher & F. Corcoglioniti
In a nutshell
… token1 token2 token3 token4 token5 token6 ….
Task1 Taskn Task2 a1,1 a1,2 a1,k … a2,1 a2,2 a2,i … an,1 an,2 an,j …
- ntological background knowledge
Joint Posterior Revision of NLP Annotations via Ontological Knowledge - M. Rospocher & F. Corcoglioniti
In a nutshell
… token1 token2 token3 token4 token5 token6 ….
Task1 Taskn Task2 a1,1 a1,2 a1,k … a2,1 a2,2 a2,i … an,1 an,2 an,j …
- ntological background knowledge
Joint Posterior Revision of NLP Annotations via Ontological Knowledge - M. Rospocher & F. Corcoglioniti
Contributions
1. JPARK: a probabilistic model capable to estimate a posteriori the overall confidence of NLP annotations 2. A concrete instantiation of the model for NERC and EL (using YAGO as ontological knowledge) 3. Application of the NERC and EL model to revise the annotations of Stanford NER and DBpedia Spotlight
Joint Posterior Revision of NLP Annotations via Ontological Knowledge - M. Rospocher & F. Corcoglioniti
The Model
P (a|m, B, K )
Joint Posterior Revision of NLP Annotations via Ontological Knowledge - M. Rospocher & F. Corcoglioniti
The Model
P (a|m, B, K )
(ai , … , an) NLP Annotations
entity mention NLP Background Knowledge “The” Ontological Knowledge
Joint Posterior Revision of NLP Annotations via Ontological Knowledge - M. Rospocher & F. Corcoglioniti
The Model
P (a|m, B, K )
(ai , … , an) NLP Annotations
entity mention NLP Background Knowledge “The” Ontological Knowledge
P (a ,C|m, B, K )
set of classes from K
Joint Posterior Revision of NLP Annotations via Ontological Knowledge - M. Rospocher & F. Corcoglioniti
The Model
P (a|m, B, K ) P (ai|m, B ) P (C|ai , K )
(ai , … , an) NLP Annotations
entity mention NLP Background Knowledge “The” Ontological Knowledge
P (a ,C|m, B, K )
set of classes from K
Joint Posterior Revision of NLP Annotations via Ontological Knowledge - M. Rospocher & F. Corcoglioniti
The Model
confidence score P (a|m, B, K ) P (ai|m, B ) P (C|ai , K )
(ai , … , an) NLP Annotations
entity mention NLP Background Knowledge “The” Ontological Knowledge
P (a ,C|m, B, K )
set of classes from K
Joint Posterior Revision of NLP Annotations via Ontological Knowledge - M. Rospocher & F. Corcoglioniti
The Model
confidence score learned from data P (a|m, B, K ) P (ai|m, B ) P (C|ai , K )
(ai , … , an) NLP Annotations
entity mention NLP Background Knowledge “The” Ontological Knowledge
P (a ,C|m, B, K )
set of classes from K
Joint Posterior Revision of NLP Annotations via Ontological Knowledge - M. Rospocher & F. Corcoglioniti
The Model
confidence score learned from data P (a|m, B, K ) = arg maxa P (ai|m, B ) P (C|ai , K )
(ai , … , an) NLP Annotations
entity mention NLP Background Knowledge “The” Ontological Knowledge
P (a ,C|m, B, K )
set of classes from K
NERC and EL Model
Joint Posterior Revision of NLP Annotations via Ontological Knowledge - M. Rospocher & F. Corcoglioniti
Ingredients
- Ontological Knowledge
- Estimating
- Estimating P (C|aEL , K )
P (C|aNERC , K )
Joint Posterior Revision of NLP Annotations via Ontological Knowledge - M. Rospocher & F. Corcoglioniti
Ingredients
- Ontological Knowledge
- Estimating
- Estimating P (C|aEL , K )
P (C|aNERC , K )
Joint Posterior Revision of NLP Annotations via Ontological Knowledge - M. Rospocher & F. Corcoglioniti
Ingredients
- Ontological Knowledge
- Estimating
- Estimating
Leverage a gold standard corpus G annotated with NERC types and
- ntological classes (or EL annotations)
P (C|aEL , K ) P (C|aNERC , K )
Joint Posterior Revision of NLP Annotations via Ontological Knowledge - M. Rospocher & F. Corcoglioniti
Ingredients
- Ontological Knowledge
- Estimating
- Estimating
Leverage a gold standard corpus G annotated with NERC types and
- ntological classes (or EL annotations)
# co-occurences
P (C|aEL , K ) P (C|aNERC , K ) _ ∑C nG(C , aNERC) nG(C , aNERC) ~
Joint Posterior Revision of NLP Annotations via Ontological Knowledge - M. Rospocher & F. Corcoglioniti
Ingredients
- Ontological Knowledge
- Estimating
- Estimating
Leverage a gold standard corpus G annotated with NERC types and
- ntological classes (or EL annotations)
# co-occurences
P (C|aEL , K ) P (C|aNERC , K ) _ ∑C nG(C , aNERC) nG(C , aNERC) ~
Leverage alignments between EL Knowledge Base and
Joint Posterior Revision of NLP Annotations via Ontological Knowledge - M. Rospocher & F. Corcoglioniti
Ingredients
- Ontological Knowledge
- Estimating
- Estimating
Leverage a gold standard corpus G annotated with NERC types and
- ntological classes (or EL annotations)
# co-occurences
P (C|aEL , K ) P (C|aNERC , K ) _ ∑C nG(C , aNERC) nG(C , aNERC) ~
Leverage alignments between EL Knowledge Base and
{
1 entity aEL is instance of C 0 otherwise
Application and Evaluation
Joint Posterior Revision of NLP Annotations via Ontological Knowledge - M. Rospocher & F. Corcoglioniti
Tools
- NERC: [Finkel et al., 2005]
- EL: [Daiber et al., 2013]
Joint Posterior Revision of NLP Annotations via Ontological Knowledge - M. Rospocher & F. Corcoglioniti
NERC+EL Datasets
- AIDA CoNLL-YAGO [Hoffart et al., 2011]
- MEANTIME [Minard et al., 2016]
- TAC-KBP [ Ji et al., 2011]
Joint Posterior Revision of NLP Annotations via Ontological Knowledge - M. Rospocher & F. Corcoglioniti
Does the posteriori joint revision of the annotations from Stanford NER and DBpedia Spotlight, via YAGO, improve their NERC and EL performances?
Research Question
Joint Posterior Revision of NLP Annotations via Ontological Knowledge - M. Rospocher & F. Corcoglioniti
Does the posteriori joint revision of the annotations from Stanford NER and DBpedia Spotlight, via YAGO, improve their NERC and EL performances?
Research Question
Joint Posterior Revision of NLP Annotations via Ontological Knowledge - M. Rospocher & F. Corcoglioniti
Results
Bold = statistical significant (approx. rand. test)
Joint Posterior Revision of NLP Annotations via Ontological Knowledge - M. Rospocher & F. Corcoglioniti
Results
Bold = statistical significant (approx. rand. test)
Joint Posterior Revision of NLP Annotations via Ontological Knowledge - M. Rospocher & F. Corcoglioniti
Results
Bold = statistical significant (approx. rand. test)
Joint Posterior Revision of NLP Annotations via Ontological Knowledge - M. Rospocher & F. Corcoglioniti
Does the posteriori joint revision of the annotations from Stanford NER and DBpedia Spotlight, via YAGO, improve their NERC and EL performances?
Research Question
Joint Posterior Revision of NLP Annotations via Ontological Knowledge - M. Rospocher & F. Corcoglioniti
Conclusions
- Novel probabilistic model, leveraging ontological knowledge,
for improving NLP entity annotations
- Instantiation of the model for the NERC and EL tasks
- Empirical confirmation (3 datasets) of the capability of the
model to improve the quality of the annotations
- Future Work: extension to other tasks (e.g., SRL)
Marco Rospocher
rospocher@fbk.eu dkm.fbk.eu/rospocher @marcorospocher
KE4IR
PIKES powered bypikes.fbk.eu/ke4ir premon.fbk.eu
RDFpro
rdfpro.fbk.eu
MoKi
- moki.fbk.eu
knowledgestore.fbk.eu pikes.fbk.eu
PIKES
pikes.fbk.eu/jpark pikes.fbk.eu/psl4ea
BPMN Ontology
dkm.fbk.eu/bpmn-ontology bit.ly/pescado-onto github.com/dkmfbk/TexOwl
Event & Situation Ontology
github.com/newsreader/eso