Unsupervised Coreference Resolution in a Nonparametric Bayesian - - PowerPoint PPT Presentation
Unsupervised Coreference Resolution in a Nonparametric Bayesian - - PowerPoint PPT Presentation
Unsupervised Coreference Resolution in a Nonparametric Bayesian Model Aria Haghighi and Dan Klein Computer Science Division University of California Berkeley Coreference Resolution The Weir Group , whose headquarters is in the U.S , is a
Coreference Resolution
The Weir Group , whose headquarters is in the U.S , is a large specialized corporation . This power plant ,which , will be situated in Jiangsu , has a large generation capacity.
Coreference Resolution
The Weir Group , whose headquarters is in the U.S , is a large specialized corporation . This power plant ,which , will be situated in Jiangsu , has a large generation capacity.
Coreference Resolution
The Weir Group , whose headquarters is in the U.S , is a large specialized corporation . This power plant ,which , will be situated in Jiangsu , has a large generation capacity.
Coreference Resolution
.............. .......... .......... ... . ... ... . . ....... .......
Generative Story
Coreference Resolution
.............. .......... .......... ... . ... ... . . ....... .......
Coreference Resolution
Weir Group Weir HQ United States Weir Group Weir Group Weir Plant Weir Plant Jiangsu .............. .......... .......... ... . ... ... . . ....... .......
Coreference Resolution
Weir Group Weir HQ United States Weir Group Weir Group Weir Plant Weir Plant Jiangsu .............. .......... .......... ... . ... ... . . ....... .......
Entity
Coreference Resolution
Weir Group Weir HQ United States Weir Group Weir Group Weir Plant Weir Plant Jiangsu .............. .......... .......... ... . ... ... . . ....... .......
Coreference Resolution
“Weir group” “whose” “headquarters” “U.S” “corporation” “power plant” “which” “Jiangsu” Weir Group Weir HQ United States Weir Group Weir Group Weir Plant Weir Plant Jiangsu .............. .......... .......... ... . ... ... . . ....... .......
Coreference Resolution
“Weir group” “whose” “headquarters” “U.S” “corporation” “power plant” “which” “Jiangsu” Weir Group Weir HQ United States Weir Group Weir Group Weir Plant Weir Plant Jiangsu .............. .......... .......... ... . ... ... . . ....... .......
Mention
Coreference Resolution
“Weir group” “whose” “headquarters” “U.S” “corporation” “power plant” “which” “Jiangsu” Weir Group Weir HQ United States Weir Group Weir Group Weir Plant Weir Plant Jiangsu .............. .......... .......... ... . ... ... . . ....... .......
Coreference Resolution
.............. .......... .......... ... . ... ... . . ....... .......
Inference Time
Coreference Resolution
.............. .......... .......... ... . ... ... . . ....... .......
Coreference Resolution
“Weir group” “whose” “headquarters” “U.S” “corporation” “power plant” “which” “Jiangsu” .............. .......... .......... ... . ... ... . . ....... .......
Coreference Resolution
“Weir group” “whose” “headquarters” “U.S” “corporation” “power plant” “which” “Jiangsu” Weir Group Weir HQ United States Weir Group Weir Group Weir Plant Weir Plant Jiangsu .............. .......... .......... ... . ... ... . . ....... .......
Finite Mixture Model
Finite Mixture Model
Z1=
Weir Group
Z2=
Weir Group
Z3=
Weir HQ
Finite Mixture Model
P(Weir Group) = 0.2, ..... P(Weir HQ) = 0.5, Entity Distribution Z1=
Weir Group
Z2=
Weir Group
Z3=
Weir HQ
Finite Mixture Model
P(Weir Group) = 0.2, ..... P(Weir HQ) = 0.5, Entity Distribution Z1=
Weir Group
Z2=
Weir Group
Z3=
Weir HQ
W1=
“Weir Group”
W2=
“whose”
W3=
“headquart.”
Finite Mixture Model
P(Weir Group) = 0.2, ..... P(Weir HQ) = 0.5, Entity Distribution P(W | Weir Group): “Weir Group”=0.4, “whose”=0.2, ....... Mention Distribution Z1=
Weir Group
Z2=
Weir Group
Z3=
Weir HQ
W1=
“Weir Group”
W2=
“whose”
W3=
“headquart.”
Bayesian Finite Mixture Model
β
K Entity Distribution Z1=
Weir Group
W1=
“Weir Group”
Z2=
Weir Group
W2=
“whose”
Z3=
Weir HQ
W3=
“headquart.”
P(W | Weir Group): “Weir Group”=0.4, “whose”=0.2, ....... Mention Distribution
Bayesian Finite Mixture Model
β
K Entity Distribution Z1=
Weir Group
W1=
“Weir Group”
Z2=
Weir Group
W2=
“whose”
Z3=
Weir HQ
W3=
“headquart.”
P(W | Weir Group): “Weir Group”=0.4, “whose”=0.2, ....... Mention Distribution
This is how many entities there are
Bayesian Finite Mixture Model
β
K Entity Distribution Z1=
Weir Group
W1=
“Weir Group”
Z2=
Weir Group
W2=
“whose”
Z3=
Weir HQ
W3=
“headquart.”
P(W | Weir Group): “Weir Group”=0.4, “whose”=0.2, ....... Mention Distribution
Bayesian Finite Mixture Model
β
K K
φ
Z1=
Weir Group
W1=
“Weir Group”
Z2=
Weir Group
W2=
“whose”
Z3=
Weir HQ
W3=
“headquart.”
Mention Distribution Entity Distribution
Bayesian Finite Mixture Model
β
K K
φ
Z1=
Weir Group
W1=
“Weir Group”
Z2=
Weir Group
W2=
“whose”
Z3=
Weir HQ
W3=
“headquart.”
Mention Distribution Entity Distribution
How do you choose K?
Bayesian Finite Mixture Model
β
K K
φ
Z1=
Weir Group
W1=
“Weir Group”
Z2=
Weir Group
W2=
“whose”
Z3=
Weir HQ
W3=
“headquart.”
Mention Distribution Entity Distribution
Infinite Mixture Model
β
φ
Z1=
Weir Group
W1=
“Weir Group”
Z2=
Weir Group
W2=
“whose”
Z3=
Weir HQ
W3=
“headquart.”
Mention Distribution Entity Distribution
∞ ∞
Infinite Mixture Model
Drawn from a Dirichlet Process (DP) prior [Teh et al., 2006]
β
φ
Z1=
Weir Group
W1=
“Weir Group”
Z2=
Weir Group
W2=
“whose”
Z3=
Weir HQ
W3=
“headquart.”
Mention Distribution Entity Distribution
∞ ∞
Infinite Mixture Model
β
φ
Z1=
Weir Group
W1=
“Weir Group”
Z2=
Weir Group
W2=
“whose”
Z3=
Weir HQ
W3=
“headquart.”
Mention Distribution Entity Distribution
∞ ∞
Experimental Setup
Experimental Setup
- ACE 2004 English translations of
Arabic and Chinese Treebanks
Experimental Setup
- ACE 2004 English translations of
Arabic and Chinese Treebanks
- 95 Documents and 3,905 Mentions
Experimental Setup
- ACE 2004 English translations of
Arabic and Chinese Treebanks
- 95 Documents and 3,905 Mentions
- Given mention boundaries
Experimental Setup
- ACE 2004 English translations of
Arabic and Chinese Treebanks
- 95 Documents and 3,905 Mentions
- Given mention boundaries
- Evaluate on MUC F1 Measure
Infinite Mixture Model
MUC F1
50 60 70 80 90 100 Mixture
54.5
Infinite Mixture Model
MUC F1
The Weir Group , whose headquarters is in the U.S is a large specialized corporation. This power plant , which , will be situated in Jiangsu, has a large generation capacity.
50 60 70 80 90 100 Mixture
54.5
Infinite Mixture Model
MUC F1
The Weir Group , whose headquarters is in the U.S is a large specialized corporation. This power plant , which , will be situated in Jiangsu, has a large generation capacity.
50 60 70 80 90 100 Mixture
54.5
Infinite Mixture Model
MUC F1
The Weir Group , whose headquarters is in the U.S is a large specialized corporation. This power plant , which , will be situated in Jiangsu, has a large generation capacity.
50 60 70 80 90 100 Mixture
54.5
Infinite Mixture Model
MUC F1
The Weir Group , whose headquarters is in the U.S is a large specialized corporation. This power plant , which , will be situated in Jiangsu, has a large generation capacity.
50 60 70 80 90 100 Mixture
54.5 Pronouns lumped into
their own clusters!
Enriching Mention Model
W Z
Enriching Mention Model
W Z
Non-Pronoun
Enriching Mention Model
Pronoun
W Z W Z
Non-Pronoun
Enriching Mention Model
Pronoun
W Z N W Z
Non-Pronoun
Enriching Mention Model
Pronoun
W Z N
Number Sing, Plural
W Z
Non-Pronoun
Enriching Mention Model
Pronoun
W Z N W Z
Non-Pronoun
Enriching Mention Model
Pronoun
W Z G N W Z
Non-Pronoun
Enriching Mention Model
Pronoun
W Z G N
Gender M,F,N
W Z
Non-Pronoun
Enriching Mention Model
Pronoun
W Z G N W Z
Non-Pronoun
Enriching Mention Model
Pronoun
W Z T G N W Z
Non-Pronoun
Enriching Mention Model
Pronoun
W Z T G N W Z
Non-Pronoun
EntityType PERS, LOC, ORG, MISC
Enriching Mention Model
Pronoun
W Z T G N W Z
Non-Pronoun
Enriching Mention Model
Pronoun
W Z T G N
Enriching Mention Model
Entity Parameters
φ ∞ Pronoun
W Z T G N
Enriching Mention Model
W | SING, MALE, PERS “he”:0.5, “him”: 0.3,... W | PL, NEUT, ORG “they”:0.3, “it”: 0.2,... Entity Parameters
φ ∞
Pronoun Parameters
Pronoun
W Z T G N
Enriching Mention Model
Entity Parameters
φ ∞
Pronoun Parameters ψ
Pronoun
W Z T G N
Enriching Mention Model
W Z
Non-Pronoun Pronoun
W Z T G N
Enriching Mention Model
W Z G N T Pronoun Non-pronoun
Enriching Mention Model
M W Z G N T Pronoun Non-pronoun
Enriching Mention Model
M W Z G N T Pronoun Non-pronoun
Mention Type Proper, Pronoun, Nominal
Enriching Mention Model
M W Z G N T Pronoun Non-pronoun
Enriching Mention Model
β ∞
φ ∞
W W Z Z
..... .....
Enriching Mention Model
β ∞
φ ∞
Z W Z W G N T G N T M M
..... .....
Pronoun Head Model
50 60 70 80 90 100 Mixture Pronoun
64.1 54.5
MUC F1
Pronoun Head Model
50 60 70 80 90 100 Mixture Pronoun
64.1 54.5
MUC F1
The Weir Group , whose headquarters is in the U.S is a large specialized corporation. This power plant , which , will be situated in Jiangsu, has a large generation capacity.
Pronoun Head Model
50 60 70 80 90 100 Mixture Pronoun
64.1 54.5
MUC F1
The Weir Group , whose headquarters is in the U.S is a large specialized corporation. This power plant , which , will be situated in Jiangsu, has a large generation capacity.
Pronoun Head Model
50 60 70 80 90 100 Mixture Pronoun
64.1 54.5
MUC F1
The Weir Group , whose headquarters is in the U.S is a large specialized corporation. This power plant , which , will be situated in Jiangsu, has a large generation capacity.
Should be coreferent with recent “power plant” entity.
Pronoun Head Model
50 60 70 80 90 100 Mixture Pronoun
64.1 54.5
MUC F1
The Weir Group , whose headquarters is in the U.S is a large specialized corporation. This power plant , which , will be situated in Jiangsu, has a large generation capacity.
Salience Model
Salience Model
L
Salience Model
Entity Activation 1 1.0 2 0.0 L
Salience Model
Entity Activation 1 1.0 2 0.0 Z L
Salience Model
Entity Activation 1 1.0 2 0.0 Z L S
Salience Model
Entity Activation 1 1.0 2 0.0 Salience Values TOP, HIGH, MED, LOW, NONE Z L S
Salience Model
Entity Activation 1 1.0 2 0.0 Salience Values TOP, HIGH, MED, LOW, NONE Z L M S
Salience Model
Entity Activation 1 1.0 2 0.0 Salience Values TOP, HIGH, MED, LOW, NONE Z L M S Mention Type Proper, Pronoun, Nominal
Salience Model
Z1 M1 L1 S1
Salience Model
Z1 M1 L1 S1 Entity Activation 1 0.0 2 0.0
Salience Model
Z1 M1 L1 S1 Entity Activation 1 0.0 2 0.0 Ent 1
Salience Model
Z1 M1 L1 S1 Entity Activation 1 0.0 2 0.0 Ent 1 NONE
Salience Model
Z1 M1 L1 S1 Entity Activation 1 0.0 2 0.0 Ent 1 NONE PROPER
Salience Model
Z1 M1 L1 S1 Z2 M2 L2 S2 Entity Activation 1 0.0 2 0.0
Salience Model
Z1 M1 L1 S1 Entity Activation 1 1.0 2 0.0 Z2 M2 L2 S2 Entity Activation 1 0.0 2 0.0
Salience Model
Z1 M1 L1 S1 Entity Activation 1 1.0 2 0.0 Ent 2 Z2 M2 L2 S2 Entity Activation 1 0.0 2 0.0
Salience Model
Z1 M1 L1 S1 NONE Entity Activation 1 1.0 2 0.0 Ent 2 Z2 M2 L2 S2 Entity Activation 1 0.0 2 0.0
Salience Model
Z1 M1 L1 S1 NONE Entity Activation 1 1.0 2 0.0 PROPER Ent 2 Z2 M2 L2 S2 Entity Activation 1 0.0 2 0.0
Salience Model
Z1 M1 L1 S1 Z2 M2 L2 S2 Z3 M3 L3 S3 Entity Activation 1 1.0 2 0.0 Entity Activation 1 0.0 2 0.0
Salience Model
Entity Activation 1 0.5 2 1.0 Z1 M1 L1 S1 Z2 M2 L2 S2 Z3 M3 L3 S3 Entity Activation 1 1.0 2 0.0 Entity Activation 1 0.0 2 0.0
Salience Model
Entity Activation 1 0.5 2 1.0 Ent 2 Z1 M1 L1 S1 Z2 M2 L2 S2 Z3 M3 L3 S3 Entity Activation 1 1.0 2 0.0 Entity Activation 1 0.0 2 0.0
Salience Model
TOP Entity Activation 1 0.5 2 1.0 Ent 2 Z1 M1 L1 S1 Z2 M2 L2 S2 Z3 M3 L3 S3 Entity Activation 1 1.0 2 0.0 Entity Activation 1 0.0 2 0.0
Salience Model
TOP Entity Activation 1 0.5 2 1.0 Ent 2 PRONOUN Z1 M1 L1 S1 Z2 M2 L2 S2 Z3 M3 L3 S3 Entity Activation 1 1.0 2 0.0 Entity Activation 1 0.0 2 0.0
Enriching Mention Model
β ∞
φ ∞
Z W Z W G N T G N T M M
..... .....
Enriching Mention Model
β ∞
φ ∞
Z W Z W G N T G N T M M
..... .....
S L S L
Enriching Mention Model
β ∞
φ ∞
Z W Z W G N T G N T M M
..... .....
S L S L
Salience Model
50 60 70 80 90 100 Mixture Pronoun Salience
71.5 61.5 54.5
MUC F1
Salience Model
50 60 70 80 90 100 Mixture Pronoun Salience
71.5 61.5 54.5
MUC F1
The Weir Group , whose headquarters is in the U.S is a large specialized corporation. This power plant , which , will be situated in Jiangsu, has a large generation capacity.
Salience Model
TOP HIGH MID LOW NONE 1.000000000000000205
Proper Pronoun Nominal
Salience Model
TOP HIGH MID LOW NONE 1.000000000000000205
Proper Pronoun Nominal
Salience Model
TOP HIGH MID LOW NONE 1.000000000000000205
Proper Pronoun Nominal
Salience Model
TOP HIGH MID LOW NONE 1.000000000000000205
Proper Pronoun Nominal
Salience Model
TOP HIGH MID LOW NONE 1.000000000000000205
Proper Pronoun Nominal
Salience Model
TOP HIGH MID LOW NONE 1.000000000000000205
Proper Pronoun Nominal
Salience Model
TOP HIGH MID LOW NONE 1.000000000000000205
Proper Pronoun Nominal
Global Coreference Resolution
Global Coreference Resolution
Global Coreference Resolution
Global Coreference Resolution
Global Coreference Resolution
Global Entities
HDP Model
β ∞
φ ∞
Z W Z W G N T G N T M M
..... .....
S L S L
HDP Model
ψ N
HDP Model
β ∞
φ ∞
Z W Z W G N T G N T M M
.. ..
S L S L
HDP Model
∞
β0
ψ ψ N
β ∞
Z W Z W G N T G N T M M
.. ..
S L S L φ ∞
HDP Model
∞
β0
ψ
Global Entity Distribution drawn from a DP
ψ N
β ∞
Z W Z W G N T G N T M M
.. ..
S L S L φ ∞
HDP Model
∞
β0
ψ ψ N
β ∞
Z W Z W G N T G N T M M
.. ..
S L S L φ ∞
HDP Model
∞
β0
ψ ψ N
β ∞
Z W Z W G N T G N T M M
.. ..
S L S L φ ∞
Document Entity Distribution subsampled from Global Distr.
HDP Model
∞
β0
ψ ψ N
β ∞
Z W Z W G N T G N T M M
.. ..
S L S L φ ∞
HDP Model
50 60 70 80 90 100 Mixture Pronoun Salience HDP
72.5 71.5 64.1 54.5
MUC F1
HDP Model
50 60 70 80 90 100 Mixture Pronoun Salience HDP
72.5 71.5 64.1 54.5
MUC F1
The Weir Group , whose headquarters is in the U.S is a large specialized corporation. This power plant , which , will be situated in Jiangsu, has a large generation capacity.
HDP Model
HDP Model
Bush he Rice
HDP Model
Bush he Rice Rice Bush she
MUC6 Formal Experiments
Dataset # Docs. P R F MUC6 60 80.8 52.8 63.9 +DRYRUN 251 79.1 59.7 68.0 +NWIRE 381 80.4 62.4 70.3
- MUC6: 30 train / test documents
- Our Unsupervised Result
- Recent Supervised Result
- 73.4 F1 [McCallum and Wellner, 2004]
ACE Formal Experiments
- ACE 2004 English NWIRE
- 64.2 F1 [This paper] Unsupervised
- 67.5 F1 [Denis et al., 2007] Supervised
Summary
- Fully generative unsupervised
Bayesian nonparemetric coref model
- Sequential model of local attentional
state at the document level
- HDP global coreference model
- Broadly competitive with many
supervised results
Thanks! Questions?
Enriching Mention Model
W Z
Enriching Mention Model
W Z Entity Parameters “group”: 0.3, “its”: 0.2, ... Word | Entity
Enriching Mention Model
W Z “group”: 0.3, “its”: 0.2, ... Entity Parameters Word | Entity
Enriching Mention Model
W Z “group”: 0.3, “its”: 0.2, ... Entity Parameters Word | Entity Pronoun Non-pronoun
Enriching Mention Model
W Z “group”: 0.3, “its”: 0.2, ... Entity Parameters Word | Entity Pronoun Non-pronoun
Enriching Mention Model
W Z “group”: 0.3, “its”: 0.2, ... Entity Parameters T Word | Entity Pronoun Non-pronoun
Enriching Mention Model
W Z “group”: 0.3, “its”: 0.2, ... Entity Parameters T Word | Entity ORG: 0.5, LOC: 0.2, ... Entity Type | Entity Pronoun Non-pronoun
Enriching Mention Model
W Z “group”: 0.3, “its”: 0.2, ... Entity Parameters N T Word | Entity ORG: 0.5, LOC: 0.2, ... Entity Type | Entity Pronoun Non-pronoun
Enriching Mention Model
W Z “group”: 0.3, “its”: 0.2, ... Entity Parameters N T Word | Entity ORG: 0.5, LOC: 0.2, ... Entity Type | Entity PL: 0.6, SING: 0.4 Number | Entity Pronoun Non-pronoun
Enriching Mention Model
W Z “group”: 0.3, “its”: 0.2, ... Entity Parameters G N T Word | Entity ORG: 0.5, LOC: 0.2, ... Entity Type | Entity PL: 0.6, SING: 0.4 Number | Entity Pronoun Non-pronoun
Enriching Mention Model
W Z “group”: 0.3, “its”: 0.2, ... Entity Parameters G N T Word | Entity ORG: 0.5, LOC: 0.2, ... Entity Type | Entity PL: 0.6, SING: 0.4 Number | Entity
NEUT: 0.7, M: 0.2, F: 0.1