Probabilistic Frame-Semantic Parsing
Dipanjan Das Nathan Schneider Desai Chen
Noah A. Smith
NAACL-HLT June 4, 2010
School of Computer Science Carnegie Mellon University
Probabilistic Frame-Semantic Parsing Noah A. Smith Dipanjan Das - - PowerPoint PPT Presentation
Probabilistic Frame-Semantic Parsing Noah A. Smith Dipanjan Das Nathan Schneider Desai Chen School of Computer Science Carnegie Mellon University NAACL-HLT June 4, 2010 In a Nutshell Most models for semantics are very local (cascades of
Dipanjan Das Nathan Schneider Desai Chen
Noah A. Smith
NAACL-HLT June 4, 2010
School of Computer Science Carnegie Mellon University
Das, Schneider, Chen and Smith, NAACL-HLT 2010
2
(cascades of classifiers)
semantic processing (feature sharing among all semantic classes) (just two probabilistic models)
improvements
Das, Schneider, Chen and Smith, NAACL-HLT 2010
3
Das, Schneider, Chen and Smith, NAACL-HLT 2010
4
Das, Schneider, Chen and Smith, NAACL-HLT 2010
5
representations
Das, Schneider, Chen and Smith, NAACL-HLT 2010
elaborating the frame instance
6
Das, Schneider, Chen and Smith, NAACL-HLT 2010
elaborating the frame instance
7
the 1995 book by John Grisham
Das, Schneider, Chen and Smith, NAACL-HLT 2010
elaborating the frame instance
8
the 1995 book by John Grisham
TEXT
Das, Schneider, Chen and Smith, NAACL-HLT 2010
elaborating the frame instance
9
the 1995 book by John Grisham
TEXT
Das, Schneider, Chen and Smith, NAACL-HLT 2010
elaborating the frame instance
10
the 1995 book by John Grisham
Author Time_of_creation
elaborating the frame instance
TEXT
Das, Schneider, Chen and Smith, NAACL-HLT 2010
11
(Fillmore et al., 2003)
MAKE_NOISE Noisy_event Sound Sound_source Place Time
cough.v, gobble.v, hiss.v, ring.v, yodel.v, ...
Das, Schneider, Chen and Smith, NAACL-HLT 2010
MAKE_NOISE Noisy_event Sound Sound_source Place Time
cough.v, gobble.v, hiss.v, ring.v, yodel.v, ...
12
lexical units frame roles
(Fillmore et al., 2003)
Das, Schneider, Chen and Smith, NAACL-HLT 2010
13
relationships between frames and between roles
EVENT Place Time Event TRANSITIVE_ACTION Agent Patient Event Cause Place Time OBJECTIVE_INFLUENCE Dependent_entity Influencing_situation Place Time Influencing_entity CAUSE_TO_MAKE_NOISE Agent Sound_maker Cause Place Time MAKE_NOISE Noisy_event Sound Sound_source Place Time
cough.v, gobble.v, hiss.v, ring.v, yodel.v, ... blare.v, honk.v, play.v, ring.v, toot.v, ... — affect.v, effect.n, impact.n, impact.v, ... event.n, happen.v,
Inheritance relation Causative_of relation Excludes relation
Purpose
(Fillmore et al., 2003)
Das, Schneider, Chen and Smith, NAACL-HLT 2010
14
annotation per sentence
Das, Schneider, Chen and Smith, NAACL-HLT 2010
15
Marco Polo wrote an account of Asian society during the 13th century .
TEXT Author Topic
Das, Schneider, Chen and Smith, NAACL-HLT 2010
16
Marco Polo wrote an account of Asian society during the 13th century .
TEXT Author Topic
here, the ambiguous word evokes the TEXT frame
Das, Schneider, Chen and Smith, NAACL-HLT 2010
17
Marco Polo wrote an account of Asian society during the 13th century .
TEXT Author Topic
participants in the event or scenario
Das, Schneider, Chen and Smith, NAACL-HLT 2010
18
Marco Polo wrote an account of Asian society during the 13th century .
TEXT Author Topic
frame-specific
participants in the event or scenario
Das, Schneider, Chen and Smith, NAACL-HLT 2010
19
(Yi et al. 2007, Matsubayashi et al. 2009)
(Matsubayashi et al. 2009)
Das, Schneider, Chen and Smith, NAACL-HLT 2010
20
Das, Schneider, Chen and Smith, NAACL-HLT 2010
21
Das, Schneider, Chen and Smith, NAACL-HLT 2010
22
SemEval 2007
frame structure extraction
frame-semantic parses
performing system
Das, Schneider, Chen and Smith, NAACL-HLT 2010
23
SemEval 2007
Das, Schneider, Chen and Smith, NAACL-HLT 2010
24
SemEval 2007
frames and arguments
identification
related frames in the FrameNet hierarchy
Das, Schneider, Chen and Smith, NAACL-HLT 2010
25
Das, Schneider, Chen and Smith, NAACL-HLT 2010
26
shallow semantic parsing
syntactic annotation
lexical units
Das, Schneider, Chen and Smith, NAACL-HLT 2010
27
Desired Structure
Everyone in Dublin seems intent on changing places with everyone else .
Das, Schneider, Chen and Smith, NAACL-HLT 2010
28
Desired Structure
Everyone in Dublin seems intent on changing places with everyone else .
LOCATIVE_RELATION Figure Ground LOCALE Locale EXCHANGE Exchanger_1 Exchanger_2 Themes
PURPOSE
Agent Goal Phenomenon
APPEARANCE
Inference
Das, Schneider, Chen and Smith, NAACL-HLT 2010
29
Three Subtasks:
(nontrivial!)
(795 possibilities; ~WSD)
(~SRL; roleset is frame-specific)
Das, Schneider, Chen and Smith, NAACL-HLT 2010
29
Three Subtasks:
(nontrivial!)
(795 possibilities; ~WSD)
(~SRL; roleset is frame-specific)
sentence
predicates frames frames and arguments
Das, Schneider, Chen and Smith, NAACL-HLT 2010
29
Three Subtasks:
(nontrivial!)
(795 possibilities; ~WSD)
(~SRL; roleset is frame-specific)
sentence
predicates frames frames and arguments
rule-based
probabilistic probabilistic
Das, Schneider, Chen and Smith, NAACL-HLT 2010
30
Das, Schneider, Chen and Smith, NAACL-HLT 2010
31
Target Identification
Everyone in Dublin seems intent on changing places with everyone else .
Das, Schneider, Chen and Smith, NAACL-HLT 2010
32
Target Identification
Everyone in Dublin seems intent on changing places with everyone else .
Das, Schneider, Chen and Smith, NAACL-HLT 2010
33
Das, Schneider, Chen and Smith, NAACL-HLT 2010
34
Frame Identification
Everyone in Dublin seems intent on changing places with everyone else .
LOCATIVE_RELATION LOCALE EXCHANGE
PURPOSE APPEARANCE
Das, Schneider, Chen and Smith, NAACL-HLT 2010
35
Frame Identification
Everyone in Dublin seems intent on changing places with everyone else .
LOCATIVE_RELATION LOCALE EXCHANGE
PURPOSE APPEARANCE
J&N’07 used several classifiers for this subtask
Das, Schneider, Chen and Smith, NAACL-HLT 2010
36
Unseen LUs
from WordNet-extended set
sixth ∈ ORDINAL_NUMBERS? Y N
Seen LUs
sixth ∈ RELATIVE_TIME? sixth ∈ INGESTION?
… 1 classifier … 795 classifiers
Frame Identification
(Johansson and Nugues, 2007)
place
LOCALE PLACING
book in
Das, Schneider, Chen and Smith, NAACL-HLT 2010
37
Unseen LUs
from WordNet-extended set
intent ∈ PURPOSE? Y N
Seen LUs
intent ∈ AIMING? intent ∈ INGESTION?
… 1 classifier … 795 classifiers
Frame Identification
(Johansson and Nugues, 2007)
place
LOCALE PLACING
book in
Das, Schneider, Chen and Smith, NAACL-HLT 2010
38
Frame Identification
Our approach: One single model for frame identification
Everyone in Dublin seems intent on changing places with everyone else .
LOCATIVE_RELATION LOCALE EXCHANGE
PURPOSE APPEARANCE
Das, Schneider, Chen and Smith, NAACL-HLT 2010
39
NN IN NNP VBZ NN IN VBG NNS IN NN RB .
Frame Identification
Assume POS tags and dependency trees to be given
Everyone in Dublin seems intent on changing places with everyone else .
Das, Schneider, Chen and Smith, NAACL-HLT 2010
40
Frame Identification
Assume that target is connected to the frame through a prototype unit
t ℓ f t ℓ
f
NN IN NNP VBZ NN IN VBG NNS IN NN RB .
Everyone in Dublin seems intent on changing places with everyone else .
Das, Schneider, Chen and Smith, NAACL-HLT 2010
NN
41
Frame Identification
PURPOSE
Assume that target is connected to the frame through a prototype unit
t ℓ f
?
NN IN NNP VBZ IN VBG NNS IN NN RB .
Everyone in Dublin seems intent on changing places with everyone else .
t ℓ
f
Das, Schneider, Chen and Smith, NAACL-HLT 2010
42
Frame Identification
PURPOSE Goal Means Agent
aim.n, goal.n, object.n,
target.n
Attribute Value
Das, Schneider, Chen and Smith, NAACL-HLT 2010
43
Frame Identification
PURPOSE Goal Means Agent
aim.n, goal.n, object.n,
target.n
Attribute Value
Das, Schneider, Chen and Smith, NAACL-HLT 2010
44
Frame Identification
PURPOSE Goal Means Agent
aim.n, goal.n, object.n,
target.n
Attribute Value
note that the target intent is unseen
Das, Schneider, Chen and Smith, NAACL-HLT 2010
45
Frame Identification
PURPOSE Goal Means Agent
aim.n, goal.n, object.n,
target.n
Attribute Value
note that the target intent is unseen
but lexical semantic relationships between some and the target exist
ℓ
purpose ≈ intent
Das, Schneider, Chen and Smith, NAACL-HLT 2010
46
Thus, we define a probabilistic model: Frame Identification
pθ(f, ℓ | t, x) ∝ exp θ⊤g(f, ℓ, t, x)
NN IN NNP VBZ NN IN VBG NNS IN NN RB .
Everyone in Dublin seems intent on changing places with everyone else .
Das, Schneider, Chen and Smith, NAACL-HLT 2010
47
Thus, we define a probabilistic model: Frame Identification
pθ(f, ℓ | t, x) ∝ exp θ⊤g(f, ℓ, t, x)
WordNet relationships!
some features looking at the lexical and semantic relationships between and
ℓ f
NN IN NNP VBZ NN IN VBG NNS IN NN RB .
Everyone in Dublin seems intent on changing places with everyone else .
Das, Schneider, Chen and Smith, NAACL-HLT 2010
whole sentence structure
48
Thus, we define a probabilistic model: Frame Identification
pθ(f, ℓ | t, x) ∝ exp θ⊤g(f, ℓ, t, x) x
NN IN NNP VBZ NN IN VBG NNS IN NN RB .
Everyone in Dublin seems intent on changing places with everyone else .
Das, Schneider, Chen and Smith, NAACL-HLT 2010
49
Thus, we define a probabilistic model: Frame Identification
pθ(f, ℓ | t, x) ∝ exp θ⊤g(f, ℓ, t, x)
Note that is unknown
NN IN NNP VBZ NN IN VBG NNS IN NN RB .
Everyone in Dublin seems intent on changing places with everyone else .
ℓ
Das, Schneider, Chen and Smith, NAACL-HLT 2010
50
Thus, we define a probabilistic model: Frame Identification
pθ(f, ℓ | t, x) ∝ exp θ⊤g(f, ℓ, t, x)
Marginalization of latent variable:
pθ(f | t, x) ∝
exp θ⊤g(f, ℓ, t, x)
NN IN NNP VBZ NN IN VBG NNS IN NN RB .
Everyone in Dublin seems intent on changing places with everyone else .
Das, Schneider, Chen and Smith, NAACL-HLT 2010
51
Frame Identification Inference:
NN IN NNP VBZ NN IN VBG NNS IN NN RB .
Everyone in Dublin seems intent on changing places with everyone else .
ˆ f ← argmaxf
exp θ⊤g(f, ℓ, t, x)
Das, Schneider, Chen and Smith, NAACL-HLT 2010
52
Frame Identification Inference: Training: maximum conditional likelihood
NN IN NNP VBZ NN IN VBG NNS IN NN RB .
Everyone in Dublin seems intent on changing places with everyone else .
ˆ f ← argmaxf
exp θ⊤g(f, ℓ, t, x)
Das, Schneider, Chen and Smith, NAACL-HLT 2010
53
Results
50 57 64 71 78 J&N’07 This Work This Work
74.2 68.3 64.0 74.2 61.0 56.4 74.2 77.5 73.9
Precision Recall F1
(Oracle Targets) (Model Targets)
Frame Identification
Das, Schneider, Chen and Smith, NAACL-HLT 2010
54
Results
50 57 64 71 78 J&N’07 This Work This Work
74.2 68.3 64.0 74.2 61.0 56.4 74.2 77.5 73.9
Precision Recall F1
(Oracle Targets) (Model Targets)
Frame Identification
significant improvement
Das, Schneider, Chen and Smith, NAACL-HLT 2010
55
Frame Identification
were unseen
frame matching
Das, Schneider, Chen and Smith, NAACL-HLT 2010
56
Das, Schneider, Chen and Smith, NAACL-HLT 2010
57
Argument Identification
Everyone in Dublin seems intent on changing places with everyone else .
EXCHANGE Exchanger_1 Exchanger_2 Themes
Das, Schneider, Chen and Smith, NAACL-HLT 2010
58
Candidate spans
Two steps:
Argument Identification: The traditional approach
Everyone in Dublin
......
in Dublin
changing places with everyone else places everyone
Das, Schneider, Chen and Smith, NAACL-HLT 2010
59
Everyone in Dublin
......
Candidate spans
Two steps:
Argument Identification: The traditional approach
in Dublin
changing places with everyone else places everyone
✔
binary filtering potential arguments
✗ ✗ ✗ ✗ ✔ ✔
Das, Schneider, Chen and Smith, NAACL-HLT 2010
60
Everyone in Dublin
......
Candidate spans
Two steps:
Argument Identification: The traditional approach
in Dublin
changing places with everyone else places everyone
✔ ✔ ✗ ✗ ✗ ✗ ✔
Exchanger_1 Exchanger_2 Themes
classification of arguments into different roles
Das, Schneider, Chen and Smith, NAACL-HLT 2010
61
Everyone in Dublin
......
Candidate spans
Two steps:
Argument Identification: The traditional approach
in Dublin
changing places with everyone else places everyone
✔ ✔ ✗ ✗ ✗ ✗ ✔
Exchanger_1 Exchanger_2 Themes
Two steps unnecessary
Das, Schneider, Chen and Smith, NAACL-HLT 2010
62
......
Candidate spans Roleset for EXCHANGE
Argument Identification: Our approach
...
Everyone in Dublin in Dublin
changing places with everyone else places everyone
Exchanger_1 Exchanger_2 Exchangers Theme_1 Theme_2 Themes Manner Means
Das, Schneider, Chen and Smith, NAACL-HLT 2010
63
......
Candidate spans
Exchanger_1 Exchanger_2 Exchangers Theme_1 Theme_2 Themes Manner Means
Roleset for EXCHANGE
Argument Identification: Our approach
...
Everyone in Dublin in Dublin
changing places with everyone else places everyone
Das, Schneider, Chen and Smith, NAACL-HLT 2010
64
A probabilistic model: Argument Identification
pψ(r → s | f, t, x) ∝ exp ψ⊤h(r, s, f, t, x)
Das, Schneider, Chen and Smith, NAACL-HLT 2010
65
A probabilistic model: Argument Identification
pψ(r → s | f, t, x) ∝ exp ψ⊤h(r, s, f, t, x)
features looking at the span, the frame, the role and the observed sentence structure
Das, Schneider, Chen and Smith, NAACL-HLT 2010
Argument Identification A probabilistic model:
pψ(r → s | f, t, x) ∝ exp ψ⊤h(r, s, f, t, x)
Decoding: Best span for each role is selected For each frame, the best set of non-
together
66
Das, Schneider, Chen and Smith, NAACL-HLT 2010
Argument Identification A probabilistic model:
pψ(r → s | f, t, x) ∝ exp ψ⊤h(r, s, f, t, x)
Training: Maximum conditional likelihood
67
Das, Schneider, Chen and Smith, NAACL-HLT 2010
68
Results
50.000 59.572 69.145 78.717 88.290 Model Spans Oracle Spans
81.0 68.5 74.8 60.6 88.3 78.7
Precision Recall F1
Argument identification only, with gold targets and frames Argument Identification
Das, Schneider, Chen and Smith, NAACL-HLT 2010
69
Das, Schneider, Chen and Smith, NAACL-HLT 2010
70
Results
30.00 38.75 47.50 56.25 65.00 J&N’07 This work
50.2 45.6 41.9 38.4 62.8 56.0
Precision Recall F1
full frame-semantic parsing Full Frame-Semantic Parsing
Das, Schneider, Chen and Smith, NAACL-HLT 2010
71
Results
30.00 38.75 47.50 56.25 65.00 J&N’07 This work
50.2 45.6 41.9 38.4 62.8 56.0
Precision Recall F1
full frame-semantic parsing
significant improvement
Full Frame-Semantic Parsing
Das, Schneider, Chen and Smith, NAACL-HLT 2010
72
classifiers for the frame-semantic parsing task
stage using only one model instead of two
http://www.ark.cs.cmu.edu/SEMAFOR
http://www.ark.cs.cmu.edu/SEMAFOR
JUDGMENT_DIRECT_ADDRESS