Semi-supervised Semantic Role Labeling Hagen Frstenau Department of - - PowerPoint PPT Presentation

semi supervised semantic role labeling
SMART_READER_LITE
LIVE PREVIEW

Semi-supervised Semantic Role Labeling Hagen Frstenau Department of - - PowerPoint PPT Presentation

Frame Semantics A Semi-supervised approach to role labeling Summary Semi-supervised Semantic Role Labeling Hagen Frstenau Department of Computational Linguistics Saarland University (joint work with Mirella Lapata) FEAST meeting


slide-1
SLIDE 1

Frame Semantics A Semi-supervised approach to role labeling Summary

Semi-supervised Semantic Role Labeling

Hagen Fürstenau

Department of Computational Linguistics Saarland University (joint work with Mirella Lapata)

“FEAST” meeting November 26th, 2008

1 / 17

slide-2
SLIDE 2

Frame Semantics A Semi-supervised approach to role labeling Summary

Outline

1

Frame Semantics

2

A Semi-supervised approach to role labeling

3

Summary

2 / 17

slide-3
SLIDE 3

Frame Semantics A Semi-supervised approach to role labeling Summary

Frame Semantics

Charles J. Fillmore, 1975 & 1981

Definition A frame describes a prototypical situation. It is evoked by a frame evoking element (FEE). It can have several frame elements (roles).

3 / 17

slide-4
SLIDE 4

Frame Semantics A Semi-supervised approach to role labeling Summary

Frame Semantics

Charles J. Fillmore, 1975 & 1981

Definition A frame describes a prototypical situation. It is evoked by a frame evoking element (FEE). It can have several frame elements (roles).

Matilde fried the catfish in a heavy iron skillet.

3 / 17

slide-5
SLIDE 5

Frame Semantics A Semi-supervised approach to role labeling Summary

Frame Semantics

Charles J. Fillmore, 1975 & 1981

Definition A frame describes a prototypical situation. It is evoked by a frame evoking element (FEE). It can have several frame elements (roles).

Apply_heat FEE Matilde fried the catfish in a heavy iron skillet.

3 / 17

slide-6
SLIDE 6

Frame Semantics A Semi-supervised approach to role labeling Summary

Frame Semantics

Charles J. Fillmore, 1975 & 1981

Definition A frame describes a prototypical situation. It is evoked by a frame evoking element (FEE). It can have several frame elements (roles).

Apply_heat FEE Matilde fried the catfish in a heavy iron skillet. Roles

Heating_instrument Food Cook 3 / 17

slide-7
SLIDE 7

Frame Semantics A Semi-supervised approach to role labeling Summary

Frame Semantics

Shallow semantic analysis Generalizes well across languages Avoids problem of “universal roles” Can benefit various NLP tasks (IR, QA, ...)

4 / 17

slide-8
SLIDE 8

Frame Semantics A Semi-supervised approach to role labeling Summary

Frame Semantics

Shallow semantic analysis Generalizes well across languages Avoids problem of “universal roles” Can benefit various NLP tasks (IR, QA, ...)

4 / 17

slide-9
SLIDE 9

Frame Semantics A Semi-supervised approach to role labeling Summary

Frame Semantics

Shallow semantic analysis Generalizes well across languages Avoids problem of “universal roles” Can benefit various NLP tasks (IR, QA, ...)

4 / 17

slide-10
SLIDE 10

Frame Semantics A Semi-supervised approach to role labeling Summary

Frame Semantics

Shallow semantic analysis Generalizes well across languages Avoids problem of “universal roles” Can benefit various NLP tasks (IR, QA, ...)

Commerce_goods-transfer Google snapped up YouTube for $1.65 billion.

Money Goods B u y e r

How much did Google pay for YouTube?

B u y e r Goods M

  • n

e y

4 / 17

slide-11
SLIDE 11

Frame Semantics A Semi-supervised approach to role labeling Summary

Frame Semantic Parsing

To automatically derive Frame Semantic analyses

1

Take an annotated corpus: FrameNet (135,000 sentences)

2

Train a classifier on this data

3

Use classifier as Frame Semantic parser

5 / 17

slide-12
SLIDE 12

Frame Semantics A Semi-supervised approach to role labeling Summary

Frame Semantic Parsing

To automatically derive Frame Semantic analyses

1

Take an annotated corpus: FrameNet (135,000 sentences)

2

Train a classifier on this data

3

Use classifier as Frame Semantic parser Annotation is expensive and time-consuming Must be repeated for new languages or domains Can we reduce this annotation effort?

5 / 17

slide-13
SLIDE 13

Frame Semantics A Semi-supervised approach to role labeling Summary

Semi-supervised learning

Goal: Try to make use of unlabeled data! Example: Binary classification

6 / 17

slide-14
SLIDE 14

Frame Semantics A Semi-supervised approach to role labeling Summary

Semi-supervised learning

Goal: Try to make use of unlabeled data! Example: Binary classification

?

6 / 17

slide-15
SLIDE 15

Frame Semantics A Semi-supervised approach to role labeling Summary

Semi-supervised learning

Goal: Try to make use of unlabeled data! Example: Binary classification

?

6 / 17

slide-16
SLIDE 16

Frame Semantics A Semi-supervised approach to role labeling Summary

Semi-supervised learning

Goal: Try to make use of unlabeled data! Example: Binary classification

?

6 / 17

slide-17
SLIDE 17

Frame Semantics A Semi-supervised approach to role labeling Summary

Semi-supervised learning

Goal: Try to make use of unlabeled data! Example: Binary classification

?

6 / 17

slide-18
SLIDE 18

Frame Semantics A Semi-supervised approach to role labeling Summary

Semi-supervised learning

Goal: Try to make use of unlabeled data! Example: Binary classification

?

6 / 17

slide-19
SLIDE 19

Frame Semantics A Semi-supervised approach to role labeling Summary

Applied to role labeling

To expand a Frame Semantic corpus

1

Find unlabeled sentences “similar” to labeled ones

2

Project annotations from labeled sentences

3

Add new labeled examples to annotation pool

4

Hope that the expanded corpus is “better” than the original one

7 / 17

slide-20
SLIDE 20

Frame Semantics A Semi-supervised approach to role labeling Summary

Applied to role labeling

To expand a Frame Semantic corpus

1

Find unlabeled sentences “similar” to labeled ones

2

Project annotations from labeled sentences

3

Add new labeled examples to annotation pool

4

Hope that the expanded corpus is “better” than the original one What’s “similar”? Take into account syntactic and semantic measures! What’s “better”? A supervised algorithm makes better predictions when trained on the expanded corpus.

7 / 17

slide-21
SLIDE 21

Frame Semantics A Semi-supervised approach to role labeling Summary

The General Framework

FrameNet

training test

Classifier

... ...

Syntactic parsing

8 / 17

slide-22
SLIDE 22

Frame Semantics A Semi-supervised approach to role labeling Summary

The General Framework

FrameNet

training test

BNC

Syntactic parsing Syntactic parsing

8 / 17

slide-23
SLIDE 23

Frame Semantics A Semi-supervised approach to role labeling Summary

The General Framework

FrameNet

training test

BNC

Syntactic parsing Syntactic parsing Annotation of similar sentences

8 / 17

slide-24
SLIDE 24

Frame Semantics A Semi-supervised approach to role labeling Summary

The General Framework

FrameNet

training test

Classifier BNC

Results improved?

8 / 17

slide-25
SLIDE 25

Frame Semantics A Semi-supervised approach to role labeling Summary

Similarity measure

Find best alignment between predicate-argument structures:

fry Mathilde catfish in the skillet iron Apply_heat

m

  • d

subj

  • bj

Cook H e a t i n g _ i n s t r u m e n t Food

a heavy boil we egg some

subj

  • bj

GR Lemma Role subj Mathilde Cook

  • bj

catfish Food mod_in skillet Heating_instrument GR Lemma Role subj we

  • bj

egg

9 / 17

slide-26
SLIDE 26

Frame Semantics A Semi-supervised approach to role labeling Summary

Similarity measure

Find best alignment between predicate-argument structures:

fry Mathilde catfish in the skillet iron Apply_heat

m

  • d

subj

  • bj

Cook H e a t i n g _ i n s t r u m e n t Food

a heavy boil we egg some

subj

  • bj

GR Lemma Role subj Mathilde Cook

  • bj

catfish Food mod_in skillet Heating_instrument GR Lemma Role subj we Cook

  • bj

egg Food Apply_heat

Cook Food

9 / 17

slide-27
SLIDE 27

Frame Semantics A Semi-supervised approach to role labeling Summary

Alignment

We can feel the blood coursing through our veins again. Adrenalin was still coursing through her veins.

Fluid

  • blood

SUBJ

  • adrenalin

SUBJ

Path

  • vein

IOBJ_THROUGH

  • be

AUX

again

MOD

  • still

MOD

vein

IOBJ_THROUGH 10 / 17

slide-28
SLIDE 28

Frame Semantics A Semi-supervised approach to role labeling Summary

Similarity measure

Consider a partial, injective alignment function σ : {1, . . . , m} → {ε, 1, . . . , n}

(σ(i) = σ(j) = ε ⇒ i = j)

and define similarity with respect to this alignment: sim(σ) :=

m

  • i=1

σ(i)=ε

  • A · δGRi,GRσ(i) + cos(

vi, vσ(i)) − B

  • The similarity of two predicate-argument structures is

max

σ

sim(σ)

11 / 17

slide-29
SLIDE 29

Frame Semantics A Semi-supervised approach to role labeling Summary

Choosing which sentences to label

We have a large corpus, therefore we can be picky: Annotate if similarity is above some threshold? Global threshold value doesn’t work! Pick k-NN unlabeled sentences for each labeled one? Neglects some of the global structure. Global graph optimization? Computationally expensive.

12 / 17

slide-30
SLIDE 30

Frame Semantics A Semi-supervised approach to role labeling Summary

Choosing which sentences to label

We have a large corpus, therefore we can be picky: Annotate if similarity is above some threshold? Global threshold value doesn’t work! Pick k-NN unlabeled sentences for each labeled one? Neglects some of the global structure. Global graph optimization? Computationally expensive.

12 / 17

slide-31
SLIDE 31

Frame Semantics A Semi-supervised approach to role labeling Summary

Choosing which sentences to label

We have a large corpus, therefore we can be picky: Annotate if similarity is above some threshold? Global threshold value doesn’t work! Pick k-NN unlabeled sentences for each labeled one? Neglects some of the global structure. Global graph optimization? Computationally expensive.

12 / 17

slide-32
SLIDE 32

Frame Semantics A Semi-supervised approach to role labeling Summary

Choosing which sentences to label

We have a large corpus, therefore we can be picky: Annotate if similarity is above some threshold? Global threshold value doesn’t work! Pick k-NN unlabeled sentences for each labeled one? Neglects some of the global structure. Global graph optimization? Computationally expensive.

12 / 17

slide-33
SLIDE 33

Frame Semantics A Semi-supervised approach to role labeling Summary

Evaluation

Compare labeling performance before and after expansion Independent supervised labeler: desert (dependency-based semantic role labeling toolkit)

1

argument recognition

2

argument labeling

Evaluate on 100 verb FrameNet sample (80% training, 20% test) Labeled precision:

# correctly labeled roles # all predicted roles

Labeled recall:

# correctly labeled roles # all real roles

Labeled F1 score: F1 = 2PR

P+R

13 / 17

slide-34
SLIDE 34

Frame Semantics A Semi-supervised approach to role labeling Summary

Results

TrainSet Size Frame Acc. Prec (%) Rec (%) F1 (%) ≤ 1 seed 95 81.6 24.9 31.3 27.7 + 2-NN 170 81.6 26.4 32.6 29.2∗ + self training 183 81.6 22.2 29.3 25.3 ≤ 5 seeds 450 83.6 29.7 38.4 33.5 + 2-NN 844 82.6 31.8 40.4 35.6∗ + self training 882 83.4 24.3 33.4 28.1 ≤ 10 seeds 849 83.1 35.5 42.0 38.5 + 2-NN 1549 82.8 38.1 44.1 40.9∗ + self training 1609 82.9 34.0 41.0 37.1 ≤ 20 seeds 1414 84.6 38.7 46.1 42.1 + 2-NN 2600 85.6 40.5 46.7 43.4 + self training 2686 85.3 34.1 42.0 37.6 all seeds 2323 84.6 38.3 47.0 42.2 + 2-NN 4387 84.6 39.5 46.7 42.8 + self training 4501 85.0 34.5 44.1 38.7

Significant improvements for 1, 5 and 10 seeds per verb Expanded ”10 seeds set“ almost as good as original ”20 seeds set“!

14 / 17

slide-35
SLIDE 35

Frame Semantics A Semi-supervised approach to role labeling Summary

Results (contd.)

TrainSet Size Frame Acc. Prec (%) Rec (%) F1 (%) 0-NN 849 83.1 35.5 42.0 38.5 1-NN 1205 83.1 36.4 43.3 39.5 2-NN 1549 82.8 38.1 44.1 40.9∗ 3-NN 1883 83.1 37.9 43.7 40.6∗ 4-NN 2204 82.3 38.0 43.9 40.7∗ 5-NN 2514 81.9 37.4 43.9 40.4∗

Significant improvements for all but 1-NN

15 / 17

slide-36
SLIDE 36

Frame Semantics A Semi-supervised approach to role labeling Summary

Example

[He]Theme stared and came [slowly]Manner [towards me]Goal. Nearest neighbors

1

[He]Theme had heard the shooting and come [rapidly]Manner [back towards the house]Goal.

2

Without answering, [she]Theme left the room and came [slowly]Manner [down the stairs]Goal.

3

[Then]Manner [he]Theme won’t come [to Salisbury]Goal.

4

Does [he]Theme always come round [in the morning]Goal [then]Manner?

16 / 17

slide-37
SLIDE 37

Frame Semantics A Semi-supervised approach to role labeling Summary

Summary

Our approach Only relies on seed corpus and dependency parser Uses a general, extensible similarity metric Shows encouraging first results Future Work Compare predicate-argument structures of different verbs Globally optimize similarity graph partition Automatically identify alternation behaviour Apply method to other languages (German) and corpora (PropBank)

17 / 17