VerbNet Overview Karin Kipper Schuler kipper@verbs.colorado.edu - - PowerPoint PPT Presentation

verbnet overview
SMART_READER_LITE
LIVE PREVIEW

VerbNet Overview Karin Kipper Schuler kipper@verbs.colorado.edu - - PowerPoint PPT Presentation

VerbNet Overview Karin Kipper Schuler kipper@verbs.colorado.edu May 31st, 2009 Overview Real world applications need resources with rich syntactic and se- mantic representations. Most existing broad-coverage resources provide only a


slide-1
SLIDE 1

VerbNet Overview

Karin Kipper Schuler

kipper@verbs.colorado.edu May 31st, 2009

slide-2
SLIDE 2

Overview Real world applications need resources with rich syntactic and se- mantic representations.

  • Most existing broad-coverage resources provide only a shallow

semantic representation

  • Much richer representations are needed
  • The verbs are key elements in providing this

1

slide-3
SLIDE 3

Overview Natural language applications are currently limited to specific do- mains with hand-crafted lexicons.

  • not available to the whole community
  • expensive and time-consuming to build

Most available broad-coverage resources either focus on syntax or on semantics and do not provide a clear association between the two.

2

slide-4
SLIDE 4

Semantic representation must be tied to the syntactic information:

  • Differences between syntactic frames can help:

Eng: John left the room. (exited) Port: John saiu do quarto. Eng: John left the book on the table. (left) Port: John deixou o livro na mesa.

  • But syntax alone is not sufficient:

Eng: John left the room. (exited) Port: John saiu do quarto. Eng: John left a fortune. (gave away) Port: John deixou uma fortuna.

3

slide-5
SLIDE 5

Overview Predicate argument relations are of interest for NLP, providing gen- eralizations over data:

  • Ronaldo scored a goal for the Brazilian team
  • A goal was scored by Ronaldo for the Brazilian team
  • Ronaldo wanted to score a goal for the Brazilian team

4

slide-6
SLIDE 6

VerbNet connects semantics to syntax Created with these ideas in mind

  • computational verb lexicon
  • broad-coverage and domain-independent
  • clear association between syntax and semantics

– lexical semantic information (pred argument structure) – syntactic frames and selectional restrictions – semantic predicates – links to WordNet senses, FrameNet frames, PropBank framesets

  • refinement of Levin classes to construct the entries

5

slide-7
SLIDE 7

Outline

  • Overview
  • Building blocks for VerbNet
  • VerbNet
  • Parameterized Action Representation (PARs)
  • Evaluation
  • Mappings to other Resources
  • Automatic techniques to extend coverage

6

slide-8
SLIDE 8

Levin classes

Levin (1993)

  • Verbs are grouped into classes
  • Each class is characterized by a set of syntactic patterns

John broke the jar / The jar broke / Jars break easily John cut the bread / *The bread cut / Bread cuts easily John hit the wall / *The wall hit / *Walls hit easily

  • Hypothesis: syntax reflects implicit semantic components

contact, directed motion, exertion of force, change of state

7

slide-9
SLIDE 9

Example Levin class

break

Break Levin class - Change-of-state

crack crash snap splinter split chip tear crush fracture smash shatter rip

8

slide-10
SLIDE 10

Levin classes

  • classes are not semantically homogeneous

{braid, clip, file, powder, etc..}

  • classes are not completely syntactically homogeneous
  • verbs can be in multiple class listings
  • alternation contradictions

– Carry verbs disallow conative but include {push, pull, shove, etc}

also in Push/pull class which does take conative

9

slide-11
SLIDE 11

Event Structure Verbs refer to events which can be decomposed into a tripartite structure in a manner similar to Moens and Steedman (1988)

consequent preparatory process state culmination

10

slide-12
SLIDE 12

Verb classes and event structure

consequent state preparatory process (activity)

(bounce, jog, jump, hop, run) (break, chip, crack, tear) (batter, kick, hit, slap)

RUN class BREAK class HIT class

culmination

11

slide-13
SLIDE 13

Outline

  • Overview
  • Building blocks for VerbNet
  • VerbNet
  • Parameterized Action Representation (PARs)
  • Evaluation
  • Mappings to other Resources
  • Automatic techniques to extend coverage

12

slide-14
SLIDE 14

Characteristics of verbs: Verbs have complex meaning: key components can be made explicit

  • have participants
  • space
  • verbs represent processes/events/states which are located in time
  • can be subdivided into sub-parts to capture during, end, results

13

slide-15
SLIDE 15

Examples of verbs and their components

  • RUN

– express iterative activity, no culmination, or consequent – one participant – motion of participant is a semantic component – path is optional

  • HIT

– express contact between two objects – happens momentarily, has a well defined end, has no consequent – has three participants

  • BREAK

– express a change of state

14

slide-16
SLIDE 16

VerbNet class entries

Kipper, Dang and Palmer, 2000

  • verb classes based on Levin’s classification
  • classes defined by syntactic properties
  • capture generalizations about verb behavior
  • for each verb class

– thematic roles – syntactic frames – selectional restrictions for the arguments in each frame – each frame includes semantic predicates with a time function

15

slide-17
SLIDE 17

Hit class

Class hit-18.1 Parent — Members bang (1,3), bash(1), batter(1,2,3), beat(2,5), ..., hit(2,4,7,10), kick(3), ... Themroles Agent Patient Instrument Selrestr Agent[+int control] Patient[+concrete] Instrument[+concrete] Frames Name Syntax Semantic Predicates Transitive Agent V Patient “Paula hit the ball” cause(Agent, E) ∧ manner(during(E),directedmotion,Agent) ∧ !contact(during(E), Agent, Patient) ∧ manner(end(E),forceful, Agent) ∧ contact(end(E), Agent, Patient) Transitive with Instrument Agent V Patient Prep(with) Instrument “Paula hit the ball with a stick” cause(Agent, E) ∧ manner(during(E),directedmotion,Agent) ∧ !contact(during(E),Instrument,Patient) ∧ manner(end(E),forceful, Agent) ∧ contact(end(E), Instrument,Patient)

16

slide-18
SLIDE 18

Thematic roles

  • use roles to provide as much information as possible for classes
  • thematic roles vs. generic arguments
  • specification of roles supplies part of the semantic description for

the class

Build-26.1 “The artist (Agent) carved a toy (Product) out of a piece of wood (Material)”

  • roles also help differentiate classes

Admire-31.2: Experiencer and Theme Hit-18.1: Agent and Patient

17

slide-19
SLIDE 19

Thematic roles

  • small set of roles (Agent, Theme, Location, ...)
  • roles used across classes
  • certain roles need specific characteristics to be present

(Patient → undergoes change)

  • thematic roles used are generally accepted

18

slide-20
SLIDE 20

Selectional Restrictions

  • semantic restrictions on the thematic roles

– based on EuroWordNet concepts (Vossen 2003) – associate VerbNet with an ontology publicly available, widely used – IS-A hierarchy with multiple inheritance and no cycles

  • syntactic restrictions for the syntactic frames

(e.g., sentential, plural)

19

slide-21
SLIDE 21

Selectional Restrictions

SelRestr concrete int-control force machine vehicle natural animate human animal body-part plant phys-obj comestible artifact machine tool garment solid rigid non-rigid shape pointed elongated substance abstract idea sound communication location regionPP place

  • bject

time state scalar currency

  • rganization

20

slide-22
SLIDE 22

Syntactic Frames Describe possible surface realizations for verbs in a class

  • constructions such as transitive, intransitive, resultative,

and a large set of Levin’s alternations

  • Examples:
  • 1. Agent V Patient

(John hit the ball)

  • 2. Agent V at Patient

(John hit at the window)

  • 3. Agent V Patient[+plural] together

(John hit the sticks together)

21

slide-23
SLIDE 23

Semantic Predicates Semantics of a syntactic frame captured through a conjunction of semantic predicates

  • each semantic predicate includes a time function showing at what

stage in the event the predicate holds

start(E), during(E), end(E), result(E)

  • similar to Moens and Steedman’s event decomposition
  • choice influenced because it was suitable for PARs (pre-conditions,

post-conditions, and results)

  • semantic predicates can be:

General (e.g.,motion and cause), Specific (e.g.,suffocate), or Variable (Prep)

22

slide-24
SLIDE 24

Semantic Predicates

  • relations between verbs (or verb classes) captured implicitly by

the predicates for the class

  • aspect captured by the temporal function present in the predi-

cates: – activities (e.g., run) have during(E) – bounded activities (e.g., hit) have during(E) and end(E) – accomplishments (e.g., break) have result(E)

23

slide-25
SLIDE 25

Hit class

Class hit-18.1 Parent — Members bang (1,3), bash(1), batter(1,2,3), beat(2,5), ..., hit(2,4,7,10), kick(3), ... Themroles Agent Patient Instrument Selrestr Agent[+int control] Patient[+concrete] Instrument[+concrete] Frames Name Syntax Semantic Predicates Transitive Agent V Patient “Paula hit the ball” cause(Agent, E) ∧ manner(during(E),directedmotion,Agent) ∧ !contact(during(E), Agent, Patient) ∧ manner(end(E),forceful, Agent) ∧ contact(end(E), Agent, Patient) Transitive with Instrument Agent V Patient Prep(with) Instrument “Paula hit the ball with a stick” cause(Agent, E) ∧ manner(during(E),directedmotion,Agent) ∧ !contact(during(E),Instrument,Patient) ∧ manner(end(E),forceful, Agent) ∧ contact(end(E), Instrument,Patient)

24

slide-26
SLIDE 26

Hierarchical organization Refinement of Levin classes

  • verb classes are hierarchically organized

– the original set of Levin classes has been further subdivided into additional

subclasses which are more syntactic and semantically coherent

– members have common semantic predicates, thematic roles, syntactic frames – a particular verb or subclass inherit from parent and may add more infor-

mation

25

slide-27
SLIDE 27

Transfer of Message

Class Transfer mesg-37.1 Parent — Members cite(1,3,4), demonstrate(1), ... Themroles Agent Topic Recipient Selrestr Agent[+animate] Topic[+message] Recipient[+animate] Frames Name Syntax Semantic Predicates Transitive Agent V Topic “Wanda cited the author” transfer info(during(E),Agent,?,Topic)∧ cause(Agent,E) Dative (to- PP variant) Agent V Topic Prep(to) Recipient “Wanda cited the author to her students” transfer info(during(E),Agent,Recipient,Topic) ∧ cause(Agent,E) Class Transfer mesg-37.1-1 Parent Transfer mesg-37.1 Members quote(1), read(3) Themroles Selrestr Frames Name Syntax Semantic Predicates Dative (di- transitive variant) Agent V Recipient Topic “Wanda quoted her students the author” transfer info(during(E),Agent,Recipient,Topic) ∧ cause(Agent,E)

26

slide-28
SLIDE 28

Transfer of Message – level 2

Class Transfer mesg-37.1-1 Parent Transfer mesg-37.1 Members quote(1), read(3) Themroles Agent Topic Recipient Selrestr Agent[+animate] Topic[+message] Recipient[+animate] Frames Name Syntax Semantic Predicates Transitive Agent V Topic transfer info(during(E),Agent,?,Topic)∧ cause(Agent,E) Dative (to- PP variant) Agent V Topic Prep(to) Recipient transfer info(during(E),Agent,Recipient,Topic) ∧ cause(Agent,E) Dative (di- transitive variant) Agent V Recipient Topic transfer info(during(E),Agent,Recipient,Topic) ∧ cause(Agent,E)

27

slide-29
SLIDE 29

Related Work

  • WordNet (Miller 1985; Fellbaum 1998)

– predicate-argument structure

  • FrameNet (Baker et al. 1998)

– verb groupings – frame elements vs. thematic roles

  • LCS database (Dorr 2001)

– classes based on Levin – syntactic frames not explicit

28

slide-30
SLIDE 30

Related Work

  • Xtag (Xtag Research Group 2001) and ComLex (Comlex 1994)

– provide detailed syntactic description

29

slide-31
SLIDE 31

Outline

  • Overview
  • Building blocks for VerbNet
  • VerbNet
  • Parameterized Action Representation (PARs)
  • Evaluation
  • Mappings to other Resources
  • Automatic techniques to extend coverage

30

slide-32
SLIDE 32

Parameterized Action Representation (PAR)

Badler et al. (1999)

Interface to agents in an animation system. Needs a semantically precise representation.

  • Representation of actions

– instructions to a virtual human – used in a simulated 3D environment

  • Represented as

– parameterized structures – hierarchical organization

31

slide-33
SLIDE 33

PARs include:

  • action participants (agents/objects)
  • restrictions on the types of objects
  • kinematic and dynamic properties (path, manner, ..., force)
  • stages of the action (like Moens and Steedman event decomposition)

– preparatory specifications – termination conditions – post-assertions

32

slide-34
SLIDE 34

Example of the PAR inheritance hierarchy

contact/(par:contact) hit/(manner:forcefully) kick/(OBJ2:foot) hammer/(OBJ2:hammer) touch/(manner:gently) A lexical/semantic hierarchy for actions of contact

33

slide-35
SLIDE 35

Instantiated PAR: John hit the ball with a stick

                                            

activity :

  • ACTION
  • participants :

agent : John

  • bjects : ball, stick
  • preparatory spec : [get control of(John, stick)]

termination cond : [contact(ball, stick)] post assertions : duration : [momentarily] path, motion, force manner :

  • forcefully

                                            34

slide-36
SLIDE 36

PARs and VerbNet PARs for animating agents require precise semantics associated with syntax provided by VerbNet.

  • participants of an action are the arguments of a verb
  • selectional restrictions on the arguments
  • event structure (during, end, result)
  • semantic components expressed by predicates

35

slide-37
SLIDE 37

Aggregates

(Allbeck et al. 2002)

  • VerbNet also used to describe actions of aggregate entities
  • actions decomposed by features based on Laban Movement

Analysis (EMOTE system)

  • used in a playground scenario with a teacher and 8 kids
  • Examples of aggregate actions:

Aggregate actions Gathering assemble congregate Dispersing dissipate scatter Obj refer surround encircle Formation Milling

36

slide-38
SLIDE 38

Aggregates

  • PAR entry for assemble

(as in “children assemble in the playground”)

assemble / ARG0-v / is_concrete(ARG0) is_plural(ARG0) !together_group(start(e),ARG0) transl_motion(during(e),ARG0) shape_enclosing(during(e),ARG0) effort_direct(during(e),ARG0) together_group(end(e),ARG0)

  • VerbNet entry

Class Herd-47.5.2 Parent — Members accumulate aggregate amass assemble cluster collect congregate convene flock gather group herd huddle mass Themroles Theme[+concrete +plural] Frames Name Example Syntax Semantics Intransitive The kids are assembling Theme V !together(start(E),physical,Theme) together(end(E),physical,Theme)

37

slide-39
SLIDE 39

Outline

  • Overview
  • Building blocks for VerbNet
  • VerbNet
  • Parameterized Action Representation (PARs)
  • Evaluation
  • Mappings to other Resources
  • Automatic techniques to extend coverage

38

slide-40
SLIDE 40

Syntactic coverage against PropBank

Kipper, Snyder, Palmer 2004

  • used Penn TreeBank data, with PropBank annotation to verify

coverage

  • 79k instances, 2,100 verbs, 185 classes
  • mapping between PB framesets and VN verb classes
  • mapping between PB argument labels and VN thematic roles
  • goal was to establish a baseline

39

slide-41
SLIDE 41

Syntactic coverage against PropBank

arg0 (giver) arg1 (thing given) arg2 (benefactive) Agent Recipient Theme

"give"

leave.02 future_having−13.3 keep−15.2 fulfill−13.4.1 leave.01

"move away from"

arg2 (attribute) arg1 (place left)

escape−51.1

Theme Source arg0 (entity leaving)

leave−51.2

40

slide-42
SLIDE 42

Syntactic coverage against PropBank

Example: verb LEAVE wsj/05/wsj 0568.mrg 12 4: The tax payments will leave Unisys with $ 225 million in loss carry-forwards that will cut tax payments in future quarters . [ARG0 The tax payments] [rel leave] [ARG2 Unisys] [ARG1 with 225 million] leave-51.2: Theme V NP Prep(with) Source future have-13.3: Agent V Recipient Prep(with) Theme

41

slide-43
SLIDE 43

Syntactic coverage against PropBank

  • VerbNet provided exact matches of syntactic frames

in 84.67% of the instances

(86.30% using a more relaxed criteria of matching, such as ignoring preposition mismatches, or allowing ARGMs to match against specific roles)

Why not a 100%?

  • few problems in the mappings
  • different sense coverage in PropBank and VerbNet
  • different granularity of the arguments

Calibratable cos-45.6 The price of oil soared ⇐ ⇒ Oil soared in price

42

slide-44
SLIDE 44

Outline

  • Overview
  • Building blocks for VerbNet
  • VerbNet
  • Parameterized Action Representation (PARs)
  • Evaluation
  • Mappings to other Resources
  • Automatic techniques to extend coverage

43

slide-45
SLIDE 45

Linking resources Many applications would benefit by merging the results of different lexical resources and annotation projects:

  • compatibility between resources
  • inherent theoretical differences
  • the different levels of representation may be more than the sum of

its parts, since inferences may be drawn from how the components interact Semlink: develop computationally explicit connections between FrameNet, PropBank, and VerbNet.

44

slide-46
SLIDE 46

Mappings between VerbNet and WordNet Each verb in VerbNet is mapped to its corresponding synset(s) in WordNet, if available.

escape−51.1 leave−51.2 fulfill−13.4.1 keep−15.2 wn5 wn9

motion, direction motion, direction, change location has_possession, transfer be Prep

future_having−13.3

has_possession, transfer, future_having

wn10 wn3 wn2 wn1 wn14 LEAVE

45

slide-47
SLIDE 47

PropBank/VerbNet/WordNet

leave.01 leave.02 escape−51.1 leave−51.2 fulfill−13.4.1 keep−15.2 wn5 wn9

motion, direction motion, direction, change location has_possession, transfer be Prep

future_having−13.3

has_possession, transfer, future_having

wn10 wn3 wn2 wn1 wn14

give move away from

46

slide-48
SLIDE 48

Mappings between VerbNet and FrameNet Two steps:

  • 1. mappings between VerbNet verb senses to FrameNet;

VN class VN member FN frame 9.1 arrange (diff. sense) 9.1 immerse Placing 9.1 lodge Placing 9.1 mount Placing 9.1 sling

  • 2. mappings from VerbNet thematic roles to the FrameNet frame

elements

VNclass 9.1 FNclass “Placing” VN role FN frame element Agent Agent Agent Cause Destination Goal Theme Theme

47

slide-49
SLIDE 49

Assigning Xtag trees to VerbNet

Ryant and Kipper (2004)

  • VerbNet only describes declarative frames
  • Xtag provides detailed account of syntactic transformations
  • mapping VerbNet syntactic frames to Xtag trees extends VerbNet

syntactic coverage while providing semantics for the Xtag trees

  • 104 VerbNet syntactic frames (out of 131) map to 19 Xtag tree

families

48

slide-50
SLIDE 50

Mappings These resources are:

  • Complementary
  • Redundancy is harmless, may even be useful
  • PropBank provides great training data
  • VerbNet provides clear links between syntax and semantics
  • FrameNet provides rich semantics
  • Together they give us the most comprehensive coverage

49

slide-51
SLIDE 51

Outline

  • Overview
  • Building blocks for VerbNet
  • VerbNet
  • Parameterized Action Representation (PARs)
  • Evaluation
  • Mappings to other Resources
  • Automatic techniques to extend coverage

50

slide-52
SLIDE 52

Extending VerbNet’s members – LCS

Dorr (2001)

Addition of members from the LCS database

  • inspected 1,266 verbs present in the LCS database and not in

VerbNet

  • 429 (426 lemmas) were initially integrated into our lexicon
  • verbs had been acquired automatically, data noisy

51

slide-53
SLIDE 53

Automatic acquisition of verbs – Clusters

Kingsbury and Kipper (2003); Kingsbury (2004)

  • used PropBank subcategorization frames (e.g., Arg0.V.Arg1)
  • 121 clusters from the EM algorithm (0 to 45 elements each)
  • 1,278 verbs which occurred at least 10 times in the PropBank

annotation were used as data

  • 484 verbs were already in VerbNet class

(824 potential candidates for inclusion in VerbNet classes)

52

slide-54
SLIDE 54

Automatic acquisition of verbs – Clusters Results:

  • 5.6% of the candidates were included in VerbNet
  • large clusters were not predictive of any classes
  • small clusters did not offer many candidates
  • 12.6% if using only “good clusters”
  • need better way to filter the clusters
  • impoverished features
  • senses predicted in VerbNet and PropBank are different

53

slide-55
SLIDE 55

Extending VerbNet with WordNet

(Loper, Kipper and Palmer)

  • use WordNet as a source of candidates for inclusion in VerbNet
  • use syntactic contexts of these verbs in Propbank
  • candidates are filtered based on the grammatical patterns and

the relationship between those patterns and known members of VerbNet classes

  • 707 lemmas suggested, 849 senses
  • 208 lemmas, 255 senses integrated into the suggested classes
  • experiment done on version 1.5 of VerbNet

54

slide-56
SLIDE 56

Extending VerbNet with WordNet Problems due to different goals of VerbNet and WordNet:

  • verbs suggested with a preposition (e.g., chase after)
  • idiomatic expressions (e.g., shoot a line)
  • morphological variants (criminalise vs. criminalize)
  • archaic usages (e.g., coggle, aggroup)

55

slide-57
SLIDE 57

Conclusion To achieve the detailed level of representation required for natural language applications we need resources capable of providing a rich semantic representation tied to syntax. VerbNet:

  • broad-coverage, general purpose natural language resource
  • focuses on both syntax and semantics and provides a clear asso-

ciation between the two, necessary for characterizing verbs

56

slide-58
SLIDE 58

Thank you!

57