DS-to-PS conversion Fei Xia University of Washington July 29, 2011 - PowerPoint PPT Presentation

DS-to-PS conversion Fei Xia University of Washington July 29, 2011 1

Main steps in building the treebank • DS treebank: – Tokenization – Morphological analysis, voice, etc. – POS tagging – DS • Propbank: adding Predicate-argument info • Automatic DS-to-PS conversion • Some manual check to ensure the conversion works well 2

Outline • Important concepts • Compatibility and consistency • Handling inconsistency 3

Important concepts • Linguistic phenomena • Representation type • Linguistic theory – Theoretical framework – Linguistic analyses • Annotation guidelines 4

Linguistic phenomena • They are what we want to present, including – General concepts: e.g., which words form a phrase? What types of phrases does a language have? – Types of relations between words or phrases (e.g., subjecthood, temporal modification) – Specific constructions (e.g., small clause) – Finer-grained distinctions (e.g., unergative vs. unaccusative) 5

Representation type • It is the type of mathematical object that is used to represent syntactic facts • Examples: DS, PS • Each representation type can decide what more specific representation devices to employ – Labels on the arcs of a tree – Use of empty nodes or coindexation between nodes 6

Linguistic theory • It explains how linguistic phenomena are represented in the chose representation type • It has two components: – Theoretical framework: it provides vocabulary and constraints in which linguistic theories can be formulated: e.g., GB, LFG, LTAG, HPSG – Linguistic analyses 7

Small clause 8

“Exceptional case - marking” analysis 9

“Raising -to- object” analysis 10

Annotation guidelines • Guideline designers need to choose the following – Linguistic phenomena to represent – Representation type – Theoretical framework – Linguistic analyses – Descriptions – Examples: sentences with DS or PS trees 11

“Exceptional case - marking” analysis 13

“Raising -to- object” analysis 14

Implicit vs. explicit information • Certain aspects of information has to be expressed explicitly in DS, but not PS, or vice versa – Head in DS – Syntactic categories of phrases in PS • Not explicitly providing info does not mean that corresponding concepts does not exist in DS/PS 15

Syntactic consistency • We assume each phrase in a PS has a special word, head word, which represents the property of the phrase. • A (DS, PS) pair is called consistent if there is a way to assign a head word to each internal node in the PS so that the resulting DS is identical to the given DS. 16

Consistent pairs 17

Inconsistent pairs 18

A real example 19

Consistency assumption 20

Definition of consistency • A DS and a PS are consistent iff there exists a flattened version of the PS that is identical to the DS. • If the input DS and the desired PS are consistent, the PS can be created by stretching the DS and adding syntactic labels. 21

Checking consistency • For each (dep, head) pair in the DS – find their location in the PS and their closest antecedent – add heads to the nodes on the path between the leaf nodes and the antecedent • The DS and the PS are consistent iff each node in the PS has exactly one head. 22

(Vinken, join) (will, join) (board, join) (29, join) (join) (join) (Vinken) (join) (board) (29) 23

wh-movement come come come come come come (who, come) come (come, think) 25

wh-movement come come | think come | think come come come (who, come) come (come, think) 26

wh-movement ?? think think ?? come come (who, come) come (come, think) (you, think) 27

Can DS and PS be inconsistent? • DS and PS can represent different aspects of the same overall pictures, and still be consistent. – Info provided in PropBank: e.g., empty subject, unaccusative – Info that is in PS only: e.g., traces • DS and PS should not choose “conflicting” analyses. – DS and PS are two images of the same underlying treebank, not two separate treebanks. – Ex: ba-construction in Chinese: verb, prep, or something else? – Ex: free relatives: empty nominal head • The inconsistency cases should be rare and well-motivated. 28

How to handle inconsistency? • Detect inconsistency in (DS, PS) pairs in the guidelines • Consult guideline designers to determine whether the inconsistency can be resolved by changing analyses • If not, introduce DS cons and ensure sufficient info is in DS for automatic conversion. 29

Two-stage conversion • DS to DS cons : by removing “inconsistency” between DS and PS. • DS cons to PS: by applying conversion rules 30

Case #1: long-distance movement DS prop: DS const: • Other examples: extraposition • Easily detectable due to non-projectivity • Create DS const by moving up the “moved element” and leaving a trace • which node is the “moved element”? The one that is apart from other nodes in the subtree. 31

Case #2: local scrambling Detectable by assuming canonical word order: k1 > k2 Need from PS/DS teams the canonical word order and what word order triggers movement 32

Case #3: small clause rule Detectable by dependency type k2s Need confirmation from IIIT that k2s is used only for small clause 33

Case 4: support verb Detectable by dependency type “pof” Need confirmation from IIIT that “pof” is used only for support verb 34

Conclusion • We define consistency between DS and PS • DS and PS can be inconsistent but such cases should be rare and well-motivated. • We will handle inconsistency with the two- stage approach 35

Conversion algorithm 36

Definition of conversion rule • A conversion rule is a (DS_pattern, PS_pattern) pair. • Ex: • Simplest case: – DS_pattern corresponds to only one dependency link – Decomposing DS becomes trivial – PS_pattern is a tree fragment (e.g., wh-movement) – Learning rules from (PS, DS) pairs is easy 37

Extracting rules 38

Rules extracted from the example 39

Input DS 40

Gluing PS segments together 42

c c c 43

DS-to-PS conversion Fei Xia University of Washington July 29, 2011 - PowerPoint PPT Presentation

DS-to-PS conversion Fei Xia University of Washington July 29, 2011 1 Main steps in building the treebank DS treebank: Tokenization Morphological analysis, voice, etc. POS tagging DS Propbank: adding Predicate-argument

5 CONVERSION FUNCTIONS Data type conversion Implicit data type Explicit data type conversion

Data$Conversion ADC$and$DAC (aka$A/D$&$D/A) 1 Embedded$System 2 Signal$Conversion$System

Welcome! Medicaid Operations Conversion Go-Live: August 1 , 2 0 1 5 2015 Medicaid Conversion 1

Conversion Plans Presented by: OPERS Employer Services 1 Agenda What is a Conversion Plan?

hadronic matter to quark matter Shock Induced Conversion Diffusion Induced Conversion Phys. Rev.

Closure conversion Expressing higher-order functions in first-order function languages Theory of

Digital Design Discussion: Numbers Binary to Decimal Conversion Decimal to Binary Conversion

PV1x Photovoltaic Energy Conversion Photovoltaic energy conversion PV1x Photovoltaic Energy

Spin-dependent muon to electron conversion and muon to positron conversion Yoshitaka Kuno

Nanostructured Materials for Light Conversion and Energy Storage Conversion and Energy Storage

IN MAXIMISING CONVERSION Simon Bell, Managing Director, Diligent Commerce ECOMMERCE CONVERSION

2D-to-3D conversion: market overview, conversion workflow and basic rules for good 3D Dr.-Ing.

Shamlan Albahar, EBaja Conversion, Team 19F14 11/7/2019 2

Conversion of agricultural biomass to fuels and value- -added added Conversion of agricultural

Nanofluidic Energy Conversion Nanofluidic Energy Conversion Xi CHEN Xi CHEN Ling Liu, Jianbing

Advance Power Conversion Co., Ltd. www.apcon.co.th Corporate Overview Advance Power Conversion

! e Picha Project ENJOY MEALS , EMPOWER LIVES picha from Myanmar 150,000 registered refugees

Protocol on SEA Chapter A6: Policies and legislation Resource Manual to Support Application of

Cyber@UC Meeting 48 Docker for easy tool demos! If Youre New! Join our Slack

L1 June 5, 2018 1 Lecture 1: Welcome to CSCI 1360E! CSCI 1360E: Foundations for Informatics and

Updates on Conversions in H Updates on Conversions in H Henso ABREU ,

Syntactic Transfer Using a Bilingual Lexicon Greg Durrett, Adam Pauls, and Dan Klein UC

Researching Regulations Regulations are law created by executive branch agencies utilizing

Mark Carroll HM Inspector of Health and Safety HSE Construction Division Why HSE investigates

Sambuz

Useful Links

Newsletter

Mail Us

DS-to-PS conversion Fei Xia University of Washington July 29, 2011 - PowerPoint PPT Presentation

DS-to-PS conversion Fei Xia University of Washington July 29, 2011 1 Main steps in building the treebank DS treebank: Tokenization Morphological analysis, voice, etc. POS tagging DS Propbank: adding Predicate-argument

5 CONVERSION FUNCTIONS Data type conversion Implicit data type Explicit data type conversion

Data$Conversion ADC$and$DAC (aka$A/D$&amp;$D/A) 1 Embedded$System 2 Signal$Conversion$System

Welcome! Medicaid Operations Conversion Go-Live: August 1 , 2 0 1 5 2015 Medicaid Conversion 1

Conversion Plans Presented by: OPERS Employer Services 1 Agenda What is a Conversion Plan?

hadronic matter to quark matter Shock Induced Conversion Diffusion Induced Conversion Phys. Rev.

Closure conversion Expressing higher-order functions in first-order function languages Theory of

Digital Design Discussion: Numbers Binary to Decimal Conversion Decimal to Binary Conversion

PV1x Photovoltaic Energy Conversion Photovoltaic energy conversion PV1x Photovoltaic Energy

Spin-dependent muon to electron conversion and muon to positron conversion Yoshitaka Kuno

Nanostructured Materials for Light Conversion and Energy Storage Conversion and Energy Storage

IN MAXIMISING CONVERSION Simon Bell, Managing Director, Diligent Commerce ECOMMERCE CONVERSION

2D-to-3D conversion: market overview, conversion workflow and basic rules for good 3D Dr.-Ing.

Shamlan Albahar, EBaja Conversion, Team 19F14 11/7/2019 2

Conversion of agricultural biomass to fuels and value- -added added Conversion of agricultural

Nanofluidic Energy Conversion Nanofluidic Energy Conversion Xi CHEN Xi CHEN Ling Liu, Jianbing

Advance Power Conversion Co., Ltd. www.apcon.co.th Corporate Overview Advance Power Conversion

! e Picha Project ENJOY MEALS , EMPOWER LIVES picha from Myanmar 150,000 registered refugees

Protocol on SEA Chapter A6: Policies and legislation Resource Manual to Support Application of

Cyber@UC Meeting 48 Docker for easy tool demos! If Youre New! Join our Slack

L1 June 5, 2018 1 Lecture 1: Welcome to CSCI 1360E! CSCI 1360E: Foundations for Informatics and

Updates on Conversions in H Updates on Conversions in H Henso ABREU ,

Syntactic Transfer Using a Bilingual Lexicon Greg Durrett, Adam Pauls, and Dan Klein UC

Researching Regulations Regulations are law created by executive branch agencies utilizing

Mark Carroll HM Inspector of Health and Safety HSE Construction Division Why HSE investigates

Sambuz

Useful Links

Newsletter

Mail Us

Data$Conversion ADC$and$DAC (aka$A/D$&$D/A) 1 Embedded$System 2 Signal$Conversion$System