Solution 1: Rule Rewriting The grammar rewriting approach attempts - PowerPoint PPT Presentation

Solution 1: Rule Rewriting � The grammar rewriting approach attempts to Natural Language capture local tree information by rewriting the grammar so that the rules capture the Processing regularities we want. � By splitting and merging the non-terminals in the grammar. Lecture 19.1—3/17/2015 � Example: split NPs into different classes… Martha Palmer � Remember, we rewrote the grammar rules for CKY, and we rewrote the IOB tags. Speech and Language Processing - Jurafsky and Martin 3/19/15 2 Example: NPs Other Examples � There are lots of other examples like this � Our CFG rules for NPs don ’ t condition on in any treebank where in a tree the rule is applied � Many at the part of speech level � But we know that not all the rules occur � Recall that many decisions made in with equal frequency in all contexts. annotation efforts are directed towards � Consider NP s that involve pronouns vs. those improving annotator agreement, not towards that don ’ t. doing the right thing. � Often this involves conflating distinct classes into a larger class � TO, IN, Det, etc. Speech and Language Processing - Jurafsky and Martin Speech and Language Processing - Jurafsky and Martin 3/19/15 3 3/19/15 4 1

Rule Rewriting Local Context Approach � Three approaches � Condition the rules based on their parent nodes � Use linguistic knowledge to directly rewrite rules by hand � This splitting based on tree-context captures � NP_Obj and the NP_Subj approach some of the linguistic intuitions � Automatically rewrite the rules using context to capture some of what we want � Ie. Incorporate context into a context-free approach � Search through the space of rewrites for the grammar that maximizes the probability of the training set Speech and Language Processing - Jurafsky and Martin Speech and Language Processing - Jurafsky and Martin 3/19/15 5 3/19/15 6 Parent Annotation Parent Annotation � Now we have non-terminals NP^S and NP^VP that should capture the subject/object and pronoun/full NP cases. That is... � Recall what ’ s going on here. We ’ re in effect rewriting � The rules are now the treebank, thus rewriting the grammar. � NP^S -> PRP � And changing the probabilities since they ’ re being � NP^VP -> DT derived from different counts… � VP^S -> NP^VP � And if we ’ re splitting what ’ s happening to the counts? Speech and Language Processing - Jurafsky and Martin Speech and Language Processing - Jurafsky and Martin 3/19/15 7 3/19/15 8 2

Auto Rewriting Auto Rewriting � If this is such a good idea we may as well � Basic idea… apply a learning approach to it. � Split every non-terminal into two new non- terminals across the entire grammar (X � Start with a grammar (perhaps a treebank becomes X1 and X2). grammar) � Duplicate all the rules of the grammar that � Search through the space of splits/merges use X, dividing the probability mass of the original rule almost equally. for the grammar that in some sense � Run EM to readjust the rule probabilities maximizes parsing performance on the � Perform a merge step to back off the splits training/development set. that look like they don ’ t really do any good. Speech and Language Processing - Jurafsky and Martin Speech and Language Processing - Jurafsky and Martin 3/19/15 9 3/19/15 10 Solution 2: Dumped Example Lexicalized Grammars � Lexicalize the grammars with heads � Compute the rule probabilities on these lexicalized rules � Run Prob CKY as before Speech and Language Processing - Jurafsky and Martin Speech and Language Processing - Jurafsky and Martin 3/19/15 11 3/19/15 12 3

How? Declare Independence � We used to have When stuck, exploit independence and � � VP -> V NP PP P(rule|VP) collect the statistics you can… � That ’ s the count of this rule divided by the number There are a larger number of ways to do � of VPs in a treebank this... � Now we have fully lexicalized rules... Let ’ s consider one generative story: � � VP(dumped)-> V(dumped) NP(sacks)PP(into) given a rule we ’ ll P(r|VP ^ dumped is the verb ^ sacks is the head of the NP ^ into is the head of the PP) 1. Generate the head To get the counts for that.. 2. Generate the stuff to the left of the head 3. Generate the stuff to the right of the head Speech and Language Processing - Jurafsky and Martin Speech and Language Processing - Jurafsky and Martin 3/19/15 13 3/19/15 14 Example Example � That is, the rule probability for � So the probability of a lexicalized rule such as � VP(dumped) → V(dumped)NP(sacks)PP(into) � Is the product of the probability of is estimated as � “ dumped ” as the head � With nothing to its left � “ sacks ” as the head of the first right-side thing � “ into ” as the head of the next right-side element � And nothing after that Speech and Language Processing - Jurafsky and Martin Speech and Language Processing - Jurafsky and Martin 3/19/15 15 3/19/15 16 4

Framework Features � C for Case, Subjective/Objective � That ’ s just one simple model � She visited her. � Collins Model 1 � P for Person agreement, (1 st , 2 nd , 3 rd ) � You can imagine a gazzillion other � I like him, You like him, He likes him, � N for Number agreement, Subject/Verb assumptions that might lead to better � He likes him, They like him. models � G for Gender agreement, Subject/Verb � You just have to make sure that you can � English, reflexive pronouns He washed himself. � Romance languages, det/noun get the counts you need � T for Tense, � And that it can be used/exploited � auxiliaries, sentential complements, etc. efficiently during decoding � * will finished is bad Speech and Language Processing - Jurafsky and Martin NLP 3/19/15 17 CSE391 – 18 2004 5

Solution 1: Rule Rewriting The grammar rewriting approach attempts - PowerPoint PPT Presentation

Solution 1: Rule Rewriting The grammar rewriting approach attempts to Natural Language capture local tree information by rewriting the grammar so that the rules capture the Processing regularities we want. By splitting and merging the

On Data-Structure Rewriting Rachid Echahed LIG Lab, Grenoble France June, 2010 Rewriting

Rewriting Part 4. Termination of Term Rewriting Systems Temur Kutsia RISC, JKU Linz Termination

2/17/2017 Continued from yesterday >java RealQueen 5 SOLUTION: 1 3 5 2 4 SOLUTION: 1 4 2 5

Rule Changes - Non rule change year Review of 2017 rule changes - just the easy to forgot

Common Rule Advanced Notice of Proposed Rulemaking (ANPRM) IRB Investigator Advanced Notice

2nd RULE: You MUST TALK about BOOK CLUB. 2nd RULE: You DO NOT talk about 3rd RULE: PERSEVERE -- If

Rule #1: Have a takeaway. Rule #2: Keep It Simple. Rule #3: Repetition is Good. Rule #4: Be

Counting Rules, etc Product Rule Generalized Product Rule Division Rule Bijection

Using Rule-Based Activity Using Rule-Based Activity Using Rule-Based Activity Using Rule-Based

Contents/Objectives of the lecture Definition and properties of rewriting Rule-based

Contents/Objectives of the lecture Definition and properties of rewriting Rule-based

On Formal Analysis of OO Languages using Rewriting Logic: Designing for Performance Mark Hills

Term rewriting Equational logic Term rewriting systems Termination Confluence

YARR! Yet Another Rewriting Reasoner Jrg Schnfisch ORE Workshop 2013 Ulm, Germany,

The Rewriting Logic Semantics Project and its Maude Implementation Jos e Meseguer University

Higher-order rewriting of String Diagrams Vladimir Zamdzhiev 21 April 2016 Vladimir Zamdzhiev

Motivation Power and energy: Will not motivate low power topic You are here,

Testability Lecture 6: Logic Simulation Lecture 6: Logic Simulation Shaahin Hessabi Department

Overview ECE 753: FAULT-TOLERANT Fault Modeling COMPUTING References Introduction

AMemristoris: Anewkindelectroniccomponent

Rewriting Part 1. Abstract Reduction Temur Kutsia RISC, JKU Linz Literature Franz Baader and

General Context-Free Grammar Parsing Application of grammar rewrite rules A phrase structure

Interacve Simplifier Tracing and Debugging in Isabelle Lars Hupel Technische Universitt

1 Higher-Order Unification First Order Unification In Isabelle: Decidable Terms are terms in

Solution 1: Rule Rewriting The grammar rewriting approach attempts - PowerPoint PPT Presentation

Solution 1: Rule Rewriting The grammar rewriting approach attempts to Natural Language capture local tree information by rewriting the grammar so that the rules capture the Processing regularities we want. By splitting and merging the

On Data-Structure Rewriting Rachid Echahed LIG Lab, Grenoble France June, 2010 Rewriting

Rewriting Part 4. Termination of Term Rewriting Systems Temur Kutsia RISC, JKU Linz Termination

2/17/2017 Continued from yesterday &gt;java RealQueen 5 SOLUTION: 1 3 5 2 4 SOLUTION: 1 4 2 5

Rule Changes - Non rule change year Review of 2017 rule changes - just the easy to forgot

Common Rule Advanced Notice of Proposed Rulemaking (ANPRM) IRB Investigator Advanced Notice

2nd RULE: You MUST TALK about BOOK CLUB. 2nd RULE: You DO NOT talk about 3rd RULE: PERSEVERE -- If

Rule #1: Have a takeaway. Rule #2: Keep It Simple. Rule #3: Repetition is Good. Rule #4: Be

Counting Rules, etc Product Rule Generalized Product Rule Division Rule Bijection

Using Rule-Based Activity Using Rule-Based Activity Using Rule-Based Activity Using Rule-Based

Contents/Objectives of the lecture Definition and properties of rewriting Rule-based

Contents/Objectives of the lecture Definition and properties of rewriting Rule-based

On Formal Analysis of OO Languages using Rewriting Logic: Designing for Performance Mark Hills

Term rewriting Equational logic Term rewriting systems Termination Confluence

YARR! Yet Another Rewriting Reasoner Jrg Schnfisch ORE Workshop 2013 Ulm, Germany,

The Rewriting Logic Semantics Project and its Maude Implementation Jos e Meseguer University

Higher-order rewriting of String Diagrams Vladimir Zamdzhiev 21 April 2016 Vladimir Zamdzhiev

Motivation Power and energy: Will not motivate low power topic You are here,

Testability Lecture 6: Logic Simulation Lecture 6: Logic Simulation Shaahin Hessabi Department

Overview ECE 753: FAULT-TOLERANT Fault Modeling COMPUTING References Introduction

AMemristoris: Anewkindelectroniccomponent

Rewriting Part 1. Abstract Reduction Temur Kutsia RISC, JKU Linz Literature Franz Baader and

General Context-Free Grammar Parsing Application of grammar rewrite rules A phrase structure

Interacve Simplifier Tracing and Debugging in Isabelle Lars Hupel Technische Universitt

1 Higher-Order Unification First Order Unification In Isabelle: Decidable Terms are terms in

2/17/2017 Continued from yesterday >java RealQueen 5 SOLUTION: 1 3 5 2 4 SOLUTION: 1 4 2 5