EVALB, Improving CKY Parsing, Hw3 Evaluating parsers Hw3 - PowerPoint PPT Presentation

EVALB, Improving CKY Parsing, Hw3 Scott Farrar CLMA, University of Washington farrar@u.washington.edu EVALB, Improving CKY Parsing, Hw3 Evaluating parsers Hw3 Optimization: tips and tricks Scott Farrar 1. Size of the grammar CLMA, University of Washington 2. Limit rules added to chart 3. Sentence length farrar@u.washington.edu January 28, 2010 1/42

EVALB, Improving Today’s lecture CKY Parsing, Hw3 Scott Farrar CLMA, University of Washington farrar@u.washington.edu Evaluating parsers Evaluating parsers 1 Hw3 Optimization: tips and tricks Hw3 2 1. Size of the grammar 2. Limit rules added to chart 3. Sentence length Optimization: tips and tricks 3 1. Size of the grammar 2. Limit rules added to chart 3. Sentence length 2/42

EVALB, Improving Parsing: dev/train/test paradigm CKY Parsing, Hw3 Scott Farrar The Wall Street Journal (WSJ) section of the Penn CLMA, University of Washington far- Treebank (PTB), for all its faults, provides a very useful rar@u.washington.edu resource for comparing parser performance. Evaluating parsers Hw3 Optimization: tips and tricks 1. Size of the grammar 2. Limit rules added to chart 3. Sentence length 3/42

EVALB, Improving Parsing: dev/train/test paradigm CKY Parsing, Hw3 Scott Farrar The Wall Street Journal (WSJ) section of the Penn CLMA, University of Washington far- Treebank (PTB), for all its faults, provides a very useful rar@u.washington.edu resource for comparing parser performance. Evaluating parsers Hw3 In building a probabilistic parser, there are four kinds of Optimization: tips and tricks resources that are commonly used esp. in the ACL related 1. Size of the grammar literature: 2. Limit rules added to chart 3. Sentence length 3/42

EVALB, Improving Parsing: dev/train/test paradigm CKY Parsing, Hw3 Scott Farrar The Wall Street Journal (WSJ) section of the Penn CLMA, University of Washington far- Treebank (PTB), for all its faults, provides a very useful rar@u.washington.edu resource for comparing parser performance. Evaluating parsers Hw3 In building a probabilistic parser, there are four kinds of Optimization: tips and tricks resources that are commonly used esp. in the ACL related 1. Size of the grammar literature: 2. Limit rules added to chart 3. Sentence length 1 training data : large number of annotated sentences (sec. 2–21 of PTB has 39,830 sentences) 3/42

EVALB, Improving Parsing: dev/train/test paradigm CKY Parsing, Hw3 Scott Farrar The Wall Street Journal (WSJ) section of the Penn CLMA, University of Washington far- Treebank (PTB), for all its faults, provides a very useful rar@u.washington.edu resource for comparing parser performance. Evaluating parsers Hw3 In building a probabilistic parser, there are four kinds of Optimization: tips and tricks resources that are commonly used esp. in the ACL related 1. Size of the grammar literature: 2. Limit rules added to chart 3. Sentence length 1 training data : large number of annotated sentences (sec. 2–21 of PTB has 39,830 sentences) 2 development data : small number of annotated sentences used to “tweak” parser (sec. 22, of PTB) 3/42

EVALB, Improving Parsing: dev/train/test paradigm CKY Parsing, Hw3 Scott Farrar The Wall Street Journal (WSJ) section of the Penn CLMA, University of Washington far- Treebank (PTB), for all its faults, provides a very useful rar@u.washington.edu resource for comparing parser performance. Evaluating parsers Hw3 In building a probabilistic parser, there are four kinds of Optimization: tips and tricks resources that are commonly used esp. in the ACL related 1. Size of the grammar literature: 2. Limit rules added to chart 3. Sentence length 1 training data : large number of annotated sentences (sec. 2–21 of PTB has 39,830 sentences) 2 development data : small number of annotated sentences used to “tweak” parser (sec. 22, of PTB) 3 test data : small-medium number of un-annotated sentences used as input to parser (sec. 23 of PTB has 2416 sentences, ∼ 6% of training set) 3/42

EVALB, Improving Parsing: dev/train/test paradigm CKY Parsing, Hw3 Scott Farrar The Wall Street Journal (WSJ) section of the Penn CLMA, University of Washington far- Treebank (PTB), for all its faults, provides a very useful rar@u.washington.edu resource for comparing parser performance. Evaluating parsers Hw3 In building a probabilistic parser, there are four kinds of Optimization: tips and tricks resources that are commonly used esp. in the ACL related 1. Size of the grammar literature: 2. Limit rules added to chart 3. Sentence length 1 training data : large number of annotated sentences (sec. 2–21 of PTB has 39,830 sentences) 2 development data : small number of annotated sentences used to “tweak” parser (sec. 22, of PTB) 3 test data : small-medium number of un-annotated sentences used as input to parser (sec. 23 of PTB has 2416 sentences, ∼ 6% of training set) 4 gold standard : annotated version of test data, with no errors (hidden till parser is developed) 3/42

EVALB, Improving Recall our discussion first day of class CKY Parsing, Hw3 Scott Farrar CLMA, University of Washington farrar@u.washington.edu Definition Evaluating parsers Hw3 objective criterion : that which a parser tries to maximize. Optimization: tips and tricks 1. Size of the grammar 2. Limit rules added to chart 3. Sentence length 4/42

EVALB, Improving Recall our discussion first day of class CKY Parsing, Hw3 Scott Farrar CLMA, University of Washington farrar@u.washington.edu Definition Evaluating parsers Hw3 objective criterion : that which a parser tries to maximize. Optimization: tips and tricks 1. Size of the Definition grammar 2. Limit rules added to chart tree accuracy : (harsh) exact match criterion; 1 for perfect 3. Sentence length match, otherwise 0. 4/42

EVALB, Improving Recall our discussion first day of class CKY Parsing, Hw3 Scott Farrar CLMA, University of Washington farrar@u.washington.edu Definition Evaluating parsers Hw3 objective criterion : that which a parser tries to maximize. Optimization: tips and tricks 1. Size of the Definition grammar 2. Limit rules added to chart tree accuracy : (harsh) exact match criterion; 1 for perfect 3. Sentence length match, otherwise 0. Non-exact matches can be very useful for some tasks: named entity extraction, information retrieval, document clustering 4/42

EVALB, Improving PARSEVAL CKY Parsing, Hw3 Scott Farrar CLMA, University of Washington farrar@u.washington.edu Definition Evaluating parsers PARSEVAL measures : standard metrics for evaluation Hw3 using the component pieces of a parse; a way to give partial Optimization: tips credit. and tricks 1. Size of the grammar 2. Limit rules added evalb is an implementation of the PARSEVAL measures to chart 3. Sentence length The evalb program uses several PARSEVAL measures: labeled precision (LP) labeled recall (LR) F-measure cross bracketing 5/42

EVALB, Improving PARSEVAL: Labeled precision CKY Parsing, Hw3 Scott Farrar CLMA, University of Washington farrar@u.washington.edu Definition Evaluating parsers Labeled Precision (LP) : the average of how many brackets Hw3 in the resulting parse tree match those in the gold standard Optimization: tips and tricks (same span). Focusing in on specific problems can increase 1. Size of the grammar precision. Broadening your methodology can decrease 2. Limit rules added to chart 3. Sentence length precision. Labeled precision includes the node label as well. LP = # of correct constituents in candidate parse of s # of total constituents in candidate parse of s 6/42

EVALB, Improving PARSEVAL: Labeled recall CKY Parsing, Hw3 Scott Farrar CLMA, University of Washington farrar@u.washington.edu Definition Evaluating parsers Labeled Recall (LR) : the average of how many brackets in Hw3 the gold standard are in the resulting parse. Did you get Optimization: tips and tricks them all? Coverage . Focusing in on specific problems can 1. Size of the grammar decrease recall, because other problems may get ignored. 2. Limit rules added to chart 3. Sentence length Labeled recall includes the node label as well. LR = # of correct constituents in candidate parse of s # of correct constituents in reference parse of s 7/42

EVALB, Improving P, R errors CKY Parsing, Hw3 Scott Farrar CLMA, University Example of Washington farrar@u.washington.edu PP attachment error Evaluating parsers Hw3 (S (NP (A a)) (VP(B b) (PP (C c))) ) gold (S (NP (A a)) (VP(B b) ) (PP (C c))) Optimization: tips and tricks 1. Size of the grammar 2. Limit rules added to chart 3. Sentence length 8/42

EVALB, Improving P, R errors CKY Parsing, Hw3 Scott Farrar CLMA, University Example of Washington farrar@u.washington.edu PP attachment error Evaluating parsers Hw3 (S (NP (A a)) (VP(B b) (PP (C c))) ) gold (S (NP (A a)) (VP(B b) ) (PP (C c))) Optimization: tips and tricks 1. Size of the Constituents in gold: S (0 , 3), NP (0 , 1), VP (1 , 3), PP (2 , 3) grammar 2. Limit rules added to chart 3. Sentence length 8/42

EVALB, Improving CKY Parsing, Hw3 Evaluating parsers Hw3 - PowerPoint PPT Presentation

EVALB, Improving CKY Parsing, Hw3 Scott Farrar CLMA, University of Washington far- rar@u.washington.edu EVALB, Improving CKY Parsing, Hw3 Evaluating parsers Hw3 Optimization: tips and tricks Scott Farrar 1. Size of the grammar CLMA,

Introduction to Bottom-Up Parsing Shift-reduce parsing The LR parsing algorithm

CKY & Earley Parsing Ling 571 Deep Processing Techniques for NLP January 13, 2016 No Class

Basic Parsing Algorithms Chart Parsing Seminar Recent Advances in Parsing Technology WS

CSC 4181 Compiler Construction Parsing 1 1 Outline Top-down v.s. Bottom-up Top-down parsing

Lecture 10: Maps Part II: Core Commands Announcements HW3 due NOW! Announcements HW3 due

CKY Algorithm, Chomsky Normal Form Scott Farrar CLMA, University of Washington January 13, 2010

Neural CRF Parsing Greg Durre2 and Dan Klein UC Berkeley

CKY Parsing Ling 571 Deep Processing Techniques for NLP January 12, 2011 Roadmap

Robust Incremental Neural Semantic Graph Parsing Jan Buys and Phil Blunsom Dependency Parsing vs

Dependency Grammars and Parsing CMSC 473/673 UMBC Outline Review: PCFGs and CKY Dependency

Statistical Parsing Parsing context-free languages ar ltekin University of Tbingen

CS345a: Data Mining Jure Leskovec and Anand Rajaraman j Stanford University HW3 is out HW3

Lecture 9 Logistics HW2 due now HW3 due Monday HW3 due Monday Lab --- stay on

Lecture 16: The CKY parsing algorithm Kai-Wei Chang CS @ University of Virginia kw@kwchang.net

Lecture 9: The CKY parsing algorithm Julia Hockenmaier juliahmr@illinois.edu 3324 Siebel

The CKY algorithm part 1: Recognition Syntactic parsing 2018-01-17 Sara Stymne Department of

Informed Search Yingyu Liang yliang@cs.wisc.edu Computer Sciences Department University of

Probabilistic Models of Human Parsing Parser Architectures Informatics 2A: Lecture 23 2

Comparing different models of pulsar timing noise NewCompStar, Budapest, 2015 Gregory Ashton In

The mystery of simultaneous pulsar moding in the X-ray and radio bands Wim Hermsen 1,2

Maverick: Discovering Exceptional Facts from Knowledge Graphs 12/03/19 Paper published in Proc.

Exercise 2: Thresholds FLUKA Advanced Course Exercise 2: Thresholds Aim of the exercise: 1.

For Wednesday Read chapter 5, sections 1-4 Homework: Chapter 3, exercise 23. Then do

Microphone Array Processing : A Quick Update Iain McCowan Guillaume Lathoud, Darren Moore,

EVALB, Improving CKY Parsing, Hw3 Evaluating parsers Hw3 - PowerPoint PPT Presentation

EVALB, Improving CKY Parsing, Hw3 Scott Farrar CLMA, University of Washington far- rar@u.washington.edu EVALB, Improving CKY Parsing, Hw3 Evaluating parsers Hw3 Optimization: tips and tricks Scott Farrar 1. Size of the grammar CLMA,

Introduction to Bottom-Up Parsing Shift-reduce parsing The LR parsing algorithm

CKY &amp; Earley Parsing Ling 571 Deep Processing Techniques for NLP January 13, 2016 No Class

Basic Parsing Algorithms Chart Parsing Seminar Recent Advances in Parsing Technology WS

CSC 4181 Compiler Construction Parsing 1 1 Outline Top-down v.s. Bottom-up Top-down parsing

Lecture 10: Maps Part II: Core Commands Announcements HW3 due NOW! Announcements HW3 due

CKY Algorithm, Chomsky Normal Form Scott Farrar CLMA, University of Washington January 13, 2010

Neural CRF Parsing Greg Durre2 and Dan Klein UC Berkeley

CKY Parsing Ling 571 Deep Processing Techniques for NLP January 12, 2011 Roadmap

Robust Incremental Neural Semantic Graph Parsing Jan Buys and Phil Blunsom Dependency Parsing vs

Dependency Grammars and Parsing CMSC 473/673 UMBC Outline Review: PCFGs and CKY Dependency

Statistical Parsing Parsing context-free languages ar ltekin University of Tbingen

CS345a: Data Mining Jure Leskovec and Anand Rajaraman j Stanford University HW3 is out HW3

Lecture 9 Logistics HW2 due now HW3 due Monday HW3 due Monday Lab --- stay on

Lecture 16: The CKY parsing algorithm Kai-Wei Chang CS @ University of Virginia kw@kwchang.net

Lecture 9: The CKY parsing algorithm Julia Hockenmaier juliahmr@illinois.edu 3324 Siebel

The CKY algorithm part 1: Recognition Syntactic parsing 2018-01-17 Sara Stymne Department of

Informed Search Yingyu Liang yliang@cs.wisc.edu Computer Sciences Department University of

Probabilistic Models of Human Parsing Parser Architectures Informatics 2A: Lecture 23 2

Comparing different models of pulsar timing noise NewCompStar, Budapest, 2015 Gregory Ashton In

The mystery of simultaneous pulsar moding in the X-ray and radio bands Wim Hermsen 1,2

Maverick: Discovering Exceptional Facts from Knowledge Graphs 12/03/19 Paper published in Proc.

Exercise 2: Thresholds FLUKA Advanced Course Exercise 2: Thresholds Aim of the exercise: 1.

For Wednesday Read chapter 5, sections 1-4 Homework: Chapter 3, exercise 23. Then do

Microphone Array Processing : A Quick Update Iain McCowan Guillaume Lathoud, Darren Moore,

CKY & Earley Parsing Ling 571 Deep Processing Techniques for NLP January 13, 2016 No Class