Capitalization Cues Improve Dpendency Grammar Induction Valentin I. - PowerPoint PPT Presentation

Capitalization Cues Improve Dpendency Grammar Induction Valentin I. Spitkovsky with Daniel Jurafsky (Stanford University) and Hiyan Alshawi (Google Inc.) Spitkovsky et al. (Stanford & Google) Capitalization WILS (2012-06-07) 1 / 10

Problem Unsupervised Learning Problem: Grammar Induction is Hard Spitkovsky et al. (Stanford & Google) Capitalization WILS (2012-06-07) 2 / 10

Problem Unsupervised Learning Problem: Grammar Induction is Hard Major challenges: Spitkovsky et al. (Stanford & Google) Capitalization WILS (2012-06-07) 2 / 10

Problem Unsupervised Learning Problem: Grammar Induction is Hard Major challenges: non-convex objectives Spitkovsky et al. (Stanford & Google) Capitalization WILS (2012-06-07) 2 / 10

Problem Unsupervised Learning Problem: Grammar Induction is Hard Major challenges: non-convex objectives poor correlations between likelihood and accuracy Spitkovsky et al. (Stanford & Google) Capitalization WILS (2012-06-07) 2 / 10

Problem Unsupervised Learning Problem: Grammar Induction is Hard Major challenges: non-convex objectives (Gimpel and Smith, 2012) poor correlations between likelihood and accuracy Spitkovsky et al. (Stanford & Google) Capitalization WILS (2012-06-07) 2 / 10

Problem Unsupervised Learning Problem: Grammar Induction is Hard Major challenges: non-convex objectives (Gimpel and Smith, 2012) poor correlations between likelihood and accuracy (Pereira and Schabes, 1992; Elworthy, 1994; Merialdo, 1994; Liang and Klein, 2008; Spitkovsky et al., 2009–2011) Spitkovsky et al. (Stanford & Google) Capitalization WILS (2012-06-07) 2 / 10

Problem Unsupervised Learning Problem: Grammar Induction is Hard Major challenges: non-convex objectives (Gimpel and Smith, 2012) poor correlations between likelihood and accuracy (Pereira and Schabes, 1992; Elworthy, 1994; Merialdo, 1994; Liang and Klein, 2008; Spitkovsky et al., 2009–2011) ◮ e.g., optimizers run away from supervised MLE solutions Spitkovsky et al. (Stanford & Google) Capitalization WILS (2012-06-07) 2 / 10

Problem Unsupervised Learning Problem: Grammar Induction is Hard Major challenges: non-convex objectives (Gimpel and Smith, 2012) poor correlations between likelihood and accuracy (Pereira and Schabes, 1992; Elworthy, 1994; Merialdo, 1994; Liang and Klein, 2008; Spitkovsky et al., 2009–2011) ◮ e.g., optimizers run away from supervised MLE solutions (to the tune of 20 points of accuracy) Spitkovsky et al. (Stanford & Google) Capitalization WILS (2012-06-07) 2 / 10

Problem Unsupervised Learning Problem: Grammar Induction is Hard Major challenges: non-convex objectives (Gimpel and Smith, 2012) poor correlations between likelihood and accuracy (Pereira and Schabes, 1992; Elworthy, 1994; Merialdo, 1994; Liang and Klein, 2008; Spitkovsky et al., 2009–2011) ◮ e.g., optimizers run away from supervised MLE solutions (to the tune of 20 points of accuracy) flaws in evaluation (Schwartz et al., 2011) Spitkovsky et al. (Stanford & Google) Capitalization WILS (2012-06-07) 2 / 10

Problem Unsupervised Learning Problem: Grammar Induction is Hard Major challenges: non-convex objectives (Gimpel and Smith, 2012) poor correlations between likelihood and accuracy (Pereira and Schabes, 1992; Elworthy, 1994; Merialdo, 1994; Liang and Klein, 2008; Spitkovsky et al., 2009–2011) ◮ e.g., optimizers run away from supervised MLE solutions (to the tune of 20 points of accuracy) flaws in evaluation (Schwartz et al., 2011) Partial solutions: Spitkovsky et al. (Stanford & Google) Capitalization WILS (2012-06-07) 2 / 10

Problem Unsupervised Learning Problem: Grammar Induction is Hard Major challenges: non-convex objectives (Gimpel and Smith, 2012) poor correlations between likelihood and accuracy (Pereira and Schabes, 1992; Elworthy, 1994; Merialdo, 1994; Liang and Klein, 2008; Spitkovsky et al., 2009–2011) ◮ e.g., optimizers run away from supervised MLE solutions (to the tune of 20 points of accuracy) flaws in evaluation (Schwartz et al., 2011) Partial solutions: train on more / better data (Mareˇ cek and Zabokrtsk´ y, 2012) Spitkovsky et al. (Stanford & Google) Capitalization WILS (2012-06-07) 2 / 10

Problem Unsupervised Learning Problem: Grammar Induction is Hard Major challenges: non-convex objectives (Gimpel and Smith, 2012) poor correlations between likelihood and accuracy (Pereira and Schabes, 1992; Elworthy, 1994; Merialdo, 1994; Liang and Klein, 2008; Spitkovsky et al., 2009–2011) ◮ e.g., optimizers run away from supervised MLE solutions (to the tune of 20 points of accuracy) flaws in evaluation (Schwartz et al., 2011) Partial solutions: train on more / better data (Mareˇ cek and Zabokrtsk´ y, 2012) test many data sets / languages (fight noise with CLT) Spitkovsky et al. (Stanford & Google) Capitalization WILS (2012-06-07) 2 / 10

Problem Unsupervised Learning Problem: Grammar Induction is Hard Major challenges: non-convex objectives (Gimpel and Smith, 2012) poor correlations between likelihood and accuracy (Pereira and Schabes, 1992; Elworthy, 1994; Merialdo, 1994; Liang and Klein, 2008; Spitkovsky et al., 2009–2011) ◮ e.g., optimizers run away from supervised MLE solutions (to the tune of 20 points of accuracy) flaws in evaluation (Schwartz et al., 2011) Partial solutions: train on more / better data (Mareˇ cek and Zabokrtsk´ y, 2012) test many data sets / languages (fight noise with CLT) employ less ad-hoc initializers (“eat your own dog food”) Spitkovsky et al. (Stanford & Google) Capitalization WILS (2012-06-07) 2 / 10

Problem Unsupervised Learning Problem: Grammar Induction is Hard Major challenges: non-convex objectives (Gimpel and Smith, 2012) poor correlations between likelihood and accuracy (Pereira and Schabes, 1992; Elworthy, 1994; Merialdo, 1994; Liang and Klein, 2008; Spitkovsky et al., 2009–2011) ◮ e.g., optimizers run away from supervised MLE solutions (to the tune of 20 points of accuracy) flaws in evaluation (Schwartz et al., 2011) Partial solutions: train on more / better data (Mareˇ cek and Zabokrtsk´ y, 2012) test many data sets / languages (fight noise with CLT) employ less ad-hoc initializers (“eat your own dog food”) constrain search space (structure is underdetermined) Spitkovsky et al. (Stanford & Google) Capitalization WILS (2012-06-07) 2 / 10

Idea New Cue Idea: Use Capitalization as Parsing Cues Spitkovsky et al. (Stanford & Google) Capitalization WILS (2012-06-07) 3 / 10

Idea New Cue Idea: Use Capitalization as Parsing Cues Partial bracketing constraints: (Pereira and Schabes, 1992) Spitkovsky et al. (Stanford & Google) Capitalization WILS (2012-06-07) 3 / 10

Idea New Cue Idea: Use Capitalization as Parsing Cues Partial bracketing constraints: (Pereira and Schabes, 1992) semantic annotations (Naseem and Barzilay, 2011) punctuation marks (Ponvert et al., 2010) web markup (Spitkovsky et al., 2010) Spitkovsky et al. (Stanford & Google) Capitalization WILS (2012-06-07) 3 / 10

Idea New Cue Idea: Use Capitalization as Parsing Cues Partial bracketing constraints: (Pereira and Schabes, 1992) semantic annotations (Naseem and Barzilay, 2011) punctuation marks (Ponvert et al., 2010) web markup (Spitkovsky et al., 2010) ... defined over raw text (no POS tags). Spitkovsky et al. (Stanford & Google) Capitalization WILS (2012-06-07) 3 / 10

Example Very WSJ Example: (no punctuation, etc. cues) Spitkovsky et al. (Stanford & Google) Capitalization WILS (2012-06-07) 4 / 10

Example Very WSJ Example: (no punctuation, etc. cues) [ NP Jay Stevens ] of [ NP Dean Witter ] actually cut his per-share earnings estimate to [ NP $9 ] from [ NP $9.50 ] for [ NP 1989 ] and to [ NP $9.50 ] from [ NP $10.35 ] in [ NP 1990 ] because he decided sales would be even weaker than he had expected. Spitkovsky et al. (Stanford & Google) Capitalization WILS (2012-06-07) 4 / 10

Example Still WSJ Example: (less WSJ-ish) Spitkovsky et al. (Stanford & Google) Capitalization WILS (2012-06-07) 5 / 10

Example Still WSJ Example: (less WSJ-ish) [ NP Jurors ] in [ NP U.S. District Court ] in [ NP Miami ] cleared [ NP Harold Hershhenson ] , a former executive vice president; [ NP John Pagones ] , a former vice president; and [ NP Stephen Vadas ] and [ NP Dean Ciporkin ] , who had been engineers with [ NP Cordis ] . Spitkovsky et al. (Stanford & Google) Capitalization WILS (2012-06-07) 5 / 10

Analysis English Analysis: (English PTB) Mostly noun phrases (96%): Spitkovsky et al. (Stanford & Google) Capitalization WILS (2012-06-07) 6 / 10

Analysis English Analysis: (English PTB) Mostly noun phrases (96%): Apple II World War I Mayor William H. Hudnut III International Business Machines Corp. Alexandria, Va Spitkovsky et al. (Stanford & Google) Capitalization WILS (2012-06-07) 6 / 10

Analysis English Analysis: (English PTB) Mostly noun phrases (96%): Apple II World War I Mayor William H. Hudnut III International Business Machines Corp. Alexandria, Va Some proper adjectives (5%); Spitkovsky et al. (Stanford & Google) Capitalization WILS (2012-06-07) 6 / 10

Capitalization Cues Improve Dpendency Grammar Induction Valentin I. - PowerPoint PPT Presentation

Capitalization Cues Improve Dpendency Grammar Induction Valentin I. Spitkovsky with Daniel Jurafsky (Stanford University) and Hiyan Alshawi (Google Inc.) Spitkovsky et al. (Stanford & Google) Capitalization WILS (2012-06-07) 1 / 10

capitalization ATI TEAS ENGLISH AND LANGUAGE USAGE capitalization Capitalization questions

Reinforcement Learning of Reinforcement Learning of Affordance Cues Affordance Cues Final

2020 Capital Budget Application Technical Conference Capitalization Process Presentation to

OUTLINE CAPITALIZATION OF COLLECTIVE KNOWLEDGE: Knowledge management and Knowledge

Overview of the Final Repair and Capitalization Regulations Sorting Out the Confusion and the Myths

The Capitalization of Consumer Financing into Durable Goods Prices Bronson Argyle Taylor Nadauld

DISC- Improv to Improve DISC- Improv to Improve DISC- Improv to Improve DISC- Improv to Improve

Acoustic Cues Used by Learners of English Danica Reid Phonological Processing Lab Simon Fraser

Auditory Perception and Audition Addition of audio cues to VR environments Primary goal is to

Motion Cues for Illustration of Skeletal Motion Capture Data Simon Bouvier-Zappa Victor

Depth Perception Deep Blue See April 5, 2020 PSYCH 4041 / 6014 Overview Cue Theory

Inferring 3D Cues from a Single Image Wei- -Cheng Su Cheng Su Wei Motivation 2 Human can

Perceptive Context Awareness of the User -- Visual Conversation Cues: Interfaces (kiosks, agents,

Syntactic cues alone in Adjective learning Michael Clauss and Jeremy Hartman 13 November 2015

CORPORATE PRESENTATION June 2018 TSX: VII.TO 7G CORPORATE PROFILE 7G Capitalization & Key

January 2017 TSX: YGR TSX: YGR Corporate Snapshot Capitalization Reserves and Locations (2)

Nonverbal Communication Cues in SOVB Bench Players An Independent Study by Emily Carr The

A Learning College Closing the Loop w/ ePortfolio & Assessment Associate Dean Bret Eynon &

$1 BILLION 150 YEARS COMPLETED PROJECTS COMBINED PROPERTY EXPERIENCE $2 BILLION+ 100 PERCENT

Nicholsons Shopping Centre Community Planning Weekend 22 to 26 March 2019 Report Back

Blended Learning in Elementary: How do I Start? Presented by Claire Cummings Third Grade

The Center for the Study of Languages and Cultures Giving an Effective Academic Presentation

Use of Electromagnetic Foraging Cues By Conophagous Insects Tracy Zahradnik 1 Stephan Takcs 1 ,

S.K.H. St. Marys Church Mok Hing Yiu College Lee Chun Yu, Yuen Wing Ho 1 Band 2

Capitalization Cues Improve Dpendency Grammar Induction Valentin I. - PowerPoint PPT Presentation

Capitalization Cues Improve Dpendency Grammar Induction Valentin I. Spitkovsky with Daniel Jurafsky (Stanford University) and Hiyan Alshawi (Google Inc.) Spitkovsky et al. (Stanford & Google) Capitalization WILS (2012-06-07) 1 / 10

capitalization ATI TEAS ENGLISH AND LANGUAGE USAGE capitalization Capitalization questions

Reinforcement Learning of Reinforcement Learning of Affordance Cues Affordance Cues Final

2020 Capital Budget Application Technical Conference Capitalization Process Presentation to

OUTLINE CAPITALIZATION OF COLLECTIVE KNOWLEDGE: Knowledge management and Knowledge

Overview of the Final Repair and Capitalization Regulations Sorting Out the Confusion and the Myths

The Capitalization of Consumer Financing into Durable Goods Prices Bronson Argyle Taylor Nadauld

DISC- Improv to Improve DISC- Improv to Improve DISC- Improv to Improve DISC- Improv to Improve

Acoustic Cues Used by Learners of English Danica Reid Phonological Processing Lab Simon Fraser

Auditory Perception and Audition Addition of audio cues to VR environments Primary goal is to

Motion Cues for Illustration of Skeletal Motion Capture Data Simon Bouvier-Zappa Victor

Depth Perception Deep Blue See April 5, 2020 PSYCH 4041 / 6014 Overview Cue Theory

Inferring 3D Cues from a Single Image Wei- -Cheng Su Cheng Su Wei Motivation 2 Human can

Perceptive Context Awareness of the User -- Visual Conversation Cues: Interfaces (kiosks, agents,

Syntactic cues alone in Adjective learning Michael Clauss and Jeremy Hartman 13 November 2015

CORPORATE PRESENTATION June 2018 TSX: VII.TO 7G CORPORATE PROFILE 7G Capitalization &amp; Key

January 2017 TSX: YGR TSX: YGR Corporate Snapshot Capitalization Reserves and Locations (2)

Nonverbal Communication Cues in SOVB Bench Players An Independent Study by Emily Carr The

A Learning College Closing the Loop w/ ePortfolio &amp; Assessment Associate Dean Bret Eynon &amp;

$1 BILLION 150 YEARS COMPLETED PROJECTS COMBINED PROPERTY EXPERIENCE $2 BILLION+ 100 PERCENT

Nicholsons Shopping Centre Community Planning Weekend 22 to 26 March 2019 Report Back

Blended Learning in Elementary: How do I Start? Presented by Claire Cummings Third Grade

The Center for the Study of Languages and Cultures Giving an Effective Academic Presentation

Use of Electromagnetic Foraging Cues By Conophagous Insects Tracy Zahradnik 1 Stephan Takcs 1 ,

S.K.H. St. Marys Church Mok Hing Yiu College Lee Chun Yu, Yuen Wing Ho 1 Band 2

CORPORATE PRESENTATION June 2018 TSX: VII.TO 7G CORPORATE PROFILE 7G Capitalization & Key

A Learning College Closing the Loop w/ ePortfolio & Assessment Associate Dean Bret Eynon &