Induction of Broad-Coverage Semantic Parsers Ivan Titov Natural - PowerPoint PPT Presentation

BroadSem: Induction of Broad-Coverage Semantic Parsers Ivan Titov

Natural language processing (NLP) The key bottleneck: the lack of accurate methods for producing meaning representations of texts and reasoning with these representations Machine translation Machine reading Information retrieval

Machine reading Lansky left Australia to study the piano at the Royal College of Music. …. Lansky dropped his studies at RCM, but eventually graduated from Trinity. 1. Where did Lansky get his diploma? 2. Where did he live? 3. What does he do?

Frame-semantic parsing Lansky left Australia to study the piano at the Royal College of Music.

Frame-semantic parsing Semantic roles Student Institution Subject EDUCATION Lansky left Australia to study the piano at the Royal College of Music. Semantic frame

Frame-semantic parsing Semantic roles Student Institution Subject DEPARTING EDUCATION Lansky left Australia to study the piano at the Royal College of Music. Source Purpose Agent Semantic frame

Frame-semantic parsing Semantic roles Student Institution Subject DEPARTING EDUCATION Lansky left Australia to study the piano at the Royal College of Music. Source Purpose Agent Semantic frame } Intuitively, a frame-semantic parser extracts knowledge from text into a relational database Frames are tables, roles are attributes DEPARTING … Object Source Purpose … Lansky Australia to study … EDUCATION … … … Student Subject Institution … Lansky Royal College of Music piano … … … …

Outline } Motivation: why we need unsupervised feature-rich models and learning for inference } Framework: reconstruction error minimization for semantics } Special case: inferring missing arguments } Conclusions

Modern semantics parsers Modern frame-semantic parsers rely on supervised learning learning algorithm Text Parser collection ready to be annotated applied to by linguists new texts Especially across languages and domains Challenge #1 It is impossible to annotate enough data to estimate an effective broad-coverage semantic parser

Machine reading Student Institution Subject EDUCATION Lansky left Australia to study the piano at the Royal College of Music. …. Lansky dropped his studies at RCM, but eventually graduated from Trinity. 1. Where did Lansky get his diploma?

Output of a state-of-the-art parser CMU's SEMAFOR [Das et al., 2012] trained on 100,000 sentences (FrameNet) Student Institution Subject EDUCATION Lansky left Australia to study the piano at the Royal College of Music. …. Agent Object EDUCATION MOVEMENT Lansky dropped his studies at RCM, but eventually graduated from Trinity. Manner Student WRONG Institution Object Place Agent Representative of the GET 1. Where did Lansky get his diploma? "Head", at least for the training data WRONG The parser's output does not let us answer even this simple question

"Correct" semantics as imposed by linguists Student Institution Subject EDUCATION Lansky left Australia to study the piano at the Royal College of Music. …. Institution Student EDUCATION EDUCATION Lansky dropped his studies at RCM, but eventually graduated from Trinity. Time Student Institution Institution Student EDUCATION 1. Where did Lansky get his diploma?

"Correct" semantics as imposed by linguists Student Institution Subject EDUCATION Lansky left Australia to study the piano at the Royal College of Music. …. Institution Student EDUCATION EDUCATION Lansky dropped his studies at RCM, but eventually graduated from Trinity. Time Student Institution Institution Student EDUCATION Trinity or RCM ???? 1. Where did Lansky get his diploma? Challenge #2 Representations defined by linguists are not appropriate for reasoning (i.e. inference)

Unsupervised role and frame induction } The challenges motivated research in unsupervised role / frame induction: } Role induction [Swier and Stevenson '04; Grenager and Manning '06; Lang and Lapata '10, '11, '14; Titov and Klementiev '12; Garg and Henderson '12; Fürstenau and Rambow, '12;…] } Frame induction [Titov and Klementiev '11; O' Connor '12; Modi et al.'12; Materna '12; Lorenzo and Cerisara '12; Kawahara et al. '13; Cheung et al. '13; Chambers et al., 14; …]

In contrast to supervised methods Unsupervised role and frame induction to frame-semantic parsing / semantic role labeling The models rely on very restricted sets of features } not very effective in the semi-supervised set-up, and not very appropriate for languages } with freer order than English … over-rely on syntax } not going to induce, e.g., "X sent Y = Y is a shipment from X" } … use language-specific priors } a substantial drop in performance if no adaptation } … not (quite) appropriate for inference } } not only no inference models but also opposites and antonyms (e.g., increase + decrease) are typically grouped together; induced granularity is often problematic; …

In contrast to supervised methods Unsupervised role and frame induction to frame-semantic parsing / semantic role labeling The models rely on very restricted sets of features } not very effective in the semi-supervised set-up, and not very appropriate for languages } with freer order than English … over-rely on syntax } not going to induce, e.g., "X sent Y = Y is a shipment from X" } … use language-specific priors } a substantial drop in performance if no adaptation } … not (quite) appropriate for inference } } not only no inference models but also opposites and antonyms (e.g., increase + decrease) are typically grouped together; induced granularity is often problematic; … Do not impose the notion of semantics, induce it from unannotated data in such way that it is useful for reasoning

Idea: estimating the model Left-out facts Reconstruction Semantic Not observable in the data representations – need to be induced Encoding Text(s) Instead of using annotated data, induce representations beneficial for inferring left-out facts

Idea: estimating the model ideas from Left-out facts statistical relational learning e.g., [Yilmaz et al., '11] Inference model: tensor factorization Similar to a relational database Semantic representations Encoding Text(s)

Idea: estimating the model Left-out facts Inference model: tensor factorization ideas from Semantic supervised parsing representations Semantic parser: expressive 'feature-rich' model Text(s) E.g., [Das et al., '10, Titov et al., '09 ] Inference model and semantic parser are jointly estimated from unannotated data

When learning for reasoning Student Institution Subject EDUCATION Lansky left Australia to study the piano at the Royal College of Music. Distinguish from Distinguish from EDUCATION EDUCATION …. Institution Student DROP_OUT GRADUATION Lansky dropped his studies at RCM, but eventually graduated from Trinity. Time Student Institution Institution Student GRADUATION Trinity 1. Where did Lansky get his diploma? The learning objective can ensure that the representations are informative for reasoning

When learning for reasoning Student Institution Subject EDUCATION Lansky left Australia to study the piano at the Royal College of Music. Distinguish from Distinguish from EDUCATION EDUCATION …. Institution Student DROP_OUT GRADUATION Lansky dropped his studies at RCM, but eventually graduated from Trinity. Time Student Institution Institution Student GRADUATION Trinity 1. Where did Lansky get his diploma? Australia and United Kingdom 2. Where did he live? 3. What does he do? Inference component can support 'reading between the lines'

When learning for reasoning Student Institution Subject EDUCATION Lansky left Australia to study the piano at the Royal College of Music. Distinguish from Distinguish from EDUCATION EDUCATION …. Institution Student DROP_OUT GRADUATION Lansky dropped his studies at RCM, but eventually graduated from Trinity. Time Student Institution Institution Student GRADUATION Trinity 1. Where did Lansky get his diploma? Australia and United Kingdom 2. Where did he live? He is a pianist (??) 3. What does he do? Inference component can support 'reading between the lines'

Induction of Broad-Coverage Semantic Parsers Ivan Titov Natural - PowerPoint PPT Presentation

BroadSem: Induction of Broad-Coverage Semantic Parsers Ivan Titov Natural language processing (NLP) The key bottleneck: the lack of accurate methods for producing meaning representations of texts and reasoning with these representations

Induction Stepwise induction (for T PA , T cons ) Complete induction (for T PA , T cons )

Scanners and parsers COMP 520 Fall 2010 Scanners and Parsers (2) A scanner or lexer transforms a

Induction and recursion Chapter 5 Chapter Summary Mathematical Induction Strong Induction

NEIGHBORHOOD MAP 1. S. BROAD ST - LOOKING NORTH 2. S. BROAD ST - LOOKING SOUTH 3. S. BROAD ST -

LR Parsing Compiler Design CSE 504 Shift-Reduce Parsing 1 LR Parsers 2 SLR and LR(1) Parsers

Objectives Combinator Parsing Show how to build complex parsers by composing simpler parsers.

XML Parsers Asst. Prof. Dr. Kanda Runapongsa Saikaew (krunapon@kku.ac.th) Dept. of Computer

Mathematical Induction Lecture 10-11 Menu Mathematical Induction Strong Induction

MA THEMA TICAL INDUCTION Induction and Deduction Mathematical Induction (its

Beyond Inductive Definitions Induction-Recursion, Induction-Induction, Coalgebras Anton

Lecture Outline Strengthening Induction Hypothesis. Lecture Outline Strengthening Induction

Strong induction (3) 23/38 Let P be a unary predicate on N Strong induction: Induction . . .

CS406: Compilers Spring 2020 Week 5: Parsers, AST, and Semantic Routines 1 Recap 2 3

Scaling Semantic Parsers with On-the-Fly Ontology Matching Tom Kwiakowski, Eunsol Choi, Yoav

Induction and Recursion CMPS/MATH 2170: Discrete Mathematics Outline Mathematical induction

Natural Deduction and Rule Induction Dr. Liam OConnor University of Edinburgh LFCS UNSW, Term

The Byzantine Generals Problem Siqiu Yao Authors Leslie Lamport you again! we

WELCOME Please make sure your valuables are SECURED in your vehicles Self Introductions

Critically endangered Spoon-billed Sandpiper: a flagship for a flyway Nigel Clark Scientific

CS 356: Computer Network Architectures Lecture 22: Internet Quality of Service [PD] Chapter 6.5

Most Common Cryptography Mistakes 3/8/2016 You fell vic+m to one of the classic blunders! #8:

Chapter 16 The World Wide Web The New Yorker, Peter Steiner, July 5, 1993 Hofstra University

Analysis using R for seasonal adjustment and trend estimate. written by Fortran. Seisho

Practicing at the Cutting Edge Learning and Unlearning about Performance Martin Thompson -