CS11-747 Neural Networks for NLP Neural Semantic Parsing Pengcheng - PowerPoint PPT Presentation

CS11-747 Neural Networks for NLP Neural Semantic Parsing Pengcheng Yin pcyin@cs.cmu.edu Language Technologies Institute Carnegie Mellon University [Some contents are adapted from talks by Graham Neubig]

The Semantic Parsing Task Motivation how to represent the meaning of the sentence? Task Parsing natural language utterances into formal meaning representations (MRs) Meaning Representation Natural Language Utterance Show me flights from Pittsburgh lambda $0 e (and (flight $0) to Seattle (from $0 san_Francisco:ci) (to $0 seattle:ci))

The Semantic Parsing Task Task-specific Meaning Representations designed for a specific task (e.g., question answering) General-purpose Meaning Representations capture the semantics of natural language Task-Specific General-Purpose Meaning Representations Meaning Representations The boy wants to go Show me flights from Pittsburgh to Seattle lambda $0 e (and (flight $0) (want-01 (from $0 san_Francisco:ci) :arg0 (b / boy) (to $0 seattle:ci)) :arg1 (g / go-01)) Abstract Meaning Representation (AMR) Task-specific Logical Form Example: Smart Personal Agent Example: AMR, Combinatory Categorical Grammar (CCG) Question Answering Systems

Workflow of a (Task-specific) Semantic Parser User’s Natural Language Query Parsing to Meaning Representation lambda $0 e (and (flight $0) Show me flights from Pittsburgh to Seattle (from $0 san_Francisco:ci) (to $0 seattle:ci)) Query Execution Execution Results (Answer) 1. AS 119 2. AA 3544 -> AS 1101 3. … Build natural language interfaces to computers

Task-specific Semantic Parsing: Datasets • Domain-specific Meaning Representations and Languages – GEO Query, ATIS, JOBS – WikiSQL, Spider – IFTTT • General-purpose Programming Languages – HearthStone – Django – C O N A L A

GEO Query, ATIS, JOBS • ATIS 5410 queries about flight booking • GEO Query 880 queries about US geographical information • JOBS 640 queries to a job database GEO Query ATIS JOBS which state has the most rivers Show me flights from what microsoft jobs do not running through it? Pittsburgh to Seattle require a bscs? argmax $0 lambda $0 e answer( (state:t $0) (and (flight $0) company(J,’microsoft’), (count $1 (and (from $0 pittsburgh:ci) job(J), (river:t $1) (to $0 seattle:ci)) not((req deg(J,’bscs’)))) (loc:t $1 $0))) Prolog-style Program Lambda Calculus Logical Form Lambda Calculus Logical Form

WikiSQL • 80654 examples of Table, Question and Answer • Context a small database table extracted from a Wikipedia article • Target a SQL query [Zhong et al. , 2017]

IFTTT Dataset • Over 70K user-generated task completion snippets crawled from ifttt.com • Wide variety of topics: home automation, productivity, etc. • Domain-Specific Language: IF-THIS-THEN-THAT structure, much simpler grammar IFTTT Natural Language Query and Meaning Representation Autosave your Instagram photos to Dropbox IF Instagram.AnyNewPhotoByYou THEN Dropbox.AddFileFromURL Domain-Specific Programming Language https://ifttt.com/applets/1p-autosave- your-instagram-photos-to-dropbox [Quirk et al. , 2015]

HearthStone (HS) Card Dataset • Description: properties/fields of an HearthStone card • Target code: implementation as a Python class from HearthBreaker Intent (Card Property) <name> Divine Favor </name> <cost> 3 </cost> <desc> Draw cards until you have as many in hand as your opponent </desc> Target Code (Python class) [Ling et al. , 2016]

Django Annotation Dataset • Description: manually annotated descriptions for 10K lines of code • Target code: one liners • Covers basic usage of Python like variable definition, function calling, string manipulation and exception handling Intent call the function _generator, join the result into a string, return the result Target [Oda et al. , 2015]

The C O N A L A Code Generation Dataset − 2,379 training and 500 test examples Get a list of words `words` of a file 'myfile' − Manually annotated, high quality natural words = open('myfile').read().split() language queries − Code is highly expressive and compositional Copy the content of file 'file.txt' to file 'file2.txt' − Also ship with 600K extra mined examples! shutil.copy('file.txt’, 'file2.txt') Check if all elements in list `mylist` are the same len(set(mylist)) == 1 Create a key `key` if it does not exist in dict `dic` and append element `value` to value dic.setdefault(key, []).append(value) conala-corpus.github.io [Yin et al. , 2018]

Learning Paradigms Supervised Learning Utterances with Labeled Meaning Representation Weakly-supervised Learning Utterances with Query Execution Results Semi-supervised Learning Learning with Labeled and Unlabeled Utterances

Learning Paradigm 1: Supervised Learning User’s Natural Language Query Parsing to Meaning Representation lambda $0 e (and (flight $0) Show me flights from Pittsburgh to Seattle (from $0 san_Francisco:ci) (to $0 seattle:ci)) Train a neural semantic parser with source natural language query and target meaning representations

Sequence-to-Sequence Learning with Attention lambda $0 e ( and ) Task-Specific Meaning Representations . . . . . Show me flights from Pittsburgh to Seattle lambda $0 e (and (flight $0) (from $0 san_Francisco:ci) (to $0 seattle:ci)) Task specific logical form flight from Pittsburgh to Seattle • Treat the target meaning representation as a sequence of surface tokens • Reduce the task as another sequence-to-sequence learning problem [Jia and Liang, 2016; Dong and Lapata, 2016]

Sequence-to-Sequence Learning with Attention • Meaning Representations (e.g., a database query) have strong underlying structures! • Issue Using vanilla seq2seq models ignore the rich structures of meaning representations Task-Specific Meaning Representations Show me flights from Pittsburgh to Seattle lambda $0 e (and (flight $0) (from $0 san_Francisco:ci) Tree-structured Representation (to $0 seattle:ci)) Task specific logical form [Jia and Liang, 2016; Dong and Lapata, 2016]

Structure-aware Decoding for Semantic Parsing • Motivation utilize the rich syntactic structure of target meaning representations • Seq2Tree Generate from top-down using hierarchical sequence-to-sequence model • Sequence-to-tree Decoding Process lambda – Each level of a parse tree is a sequence of terminals and non- $0 e and terminals – Use a LSTM decoder to generate the > from sequence – For each non-terminal node, expand $0 dallas:ci departure_time 1600:ti it using the LSTM decoder $0 Show me flight from Dallas departing after 16:00 [Dong and Lapata, 2016]

Structure-aware Decoding (Cont’d) • Coarse-to-Fine Decoding decode a coarse sketch of the target logical form first and then decode the full logical form conditioned on both the input query and the sketch • Explicitly model the coarse global structure of the logical form, and use it to guide the parsing process [Dong and Lapata, 2018]

Grammar/Syntax-driven Semantic Parsing • Previously introduced methods only added structured components to the decoding model • Meaning representations (e.g., Python) have strong underlying syntax • How can we explicitly model the underlying syntax/grammar of the target meaning representations in the decoding process? Abstract Syntax Tree Python Abstract Grammar Expr expr ⟼ Name | Call Call Call ⟼ expr[ func ] expr*[ args ] keyword*[ keywords ] expr[func] keyword*[keywords] expr*[args] If ⟼ expr[ test ] stmt*[ body ] stmt*[ orelse ] Name erpr For ⟼ expr[ target ] expr*[ iter ] stmt*[ body ] keyword stmt*[ orelse ] .... str(sorted) Name FunctionDef ⟼ identifier[ name ] expr*[ iter ] str(my_list) stmt*[ body ] stmt*[ orelse ] sorted(my_list, reverse=True) [Yin and Neubig, 2017; Rabinovich et al. , 2017]

Grammar/Syntax-driven Semantic Parsing • Key idea: use the grammar of the target meaning representation (Python AST) as prior knowledge in a neural sequence-to-sequence model sort my_list in descending order (() Input Intent % & ' : a seq2seq model with prior syntactic information Expr (") Generated AST Call expr[func] keyword*[keywords] expr*[args] Name erpr keyword .... str(sorted) Name str(my_list) Deterministic transformation (using Python astor library) Surface Code ($) sorted(my_list, reverse=True) [Yin and Neubig, 2017; Rabinovich et al. , 2017]

Grammar/Syntax-driven Semantic Parsing • Factorize the generation story of an AST into sequential application of actions {" # } : – ApplyRule[r] : apply a production rule % to the frontier node in the derivation – GenToken[v] : append a token & (e.g., variable names, string literals) to a terminal Generated by a Derivation AST Action Sequence recurrent neural decoder " ' root ⟼ Expr root Expr Expr ⟼ expr[Value] " ) expr[Value] expr ⟼ Call " * Call Call ⟼ expr[func] expr*[args] " + keyword*[keywords] expr[func] expr*[args] keyword*[keywords] " , expr ⟼ Name " '+ keyword* ⟼ keyword " 0 expr* ⟼ expr Name erpr keyword .... .... " - Name ⟼ str " '1 expr ⟼ Name str(sorted) str Name " . str(my_list) GenToken[sorted] " '' Name ⟼ str " / GenToken[</n>] " ') GenToken[my_list] sorted(my_list, reverse=True) 2 3 ApplyRule " '* GenToken[</n>] 2 3 GenToken

CS11-747 Neural Networks for NLP Neural Semantic Parsing Pengcheng - PowerPoint PPT Presentation

CS11-747 Neural Networks for NLP Neural Semantic Parsing Pengcheng Yin pcyin@cs.cmu.edu Language Technologies Institute Carnegie Mellon University [Some contents are adapted from talks by Graham Neubig] The Semantic Parsing Task Motivation

Recurrent Neural Networks Graham Neubig Site https://phontron.com/class/nn4nlp2017/ NLP and

Robust Incremental Neural Semantic Graph Parsing Jan Buys and Phil Blunsom Dependency Parsing vs

Introduction to Bottom-Up Parsing Shift-reduce parsing The LR parsing algorithm

CS11-747 Neural Networks for NLP Generate Trees Incrementally Graham Neubig gneubig@cs.cmu.edu

Neural Semantic Parsing Graham Neubig Site https://phontron.com/class/nn4nlp2018/ Tree

Neural Semantic Parsing Graham Neubig Site https://phontron.com/class/nn4nlp2017/ Tree

Recurrent Neural Networks Graham Neubig Site https://phontron.com/class/nn4nlp2020/ NLP and

Debugging Neural Networks for NLP Graham Neubig Site https://phontron.com/class/nn4nlp2020/ In

Debugging Neural Networks for NLP Graham Neubig Site https://phontron.com/class/nn4nlp2019/ In

CSC 4181 Compiler Construction Parsing 1 1 Outline Top-down v.s. Bottom-up Top-down parsing

Policy Shaping and Generalized Update Equations for Semantic Parsing from Denotations 1

Semantics for Semantic Parsing Mark Steedman ( with Mike Lewis, Siva Reddy, and Mirella Lapata) 26

Transition-based Parsing with Neural Nets Graham Neubig Site

Type-driven Incremental Semantic Parsing with Polymorphism Kai Zhao and Liang Huang City

Basic Parsing Algorithms Chart Parsing Seminar Recent Advances in Parsing Technology WS

Application: Semantic Role Labeling CS 6956: Deep Learning for NLP Overview What is semantic

Multithreading Checkout Multithreading project from SVN Joe Armstrong, Programming in

A Self-Replication Algorithm to Flexibly Match Join Point Traces Paul Leger and ric Tanter

Concurrent Revisions A novel determinis2c concurrency model Daan

Stanford CS193p Developing Applications for iOS Fall 2017-18 CS193p Fall 2017-18 Today Emoji

Overcome challenges in online exams Fan Tang Meng Integrated Information Technology Services

STM32 Ecosystem workshop T.O.M.A.S Team 2 Now it is a right time for some activities again

All in one Reporting with RStudio (slides) Presentation December 2016 CITATIONS READS 0

Welcome Clients of Mariner Wealth Advisors Cybersecurity Education Series Password Management