An Extended GHKM Algorithm for Inducing -SCFG Peng Li, Yang Liu and - PowerPoint PPT Presentation

An Extended GHKM Algorithm for Inducing λ -SCFG Peng Li, Yang Liu and Maosong Sun THUNLP&CSS Tsinghua University, China

Outline l Background l Rule extraction algorithm l Modeling l Experiments l Conclusion 2

Semantic Parsing l Semantic parsing: mapping a natural language sentence into its computer executable meaning representation NL : Every boy likes a star MR: ∀ x .( boy ( x ) → ∃ y ( human ( y ) ⋀ pop ( y ) ⋀ like ( x , y ))) 4

Related Work l Hand-build systems (e.g., Woods et al., 1972; Warren & Pereira, 1982) l Learning for semantic parsing l Supervised methods (e.g., Wong & Mooney, 2007; Lu et al., 2008) l Semi-supervised methods (e.g., Kate & Mooney, 2007) l Unsupervised methods (e.g., Poon & Domingos, 2009 & 2010; Goldwasser et al., 2011) 5

Related Work l Hand-build systems (e.g., Woods et al., 1972; Warren & Pereira, 1982) l Learning for semantic parsing l Supervised methods (e.g., Wong & Mooney, 2007; Lu et al., 2008) l Semi-supervised methods (e.g., Kate & Mooney, 2007) l Unsupervised methods (e.g., Poon & Domingos, 2009 & 2010; Goldwasser et al., 2011) 6

Supervised Methods l Inductive logic programming based methods (e.g., Zelle & Mooney, 1996; Tang & Mooney, 2001) l String kernel based methods (e.g., Kate & Mooney, 2006) l Grammar based methods l PCFG (e.g., Ge & Mooney, 2005) l SCFG (e.g., Wong & Mooney, 2006 & 2007) l CCG (e.g., Zettlemoyer & Collins, 2005 & 2007; Kwiatkowski et al., 2010 & 2011) l Hybrid tree (e.g., Lu et al., 2008) l Tree transducer (Jones et al., 2012) 7

Supervised Methods l Inductive logic programming based methods (e.g., Zelle & Mooney, 1996; Tang & Mooney, 2001) l String kernel based methods (e.g., Kate & Mooney, 2006) l Grammar based methods l PCFG (e.g., Ge & Mooney, 2005) l SCFG (e.g., Wong & Mooney, 2006 & 2007) l CCG (e.g., Zettlemoyer & Collins, 2005 & 2007; Kwiatkowski et al., 2010 & 2011) l Hybrid tree (e.g., Lu et al., 2008) l Tree transducer (Jones et al., 2012) 8

Context Free Grammar (CFG) l A formal grammar in which every production rule is of the following form Left hand side Right hand side X à Every X Terminal Nonterminal 9

Context Free Grammar (CFG) l Derivation example CFG Rules Derivation r 1 : S à X S à X r 2 : X à Every X X à Every X r 3 : X à X 1 X 2 X à Every X 1 X 2 r 4 : X à boy X à Every boy X 2 r 5 : X à X a star X à Every boy X 2 a star r 6 : X à likes X à Every boy likes a star 10

Synchronous Context Free Grammar (SCFG) Right hand Right hand Left hand side side 1 side 2 X à < Every X 1 , 每个 X 1 > Rewritten synchronously One nonterminal 11

Synchronous Context Free Grammar (SCFG) l Two strings can be generated synchronously S à < X, X > X à < Every X, 每个 X > X à < Every X 1 X 2 , 每个 X 1 X 2 > .......... X à < Every boy likes a star, 每个男孩都喜欢一个明星 > How to use SCFG to handle logical forms? 12

λ -calculus l A formal system in mathematical logic for expressing computation by way of variable binding and substitution l λ -expression: λ x . λ y . borders ( y , x ) l β -conversion: bound variable substitution λ x . λ y . borders ( y , x )( texas ) = λ y . borders ( y , texas ) l α -conversion: bound variable renaming λ x . λ y . borders ( y , x ) = λ z . λ y . borders ( y , z ) 13

λ -SCFG: SCFG+ λ -calculus l Reducing semantic parsing problem to SCFG parsing problem l Using λ -calculus to handle semantic specific phenomenon l Rule example l X à < Every X 1 , λ f . ∀ x ( f ( x )) ⊲ X 1 > (Wong & Mooney, 2007) 14

λ -SCFG: SCFG+ λ -calculus NL : Every boy likes a star r 1 : S à < X 1 , X 1 > r 2 : X à < Every X 1 , λ f . ∀ x ( f ( x )) ⊲ X 1 > r 3 : X à < X 1 X 2 , λ f . λ g . λ x . f ( x ) → g ( x ) ⊲ X 1 ⊲ X 2 > r 4 : X à < boy, λ x . boy ( x ) > r 5 : X à < X 1 , λ f . λ x . ∃ y ( f ( x , y )) ⊲ X 1 > r 6 : X à < X 1 a star , λ f . λ x . λ y. human ( y ) ⋀ pop ( y ) ⋀ f ( x,y ) ⊲ X 1 > r 7 : X à < like , λ x . λ y.like ( x,y ) > < S 1 , S 1 > à < X 2 , X 2 > ( r 1 ) ( r 2 ) à < Every X 3 , λ f . ∀ x . ( f ( x )) ⊲ X 3 > à < Every X 4 X 5 , λ f . λ g . ∀ x .( f ( x ) → g ( x )) ⊲ X 4 ⊲ X 5 > ( r 3 ) ( r 4 ) à < Every boy X 5 , λ g . ∀ x .( boy ( x ) → g ( x )) ⊲ X 5 > à < Every boy X 6 , λ f . ∀ x .( boy ( x ) → ∃ y ( f ( x , y ))) ⊲ X 6 > ( r 5 ) ( r 6 ) à < Every boy X 7 a star, λ f . ∀ x .( boy ( x ) → ∃ y ( human ( y ) ⋀ pop ( y ) ⋀ f ( x , y ))) ⊲ X 7 > ( r 7 ) à < Every boy likes a star, ∀ x .( boy ( x ) → ∃ y ( human ( y ) ⋀ pop ( y ) ⋀ like ( x , y ))) > 15

GHKM l The GHKM algorithm extracts STSG rules from aligned tree-string pairs (Galley et al., 2004) 16

GHKM l The GHKM algorithm extracts STSG rules from aligned tree-string pairs Our work 17

Overview NL : Every boy likes a star MR: ∀ x .( boy ( x ) → ∃ y ( human ( y ) ⋀ pop ( y ) ⋀ like ( x , y ))) GHKM Rule Extractor X à < Every X 1 , λ f . ∀ x ( f ( x )) ⊲ X 1 > X à < boy, λ x . boy ( x ) > Parameter estimation Semantic Parser 19

Rule Extraction Algorithm l Outline 1. Building training examples Transforming logical forms to trees 1. Aligning trees with sentences 2. 2. Identifying frontier nodes 3. Extracting minimal rules 4. Extracting composed rules 20

Building Training Examples NL : Every boy likes a star MR: ∀ x .( boy ( x ) → ∃ y ( human ( y ) ⋀ pop ( y ) ⋀ like ( x , y ))) 21

Building Training Examples 22

Building Training Examples ∀ x .( boy ( x ) → ∃ y ( human ( y ) ⋀ pop ( y ) ⋀ like ( x , y ))) 24

Identifying Frontier Nodes 26

Identifying Frontier Nodes 27

Extracting minimal rules ∀ x : X à < Every X 1 , λ f . ∀ x ( f ( x )) ⊲ X 1 > → : X à < X 1 X 2 , λ f . λ g . λ x . f ( x ) → g ( x ) ⊲ X 1 ⊲ X 2 > boy : X à < boy, λ x . boy ( x ) > ∃ y : X à < X 1 , λ f . λ x . ∃ y ( f ( x , y )) ⊲ X 1 > ⋀ : X à < X 1 a star , λ f . λ x . λ y. human ( y ) ⋀ pop ( y ) ⋀ f ( x,y ) ⊲ X 1 > like : X à < like , λ x . λ y.like ( x,y ) > 28

Composed Rule Extraction X à < X 1 X 2 , λ f . λ g . λ x . f ( x ) → g ( x ) ⊲ X 1 ⊲ X 2 > X à < boy, λ x . boy ( x ) > + X à < X 1 , λ f . λ x . ∃ y ( f ( x , y )) ⊲ X 1 > = X à < boy X 1 , λ f . λ x . boy ( x ) → ∃ y ( f ( x , y )) ⊲ X 1 > 29

Modeling l Log-linear model + MERT training 3 λ i × h 4 D λ 4 × h 5 D λ 5 ∏ ∏ ( ) = ( ) ( ) ( ) w D h i r i = 1 r ∈ D ( ) = p s e D ( ) ( ) = p e | s ( ) h 4 X → s , e ( ) h 1 X → s , e ( ) = exp D ( ) ( ) = p lex s | e h 5 X → s , e ( ) h 2 X → s , e ( ) = p lex e | s ( ) h 3 X → s , e l Target " % ˆ ( ) e = e argmax D s . t . s ( D ) ≡ s w D $ ' # & 31

Experiments l Dataset: G EOQUERY l 880 English questions with corresponding Prolog logical forms Arkansas, Canadian, Cimarron, Which rivers run Gila, Mississippi, Rio Grande … through the states bordering Texas? Answer Semantic Parsing Query answer(traverse(next_to(stateid(‘texas’)))) (Kate & Wong, ACL 2010 Tutotial) 33

Experiments l Dataset: G EOQUERY l 880 English questions with corresponding Prolog logical forms l Evaluation metrics precision = | C | | G |, recall = | C | | T |, F − measure = 2 ⋅ precision ⋅ recall precision + recall 34

Experiments System P R F Independent Test Set Z&C 2005 96.3 79.3 87.0 Z&C 2007 95.5 83.2 88.9 Kwiatkowksi, et al. (2010) 94.1 85.0 89.3 Cross Validation Results Kate et al . (2005) 89.0 54.1 67.3 Wong and Mooney (2006) 87.2 74.8 80.5 Kate and Mooney (2006) 93.3 71.7 81.1 Lu et al. (2008) 89.3 81.5 85.2 Ge and Mooney (2005) 95.5 77.2 85.4 Wong and Mooney (2007) 92.0 86.6 89.2 this work 93.0 87.6 90.2 35

Experiments • F-measure for different languages System en ge el th Wong and Mooney (2006) 77.7 74.9 78.6 75.0 Lu et al. (2008) 81.0 68.5 74.6 76.7 Kwiatkowksi, et al. (2010) 82.1 75.0 73.7 66.4 Jones et al . (2005) 79.3 74.6 75.4 78.2 this work 84.2 74.6 79.4 76.7 * en - English, ge - German, el - Greek, th - Thai 36

Experiments 37

An Extended GHKM Algorithm for Inducing -SCFG Peng Li, Yang Liu and - PowerPoint PPT Presentation

An Extended GHKM Algorithm for Inducing -SCFG Peng Li, Yang Liu and Maosong Sun THUNLP&CSS Tsinghua University, China Outline l Background l Rule extraction algorithm l Modeling l Experiments l Conclusion 2 Outline l

Preference Grammars and Soft Syntactic Constraints for GHKM Syntax-based SMT Matthias Huck, Hieu

Inducing Efficiently Inducing Efficiently optimizi optimizing outpati ng outpatient i ent

Inducing a Discriminative Parser to Optimize Machine Translation Reordering Graham Neubig 1,2,3 ,

Extended Project Qualification Introduction What is an Extended Project? What does an

Encrypting Communication Review Extended Euclids Algorithm Extended Euclids Algorithm :

A General-Purpose Rule Extractor for SCFG-Based Machine Translation Greg Hanneman , Michelle

A CYK+ Variant for SCFG Decoding Without a Dot Chart Rico Sennrich Institute for Language,

Inducing Interpretable Word Senses for WSD and Enrichment of Lexical Resources Overview Jan 11,

Low-Rank Inducing Norms with Optimality Interpretations LU Christian Grussler 2017 Pontus

Inducing Source Definitions for Web Service Composition Mark Carman Craig Knoblock Overview of

Inducing Irreducible Representations Dana P. Williams Dartmouth College SFB-Workshop on Groups,

Odds Algorithm An Online Algorithm Group Fibonado 20. Dec 2016 Group Fibonado Odds Algorithm

Extended Day Program March 20, 2014 Bobby Kaplow, Director, Extended Day Ben Harris, Assistant

Extended Binary Trees Recurrence relations Today: Extended Binary Trees (basis for much of

MA/CSSE 473 Day 07 Extended Euclid's Algorithm Modular Division Fermat's little theorem intro

Visible Surface Determination CS418 Computer Graphics John C. Hart Painters Algorithm

Introduction to Provable Security Alejandro Hevia Dept. of Computer Science, Universidad de

Higher order Fourier analysis and algebraic property testing Hamed Hatami School of Computer

in Property Testing and Local List Decoding Sofya Raskhodnikova Boston University Joint work

Constrained Pseudorandom Functions for Unconstrained Inputs Apoorvaa Deshpande (Brown

Groth-Sahai proof system Olivier Blazy Ecole normale sup erieure Jan. 21st 2011 O. Blazy

Secret Ninja Testing with HALO Software Engineering 1 Jonathan Bell, Swapneel Sheth, Gail Kaiser

Bounded-Communication Leakage Resilience via Parity-Resilient Circuits Vipul Goyal 1 Yuval Ishai 2

How to Share a Lattice Trapdoor: Threshold Protocols for Signatures and (H)IBE Rikke Bendlin, Sara