Policy Shaping and Generalized Update Equations for Semantic Parsing from Denotations
1
Policy Shaping and Generalized Update Equations for Semantic - - PowerPoint PPT Presentation
Policy Shaping and Generalized Update Equations for Semantic Parsing from Denotations 1 Semantic Parsing with Execution Text Environment Semantic Parsing Meaning Representation Execution Denotation (Answer) 2 Semantic Parsing with
1
2
Select Nation Where Points is Max
Index Name Nation Points Games Pts/game 1 Karen Andrew England 44 5 8.8 2 Daniella Waterman England 40 5 8 3 Christelle Le Duff France 33 5 6.6 4 Charlotte Barras England 30 5 6 5 Naomi Thomas Wales 25 5 5
3
Select Nation Where Points is Max
Index Name Nation Points Games Pts/game 1 Karen Andrew England 44 5 8.8 2 Daniella Waterman England 40 5 8 3 Christelle Le Duff France 33 5 6.6 4 Charlotte Barras England 30 5 6 5 Naomi Thomas Wales 25 5 5 4
○ x: “What nation scored the most points?”
○ neural models ⇒ score(x, y): encode x, encode y, and produce scores
○ Beamseach: argmax score(x, y)
○ Find approximated gold meaning representations ○ Reinforcement learning algorithms
5
Index Name Nation Points Games Pts/game 1 Karen Andrew England 44 5 8.8 2 Daniella Waterman England 40 5 8 3 Christelle Le Duff France 33 5 6.6 4 Charlotte Barras England 30 5 6 5 Naomi Thomas Wales 25 5 5
For Training
6
7
8
9
10
1 2
11
1
12
13
1 1
14
15
16
1 2
⇐ Shaping affects here
17
Most violated program generated according to reward augment inference Maximum Reward Program
18
19
2
20
22
23
24
25
26
1 2
27
1 2
⇐ Shaping affects here directly ⇐ Shaping affects here indirectly
28
Intensity Competing Distribution Dev Performance w/o shaping Maximum Marginal Likelihood (MML) Maximum Marginal Likelihood (MML) 32.4 Maximum Margin Reward (MMR) Maximum Margin Reward (MMR) 40.7 Maximum Margin Reward (MMR) Maximum Marginal Likelihood (MML) 41.9
29
30
31
32
33
34