Globally Coherent Text Generation with Neural Checklist Models
- Chloe ́ Kiddon, Luke Zettlemoyer, Yejin Choi
Computer Science & Engineering University of Washington
- Presenter: Webber Lee
Globally Coherent Text Generation with Neural Checklist Models - - PowerPoint PPT Presentation
Globally Coherent Text Generation with Neural Checklist Models Chloe Kiddon, Luke Zettlemoyer, Yejin Choi Computer Science & Engineering University of Washington Presenter: Webber Lee March 29, 2018 Outline
Computer Science & Engineering University of Washington
GRU language model New agenda item reference model Used agenda item reference model Generate
3-way classifier Update checklist ht-1 xt g Et ht
at ht ft at-1
gru: content from Gated Recurrent Unit (GRU)
new: encoding from new agenda item reference model
used: encoding from previously used item model
gru, ft new, ft used] is interpolation weights learned by a three-
– predicts which agenda item is being referred to – stores those predictions for use during generation
– initialized to all zero at t = 1
– replicate L-dimensional vector by k times (i.e., RL à RL x k) – element-wise multiplication
– gradient norm: 0.5; uniformly on [-0.35, 0.35] – beam search size: 10 – learning rate: 0.1 – temperature hyper-parameters (beta, gamma)
– hidden state size
– batch size