Adversarial Learning for Neural Dialogue Generation
Li, Jiwei, Will Monroe, Tianlin Shi, Alan Ritter, and Dan Jurafsky EMNLP’17 Presented by Yiren Wang (CS546, Spring 2018)
2
Adversarial Learning for Neural Dialogue Generation Li, Jiwei, Will - - PowerPoint PPT Presentation
Adversarial Learning for Neural Dialogue Generation Li, Jiwei, Will Monroe, Tianlin Shi, Alan Ritter, and Dan Jurafsky EMNLP17 Presented by Yiren Wang (CS546, Spring 2018) 2 Ma Main Con Contri ribution ons Goal End-to-end neural
Li, Jiwei, Will Monroe, Tianlin Shi, Alan Ritter, and Dan Jurafsky EMNLP’17 Presented by Yiren Wang (CS546, Spring 2018)
2
3
4
5
(Sutskever et al., 2014; Jean et al., 2014)
6
7
8
9
Approximated by likelihood ratio
10
Approximated by likelihood ratio classification score Baseline value to reduce the variance of the estimate while keeping it unbiased Policy updates in the parameter space
11
(negative reward)
12
(negative reward)
(neutral reward) (negative reward)
13
14
15
Average reward
16
Average reward
17
score for each partial sequence
Partially-generated sequence
18
19
”having a teacher intervene and force it to generate true responses”
20
21
22
23