Improving Neural Language Modeling via Adversarial Training
Dilin Wang*, Chengyue Gong* (equal contribution) Qiang Liu
Department of Computer Science The University of Texas at Austin
Dilin Wang*, Chengyue Gong*, Qiang Liu Adversarial Softmax 1 / 8
Improving Neural Language Modeling via Adversarial Training Dilin - - PowerPoint PPT Presentation
Improving Neural Language Modeling via Adversarial Training Dilin Wang*, Chengyue Gong* (equal contribution) Qiang Liu Department of Computer Science The University of Texas at Austin Dilin Wang*, Chengyue Gong*, Qiang Liu Adversarial Softmax
Department of Computer Science The University of Texas at Austin
Dilin Wang*, Chengyue Gong*, Qiang Liu Adversarial Softmax 1 / 8
xt ht)
ℓ=1 exp(w⊤ ℓ ht)
θ θ θ,w w w
Dilin Wang*, Chengyue Gong*, Qiang Liu Adversarial Softmax 2 / 8
200 400 600 40 60 80 100 AWD-LSTM -- Train AWD-LSTM -- Validation
Dilin Wang*, Chengyue Gong*, Qiang Liu Adversarial Softmax 3 / 8
θ θ θ,w w w
δt
j ht)
t = arg min ||δt||≤ǫ
Dilin Wang*, Chengyue Gong*, Qiang Liu Adversarial Softmax 4 / 8
θ θ θ,w w w
δt
j ht)
t = arg min ||δt||≤ǫ
Dilin Wang*, Chengyue Gong*, Qiang Liu Adversarial Softmax 4 / 8
||δi||≤ǫ (wi + δi)⊤h = (w⊤ i h − ǫ||h||)
j h,
j=i ||wj − wi|| > ǫ,
Dilin Wang*, Chengyue Gong*, Qiang Liu Adversarial Softmax 5 / 8
(Merity et al., 2017)
(Merity et al., 2017)
Dilin Wang*, Chengyue Gong*, Qiang Liu Adversarial Softmax 6 / 8
Vaswani et al., 2017
Vaswani et al., 2017
Dilin Wang*, Chengyue Gong*, Qiang Liu Adversarial Softmax 7 / 8
1 A Closed-form solution & easy to implement 2 Diversity Promotion 3 Strong empirical results
Dilin Wang*, Chengyue Gong*, Qiang Liu Adversarial Softmax 8 / 8