Translating Neuralese
Jacob Andreas, Anca Dragan, and Dan Klein
Translating Neuralese Jacob Andreas, Anca Dragan, and Dan Klein - - PowerPoint PPT Presentation
Translating Neuralese Jacob Andreas, Anca Dragan, and Dan Klein Learning to Communicate [Wagner et al. 03, Sukhbaatar et al. 16, Foerster et al. 16] 2 Learning to Communicate 3 Neuralese 1.0 2.3 -0.3 0.4 -1.2 1.1 4 Translating
Jacob Andreas, Anca Dragan, and Dan Klein
Learning to Communicate
2 [Wagner et al. 03, Sukhbaatar et al. 16, Foerster et al. 16]
Learning to Communicate
3
Neuralese
4
1.0 2.3
Translating neuralese
1.0 2.3
all clear
5
autonomous systems
Translating neuralese
1.0 2.3
all clear
[Lazaridou et al. 16] 6
Outline
Natural language & neuralese Statistical machine translation Semantic machine translation Implementation details Evaluation
7
Outline
Natural language & neuralese Statistical machine translation Semantic machine translation Implementation details Evaluation
8
Outline
Natural language & neuralese Statistical machine translation Semantic machine translation Implementation details Evaluation
9
Outline
Natural language & neuralese Statistical machine translation Semantic machine translation Implementation details Evaluation
10 10
Outline
Natural language & neuralese Statistical machine translation Semantic machine translation Implementation details Evaluation
11
Outline
Natural language & neuralese Statistical machine translation Semantic machine translation Implementation details Evaluation
12
A statistical MT problem
13
a a
a
1.0 2.3
all clear
[e.g. Koehn 10]
A statistical MT problem
14
How do we induce a translation model?
A statistical MT problem
15
a a
a
a
a
Strategy mismatch
16
ζ(s) = 1 Γ(s) ∞ 1 ex − 1xs dx x
Strategy mismatch
17
not sure
ζ(s) = 1 Γ(s) ∞ 1 ex − 1xs dx x
Strategy mismatch
18
not sure dunno
Strategy mismatch
19
not sure dunno yes
Strategy mismatch
20
not sure dunno yes yes no yes
Strategy mismatch
21
not sure yes
Stat MT criterion doesn’t capture meaning
22
moving (0,3)→(1,4) In the intersection
Outline
Natural language & neuralese Statistical machine translation Semantic machine translation Implementation details Evaluation
23
A “semantic MT” problem
24
I’m going north
The meaning of an utterance is given by its truth conditions
[Davidson 67]
A “semantic MT” problem
25
I’m going north
The meaning of an utterance is given by its truth conditions
[Davidson 67]
A “semantic MT” problem
26
(loc (goal blue) north) I’m going north
The meaning of an utterance is given by its truth conditions
A “semantic MT” problem
27
0.4 0.2 0.001
I’m going north
The meaning of an utterance is given by its truth conditions the distribution over states in which it is uttered
[Beltagy et al. 14]
A “semantic MT” problem
28
0.4 0.2 0.001
I’m going north
The meaning of an utterance is given by its truth conditions the distribution over states in which it is uttered
[Frank et al. 09, A & Klein 16]
Representing meaning
29
The meaning of an utterance is given by its truth conditions the distribution over states in which it is uttered
Representing meaning
30
The meaning of an utterance is given by its truth conditions the distribution over states in which it is uttered
This distribution is well-defined even if the “utterance” is a vector rather than a sequence of tokens.
Translating with meaning
31
1.0 2.3
Translating with meaning
32
1.0 2.3
In the intersection
Translating with meaning
33
1.0 2.3
I’m going north
Translating with meaning
34
1.0 2.3
I’m going north
p( | ) p( | )
a
Translating with meaning
35
1.0 2.3
I’m going north
a
Interlingua!
36
source text target text
Translation criterion
a
37
Translation criterion
a
38
Translation criterion
a
39
Translation criterion
a
40
Computing representations
argmin
a KL( || )
a
41
Computing representations: sparsity
argmin
a KL( || )
a
p( | )
a
p( | )
42
Computing representations: smoothing
argmin
a KL( || )
a
actions & messages agent policy
43
argmin
a KL( || )
a
actions & messages agent policy agent model
44
Computing representations: smoothing
argmin
a KL( || )
a
actions & messages human
45
Computing representations: smoothing
argmin
a KL( || )
a
actions & messages human policy human model
46
Computing representations: smoothing
argmin
a KL( || )
a
0.10 0.05 0.13 0.08 0.01 0.22
a
47
Computing representations: smoothing
Computing KL
argmin
a KL( || )
a
48
Computing KL
argmin
a KL( || )
a
49
p
Computing KL: sampling
argmin
a KL( || )
a
50
i i i i
Finding translations
argmin
a KL( || )
a
51
Finding translations: brute force
argmin
a KL( || )
a
going north crossing the intersection I’m done after you 0.5 2.3 0.2 9.7
52
argmin
a KL( || )
a
going north crossing the intersection I’m done after you 0.5 2.3 0.2 9.7
53
Finding translations: brute force
Finding translations
a
54
Outline
Natural language & neuralese Statistical machine translation Semantic machine translation Implementation details Evaluation
55
Referring expression games
56
1.0 2.3
with black face
Evaluation: translator-in-the-loop
57
1.0 2.3
with black face
58
1.0 2.3
with black face
Evaluation: translator-in-the-loop
Experiment: color references
59
Experiment: color references
0.50 1.00 Neuralese → Neuralese English → English* 0.83
60
0.50 1.00 Neuralese → English* English → Neuralese Neuralese → Neuralese English → English* Statistical MT 0.83
0.72 0.70
61
Experiment: color references
0.50 1.00 Neuralese → English* English → Neuralese Neuralese → Neuralese English → English* Statistical MT Semantic MT
0.72 0.70 0.86 0.73
0.83
62
Experiment: color references
magenta, hot, rose magenta, hot, violet
pinkish, grey, dull
63
Experiment: color references
magenta, hot, rose magenta, hot, violet
pinkish, grey, dull
64
Experiment: color references
magenta, hot, rose magenta, hot, violet
pinkish, grey, dull
65
Experiment: color references
magenta, hot, rose magenta, hot, violet
pinkish, grey, dull
66
Experiment: color references
magenta, hot, rose magenta, hot, violet
pinkish, grey, dull
67
Experiment: color references
magenta, hot, rose magenta, hot, violet
pinkish, grey, dull
68
Experiment: color references
Experiment: image references
50 95 Neuralese → English* English → Neuralese Neuralese → Neuralese English → English* Statistical MT Semantic MT 77
72 70 86 73
69
large bird, black wings, black crown
large bird, black wings, black crown small brown, light brown, dark brown
70
Experiment: image references
Experiment: driving game
1.35 1.93 Neuralese ↔ English* Neuralese → Neuralese Statistical MT Semantic MT
1.49 1.54
71
un-language-like things (e.g. RNN states)
without logical forms if we have access to world states
Conclusions
72
non-language-like things (e.g. RNN states)
without logical forms if we have access to world states
Conclusions
73
non-language-like things (e.g. RNN states)
without logical forms if we have access to world states
Conclusions
74
What about compositionality?
Jacob Andreas and Dan Klein
75
1.0 2.3
Thank you!