Overview RSA TUNA Learned RSA Experiments Conclusion
Learning in the Rational Speech Acts Model
Christopher Potts
Stanford Linguistics
Paper: http://arxiv.org/abs/1510.06807
Will Monroe
1 / 29
Learning in the Rational Speech Acts Model Christopher Potts - - PowerPoint PPT Presentation
Overview RSA TUNA Learned RSA Experiments Conclusion Learning in the Rational Speech Acts Model Christopher Potts Stanford Linguistics Paper: http://arxiv.org/abs/1510.06807 Will Monroe 1 / 29 Overview RSA TUNA Learned RSA Experiments
Overview RSA TUNA Learned RSA Experiments Conclusion
Christopher Potts
Stanford Linguistics
Paper: http://arxiv.org/abs/1510.06807
Will Monroe
1 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
2 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
Bayesian pragmatic models
2 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
Bayesian pragmatic models
design
2 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
Bayesian pragmatic models
design Machine learning
2 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
Bayesian pragmatic models
design Machine learning
experiences in training
2 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
Bayesian pragmatic models
design Machine learning
experiences in training Goal: combine the best aspects of both to achieve broader coverage and novel pragmatic insights
2 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
1 The Rational Speech Acts (RSA) model 2 TUNA 3 Learned RSA 4 Experiments
3 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
4 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
4 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
4 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
4 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
4 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
4 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
4 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
5 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
Chat80:
Overview RSA TUNA Learned RSA Experiments Conclusion
1 for w ∈ Words 2 for X ∈ Categories 3 for d ∈ Domain 4 yield ‘X → w : d’
5 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
1 for w ∈ Words 2 for X ∈ Categories 3 for d ∈ Domain 4 yield ‘X → w : d’ N → dog : dog V → dog : dogv N → dog : cat N → cat : cat N → cat : dog V → jump : dog V → jump : jump
5 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
1 for w ∈ Words 2 for X ∈ Categories 3 for d ∈ Domain 4 yield ‘X → w : d’ 1 N → dog : dog V → dog : dogv N → dog : cat N → cat : cat N → cat : dog V → jump : dog V → jump : jump
N dog : dog
5 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
1 for w ∈ Words 2 for X ∈ Categories 3 for d ∈ Domain 4 yield ‘X → w : d’ 1 N → dog : dog 1 V → dog : dogv N → dog : cat N → cat : cat N → cat : dog V → jump : dog V → jump : jump
N dog : dog V dog : dogv
5 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
1 for w ∈ Words 2 for X ∈ Categories 3 for d ∈ Domain 4 yield ‘X → w : d’ 1 N → dog : dog 1 V → dog : dogv N → dog : cat 1 N → cat : cat N → cat : dog V → jump : dog V → jump : jump
N dog : dog V dog : dogv N cat : cat
5 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
1 for w ∈ Words 2 for X ∈ Categories 3 for d ∈ Domain 4 yield ‘X → w : d’ 2 N → dog : dog 1 V → dog : dogv N → dog : cat 1 N → cat : cat N → cat : dog V → jump : dog V → jump : jump
N dog : dog V dog : dogv N cat : cat N dog : dog
5 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
1 for w ∈ Words 2 for X ∈ Categories 3 for d ∈ Domain 4 yield ‘X → w : d’ 3 N → dog : dog 1 V → dog : dogv N → dog : cat 1 N → cat : cat N → cat : dog V → jump : dog V → jump : jump
N dog : dog N dog : dog V dog : dogv N cat : cat N dog : dog
5 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
1 for w ∈ Words 2 for X ∈ Categories 3 for d ∈ Domain 4 yield ‘X → w : d’ 3 N → dog : dog 1 V → dog : dogv N → dog : cat 1 N → cat : cat N → cat : dog V → jump : dog 1 V → jump : jump
N dog : dog N dog : dog V dog : dogv V jump : jump N cat : cat N dog : dog
5 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
1 for w ∈ Words 2 for X ∈ Categories 3 for d ∈ Domain 4 yield ‘X → w : d’ 3 N → dog : dog 1 V → dog : dogv N → dog : cat 2 N → cat : cat N → cat : dog V → jump : dog 1 V → jump : jump
N dog : dog N dog : dog V dog : dogv V jump : jump N cat : cat N cat : cat N dog : dog
5 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
1 for w ∈ Words 2 for X ∈ Categories 3 for d ∈ Domain 4 yield ‘X → w : d’ 3 N → dog : dog 1 V → dog : dogv N → dog : cat 2 N → cat : cat N → cat : dog V → jump : dog 1 V → jump : jump
N dog : dog N dog : dog V dog : dogv V jump : jump N cat : cat N cat : cat N dog : dog
5 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
1 for w ∈ Words 2 for X ∈ Categories 3 for d ∈ Domain 4 yield ‘X → w : d’
Overview RSA TUNA Learned RSA Experiments Conclusion
Zettlemoyer & Collins 2005:
Rules Categories produced from logical form Input Trigger Output Category arg max(λx.state(x) ^ borders(x, texas), λx.size(x)) constant c NP : c NP : texas arity one predicate p1 N : λx.p1(x) N : λx.state(x) arity one predicate p1 S\NP : λx.p1(x) S\NP : λx.state(x) arity two predicate p2 (S\NP )/NP : λx.λy.p2(y, x) (S\NP )/NP : λx.λy.borders(y, x) arity two predicate p2 (S\NP )/NP : λx.λy.p2(x, y) (S\NP )/NP : λx.λy.borders(x, y) arity one predicate p1 N/N : λg.λx.p1(x) ^ g(x) N/N : λg.λx.state(x) ^ g(x) literal with arity two predicate p2 and constant second argument c N/N : λg.λx.p2(x, c) ^ g(x) N/N : λg.λx.borders(x, texas) ^ g(x) arity two predicate p2 (N\N)/NP : λx.λg.λy.p2(x, y) ^ g(x) (N\N)/NP : λg.λx.λy.borders(x, y) ^ g(x) an arg max / min with second argument arity one function f NP/N : λg. arg max / min(g, λx.f(x)) NP/N : λg. arg max(g, λx.size(x)) an arity one numeric-ranged function f S/NP : λx.f(x) S/NP : λx.size(x)
5 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
Zettlemoyer & Collins 2005:
Rules Categories produced from logical form Input Trigger Output Category arg max(λx.state(x) ^ borders(x, texas), λx.size(x)) constant c NP : c NP : texas arity one predicate p1 N : λx.p1(x) N : λx.state(x) arity one predicate p1 S\NP : λx.p1(x) S\NP : λx.state(x) arity two predicate p2 (S\NP )/NP : λx.λy.p2(y, x) (S\NP )/NP : λx.λy.borders(y, x) arity two predicate p2 (S\NP )/NP : λx.λy.p2(x, y) (S\NP )/NP : λx.λy.borders(x, y) arity one predicate p1 N/N : λg.λx.p1(x) ^ g(x) N/N : λg.λx.state(x) ^ g(x) literal with arity two predicate p2 and constant second argument c N/N : λg.λx.p2(x, c) ^ g(x) N/N : λg.λx.borders(x, texas) ^ g(x) arity two predicate p2 (N\N)/NP : λx.λg.λy.p2(x, y) ^ g(x) (N\N)/NP : λg.λx.λy.borders(x, y) ^ g(x) an arg max / min with second argument arity one function f NP/N : λg. arg max / min(g, λx.f(x)) NP/N : λg. arg max(g, λx.size(x)) an arity one numeric-ranged function f S/NP : λx.f(x) S/NP : λx.size(x)
constant c NP : c arity one predicate
5 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
Zettlemoyer & Collins 2005:
Rules Categories produced from logical form Input Trigger Output Category arg max(λx.state(x) ^ borders(x, texas), λx.size(x)) constant c NP : c NP : texas arity one predicate p1 N : λx.p1(x) N : λx.state(x) arity one predicate p1 S\NP : λx.p1(x) S\NP : λx.state(x) arity two predicate p2 (S\NP )/NP : λx.λy.p2(y, x) (S\NP )/NP : λx.λy.borders(y, x) arity two predicate p2 (S\NP )/NP : λx.λy.p2(x, y) (S\NP )/NP : λx.λy.borders(x, y) arity one predicate p1 N/N : λg.λx.p1(x) ^ g(x) N/N : λg.λx.state(x) ^ g(x) literal with arity two predicate p2 and constant second argument c N/N : λg.λx.p2(x, c) ^ g(x) N/N : λg.λx.borders(x, texas) ^ g(x) arity two predicate p2 (N\N)/NP : λx.λg.λy.p2(x, y) ^ g(x) (N\N)/NP : λg.λx.λy.borders(x, y) ^ g(x) an arg max / min with second argument arity one function f NP/N : λg. arg max / min(g, λx.f(x)) NP/N : λg. arg max(g, λx.size(x)) an arity one numeric-ranged function f S/NP : λx.f(x) S/NP : λx.size(x)
arity one predicate p1 N : λx.p1(x) arity one predicate p1 S\NP : λx.p1(x) arity two predicate
5 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
1 The Rational Speech Acts (RSA) model 2 TUNA 3 Learned RSA 4 Experiments
6 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
Cooperative principle: Make your contribution as is required, when it is required, by the conversation in which you are engaged.
7 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
Cooperative principle: Make your contribution as is required, when it is required, by the conversation in which you are engaged.
7 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
Cooperative principle: Make your contribution as is required, when it is required, by the conversation in which you are engaged.
7 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
Cooperative principle: Make your contribution as is required, when it is required, by the conversation in which you are engaged.
7 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
Cooperative principle: Make your contribution as is required, when it is required, by the conversation in which you are engaged.
7 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
Cooperative principle: Make your contribution as is required, when it is required, by the conversation in which you are engaged.
7 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
Cooperative principle: Make your contribution as is required, when it is required, by the conversation in which you are engaged.
7 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
8 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
Speaker S saying U to listener L conversationally implicates q iff
8 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
Speaker S saying U to listener L conversationally implicates q iff
1 S and L mutually, publicly presume that S is cooperative.
8 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
Speaker S saying U to listener L conversationally implicates q iff
1 S and L mutually, publicly presume that S is cooperative. 2 To maintain 1 given U, it must be supposed that S thinks q.
8 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
Speaker S saying U to listener L conversationally implicates q iff
1 S and L mutually, publicly presume that S is cooperative. 2 To maintain 1 given U, it must be supposed that S thinks q. 3 S thinks that both S and L mutually, publicly presume that L is
willing and able to work out that
2 holds.
8 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
Speaker S saying U to listener L conversationally implicates q iff
1 S and L mutually, publicly presume that S is cooperative. 2 To maintain 1 given U, it must be supposed that S thinks q. 3 S thinks that both S and L mutually, publicly presume that L is
willing and able to work out that
2 holds.
8 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
Speaker S saying U to listener L conversationally implicates q iff
1 S and L mutually, publicly presume that S is cooperative. 2 To maintain 1 given U, it must be supposed that S thinks q. 3 S thinks that both S and L mutually, publicly presume that L is
willing and able to work out that
2 holds.
Ann: What city does Paul live in? Bob: Hmm . . . he lives in California.
8 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
Speaker S saying U to listener L conversationally implicates q iff
1 S and L mutually, publicly presume that S is cooperative. 2 To maintain 1 given U, it must be supposed that S thinks q. 3 S thinks that both S and L mutually, publicly presume that L is
willing and able to work out that
2 holds.
Ann: What city does Paul live in? Bob: Hmm . . . he lives in California. (A) Assume Bob is cooperative.
8 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
Speaker S saying U to listener L conversationally implicates q iff
1 S and L mutually, publicly presume that S is cooperative. 2 To maintain 1 given U, it must be supposed that S thinks q. 3 S thinks that both S and L mutually, publicly presume that L is
willing and able to work out that
2 holds.
Ann: What city does Paul live in? Bob: Hmm . . . he lives in California. (A) Assume Bob is cooperative. (B) Bob supplied less information than required; clash with (A).
8 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
Speaker S saying U to listener L conversationally implicates q iff
1 S and L mutually, publicly presume that S is cooperative. 2 To maintain 1 given U, it must be supposed that S thinks q. 3 S thinks that both S and L mutually, publicly presume that L is
willing and able to work out that
2 holds.
Ann: What city does Paul live in? Bob: Hmm . . . he lives in California. (A) Assume Bob is cooperative. (B) Bob supplied less information than required; clash with (A). (C) Assume Bob does not know which city Paul lives in.
8 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
Speaker S saying U to listener L conversationally implicates q iff
1 S and L mutually, publicly presume that S is cooperative. 2 To maintain 1 given U, it must be supposed that S thinks q. 3 S thinks that both S and L mutually, publicly presume that L is
willing and able to work out that
2 holds.
Ann: What city does Paul live in? Bob: Hmm . . . he lives in California. (A) Assume Bob is cooperative. (B) Bob supplied less information than required; clash with (A). (C) Assume Bob does not know which city Paul lives in. (D) Then Bob’s answer is optimal given his evidence.
8 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
Speaker S saying U to listener L conversationally implicates q iff
1 S and L mutually, publicly presume that S is cooperative. 2 To maintain 1 given U, it must be supposed that S thinks q. 3 S thinks that both S and L mutually, publicly presume that L is
willing and able to work out that
2 holds.
8 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
Speaker S saying U to listener L conversationally implicates q iff
1 S and L mutually, publicly presume that S is cooperative. 2 To maintain 1 given U, it must be supposed that S thinks q. 3 S thinks that both S and L mutually, publicly presume that L is
willing and able to work out that
2 holds.
8 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
9 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
l0(w | msg, Lex) ∝ Lex(msg, w)P(w)
9 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
s1(msg | w, Lex) ∝ exp λ (log l0(w | msg, Lex) − C(msg))
l0(w | msg, Lex) ∝ Lex(msg, w)P(w)
9 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
l1(w | msg, Lex) ∝ s1(msg | w, Lex)P(w)
s1(msg | w, Lex) ∝ exp λ (log l0(w | msg, Lex) − C(msg))
l0(w | msg, Lex) ∝ Lex(msg, w)P(w)
9 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
l1(w | msg, Lex) = pragmatic speaker × state prior
s1(msg | w, Lex) = literal listener − message costs
l0(w | msg, Lex) = lexicon × state prior
9 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
10 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
10 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
10 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
10 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
11 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
Bergen, Levy, Goodman, ‘Pragmatic reasoning through semantic inference’
11 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
Bergen, Levy, Goodman, ‘Pragmatic reasoning through semantic inference’
Potts, Lassiter, Levy, Frank, ‘Embedded implicatures as pragmatic inferences under compositional lexical uncertainty’
11 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
Bergen, Levy, Goodman, ‘Pragmatic reasoning through semantic inference’
Potts, Lassiter, Levy, Frank, ‘Embedded implicatures as pragmatic inferences under compositional lexical uncertainty’
Kao, Wu, Bergen, Goodman, ‘Nonliteral understanding of number words’
11 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
Bergen, Levy, Goodman, ‘Pragmatic reasoning through semantic inference’
Potts, Lassiter, Levy, Frank, ‘Embedded implicatures as pragmatic inferences under compositional lexical uncertainty’
Kao, Wu, Bergen, Goodman, ‘Nonliteral understanding of number words’
Potts & Levy, ‘Negotiating lexical uncertainty and speaker expertise with disjunction’
11 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
12 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
s0(msg | w, Lex) ∝ exp λ (log Lex(msg, w) − C(msg))
12 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
l1(w | msg, Lex) ∝ s0(msg | w, Lex)P(w)
s0(msg | w, Lex) ∝ exp λ (log Lex(msg, w) − C(msg))
12 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
s1(msg | w, Lex) ∝ exp λ (log l1(w | msg, Lex) − C(msg))
l1(w | msg, Lex) ∝ s0(msg | w, Lex)P(w)
s0(msg | w, Lex) ∝ exp λ (log Lex(msg, w) − C(msg))
12 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
s1(msg | w, Lex) = pragmatic listener − message costs
l1(w | msg, Lex) = literal speaker × state prior
s0(msg | w, Lex) = lexicon − message costs
12 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
13 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
13 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
13 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
13 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
14 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
speaker rationality
14 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
speaker rationality
14 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
speaker rationality
14 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
speaker rationality
chances to learn from data
14 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
1 The Rational Speech Acts (RSA) model 2 TUNA 3 Learned RSA 4 Experiments
15 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
colour:green
size:small type:fan x-dimension:1 y-dimension:1 colour:green
size:small type:sofa x-dimension:1 y-dimension:2 colour:red
size:large type:fan x-dimension:1 y-dimension:3 colour:red
size:large type:sofa x-dimension:2 y-dimension:1 colour:blue
size:large type:fan x-dimension:2 y-dimension:2 colour:blue
size:large type:sofa x-dimension:3 y-dimension:1 colour:blue
size:small type:fan x-dimension:3 y-dimension:3
Utterance: “blue fan small” Utterance attributes: [colour:blue]; [size:small]; [type:fan]
16 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
colour:green
size:small type:fan x-dimension:1 y-dimension:1 colour:green
size:small type:sofa x-dimension:1 y-dimension:2 colour:red
size:large type:fan x-dimension:1 y-dimension:3 colour:red
size:large type:sofa x-dimension:2 y-dimension:1 colour:blue
size:large type:fan x-dimension:2 y-dimension:2 colour:blue
size:large type:sofa x-dimension:3 y-dimension:1 colour:blue
size:small type:fan x-dimension:3 y-dimension:3
Utterance: “blue fan small” Utterance attributes: [colour:blue]; [size:small]; [type:fan]
16 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
colour:green
size:small type:fan x-dimension:1 y-dimension:1 colour:green
size:small type:sofa x-dimension:1 y-dimension:2 colour:red
size:large type:fan x-dimension:1 y-dimension:3 colour:red
size:large type:sofa x-dimension:2 y-dimension:1 colour:blue
size:large type:fan x-dimension:2 y-dimension:2 colour:blue
size:large type:sofa x-dimension:3 y-dimension:1 colour:blue
size:small type:fan x-dimension:3 y-dimension:3
Utterance: “blue fan small” Utterance attributes: [colour:blue]; [size:small]; [type:fan]
16 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
age:old hairColour:light hasBeard:1 hasGlasses:0 hasHair:0 hasShirt:1 hasSuit:0 hasTie:0
type:person x-dimension:1 y-dimension:1 age:young hairColour:dark hasBeard:0 hasGlasses:0 hasHair:1 hasShirt:1 hasSuit:0 hasTie:0
type:person x-dimension:1 y-dimension:2 age:young hairColour:dark hasBeard:1 hasGlasses:0 hasHair:1 hasShirt:1 hasSuit:0 hasTie:1
type:person x-dimension:2 y-dimension:1 age:young hairColour:dark hasBeard:1 hasGlasses:0 hasHair:1 hasShirt:0 hasSuit:1 hasTie:1
type:person x-dimension:2 y-dimension:2 age:young hairColour:dark hasBeard:0 hasGlasses:0 hasHair:1 hasShirt:0 hasSuit:1 hasTie:1
type:person x-dimension:3 y-dimension:1 age:young hairColour:dark hasBeard:1 hasGlasses:0 hasHair:1 hasShirt:1 hasSuit:0 hasTie:0
type:person x-dimension:3 y-dimension:2 age:young hairColour:dark hasBeard:0 hasGlasses:0 hasHair:1 hasShirt:0 hasSuit:1 hasTie:1
type:person x-dimension:3 y-dimension:3
Utterance: The bald man with a beard [hasBeard:1]; [hasHair:0]; [type:person]
17 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
age:old hairColour:light hasBeard:1 hasGlasses:0 hasHair:0 hasShirt:1 hasSuit:0 hasTie:0
type:person x-dimension:1 y-dimension:1 age:young hairColour:dark hasBeard:0 hasGlasses:0 hasHair:1 hasShirt:1 hasSuit:0 hasTie:0
type:person x-dimension:1 y-dimension:2 age:young hairColour:dark hasBeard:1 hasGlasses:0 hasHair:1 hasShirt:1 hasSuit:0 hasTie:1
type:person x-dimension:2 y-dimension:1 age:young hairColour:dark hasBeard:1 hasGlasses:0 hasHair:1 hasShirt:0 hasSuit:1 hasTie:1
type:person x-dimension:2 y-dimension:2 age:young hairColour:dark hasBeard:0 hasGlasses:0 hasHair:1 hasShirt:0 hasSuit:1 hasTie:1
type:person x-dimension:3 y-dimension:1 age:young hairColour:dark hasBeard:1 hasGlasses:0 hasHair:1 hasShirt:1 hasSuit:0 hasTie:0
type:person x-dimension:3 y-dimension:2 age:young hairColour:dark hasBeard:0 hasGlasses:0 hasHair:1 hasShirt:0 hasSuit:1 hasTie:1
type:person x-dimension:3 y-dimension:3
Utterance: The bald man with a beard [hasBeard:1]; [hasHair:0]; [type:person]
17 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
age:old hairColour:light hasBeard:1 hasGlasses:0 hasHair:0 hasShirt:1 hasSuit:0 hasTie:0
type:person x-dimension:1 y-dimension:1 age:young hairColour:dark hasBeard:0 hasGlasses:0 hasHair:1 hasShirt:1 hasSuit:0 hasTie:0
type:person x-dimension:1 y-dimension:2 age:young hairColour:dark hasBeard:1 hasGlasses:0 hasHair:1 hasShirt:1 hasSuit:0 hasTie:1
type:person x-dimension:2 y-dimension:1 age:young hairColour:dark hasBeard:1 hasGlasses:0 hasHair:1 hasShirt:0 hasSuit:1 hasTie:1
type:person x-dimension:2 y-dimension:2 age:young hairColour:dark hasBeard:0 hasGlasses:0 hasHair:1 hasShirt:0 hasSuit:1 hasTie:1
type:person x-dimension:3 y-dimension:1 age:young hairColour:dark hasBeard:1 hasGlasses:0 hasHair:1 hasShirt:1 hasSuit:0 hasTie:0
type:person x-dimension:3 y-dimension:2 age:young hairColour:dark hasBeard:0 hasGlasses:0 hasHair:1 hasShirt:0 hasSuit:1 hasTie:1
type:person x-dimension:3 y-dimension:3
Utterance: The bald man with a beard [hasBeard:1]; [hasHair:0]; [type:person]
17 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
2
x∈D min
18 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
Multiset intersection cardinality Multiset union cardinality
18 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
Multiset intersection cardinality Multiset union cardinality
b c a] actual: [a b c a] = 1
18 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
Multiset intersection cardinality Multiset union cardinality
b c a] actual: [a b c a] = 1
b c a] actual: [a b c
18 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
Multiset intersection cardinality Multiset union cardinality
b c a] actual: [a b c a] = 1
b c a] actual: [a b c
b c
actual: [a b c a] = .86
18 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
Multiset intersection cardinality Multiset union cardinality
b c a] actual: [a b c a] = 1
b c a] actual: [a b c
b c
actual: [a b c a] = .86
actual: [a b c a] = .4
18 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
1 The Rational Speech Acts (RSA) model 2 TUNA 3 Learned RSA 4 Experiments
19 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
20 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
Target Utterance attributes Features colour:blue
size:small type:fan x-dimension:3 y-dimension:3 [colour:blue] [size:small] [type:fan]
20 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
Target Utterance attributes Features colour:blue
size:small type:fan x-dimension:3 y-dimension:3 [colour:blue] [size:small] [type:fan] colour:blue ∧ [colour:blue] colour:blue ∧ [size:small] colour:blue ∧ [type:fan]
. . .
20 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
Target Utterance attributes Features colour:blue
size:small type:fan x-dimension:3 y-dimension:3 [colour:blue] [size:small] [type:fan] colour:blue ∧ [colour:blue] colour:blue ∧ [size:small] colour:blue ∧ [type:fan]
. . . Generation features color
20 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
Target Utterance attributes Features colour:blue
size:small type:fan x-dimension:3 y-dimension:3 [colour:blue] [size:small] [type:fan] colour:blue ∧ [colour:blue] colour:blue ∧ [size:small] colour:blue ∧ [type:fan]
. . . Generation features color type + color color + ¬size type ≫ color ≫ size type ≫ orientation ≫ color ≫ size
20 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
Target Utterance attributes Features colour:blue
size:small type:fan x-dimension:3 y-dimension:3 [colour:blue] [size:small] [type:fan] colour:blue ∧ [colour:blue] colour:blue ∧ [size:small] colour:blue ∧ [type:fan]
. . . Generation features color type + color color + ¬size attribute-count = 3 type ≫ color ≫ size type ≫ orientation ≫ color ≫ size
20 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
⊙ ϕ θ
“beard” “guy with the beard” “guy with glasses” ...
T ϕ(t ,m)]
21 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
“guy with the beard”
⊙ ϕ θ
“beard” “guy with the beard” “guy with glasses” ...
∂ ∂θ log S1(m|t ,θ)
22 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
Goal Features
23 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
Goal Features Avoid hand-built lexicon
23 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
Goal Features Avoid hand-built lexicon Cross-product features
23 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
Goal Features Avoid hand-built lexicon Cross-product features Learn quirks of production
23 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
Goal Features Avoid hand-built lexicon Cross-product features Learn quirks of production Features like color
23 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
Goal Features Avoid hand-built lexicon Cross-product features Learn quirks of production Features like color Learn attribute hierarchies
23 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
Goal Features Avoid hand-built lexicon Cross-product features Learn quirks of production Features like color Learn attribute hierarchies Attribute-pair features like color + ¬size
23 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
Goal Features Avoid hand-built lexicon Cross-product features Learn quirks of production Features like color Learn attribute hierarchies Attribute-pair features like color + ¬size Learn message costs
23 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
Goal Features Avoid hand-built lexicon Cross-product features Learn quirks of production Features like color Learn attribute hierarchies Attribute-pair features like color + ¬size Learn message costs Length features and others
23 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
Goal Features Avoid hand-built lexicon Cross-product features Learn quirks of production Features like color Learn attribute hierarchies Attribute-pair features like color + ¬size Learn message costs Length features and others Cognitive and linguistic insights combined with learning
23 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
Train
Test
24 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
24 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
.08 .25
.08 .25
.17 .00
.08 .25
.17 .00
.08 .25
.17 .00
.17 .00 RSA
24 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
.08 .25 .03 .00
.08 .25 .22 .10
.17 .00 .03 .00
.08 .25 .03 .04
.17 .00 .22 .01
.08 .25 .22 .74
.17 .00 .03 .00
.17 .00 .22 .10 RSA Learned S0
24 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
.08 .25 .03 .00 .10 .11
.08 .25 .22 .10 .16 .13
.17 .00 .03 .00 .11 .07
.08 .25 .03 .04 .08 .17
.17 .00 .22 .01 .18 .08
.08 .25 .22 .74 .12 .19
.17 .00 .03 .00 .10 .11
.17 .00 .22 .10 .16 .11 RSA Learned S0 Learned S1
24 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
1 The Rational Speech Acts (RSA) model 2 TUNA 3 Learned RSA 4 Experiments
25 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
26 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
0.0 0.2 0.4 0.6 0.8 1.0
Mean Dice furniture people
27 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
0.0 0.2 0.4 0.6 0.8 1.0
Mean Dice furniture people RSA s1
0.522
RSA s1
0.254
27 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
0.0 0.2 0.4 0.6 0.8 1.0
Mean Dice furniture people RSA s1
0.522
Learned S0
0.812
RSA s1
0.254
Learned S0
0.73
27 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
0.0 0.2 0.4 0.6 0.8 1.0
Mean Dice furniture people RSA s1
0.522
Learned S0
0.812
Learned S1
0.788
RSA s1
0.254
Learned S0
0.73
Learned S1
0.764
27 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
0.0 0.2 0.4 0.6 0.8 1.0
Mean Dice furniture people RSA s1
0.522
Learned S0
0.812
Learned S1
0.788
RSA s1
0.254
Learned S0
0.73
Learned S1
0.764
27 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
50 100 150 200 250 300 350
Underproductions of attribute [type:person] [hasBeard:true]
(Lower is better!)
28 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
50 100 150 200 250 300 350
Underproductions of attribute [type:person] [hasBeard:true] RSA s1 RSA s1
(Lower is better!)
28 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
50 100 150 200 250 300 350
Underproductions of attribute [type:person] [hasBeard:true] RSA s1 Learned S0 RSA s1 Learned S0
(Lower is better!)
28 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
50 100 150 200 250 300 350
Underproductions of attribute [type:person] [hasBeard:true] RSA s1 Learned S0 Learned S1 RSA s1 Learned S0 Learned S1
(Lower is better!)
28 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
29 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
language generation synthesized with RSA
29 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
language generation synthesized with RSA
29 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
language generation synthesized with RSA
context-dependent disambiguation in one process
29 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
language generation synthesized with RSA
context-dependent disambiguation in one process
29 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
language generation synthesized with RSA
context-dependent disambiguation in one process
29 / 29
Overview RSA TUNA Learned RSA Experiments Conclusion
language generation synthesized with RSA
context-dependent disambiguation in one process
29 / 29