Multi-Hop RC, HotpotQA & GNNs
Select, Answer and Explain: Interpretable Multi-hop Reading Comprehension over Multiple Documents – Tu et al., AAAI 2020 Presented By: Lovish Madaan
Multi-Hop RC, HotpotQA & GNNs Select, Answer and Explain: - - PowerPoint PPT Presentation
Multi-Hop RC, HotpotQA & GNNs Select, Answer and Explain: Interpretable Multi-hop Reading Comprehension over Multiple Documents Tu et al., AAAI 2020 Presented By: Lovish Madaan References HotpotQA - Peng Qi (Stanford) GNNs -
Select, Answer and Explain: Interpretable Multi-hop Reading Comprehension over Multiple Documents – Tu et al., AAAI 2020 Presented By: Lovish Madaan
In which city was Facebook first launched? Cambridge, Massachusetts. This is because Mark Zuckerberg and his business partners launched it from his Harvard dormitory [1], and Harvard is located in Cambridge, Massachusetts [2].
[1] https://en.wikipedia.org/wiki/Mark_Zuckerberg [2] https://en.wikipedia.org/wiki/Harvard_University
In which city was Facebook first launched? Cambridge, Massachusetts. This is because Mark Zuckerberg and his business partners launched it from his Harvard dormitory [1], and Harvard is located in Cambridge, Massachusetts [2].
[1] https://en.wikipedia.org/wiki/Mark_Zuckerberg [2] https://en.wikipedia.org/wiki/Harvard_University
Sorry, folks from Google!
In which city was Facebook first launched? Cambridge, Massachusetts. This is because Mark Zuckerberg and his business partners launched it from his Harvard dormitory [1], and Harvard is located in Cambridge, Massachusetts [2].
[1] https://en.wikipedia.org/wiki/Mark_Zuckerberg [2] https://en.wikipedia.org/wiki/Harvard_University
Multi-hop reasoning
In which city was Facebook first launched? Cambridge, Massachusetts. This is because Mark Zuckerberg and his business partners launched it from his Harvard dormitory [1], and Harvard is located in Cambridge, Massachusetts [2].
[1] https://en.wikipedia.org/wiki/Mark_Zuckerberg [2] https://en.wikipedia.org/wiki/Harvard_University
Multi-hop reasoning Text-based, diverse
In which city was Facebook first launched? Cambridge, Massachusetts. This is because Mark Zuckerberg and his business partners launched it from his Harvard dormitory [1], and Harvard is located in Cambridge, Massachusetts [2].
[1] https://en.wikipedia.org/wiki/Mark_Zuckerberg [2] https://en.wikipedia.org/wiki/Harvard_University
Multi-hop reasoning Explainability Text-based, diverse
Multi-hop reasoning Explainability Text-based, diverse Comparison Questions HotpotQA
TriviaQA, etc) When was Chris Martin born?
When was the lead singer of Coldplay born?
(Rajpurkar et al., 2016; Joshi et al., 2017; Dunn et al., 2017)
Sup fact 1 Sup fact 2 Answer Answer
graph and realize multi-hop reasoning
with [CLS] embeddings as features
interactions
X – matrix of [CLS] embeddings of question/document pairs
Output is the matrix of modified [CLS] embeddings having contextual information
𝑘
0 𝑗𝑔 𝑇 𝐸𝑗 ≤ 𝑇(𝐸
𝑘)
𝑜
𝑘=0,𝑘≠𝑗
𝑗
𝑚𝑗,𝑘log(𝑄 𝐸𝑗, 𝐸
𝑘 ) + (1 − 𝑚𝑗,𝑘)log(1 − 𝑄 𝐸𝑗, 𝐸 𝑘 )
𝑜 Ι 𝑄 𝐸𝑗, 𝐸 𝑘 > 0.5 - Relevance score for each document
BERT 𝐼𝑗 ∈ ℝ𝑀 × 𝑒
2-Layer MLP (𝑔
𝑡𝑞𝑏𝑜)
𝑍 ∈ ℝ𝑀 × 2
2-layer MLP (𝑔
𝑏𝑢𝑢)
[0/1 Label] Weighted Representation
phrases (can be different) present in the question (Type 2)
entity / noun phrase (Type 3)
11 Jure Leskovec, Stanford University
Images T ext/Speech
average of neighbor’s previous layer embeddings
Tutorial on Graph Representation Learning, AAAI 2019 19
Initial “layer 0” embeddings are equal to node features kth layer embedding
non-linearity (e.g., ReLU or tanh) previous layer embedding of v
h0
v = xv
hk
v = σ
@ W k X
u2 N (v)
hk− 1
u
|
v
1 A , 8 k > 0
Tutorial on Graph Representation Learning, AAAI 2019 60
Sum over all neighbors (and the node itself) Non-linearity
v = σ(
X
u2 N (v)[ { v}
↵ v,uW khk− 1
u
Tutorial on Graph Representation Learning, AAAI 2019 20
u
Tutorial on Graph Representation Learning, AAAI 2019 21
trainable matrices (i.e., what we learn)
h0
v = xv
hk
v = σ
@Wk X
u2N(v)
hk− 1
u
| N(v)| + Bkhk− 1
v
1 A , 8k 2 {1, . . . , K} zv = hK
v
Tutorial on Graph Representation Learning, AAAI 2019 24
L =
X
v2 V
yv log(σ(z>
v ✓)) + (1 − yv) log(1 − σ(z> v ✓))
embedding classification weights node class label
Tutorial on Graph Representation Learning, AAAI 2019 25
u
Tutorial on Graph Representation Learning, AAAI 2019 26
Tutorial on Graph Representation Learning, AAAI 2019 27
Question - “Were Scott Derrickson and Ed Wood of the same nationality?”
retrieved documents [Rajas]
Questions?