Neurally-Guided Structure Inference
Sidi Lu*, Jiayuan Mao*, Josh Tenenbaum, and Jiajun Wu (* indicates equal contributions) http://ngsi.csail.mit.edu
Neurally-Guided Structure Inference http://ngsi.csail.mit.edu Sidi - - PowerPoint PPT Presentation
Neurally-Guided Structure Inference http://ngsi.csail.mit.edu Sidi Lu*, Jiayuan Mao*, Josh Tenenbaum, and Jiajun Wu (* indicates equal contributions) Structure Inference Structure Inference Data [Kemp et al. 2008] Structure Inference Data
Sidi Lu*, Jiayuan Mao*, Josh Tenenbaum, and Jiajun Wu (* indicates equal contributions) http://ngsi.csail.mit.edu
[Kemp et al. 2008]
[Kemp et al. 2008]
[Kemp et al. 2008]
[LeCun et al. 1998]
[Kemp et al. 2008]
[LeCun et al. 1998] [Chen et al. 2016]
Structure Inference
MG+G
GG+G
BG+G
CG+G
M(GMT+G)+G
(MG+G)(GMT+G)+G
(BG+G)(GBT+G)+G
(exp(GG+G)◦G)+G
[Grosse et al. 2012]
Input Matrix 20×3 −1.01 −0.76 1.36 0.48 −1.64 −0.50 ⋮ ⋮ ⋮ −0.24 0.89 −1.12 [Grosse et al. 2012]
Structure !
Input Matrix 20×3 −1.01 −0.76 1.36 0.48 −1.64 −0.50 ⋮ ⋮ ⋮ −0.24 0.89 −1.12 [Grosse et al. 2012]
Structure !
Input Matrix 20×3 −1.01 −0.76 1.36 0.48 −1.64 −0.50 ⋮ ⋮ ⋮ −0.24 0.89 −1.12 [Grosse et al. 2012]
Structure !
Input Matrix 20×3 −1.01 −0.76 1.36 0.48 −1.64 −0.50 ⋮ ⋮ ⋮ −0.24 0.89 −1.12
Cluster
[Grosse et al. 2012]
Structure !
Input Matrix 20×3 −1.01 −0.76 1.36 0.48 −1.64 −0.50 ⋮ ⋮ ⋮ −0.24 0.89 −1.12
Cluster
0 0 0 0 1 0 0 0 1 0 ⋮ 0 0 1 0 0 Cluster Label 20×5 [Grosse et al. 2012]
Structure !
Input Matrix 20×3 −1.01 −0.76 1.36 0.48 −1.64 −0.50 ⋮ ⋮ ⋮ −0.24 0.89 −1.12
Cluster
0 0 0 0 1 0 0 0 1 0 ⋮ 0 0 1 0 0 −1.32 +0.04 +2.01 −0.23 −2.33 −0.30 +0.08 −0.68 +0.95 +0.52 −1.27 −0.54 −0.77 +1.40 −1.66 Cluster Label 20×5 Cluster Center 5×3 [Grosse et al. 2012]
Structure !
Input Matrix 20×3 −1.01 −0.76 1.36 0.48 −1.64 −0.50 ⋮ ⋮ ⋮ −0.24 0.89 −1.12
Cluster
0 0 0 0 1 0 0 0 1 0 ⋮ 0 0 1 0 0 −1.32 +0.04 +2.01 −0.23 −2.33 −0.30 +0.08 −0.68 +0.95 +0.52 −1.27 −0.54 −0.77 +1.40 −1.66 + −0.24 −0.61 −0.04 0.76 0.01 −04 ⋮ ⋮ ⋮ −0.09 0.90 −1.84 Cluster Label 20×5 Cluster Center 5×3 Cluster Noise 20×3
Structure 1! + !
[Grosse et al. 2012]
Structure !
Input Matrix 20×3 −1.01 −0.76 1.36 0.48 −1.64 −0.50 ⋮ ⋮ ⋮ −0.24 0.89 −1.12
Cluster
0 0 0 0 1 0 0 0 1 0 ⋮ 0 0 1 0 0 −1.32 +0.04 +2.01 −0.23 −2.33 −0.30 +0.08 −0.68 +0.95 +0.52 −1.27 −0.54 −0.77 +1.40 −1.66 + −0.24 −0.61 −0.04 0.76 0.01 −04 ⋮ ⋮ ⋮ −0.09 0.90 −1.84 Cluster Label 20×5 Cluster Center 5×3 Cluster Noise 20×3
Structure 1! + !
[Grosse et al. 2012]
Structure !
Input Matrix 20×3 −1.01 −0.76 1.36 0.48 −1.64 −0.50 ⋮ ⋮ ⋮ −0.24 0.89 −1.12
Cluster
0 0 0 0 1 0 0 0 1 0 ⋮ 0 0 1 0 0 −1.32 +0.04 +2.01 −0.23 −2.33 −0.30 +0.08 −0.68 +0.95 +0.52 −1.27 −0.54 −0.77 +1.40 −1.66 + −0.24 −0.61 −0.04 0.76 0.01 −04 ⋮ ⋮ ⋮ −0.09 0.90 −1.84 Cluster Label 20×5 Cluster Center 5×3 Cluster Noise 20×3
Structure 1! + !
[Grosse et al. 2012]
Structure !
Input Matrix 20×3 −1.01 −0.76 1.36 0.48 −1.64 −0.50 ⋮ ⋮ ⋮ −0.24 0.89 −1.12
Cluster
0 0 0 0 1 0 0 0 1 0 ⋮ 0 0 1 0 0 −1.32 +0.04 +2.01 −0.23 −2.33 −0.30 +0.08 −0.68 +0.95 +0.52 −1.27 −0.54 −0.77 +1.40 −1.66 + −0.24 −0.61 −0.04 0.76 0.01 −04 ⋮ ⋮ ⋮ −0.09 0.90 −1.84 Cluster Label 20×5 Cluster Center 5×3 Cluster Noise 20×3
Structure 1! + ! LowRank
[Grosse et al. 2012]
Structure !
Input Matrix 20×3 −1.01 −0.76 1.36 0.48 −1.64 −0.50 ⋮ ⋮ ⋮ −0.24 0.89 −1.12
Cluster
0 0 0 0 1 0 0 0 1 0 ⋮ 0 0 1 0 0 −1.32 +0.04 +2.01 −0.23 −2.33 −0.30 +0.08 −0.68 +0.95 +0.52 −1.27 −0.54 −0.77 +1.40 −1.66 + −0.24 −0.61 −0.04 0.76 0.01 −04 ⋮ ⋮ ⋮ −0.09 0.90 −1.84 Cluster Label 20×5 Cluster Center 5×3 Cluster Noise 20×3
Structure 1! + ! LowRank
Cluster Center 5×2 @(2×3) Cluster Noise 20×3 0 0 0 0 1 0 0 0 1 0 ⋮ 0 0 1 0 0 −1.32 +0.04 +2.01 −0.23 +0.08 −0.68 +0.95 +0.52 −0.77 +1.40 1 −0.22 1 −1.30 + −0.24 −0.61 −0.04 0.76 0.01 −04 ⋮ ⋮ ⋮ −0.09 0.90 −1.84 Cluster Label 20×5 [Grosse et al. 2012]
Structure !
Input Matrix 20×3 −1.01 −0.76 1.36 0.48 −1.64 −0.50 ⋮ ⋮ ⋮ −0.24 0.89 −1.12
Cluster
0 0 0 0 1 0 0 0 1 0 ⋮ 0 0 1 0 0 −1.32 +0.04 +2.01 −0.23 −2.33 −0.30 +0.08 −0.68 +0.95 +0.52 −1.27 −0.54 −0.77 +1.40 −1.66 + −0.24 −0.61 −0.04 0.76 0.01 −04 ⋮ ⋮ ⋮ −0.09 0.90 −1.84 Cluster Label 20×5 Cluster Center 5×3 Cluster Noise 20×3
Structure 1! + ! LowRank
Cluster Label 20×5 Cluster Center 5×2 @(2×3) Cluster Noise 20×3
Structure 1(!! + !) + !
0 0 0 0 1 0 0 0 1 0 ⋮ 0 0 1 0 0 −1.32 +0.04 +2.01 −0.23 +0.08 −0.68 +0.95 +0.52 −0.77 +1.40 1 −0.22 1 −1.30 + −0.24 −0.61 −0.04 0.76 0.01 −04 ⋮ ⋮ ⋮ −0.09 0.90 −1.84 [Grosse et al. 2012]
0 0 0 0 1 0 0 0 1 0 ⋮ 0 0 1 0 0 −1.32 +0.04 +2.01 −0.23 −2.33 −0.30 +0.08 −0.68 +0.95 +0.52 −1.27 −0.54 −0.77 +1.40 −1.66 + −0.24 −0.61 −0.04 0.76 0.01 −04 ⋮ ⋮ ⋮ −0.09 0.90 −1.84 Cluster Label 20×5 Cluster Center 5×3 Cluster Noise 20×3
Cluster Structure 0
Input Matrix 20×3 −1.01 −0.76 1.36 0.48 −1.64 −0.50 ⋮ ⋮ ⋮ −0.24 0.89 −1.12
Structure 10 + 0
Cluster Label 20×5 Cluster Center 5×2 @(2×3) Cluster Noise 20×3
LowRank Structure 1(00 + 0) + 0
0 0 0 0 1 0 0 0 1 0 ⋮ 0 0 1 0 0 −1.32 +0.04 +2.01 −0.23 +0.08 −0.68 +0.95 +0.52 −0.77 +1.40 1 −0.22 1 −1.30 + −0.24 −0.61 −0.04 0.76 0.01 −04 ⋮ ⋮ ⋮ −0.09 0.90 −1.84 [Grosse et al. 2012]
MG+G
GG+G
BG+G
CG+G
M(GMT+G)+G
(MG+G)(GMT+G)+G
(BG+G)(GBT+G)+G
(exp(GG+G)◦G)+G
MG+G
GG+G
BG+G
CG+G
M(GMT+G)+G
(MG+G)(GMT+G)+G
(BG+G)(GBT+G)+G
(exp(GG+G)◦G)+G
G
Structure !
Input Matrix 20×3 −1.01 −0.76 1.36 0.48 −1.64 −0.50 ⋮ ⋮ ⋮ −0.24 0.89 −1.12
GG + G MG + G
……
0 0 0 0 1 0 0 0 1 0 ⋮ 0 0 1 0 0 −1.32 +0.04 +2.01 −0.23 −2.33 −0.30 +0.08 −0.68 +0.95 +0.52 −1.27 −0.54 −0.77 +1.40 −1.66 + 12345 Cluster Label 20×5 Cluster Center 5×3
Structure 6! + !
M(exp(G) ∘ G) + G M(GG+G) + G M(GTM+G) + G
……
Cluster Center 5×2 @(2×3)
Structure 6(!! + !) + !
0 0 0 0 1 0 0 0 1 0 ⋮ 0 0 1 0 0 −1.32 +0.04 +2.01 −0.23 +0.08 −0.68 +0.95 +0.52 −0.77 +1.40 1 −0.22 1 −1.30 + 12345
G
Structure !
Input Matrix 20×3 −1.01 −0.76 1.36 0.48 −1.64 −0.50 ⋮ ⋮ ⋮ −0.24 0.89 −1.12
GG + G MG + G
……
0 0 0 0 1 0 0 0 1 0 ⋮ 0 0 1 0 0 −1.32 +0.04 +2.01 −0.23 −2.33 −0.30 +0.08 −0.68 +0.95 +0.52 −1.27 −0.54 −0.77 +1.40 −1.66 + 12345 Cluster Label 20×5 Cluster Center 5×3
Structure 6! + !
M(exp(G) ∘ G) + G M(GG+G) + G M(GTM+G) + G
……
Cluster Center 5×2 @(2×3)
Structure 6(!! + !) + !
0 0 0 0 1 0 0 0 1 0 ⋮ 0 0 1 0 0 −1.32 +0.04 +2.01 −0.23 +0.08 −0.68 +0.95 +0.52 −0.77 +1.40 1 −0.22 1 −1.30 + 12345
G
Structure !
Input Matrix 20×3 −1.01 −0.76 1.36 0.48 −1.64 −0.50 ⋮ ⋮ ⋮ −0.24 0.89 −1.12
GG + G MG + G
……
0 0 0 0 1 0 0 0 1 0 ⋮ 0 0 1 0 0 −1.32 +0.04 +2.01 −0.23 −2.33 −0.30 +0.08 −0.68 +0.95 +0.52 −1.27 −0.54 −0.77 +1.40 −1.66 + 12345 Cluster Label 20×5 Cluster Center 5×3
Structure 6! + !
M(GG+G) + G
Cluster Center 5×2 @(2×3)
Structure 6(!! + !) + !
0 0 0 0 1 0 0 0 1 0 ⋮ 0 0 1 0 0 −1.32 +0.04 +2.01 −0.23 +0.08 −0.68 +0.95 +0.52 −0.77 +1.40 1 −0.22 1 −1.30 + 12345
−1.32 +0.04 +2.01 −0.23 −2.33 −0.30 +0.08 −0.68 +0.95 +0.52 −1.27 −0.54 −0.77 +1.40 −1.66
Neural Net Prediction: GG+G
Train
Train Test