Low-loss connection of weight vectors: distribution-based approaches
Ivan Anokhin, Dmitry Yarotsky ICML 2020
1 / 28
Low-loss connection of weight vectors: distribution-based approaches - - PowerPoint PPT Presentation
Low-loss connection of weight vectors: distribution-based approaches Ivan Anokhin, Dmitry Yarotsky ICML 2020 1 / 28 Introduction How much connectedness is there in the bottom of a neural networks loss function? Connection task: Given two
1 / 28
2 / 28
3 / 28
4 / 28
5 / 28
1 For each i, ψi(t = 0) = θA
2 For each t, ψ(t) ∼ p
6 / 28
7 / 28
8 / 28
9 / 28
X, Y 0.5X + 0.5Y X, Y cos( /4)X + sin( /4)Y
10 / 28
11 / 28
12 / 28
13 / 28
14 / 28
15 / 28
16 / 28
17 / 28
18 / 28
19 / 28
4
5
W A
6
W A
7
W A
8
W B
1
W B
2
W B
3
W AB
4
20 / 28
2
3
2
W A
3
W A
4
W A
1
W A
3
W A
4
W B
1
W AB
2
W A
4
W B
1
W B
2
W AB
3
W B
1
W B
2
W B
3
W B
4
21 / 28
22 / 28
23 / 28
24 / 28
0.0 0.2 0.4 0.6 0.8 1.0 t 8.0 8.5 9.0 9.5 10.0 10.5 11.0 11.5 12.0 test error (%)
Linear + WA Arc + WA
25 / 28
26 / 28
1 2 3 4 5 6 7 Number of models in ensemble 68 69 70 71 72 73 Accuracy (%)
Ind WA(14) WA(13) WA(12) WA(10) WA(6) 27 / 28
28 / 28