Meta-Learning with Shared Amortized Variational Inference
Ekaterina Iakovleva Jakob Verbeek Karteek Alahari
ICML | 2020
Thirty-seventh International Conference
- n Machine Learning
Inria Facebook Inria
Meta-Learning with Shared Amortized Variational Inference Ekaterina - - PowerPoint PPT Presentation
Meta-Learning with Shared Amortized Variational Inference Ekaterina Iakovleva Jakob Verbeek Karteek Alahari Inria Facebook Inria ICML | 2020 Thirty-seventh International Conference on Machine Learning Standard classification task pipeline
Thirty-seventh International Conference
Inria Facebook Inria
2
Meta test data
3
Schmidhuber 1999, Ravi & Larochelle ICLRβ17
4
! , π§",$ ! ) ",$%& ',(
!
!
*,(
5
6
[1] β Snell et al. NeurIPSβ17, [2] β Vinyals et al. NeurIPSβ16
6
[1] β Snell et al. NeurIPSβ17, [2] β Vinyals et al. NeurIPSβ16, [3] β Finn et al. ICMLβ17, [4] β Ravi & Larochelle ICLRβ17
6
[1] β Snell et al. NeurIPSβ17, [2] β Vinyals et al. NeurIPSβ16, [3] β Finn et al. ICMLβ17, [4] β Ravi & Larochelle ICLRβ17, [5] β Garnelo et al. ICMLβ18, [6] β Gordon et al. ICLRβ19
+
7
+
+
+
+
+
!%& +
7
+
+
+
+
+
!%& +
7
+
+
+
+
+
!%& +
7
!~π, π₯! πΈ!, π :
!%& +
)%& *
.
! +
! , π₯- !
8
[1] β Gordon et al. ICLRβ19
9
[1] β Kingma & Welling ICLRβ14
9
[1] β Kingma & Welling ICLRβ14 Reconstruction loss
9
[1] β Kingma & Welling ICLRβ14 Regularization
9
[1] β Kingma & Welling ICLRβ14, [2] β Higgins et al. ICLRβ17
! +
! , πΈ!, π β 1
.
! +
! , π₯- ! ,
!~π, π₯! πΈ!, π .
9
[1] β Kingma & Welling ICLRβ14, [2] β Higgins et al. ICLRβ17
10
10
10
1 produces embeddings of the input images π¦.
! +
! , π₯! = softmax π₯!π 1 +
!
! , π$ !} of a
! for each class π:
! πΈ!, π = πͺ π$ ! , diag(π$ !) ,
$
3#
$
& ' β"%& '
1(π¦",$ ! )
11
VERSA β Gordon et al. ICLRβ19
1.
1 is shared with an auxiliary classification task across all meta-train classes.
12
TADAM β Oreshkin et al. NeurIPSβ18
5).
! "%& '
! "%& *
5).
5
"%& '
! + π5
13
(b) π! = 0.5
14
estimated / true
15
16
VERSA β Gordon et al. ICLRβ19
16
VERSA β Gordon et al. ICLRβ19
16
VERSA β Gordon et al. ICLRβ19
TADAM β Oreshkin et al. NeurIPSβ18
17
π½: cosine scaling, AT: auxiliary co-training, TEN: task embedding network Additional ablations can be found in the paper.
π½: cosine scaling, AT: auxiliary co-training, TEN: task embedding network Additional ablations can be found in the paper.
17
TADAM β Oreshkin et al. NeurIPSβ18
β : Transductive methods.
18
β : Transductive methods.
18
β : Transductive methods.
18
19
Thirty-seventh International Conference