Invariant-equivariant representation learning for multi-class data - - PowerPoint PPT Presentation

invariant equivariant representation learning for multi
SMART_READER_LITE
LIVE PREVIEW

Invariant-equivariant representation learning for multi-class data - - PowerPoint PPT Presentation

Invariant-equivariant representation learning for multi-class data Ilya Feige Faculty Invariant-equivariant representation learning High-level introduction 2 Separating content from style This work is about disentangling


slide-1
SLIDE 1

Invariant-equivariant representation learning for multi-class data

Ilya Feige Faculty

slide-2
SLIDE 2

High-level introduction

Invariant-equivariant representation learning

2

slide-3
SLIDE 3

What? Want to represent the class and the data instance separately This work is about disentangling representations. We present a novel approach to an old problem.

Separating content from style

3

slide-4
SLIDE 4

What? Want to represent the class and the data instance separately This work is about disentangling representations. We present a novel approach to an old problem.

Separating content from style

4

class, r

<latexit sha1_base64="Ys2D6lj3c1+uto6IwD1WQZF5vj0=">AB+nicbVDLSsNAFJ3UV42vqEs3g6XgQkpSBV0W3bisYB/QhjKZTtuhk0mYuSmW2D9xJSiIW/ElX/jNM1CWw9cOJxz78y9J4gF1+C631ZhbX1jc6u4be/s7u0fOIdHTR0lirIGjUSk2gHRTHDJGsBsHasGAkDwVrB+HbutyZMaR7JB5jGzA/JUPIBpwSM1HOcLrBHSKkgWp/jGVY9p+RW3Ax4lXg5KaEc9Z7z1e1HNAmZhOyVjufG4KdEAaeCzexyN9EsJnRMhqxjqCQh036arT7DZaP08SBSpiTgTLV/TaQk1HoaBqYzJDSy95c/M/rJDC49lMu4wSYpIuPBonAEOF5DrjPFaMgpoYQqrhZFtMRUYSCScs2KXjLN6+SZrXiXVSq95el2k2eRxGdoFN0hjx0hWroDtVRA1E0Qc/oFb1ZT9aL9W59LFoLVj5zjP7A+vwBm+KThw=</latexit>
slide-5
SLIDE 5

What? Want to represent the class and the data instance separately This work is about disentangling representations. We present a novel approach to an old problem.

Separating content from style

5

datapoint, (r, v)

<latexit sha1_base64="e3aF6vstTuwkI9LmeOhMNzmeTyM=">ACAnicbVDLSgNBEJz1GeNr1YvgZTAEIoSwGwU9Br14jGAekCxhdjKbDJmdXWZ6g2GJz/Fk6AgXv0LT/6Nk8dBEwsaiqpurv8WHANjvNtrayurW9sZray2zu7e/v2wWFdR4mirEYjEamTzQTXLIacBCsGStGQl+whj+4mfiNIVOaR/IeRjHzQtKTPOCUgJE69nEb2AOkXQIkjriEIh7jgioOzp2zik5U+Bl4s5JDs1R7dhf7W5Ek5BJoIJo3XKdGLyUKOBUsHE23040iwkdkB5rGSpJyLSXTl8Y47xRujiIlCkJeKpmf02kJNR6FPqmMyTQ14veRPzPayUQXHkpl3ECTNLZoiARGCI8yQN3uWIUxMgQhU3x2LaJ4pQMKlTQru4s/LpF4uel8t1FrnI9zyODTtApKiAXaIKukVEMUPaJn9IrerCfrxXq3PmatK9Z85gj9gfX5AwGTlng=</latexit>

class, r

<latexit sha1_base64="Ys2D6lj3c1+uto6IwD1WQZF5vj0=">AB+nicbVDLSsNAFJ3UV42vqEs3g6XgQkpSBV0W3bisYB/QhjKZTtuhk0mYuSmW2D9xJSiIW/ElX/jNM1CWw9cOJxz78y9J4gF1+C631ZhbX1jc6u4be/s7u0fOIdHTR0lirIGjUSk2gHRTHDJGsBsHasGAkDwVrB+HbutyZMaR7JB5jGzA/JUPIBpwSM1HOcLrBHSKkgWp/jGVY9p+RW3Ax4lXg5KaEc9Z7z1e1HNAmZhOyVjufG4KdEAaeCzexyN9EsJnRMhqxjqCQh036arT7DZaP08SBSpiTgTLV/TaQk1HoaBqYzJDSy95c/M/rJDC49lMu4wSYpIuPBonAEOF5DrjPFaMgpoYQqrhZFtMRUYSCScs2KXjLN6+SZrXiXVSq95el2k2eRxGdoFN0hjx0hWroDtVRA1E0Qc/oFb1ZT9aL9W59LFoLVj5zjP7A+vwBm+KThw=</latexit>
slide-6
SLIDE 6

What? Want to represent the class and the data instance separately Why?

  • Classification
  • Interpretability
  • Object detection
  • Topic modelling
  • Style transfer
  • Face swap

This work is about disentangling representations. We present a novel approach to an old problem.

Separating content from style

6

datapoint, (r, v)

<latexit sha1_base64="e3aF6vstTuwkI9LmeOhMNzmeTyM=">ACAnicbVDLSgNBEJz1GeNr1YvgZTAEIoSwGwU9Br14jGAekCxhdjKbDJmdXWZ6g2GJz/Fk6AgXv0LT/6Nk8dBEwsaiqpurv8WHANjvNtrayurW9sZray2zu7e/v2wWFdR4mirEYjEamTzQTXLIacBCsGStGQl+whj+4mfiNIVOaR/IeRjHzQtKTPOCUgJE69nEb2AOkXQIkjriEIh7jgioOzp2zik5U+Bl4s5JDs1R7dhf7W5Ek5BJoIJo3XKdGLyUKOBUsHE23040iwkdkB5rGSpJyLSXTl8Y47xRujiIlCkJeKpmf02kJNR6FPqmMyTQ14veRPzPayUQXHkpl3ECTNLZoiARGCI8yQN3uWIUxMgQhU3x2LaJ4pQMKlTQru4s/LpF4uel8t1FrnI9zyODTtApKiAXaIKukVEMUPaJn9IrerCfrxXq3PmatK9Z85gj9gfX5AwGTlng=</latexit>

class, r

<latexit sha1_base64="Ys2D6lj3c1+uto6IwD1WQZF5vj0=">AB+nicbVDLSsNAFJ3UV42vqEs3g6XgQkpSBV0W3bisYB/QhjKZTtuhk0mYuSmW2D9xJSiIW/ElX/jNM1CWw9cOJxz78y9J4gF1+C631ZhbX1jc6u4be/s7u0fOIdHTR0lirIGjUSk2gHRTHDJGsBsHasGAkDwVrB+HbutyZMaR7JB5jGzA/JUPIBpwSM1HOcLrBHSKkgWp/jGVY9p+RW3Ax4lXg5KaEc9Z7z1e1HNAmZhOyVjufG4KdEAaeCzexyN9EsJnRMhqxjqCQh036arT7DZaP08SBSpiTgTLV/TaQk1HoaBqYzJDSy95c/M/rJDC49lMu4wSYpIuPBonAEOF5DrjPFaMgpoYQqrhZFtMRUYSCScs2KXjLN6+SZrXiXVSq95el2k2eRxGdoFN0hjx0hWroDtVRA1E0Qc/oFb1ZT9aL9W59LFoLVj5zjP7A+vwBm+KThw=</latexit>
slide-7
SLIDE 7

What? Want to represent the class and the data instance separately Why?

  • Classification
  • Interpretability
  • Object detection
  • Topic modelling
  • Style transfer
  • Face swap

What else? This is not a new topic…

  • Tenenbaum & Freeman. (2000)
  • Reed et al. (2014)
  • Cheung et al. (2014)
  • Zhu et al. (2014)
  • Radford et al. (2016)
  • Chen et al. (2016)
  • Makhzani et al. (2016)
  • Siddharth et al. (2017)

This work is about disentangling representations. We present a novel approach to an old problem.

Separating content from style

7

datapoint, (r, v)

<latexit sha1_base64="e3aF6vstTuwkI9LmeOhMNzmeTyM=">ACAnicbVDLSgNBEJz1GeNr1YvgZTAEIoSwGwU9Br14jGAekCxhdjKbDJmdXWZ6g2GJz/Fk6AgXv0LT/6Nk8dBEwsaiqpurv8WHANjvNtrayurW9sZray2zu7e/v2wWFdR4mirEYjEamTzQTXLIacBCsGStGQl+whj+4mfiNIVOaR/IeRjHzQtKTPOCUgJE69nEb2AOkXQIkjriEIh7jgioOzp2zik5U+Bl4s5JDs1R7dhf7W5Ek5BJoIJo3XKdGLyUKOBUsHE23040iwkdkB5rGSpJyLSXTl8Y47xRujiIlCkJeKpmf02kJNR6FPqmMyTQ14veRPzPayUQXHkpl3ECTNLZoiARGCI8yQN3uWIUxMgQhU3x2LaJ4pQMKlTQru4s/LpF4uel8t1FrnI9zyODTtApKiAXaIKukVEMUPaJn9IrerCfrxXq3PmatK9Z85gj9gfX5AwGTlng=</latexit>

class, r

<latexit sha1_base64="Ys2D6lj3c1+uto6IwD1WQZF5vj0=">AB+nicbVDLSsNAFJ3UV42vqEs3g6XgQkpSBV0W3bisYB/QhjKZTtuhk0mYuSmW2D9xJSiIW/ElX/jNM1CWw9cOJxz78y9J4gF1+C631ZhbX1jc6u4be/s7u0fOIdHTR0lirIGjUSk2gHRTHDJGsBsHasGAkDwVrB+HbutyZMaR7JB5jGzA/JUPIBpwSM1HOcLrBHSKkgWp/jGVY9p+RW3Ax4lXg5KaEc9Z7z1e1HNAmZhOyVjufG4KdEAaeCzexyN9EsJnRMhqxjqCQh036arT7DZaP08SBSpiTgTLV/TaQk1HoaBqYzJDSy95c/M/rJDC49lMu4wSYpIuPBonAEOF5DrjPFaMgpoYQqrhZFtMRUYSCScs2KXjLN6+SZrXiXVSq95el2k2eRxGdoFN0hjx0hWroDtVRA1E0Qc/oFb1ZT9aL9W59LFoLVj5zjP7A+vwBm+KThw=</latexit>
slide-8
SLIDE 8

The main idea

Inferring the class latent using strategic data routing Invariant (class) latent is deterministically calculated from “complementary” same-class examples:

8

{x1, x2, . . . , xm} − → ry

<latexit sha1_base64="/R7rH7pCczoJ5nIklzQdq4MNOA=">ACFHicbZDNSgMxFIUz/lv/qi7dBIsgImWmCroU3bhUsLbQmQ6ZNG2DmWRI7mjL0Hdw5aO4EhTErRtXvo1pOwtPRD4OPdebu6JEsENuO63MzM7N7+wuLRcWFldW98obm7dGpVqyqpUCaXrETFMcMmqwEGweqIZiSPBatHdxbBeu2facCVvoJ+wICYdyducErBWDzws17TO+w1K4e+aCkwFmN/gH2hZEfzTheI1uoB67AfFktu2R0JT4OXQwnlugqLX35L0TRmEqgxjQ8N4EgIxo4FWxQ2PNTwxJC70iHNSxKEjMTZKOjBnjPOi3cVto+CXjkFn5NZCQ2ph9HtjMm0DWTtaH5X62RQvs0yLhMUmCSjhe1U4FB4WFCuMU1oyD6FgjV3H4W0y7RhILNsWBT8CZvnobStk7Kleuj0tn53keS2gH7aJ95KETdIYu0RWqIoe0TN6RW/Ok/PivDsf49YZJ5/ZRn/kfP4AZ1qeZQ=</latexit>
slide-9
SLIDE 9

The main idea

Inferring the class latent using strategic data routing Invariant (class) latent is deterministically calculated from “complementary” same-class examples: Equivariant (instance) latent is stochastically inferred from both class and datapoint information:

9

{x1, x2, . . . , xm} − → ry

<latexit sha1_base64="/R7rH7pCczoJ5nIklzQdq4MNOA=">ACFHicbZDNSgMxFIUz/lv/qi7dBIsgImWmCroU3bhUsLbQmQ6ZNG2DmWRI7mjL0Hdw5aO4EhTErRtXvo1pOwtPRD4OPdebu6JEsENuO63MzM7N7+wuLRcWFldW98obm7dGpVqyqpUCaXrETFMcMmqwEGweqIZiSPBatHdxbBeu2facCVvoJ+wICYdyducErBWDzws17TO+w1K4e+aCkwFmN/gH2hZEfzTheI1uoB67AfFktu2R0JT4OXQwnlugqLX35L0TRmEqgxjQ8N4EgIxo4FWxQ2PNTwxJC70iHNSxKEjMTZKOjBnjPOi3cVto+CXjkFn5NZCQ2ph9HtjMm0DWTtaH5X62RQvs0yLhMUmCSjhe1U4FB4WFCuMU1oyD6FgjV3H4W0y7RhILNsWBT8CZvnobStk7Kleuj0tn53keS2gH7aJ95KETdIYu0RWqIoe0TN6RW/Ok/PivDsf49YZJ5/ZRn/kfP4AZ1qeZQ=</latexit>

{ry, x} − → v

<latexit sha1_base64="Olmoz2YQ8D4Apf2fJNJUrUJz1dU=">ACBnicbVDLSsNAFJ34rPEVdamLwVJwISWpgi6LblxWsA9oQphMp+3QyUyYmVRL6MaVn+JKUBC3/oMr/8Zpm4W2HrhwOde7r0nShV2nW/raXldW19cKGvbm1vbPr7O03lEglJnUsmJCtCnCKCd1TUjrUQSFEeMNKPB9cRvDolUVPA7PUpIEKMep12KkTZS6Bz5mQxHp/DBH0OfCd6TtNfXSEpxD4ehU3TL7hRwkXg5KYIctdD58jsCpzHhGjOkVNtzEx1kSGqKGRnbJT9VJEF4gHqkbShHMVFBNn1jDEtG6cCukKa4hlPV/jWRoVipURyZzhjpvpr3JuJ/XjvV3csgozxJNeF4tqibMqgFnGQCO1QSrNnIEIQlNcdC3EcSYW2Ss0K3vzPi6RKXtn5crtebF6ledRAIfgGJwAD1yAKrgBNVAHGDyCZ/AK3qwn68V6tz5mrUtWPnMA/sD6/AHqLJi7</latexit>
slide-10
SLIDE 10

The main idea

Inferring the class latent using strategic data routing Invariant (class) latent is deterministically calculated from “complementary” same-class examples: Equivariant (instance) latent is stochastically inferred from both class and datapoint information:

10

{x1, x2, . . . , xm} − → ry

<latexit sha1_base64="/R7rH7pCczoJ5nIklzQdq4MNOA=">ACFHicbZDNSgMxFIUz/lv/qi7dBIsgImWmCroU3bhUsLbQmQ6ZNG2DmWRI7mjL0Hdw5aO4EhTErRtXvo1pOwtPRD4OPdebu6JEsENuO63MzM7N7+wuLRcWFldW98obm7dGpVqyqpUCaXrETFMcMmqwEGweqIZiSPBatHdxbBeu2facCVvoJ+wICYdyducErBWDzws17TO+w1K4e+aCkwFmN/gH2hZEfzTheI1uoB67AfFktu2R0JT4OXQwnlugqLX35L0TRmEqgxjQ8N4EgIxo4FWxQ2PNTwxJC70iHNSxKEjMTZKOjBnjPOi3cVto+CXjkFn5NZCQ2ph9HtjMm0DWTtaH5X62RQvs0yLhMUmCSjhe1U4FB4WFCuMU1oyD6FgjV3H4W0y7RhILNsWBT8CZvnobStk7Kleuj0tn53keS2gH7aJ95KETdIYu0RWqIoe0TN6RW/Ok/PivDsf49YZJ5/ZRn/kfP4AZ1qeZQ=</latexit>

{ry, x} − → v

<latexit sha1_base64="Olmoz2YQ8D4Apf2fJNJUrUJz1dU=">ACBnicbVDLSsNAFJ34rPEVdamLwVJwISWpgi6LblxWsA9oQphMp+3QyUyYmVRL6MaVn+JKUBC3/oMr/8Zpm4W2HrhwOde7r0nShV2nW/raXldW19cKGvbm1vbPr7O03lEglJnUsmJCtCnCKCd1TUjrUQSFEeMNKPB9cRvDolUVPA7PUpIEKMep12KkTZS6Bz5mQxHp/DBH0OfCd6TtNfXSEpxD4ehU3TL7hRwkXg5KYIctdD58jsCpzHhGjOkVNtzEx1kSGqKGRnbJT9VJEF4gHqkbShHMVFBNn1jDEtG6cCukKa4hlPV/jWRoVipURyZzhjpvpr3JuJ/XjvV3csgozxJNeF4tqibMqgFnGQCO1QSrNnIEIQlNcdC3EcSYW2Ss0K3vzPi6RKXtn5crtebF6ledRAIfgGJwAD1yAKrgBNVAHGDyCZ/AK3qwn68V6tz5mrUtWPnMA/sD6/AHqLJi7</latexit>

Inspired by GQNs (Eslami et al., 2018)

slide-11
SLIDE 11

Some detail

Invariant-equivariant representation learning

11

slide-12
SLIDE 12

Flash through the math

12

p

  • {xn, yn}N

n=1

  • =

N

Y

n=1

Z dvn drn pθ(xn|rn, vn) δ

  • rn − ryn
  • p(vn) p(yn)
<latexit sha1_base64="SAWHOZK7nEwQjRzP80yN8d4t0gU=">ACf3icbVFNb9QwEHVSPsrytcCRy8CqIgvbVKQWg6VKrhwQkVi20rJXIc765Vx7HsSdVyM/gl3Hit3DB2U0laBnJ8tN7b2Y848wo6TCOfwXh1q3bd+5u3+vdf/Dw0eP+k6cnrqwsFxNeqtKeZcwJbWYoEQlzowVrMiUOM3OP7b6YWwTpb6K6MmBVsoeVcoaeSvs/DM3kIqI1XKZ6BKtUA23SWh8mzbfPrTQEOjqkI6DGlvmVAFRqhPyidY8gt5sbTEpxKZBFvhZ8B9tW9J5hm54LhWzdy9O7Nq19q+aqAZio85nI8O0P4jH8TrgJkg6MCBdHKf9nzQveVUIjVwx56ZJbHBWM4uSK9H0dmjlhGH8nC3E1EPNCuFm9Xp/Dex4Jod5af3xc63Z3l8ZNSucWxWZdxYMl+61pL/06YVzg9mtdSmQqH5ptG8UoAltJ8BubSCo1p5wLiV/rHAl8wyjv7Len4LyfWZb4KTvXHydrz35d3g6EO3j23ynLwkEUnIPjkin8gxmRBOfgcvgtfBmzAIX4XjMN5Yw6DLeUb+ifD9H7wLvGc=</latexit>

Generative model

ryn = 1 m

m

X

i=1

fθinv(xi)

<latexit sha1_base64="MUL4qZ1jRwrLFQUvrAVhFVT6XuU=">ACInicbVDLSgMxFM34rPVdekmWATdlBkV7EYounGpYFXo1CGT3rHBJDMkd8QyzK+48lNcCQriSvBjTGsXvg4EDufcy805cSaFRd9/9yYmp6ZnZitz1fmFxaXl2srquU1zw6HNU5may5hZkEJDGwVKuMwMBVLuIhvjob+xS0YK1J9hoMuopda5EIztBJUa1pomIQ6ZIe0DAxjAcqtLmKCnEQlFeKJlERYh+QRSHCHRZC35bl1t2V2I5qdb/hj0D/kmBM6mSMk6j2FvZSnivQyCWzthP4GXYLZlBwCWV1M8wtZIzfsGvoOKqZAtstRhFLumUHk1S45GOlKr3zYKpqwdqNhNKoZ9+9sbiv95nRyTZtflynIEzb8OJbmkmNJhX7QnDHCUA0cYN8J9lvI+c02ha7XqWgh+Z/5LzncawW5j53Sv3joc91Eh62SDbJGA7JMWOSYnpE04uSeP5Jm8eA/ek/fqvX2NTnjnTXyA97HJ+adpHo=</latexit>

Inference model

qφ(v|ry, x) = N

  • µφ(ry, x), σ2

φ(ry, x)I

  • <latexit sha1_base64="+dvATW3nqowsdkwjBpECq3PAOUw=">ACPXicbVBLS8NAGNz4rPEV9ehlsRQqSEmqoBdB9KIXqWit0NSw2W7bpbtJ3N2IJeaHefJHePLkSVAQr17dtBF8DSwM/Px7Td+xKhUtv1ojI1PTE5NF2bM2bn5hUVraflchrHApI5DFoLH0nCaEDqipGLiJBEPcZafj9g8xvXBMhaRicqUFEWhx1A9qhGCktedbplZe4UY+m5etb4Q02btbhLnQ5Uj2MWHKcuj7tl0ef6VGmQ3oStrlKFcvq7l+lMXPatoV+wh4F/i5KQIctQ868FthzjmJFCYISmbjh2pVoKEopiR1Cy5sSQRwn3UJU1NA8SJbCXD61NY0kobdkKhX6DgUDW/TSISzngvk5md8nfXib+5zVj1dlpJTSIYkUCPFrUiRlUIcyqhG0qCFZsoAnCgurPQtxDAmGlCzd1C87vm/+S82rF2axUT7aKe/t5HwWwCtZAGThgG+yBQ1ADdYDBHXgCL+DVuDejTfjfRQdM/KZFfADxscn1lOuSg=</latexit>

Llab = Eq(v|ry,x) log p(x|ry, v) − DKL ⇥ q(v|ry, x)

  • p(v)

⇤ + log p(y)

<latexit sha1_base64="hOfV6MjiysYcQuF4cfmSB5y4cM=">ACeXicbVFb9MwFHbCbYRbgce9HCiVWhVsiFtL0jTAmJPQyJbpOaKLJdt7PmXLBPqkZpfgM/bk/8EJ54wUk7LhtHsvX5+8757HPMciUN+v53x71x89btOxt3vXv3Hzx81Hn85NhkheZixDOV6VNGjVAyFSOUqMRprgVNmBIn7Pxdo5/MhTYyS79gmYsobNUTiWnaKm48y1MKJ5xqrDOg5RLBSlNXeW2gFxqoPdVx97c+XOi63FoMaQpXNIO8vYAkNBfMBeK/ht837S5tPhzaXydkY/lS3xLdrMV8dY68V5em5SDudP2h3wZcB8EadMk6juLORTjJeJGIFLmixowDP8eoholV6L2emFhRE75OZ2JsYUpTYSJqnZyNfQsM4Fpu1KEVrW+6uiokxZcJsZtOguao15P+0cYHTvaiSaV6gSPnqomhADNovgEmUguOqrSAci3tY4GfU052s/y7BSCqz1fB8fbw2BnuP35TXf/YD2PDbJnpM+Ccgu2ScfyREZEU5+OJvOC6fn/HSfuX35SrVdY1T8k/4e78AnI7vYQ=</latexit>

Lunlab = Eq(y|x) h Eq(v|ry,x) log p(x|ry, v) − DKL ⇥ q(v|ry, x)

  • p(v)

⇤i − DKL ⇥ q(y|x)

  • p(y)

<latexit sha1_base64="54+Aip3J5p/ua8uIn2MC3OPjeo=">ACzHichVFdaxQxFM2MX3Va7VYfQkuhS3oMlMFfSmUqlCwSAW3LewMQyab3YZmMmNyZ90hO6/+QZ989ZeYyY7SbQUvJyc+5SW5WCq4hDH96/p279+4/2HgYbG49erzd23lypotKUTaihSjURUY0E1yEXAQ7KJUjOSZYOfZ1btWP58zpXkhv0BdsiQnM8mnBKwVNr7FecELikR5qRJY2ALMJUJGuCAxw4LcvMhyY1Xwf1crHX4PiIz8Z4XZkvVq/wE4WxQyXgwVeYsfN94KX+G+P9396fDyxqVnrdK3aMUu3WYv56py4jsl/XdrbrRvUnUHa64fD0AW+DaIO9FEXp2nvRzwpaJUzCVQrcdRWEJiAJOBWuC3bjSrCT0iszY2EJcqYT4bR4F3LTPC0UHZJwI4NrlUYkmtd5nNbB+kb2ot+S9tXMH0bWK4LCtgkq4aTSuBocDtZPGEK0ZB1BYQqri9LKaXRBEKdv6B/YXo5ptvg7P9YfRquP/5df/wqPuPDfQMPUcDFKE36BAdo1M0QtQ79qT3zVv4n3zwjd+sUn2vq3mK1sL/huXVdzO</latexit>

Objective

slide-13
SLIDE 13

Flash through the math

13

p

  • {xn, yn}N

n=1

  • =

N

Y

n=1

Z dvn drn pθ(xn|rn, vn) δ

  • rn − ryn
  • p(vn) p(yn)
<latexit sha1_base64="SAWHOZK7nEwQjRzP80yN8d4t0gU=">ACf3icbVFNb9QwEHVSPsrytcCRy8CqIgvbVKQWg6VKrhwQkVi20rJXIc765Vx7HsSdVyM/gl3Hit3DB2U0laBnJ8tN7b2Y848wo6TCOfwXh1q3bd+5u3+vdf/Dw0eP+k6cnrqwsFxNeqtKeZcwJbWYoEQlzowVrMiUOM3OP7b6YWwTpb6K6MmBVsoeVcoaeSvs/DM3kIqI1XKZ6BKtUA23SWh8mzbfPrTQEOjqkI6DGlvmVAFRqhPyidY8gt5sbTEpxKZBFvhZ8B9tW9J5hm54LhWzdy9O7Nq19q+aqAZio85nI8O0P4jH8TrgJkg6MCBdHKf9nzQveVUIjVwx56ZJbHBWM4uSK9H0dmjlhGH8nC3E1EPNCuFm9Xp/Dex4Jod5af3xc63Z3l8ZNSucWxWZdxYMl+61pL/06YVzg9mtdSmQqH5ptG8UoAltJ8BubSCo1p5wLiV/rHAl8wyjv7Len4LyfWZb4KTvXHydrz35d3g6EO3j23ynLwkEUnIPjkin8gxmRBOfgcvgtfBmzAIX4XjMN5Yw6DLeUb+ifD9H7wLvGc=</latexit>

Generative model

ryn = 1 m

m

X

i=1

fθinv(xi)

<latexit sha1_base64="MUL4qZ1jRwrLFQUvrAVhFVT6XuU=">ACInicbVDLSgMxFM34rPVdekmWATdlBkV7EYounGpYFXo1CGT3rHBJDMkd8QyzK+48lNcCQriSvBjTGsXvg4EDufcy805cSaFRd9/9yYmp6ZnZitz1fmFxaXl2srquU1zw6HNU5may5hZkEJDGwVKuMwMBVLuIhvjob+xS0YK1J9hoMuopda5EIztBJUa1pomIQ6ZIe0DAxjAcqtLmKCnEQlFeKJlERYh+QRSHCHRZC35bl1t2V2I5qdb/hj0D/kmBM6mSMk6j2FvZSnivQyCWzthP4GXYLZlBwCWV1M8wtZIzfsGvoOKqZAtstRhFLumUHk1S45GOlKr3zYKpqwdqNhNKoZ9+9sbiv95nRyTZtflynIEzb8OJbmkmNJhX7QnDHCUA0cYN8J9lvI+c02ha7XqWgh+Z/5LzncawW5j53Sv3joc91Eh62SDbJGA7JMWOSYnpE04uSeP5Jm8eA/ek/fqvX2NTnjnTXyA97HJ+adpHo=</latexit>

Inference model

qφ(v|ry, x) = N

  • µφ(ry, x), σ2

φ(ry, x)I

  • <latexit sha1_base64="+dvATW3nqowsdkwjBpECq3PAOUw=">ACPXicbVBLS8NAGNz4rPEV9ehlsRQqSEmqoBdB9KIXqWit0NSw2W7bpbtJ3N2IJeaHefJHePLkSVAQr17dtBF8DSwM/Px7Td+xKhUtv1ojI1PTE5NF2bM2bn5hUVraflchrHApI5DFoLH0nCaEDqipGLiJBEPcZafj9g8xvXBMhaRicqUFEWhx1A9qhGCktedbplZe4UY+m5etb4Q02btbhLnQ5Uj2MWHKcuj7tl0ef6VGmQ3oStrlKFcvq7l+lMXPatoV+wh4F/i5KQIctQ868FthzjmJFCYISmbjh2pVoKEopiR1Cy5sSQRwn3UJU1NA8SJbCXD61NY0kobdkKhX6DgUDW/TSISzngvk5md8nfXib+5zVj1dlpJTSIYkUCPFrUiRlUIcyqhG0qCFZsoAnCgurPQtxDAmGlCzd1C87vm/+S82rF2axUT7aKe/t5HwWwCtZAGThgG+yBQ1ADdYDBHXgCL+DVuDejTfjfRQdM/KZFfADxscn1lOuSg=</latexit>

Llab = Eq(v|ry,x) log p(x|ry, v) − DKL ⇥ q(v|ry, x)

  • p(v)

⇤ + log p(y)

<latexit sha1_base64="hOfV6MjiysYcQuF4cfmSB5y4cM=">ACeXicbVFb9MwFHbCbYRbgce9HCiVWhVsiFtL0jTAmJPQyJbpOaKLJdt7PmXLBPqkZpfgM/bk/8EJ54wUk7LhtHsvX5+8757HPMciUN+v53x71x89btOxt3vXv3Hzx81Hn85NhkheZixDOV6VNGjVAyFSOUqMRprgVNmBIn7Pxdo5/MhTYyS79gmYsobNUTiWnaKm48y1MKJ5xqrDOg5RLBSlNXeW2gFxqoPdVx97c+XOi63FoMaQpXNIO8vYAkNBfMBeK/ht837S5tPhzaXydkY/lS3xLdrMV8dY68V5em5SDudP2h3wZcB8EadMk6juLORTjJeJGIFLmixowDP8eoholV6L2emFhRE75OZ2JsYUpTYSJqnZyNfQsM4Fpu1KEVrW+6uiokxZcJsZtOguao15P+0cYHTvaiSaV6gSPnqomhADNovgEmUguOqrSAci3tY4GfU052s/y7BSCqz1fB8fbw2BnuP35TXf/YD2PDbJnpM+Ccgu2ScfyREZEU5+OJvOC6fn/HSfuX35SrVdY1T8k/4e78AnI7vYQ=</latexit>

Lunlab = Eq(y|x) h Eq(v|ry,x) log p(x|ry, v) − DKL ⇥ q(v|ry, x)

  • p(v)

⇤i − DKL ⇥ q(y|x)

  • p(y)

<latexit sha1_base64="54+Aip3J5p/ua8uIn2MC3OPjeo=">ACzHichVFdaxQxFM2MX3Va7VYfQkuhS3oMlMFfSmUqlCwSAW3LewMQyab3YZmMmNyZ90hO6/+QZ989ZeYyY7SbQUvJyc+5SW5WCq4hDH96/p279+4/2HgYbG49erzd23lypotKUTaihSjURUY0E1yEXAQ7KJUjOSZYOfZ1btWP58zpXkhv0BdsiQnM8mnBKwVNr7FecELikR5qRJY2ALMJUJGuCAxw4LcvMhyY1Xwf1crHX4PiIz8Z4XZkvVq/wE4WxQyXgwVeYsfN94KX+G+P9396fDyxqVnrdK3aMUu3WYv56py4jsl/XdrbrRvUnUHa64fD0AW+DaIO9FEXp2nvRzwpaJUzCVQrcdRWEJiAJOBWuC3bjSrCT0iszY2EJcqYT4bR4F3LTPC0UHZJwI4NrlUYkmtd5nNbB+kb2ot+S9tXMH0bWK4LCtgkq4aTSuBocDtZPGEK0ZB1BYQqri9LKaXRBEKdv6B/YXo5ptvg7P9YfRquP/5df/wqPuPDfQMPUcDFKE36BAdo1M0QtQ79qT3zVv4n3zwjd+sUn2vq3mK1sL/huXVdzO</latexit>

Objective Generative model with 2 latent variables

slide-14
SLIDE 14

Flash through the math

14

p

  • {xn, yn}N

n=1

  • =

N

Y

n=1

Z dvn drn pθ(xn|rn, vn) δ

  • rn − ryn
  • p(vn) p(yn)
<latexit sha1_base64="SAWHOZK7nEwQjRzP80yN8d4t0gU=">ACf3icbVFNb9QwEHVSPsrytcCRy8CqIgvbVKQWg6VKrhwQkVi20rJXIc765Vx7HsSdVyM/gl3Hit3DB2U0laBnJ8tN7b2Y848wo6TCOfwXh1q3bd+5u3+vdf/Dw0eP+k6cnrqwsFxNeqtKeZcwJbWYoEQlzowVrMiUOM3OP7b6YWwTpb6K6MmBVsoeVcoaeSvs/DM3kIqI1XKZ6BKtUA23SWh8mzbfPrTQEOjqkI6DGlvmVAFRqhPyidY8gt5sbTEpxKZBFvhZ8B9tW9J5hm54LhWzdy9O7Nq19q+aqAZio85nI8O0P4jH8TrgJkg6MCBdHKf9nzQveVUIjVwx56ZJbHBWM4uSK9H0dmjlhGH8nC3E1EPNCuFm9Xp/Dex4Jod5af3xc63Z3l8ZNSucWxWZdxYMl+61pL/06YVzg9mtdSmQqH5ptG8UoAltJ8BubSCo1p5wLiV/rHAl8wyjv7Len4LyfWZb4KTvXHydrz35d3g6EO3j23ynLwkEUnIPjkin8gxmRBOfgcvgtfBmzAIX4XjMN5Yw6DLeUb+ifD9H7wLvGc=</latexit>

Generative model

ryn = 1 m

m

X

i=1

fθinv(xi)

<latexit sha1_base64="MUL4qZ1jRwrLFQUvrAVhFVT6XuU=">ACInicbVDLSgMxFM34rPVdekmWATdlBkV7EYounGpYFXo1CGT3rHBJDMkd8QyzK+48lNcCQriSvBjTGsXvg4EDufcy805cSaFRd9/9yYmp6ZnZitz1fmFxaXl2srquU1zw6HNU5may5hZkEJDGwVKuMwMBVLuIhvjob+xS0YK1J9hoMuopda5EIztBJUa1pomIQ6ZIe0DAxjAcqtLmKCnEQlFeKJlERYh+QRSHCHRZC35bl1t2V2I5qdb/hj0D/kmBM6mSMk6j2FvZSnivQyCWzthP4GXYLZlBwCWV1M8wtZIzfsGvoOKqZAtstRhFLumUHk1S45GOlKr3zYKpqwdqNhNKoZ9+9sbiv95nRyTZtflynIEzb8OJbmkmNJhX7QnDHCUA0cYN8J9lvI+c02ha7XqWgh+Z/5LzncawW5j53Sv3joc91Eh62SDbJGA7JMWOSYnpE04uSeP5Jm8eA/ek/fqvX2NTnjnTXyA97HJ+adpHo=</latexit>

Inference model

qφ(v|ry, x) = N

  • µφ(ry, x), σ2

φ(ry, x)I

  • <latexit sha1_base64="+dvATW3nqowsdkwjBpECq3PAOUw=">ACPXicbVBLS8NAGNz4rPEV9ehlsRQqSEmqoBdB9KIXqWit0NSw2W7bpbtJ3N2IJeaHefJHePLkSVAQr17dtBF8DSwM/Px7Td+xKhUtv1ojI1PTE5NF2bM2bn5hUVraflchrHApI5DFoLH0nCaEDqipGLiJBEPcZafj9g8xvXBMhaRicqUFEWhx1A9qhGCktedbplZe4UY+m5etb4Q02btbhLnQ5Uj2MWHKcuj7tl0ef6VGmQ3oStrlKFcvq7l+lMXPatoV+wh4F/i5KQIctQ868FthzjmJFCYISmbjh2pVoKEopiR1Cy5sSQRwn3UJU1NA8SJbCXD61NY0kobdkKhX6DgUDW/TSISzngvk5md8nfXib+5zVj1dlpJTSIYkUCPFrUiRlUIcyqhG0qCFZsoAnCgurPQtxDAmGlCzd1C87vm/+S82rF2axUT7aKe/t5HwWwCtZAGThgG+yBQ1ADdYDBHXgCL+DVuDejTfjfRQdM/KZFfADxscn1lOuSg=</latexit>

Llab = Eq(v|ry,x) log p(x|ry, v) − DKL ⇥ q(v|ry, x)

  • p(v)

⇤ + log p(y)

<latexit sha1_base64="hOfV6MjiysYcQuF4cfmSB5y4cM=">ACeXicbVFb9MwFHbCbYRbgce9HCiVWhVsiFtL0jTAmJPQyJbpOaKLJdt7PmXLBPqkZpfgM/bk/8EJ54wUk7LhtHsvX5+8757HPMciUN+v53x71x89btOxt3vXv3Hzx81Hn85NhkheZixDOV6VNGjVAyFSOUqMRprgVNmBIn7Pxdo5/MhTYyS79gmYsobNUTiWnaKm48y1MKJ5xqrDOg5RLBSlNXeW2gFxqoPdVx97c+XOi63FoMaQpXNIO8vYAkNBfMBeK/ht837S5tPhzaXydkY/lS3xLdrMV8dY68V5em5SDudP2h3wZcB8EadMk6juLORTjJeJGIFLmixowDP8eoholV6L2emFhRE75OZ2JsYUpTYSJqnZyNfQsM4Fpu1KEVrW+6uiokxZcJsZtOguao15P+0cYHTvaiSaV6gSPnqomhADNovgEmUguOqrSAci3tY4GfU052s/y7BSCqz1fB8fbw2BnuP35TXf/YD2PDbJnpM+Ccgu2ScfyREZEU5+OJvOC6fn/HSfuX35SrVdY1T8k/4e78AnI7vYQ=</latexit>

Lunlab = Eq(y|x) h Eq(v|ry,x) log p(x|ry, v) − DKL ⇥ q(v|ry, x)

  • p(v)

⇤i − DKL ⇥ q(y|x)

  • p(y)

<latexit sha1_base64="54+Aip3J5p/ua8uIn2MC3OPjeo=">ACzHichVFdaxQxFM2MX3Va7VYfQkuhS3oMlMFfSmUqlCwSAW3LewMQyab3YZmMmNyZ90hO6/+QZ989ZeYyY7SbQUvJyc+5SW5WCq4hDH96/p279+4/2HgYbG49erzd23lypotKUTaihSjURUY0E1yEXAQ7KJUjOSZYOfZ1btWP58zpXkhv0BdsiQnM8mnBKwVNr7FecELikR5qRJY2ALMJUJGuCAxw4LcvMhyY1Xwf1crHX4PiIz8Z4XZkvVq/wE4WxQyXgwVeYsfN94KX+G+P9396fDyxqVnrdK3aMUu3WYv56py4jsl/XdrbrRvUnUHa64fD0AW+DaIO9FEXp2nvRzwpaJUzCVQrcdRWEJiAJOBWuC3bjSrCT0iszY2EJcqYT4bR4F3LTPC0UHZJwI4NrlUYkmtd5nNbB+kb2ot+S9tXMH0bWK4LCtgkq4aTSuBocDtZPGEK0ZB1BYQqri9LKaXRBEKdv6B/YXo5ptvg7P9YfRquP/5df/wqPuPDfQMPUcDFKE36BAdo1M0QtQ79qT3zVv4n3zwjd+sUn2vq3mK1sL/huXVdzO</latexit>

Objective Generative model with 2 latent variables Deterministic, from same- class complementary data

slide-15
SLIDE 15

Flash through the math

15

p

  • {xn, yn}N

n=1

  • =

N

Y

n=1

Z dvn drn pθ(xn|rn, vn) δ

  • rn − ryn
  • p(vn) p(yn)
<latexit sha1_base64="SAWHOZK7nEwQjRzP80yN8d4t0gU=">ACf3icbVFNb9QwEHVSPsrytcCRy8CqIgvbVKQWg6VKrhwQkVi20rJXIc765Vx7HsSdVyM/gl3Hit3DB2U0laBnJ8tN7b2Y848wo6TCOfwXh1q3bd+5u3+vdf/Dw0eP+k6cnrqwsFxNeqtKeZcwJbWYoEQlzowVrMiUOM3OP7b6YWwTpb6K6MmBVsoeVcoaeSvs/DM3kIqI1XKZ6BKtUA23SWh8mzbfPrTQEOjqkI6DGlvmVAFRqhPyidY8gt5sbTEpxKZBFvhZ8B9tW9J5hm54LhWzdy9O7Nq19q+aqAZio85nI8O0P4jH8TrgJkg6MCBdHKf9nzQveVUIjVwx56ZJbHBWM4uSK9H0dmjlhGH8nC3E1EPNCuFm9Xp/Dex4Jod5af3xc63Z3l8ZNSucWxWZdxYMl+61pL/06YVzg9mtdSmQqH5ptG8UoAltJ8BubSCo1p5wLiV/rHAl8wyjv7Len4LyfWZb4KTvXHydrz35d3g6EO3j23ynLwkEUnIPjkin8gxmRBOfgcvgtfBmzAIX4XjMN5Yw6DLeUb+ifD9H7wLvGc=</latexit>

Generative model

ryn = 1 m

m

X

i=1

fθinv(xi)

<latexit sha1_base64="MUL4qZ1jRwrLFQUvrAVhFVT6XuU=">ACInicbVDLSgMxFM34rPVdekmWATdlBkV7EYounGpYFXo1CGT3rHBJDMkd8QyzK+48lNcCQriSvBjTGsXvg4EDufcy805cSaFRd9/9yYmp6ZnZitz1fmFxaXl2srquU1zw6HNU5may5hZkEJDGwVKuMwMBVLuIhvjob+xS0YK1J9hoMuopda5EIztBJUa1pomIQ6ZIe0DAxjAcqtLmKCnEQlFeKJlERYh+QRSHCHRZC35bl1t2V2I5qdb/hj0D/kmBM6mSMk6j2FvZSnivQyCWzthP4GXYLZlBwCWV1M8wtZIzfsGvoOKqZAtstRhFLumUHk1S45GOlKr3zYKpqwdqNhNKoZ9+9sbiv95nRyTZtflynIEzb8OJbmkmNJhX7QnDHCUA0cYN8J9lvI+c02ha7XqWgh+Z/5LzncawW5j53Sv3joc91Eh62SDbJGA7JMWOSYnpE04uSeP5Jm8eA/ek/fqvX2NTnjnTXyA97HJ+adpHo=</latexit>

Inference model

qφ(v|ry, x) = N

  • µφ(ry, x), σ2

φ(ry, x)I

  • <latexit sha1_base64="+dvATW3nqowsdkwjBpECq3PAOUw=">ACPXicbVBLS8NAGNz4rPEV9ehlsRQqSEmqoBdB9KIXqWit0NSw2W7bpbtJ3N2IJeaHefJHePLkSVAQr17dtBF8DSwM/Px7Td+xKhUtv1ojI1PTE5NF2bM2bn5hUVraflchrHApI5DFoLH0nCaEDqipGLiJBEPcZafj9g8xvXBMhaRicqUFEWhx1A9qhGCktedbplZe4UY+m5etb4Q02btbhLnQ5Uj2MWHKcuj7tl0ef6VGmQ3oStrlKFcvq7l+lMXPatoV+wh4F/i5KQIctQ868FthzjmJFCYISmbjh2pVoKEopiR1Cy5sSQRwn3UJU1NA8SJbCXD61NY0kobdkKhX6DgUDW/TSISzngvk5md8nfXib+5zVj1dlpJTSIYkUCPFrUiRlUIcyqhG0qCFZsoAnCgurPQtxDAmGlCzd1C87vm/+S82rF2axUT7aKe/t5HwWwCtZAGThgG+yBQ1ADdYDBHXgCL+DVuDejTfjfRQdM/KZFfADxscn1lOuSg=</latexit>

Llab = Eq(v|ry,x) log p(x|ry, v) − DKL ⇥ q(v|ry, x)

  • p(v)

⇤ + log p(y)

<latexit sha1_base64="hOfV6MjiysYcQuF4cfmSB5y4cM=">ACeXicbVFb9MwFHbCbYRbgce9HCiVWhVsiFtL0jTAmJPQyJbpOaKLJdt7PmXLBPqkZpfgM/bk/8EJ54wUk7LhtHsvX5+8757HPMciUN+v53x71x89btOxt3vXv3Hzx81Hn85NhkheZixDOV6VNGjVAyFSOUqMRprgVNmBIn7Pxdo5/MhTYyS79gmYsobNUTiWnaKm48y1MKJ5xqrDOg5RLBSlNXeW2gFxqoPdVx97c+XOi63FoMaQpXNIO8vYAkNBfMBeK/ht837S5tPhzaXydkY/lS3xLdrMV8dY68V5em5SDudP2h3wZcB8EadMk6juLORTjJeJGIFLmixowDP8eoholV6L2emFhRE75OZ2JsYUpTYSJqnZyNfQsM4Fpu1KEVrW+6uiokxZcJsZtOguao15P+0cYHTvaiSaV6gSPnqomhADNovgEmUguOqrSAci3tY4GfU052s/y7BSCqz1fB8fbw2BnuP35TXf/YD2PDbJnpM+Ccgu2ScfyREZEU5+OJvOC6fn/HSfuX35SrVdY1T8k/4e78AnI7vYQ=</latexit>

Lunlab = Eq(y|x) h Eq(v|ry,x) log p(x|ry, v) − DKL ⇥ q(v|ry, x)

  • p(v)

⇤i − DKL ⇥ q(y|x)

  • p(y)

<latexit sha1_base64="54+Aip3J5p/ua8uIn2MC3OPjeo=">ACzHichVFdaxQxFM2MX3Va7VYfQkuhS3oMlMFfSmUqlCwSAW3LewMQyab3YZmMmNyZ90hO6/+QZ989ZeYyY7SbQUvJyc+5SW5WCq4hDH96/p279+4/2HgYbG49erzd23lypotKUTaihSjURUY0E1yEXAQ7KJUjOSZYOfZ1btWP58zpXkhv0BdsiQnM8mnBKwVNr7FecELikR5qRJY2ALMJUJGuCAxw4LcvMhyY1Xwf1crHX4PiIz8Z4XZkvVq/wE4WxQyXgwVeYsfN94KX+G+P9396fDyxqVnrdK3aMUu3WYv56py4jsl/XdrbrRvUnUHa64fD0AW+DaIO9FEXp2nvRzwpaJUzCVQrcdRWEJiAJOBWuC3bjSrCT0iszY2EJcqYT4bR4F3LTPC0UHZJwI4NrlUYkmtd5nNbB+kb2ot+S9tXMH0bWK4LCtgkq4aTSuBocDtZPGEK0ZB1BYQqri9LKaXRBEKdv6B/YXo5ptvg7P9YfRquP/5df/wqPuPDfQMPUcDFKE36BAdo1M0QtQ79qT3zVv4n3zwjd+sUn2vq3mK1sL/huXVdzO</latexit>

Objective Generative model with 2 latent variables Deterministic, from same- class complementary data Standard VAE

slide-16
SLIDE 16

Flash through the math

16

p

  • {xn, yn}N

n=1

  • =

N

Y

n=1

Z dvn drn pθ(xn|rn, vn) δ

  • rn − ryn
  • p(vn) p(yn)
<latexit sha1_base64="SAWHOZK7nEwQjRzP80yN8d4t0gU=">ACf3icbVFNb9QwEHVSPsrytcCRy8CqIgvbVKQWg6VKrhwQkVi20rJXIc765Vx7HsSdVyM/gl3Hit3DB2U0laBnJ8tN7b2Y848wo6TCOfwXh1q3bd+5u3+vdf/Dw0eP+k6cnrqwsFxNeqtKeZcwJbWYoEQlzowVrMiUOM3OP7b6YWwTpb6K6MmBVsoeVcoaeSvs/DM3kIqI1XKZ6BKtUA23SWh8mzbfPrTQEOjqkI6DGlvmVAFRqhPyidY8gt5sbTEpxKZBFvhZ8B9tW9J5hm54LhWzdy9O7Nq19q+aqAZio85nI8O0P4jH8TrgJkg6MCBdHKf9nzQveVUIjVwx56ZJbHBWM4uSK9H0dmjlhGH8nC3E1EPNCuFm9Xp/Dex4Jod5af3xc63Z3l8ZNSucWxWZdxYMl+61pL/06YVzg9mtdSmQqH5ptG8UoAltJ8BubSCo1p5wLiV/rHAl8wyjv7Len4LyfWZb4KTvXHydrz35d3g6EO3j23ynLwkEUnIPjkin8gxmRBOfgcvgtfBmzAIX4XjMN5Yw6DLeUb+ifD9H7wLvGc=</latexit>

Generative model

ryn = 1 m

m

X

i=1

fθinv(xi)

<latexit sha1_base64="MUL4qZ1jRwrLFQUvrAVhFVT6XuU=">ACInicbVDLSgMxFM34rPVdekmWATdlBkV7EYounGpYFXo1CGT3rHBJDMkd8QyzK+48lNcCQriSvBjTGsXvg4EDufcy805cSaFRd9/9yYmp6ZnZitz1fmFxaXl2srquU1zw6HNU5may5hZkEJDGwVKuMwMBVLuIhvjob+xS0YK1J9hoMuopda5EIztBJUa1pomIQ6ZIe0DAxjAcqtLmKCnEQlFeKJlERYh+QRSHCHRZC35bl1t2V2I5qdb/hj0D/kmBM6mSMk6j2FvZSnivQyCWzthP4GXYLZlBwCWV1M8wtZIzfsGvoOKqZAtstRhFLumUHk1S45GOlKr3zYKpqwdqNhNKoZ9+9sbiv95nRyTZtflynIEzb8OJbmkmNJhX7QnDHCUA0cYN8J9lvI+c02ha7XqWgh+Z/5LzncawW5j53Sv3joc91Eh62SDbJGA7JMWOSYnpE04uSeP5Jm8eA/ek/fqvX2NTnjnTXyA97HJ+adpHo=</latexit>

Inference model

qφ(v|ry, x) = N

  • µφ(ry, x), σ2

φ(ry, x)I

  • <latexit sha1_base64="+dvATW3nqowsdkwjBpECq3PAOUw=">ACPXicbVBLS8NAGNz4rPEV9ehlsRQqSEmqoBdB9KIXqWit0NSw2W7bpbtJ3N2IJeaHefJHePLkSVAQr17dtBF8DSwM/Px7Td+xKhUtv1ojI1PTE5NF2bM2bn5hUVraflchrHApI5DFoLH0nCaEDqipGLiJBEPcZafj9g8xvXBMhaRicqUFEWhx1A9qhGCktedbplZe4UY+m5etb4Q02btbhLnQ5Uj2MWHKcuj7tl0ef6VGmQ3oStrlKFcvq7l+lMXPatoV+wh4F/i5KQIctQ868FthzjmJFCYISmbjh2pVoKEopiR1Cy5sSQRwn3UJU1NA8SJbCXD61NY0kobdkKhX6DgUDW/TSISzngvk5md8nfXib+5zVj1dlpJTSIYkUCPFrUiRlUIcyqhG0qCFZsoAnCgurPQtxDAmGlCzd1C87vm/+S82rF2axUT7aKe/t5HwWwCtZAGThgG+yBQ1ADdYDBHXgCL+DVuDejTfjfRQdM/KZFfADxscn1lOuSg=</latexit>

Llab = Eq(v|ry,x) log p(x|ry, v) − DKL ⇥ q(v|ry, x)

  • p(v)

⇤ + log p(y)

<latexit sha1_base64="hOfV6MjiysYcQuF4cfmSB5y4cM=">ACeXicbVFb9MwFHbCbYRbgce9HCiVWhVsiFtL0jTAmJPQyJbpOaKLJdt7PmXLBPqkZpfgM/bk/8EJ54wUk7LhtHsvX5+8757HPMciUN+v53x71x89btOxt3vXv3Hzx81Hn85NhkheZixDOV6VNGjVAyFSOUqMRprgVNmBIn7Pxdo5/MhTYyS79gmYsobNUTiWnaKm48y1MKJ5xqrDOg5RLBSlNXeW2gFxqoPdVx97c+XOi63FoMaQpXNIO8vYAkNBfMBeK/ht837S5tPhzaXydkY/lS3xLdrMV8dY68V5em5SDudP2h3wZcB8EadMk6juLORTjJeJGIFLmixowDP8eoholV6L2emFhRE75OZ2JsYUpTYSJqnZyNfQsM4Fpu1KEVrW+6uiokxZcJsZtOguao15P+0cYHTvaiSaV6gSPnqomhADNovgEmUguOqrSAci3tY4GfU052s/y7BSCqz1fB8fbw2BnuP35TXf/YD2PDbJnpM+Ccgu2ScfyREZEU5+OJvOC6fn/HSfuX35SrVdY1T8k/4e78AnI7vYQ=</latexit>

Lunlab = Eq(y|x) h Eq(v|ry,x) log p(x|ry, v) − DKL ⇥ q(v|ry, x)

  • p(v)

⇤i − DKL ⇥ q(y|x)

  • p(y)

<latexit sha1_base64="54+Aip3J5p/ua8uIn2MC3OPjeo=">ACzHichVFdaxQxFM2MX3Va7VYfQkuhS3oMlMFfSmUqlCwSAW3LewMQyab3YZmMmNyZ90hO6/+QZ989ZeYyY7SbQUvJyc+5SW5WCq4hDH96/p279+4/2HgYbG49erzd23lypotKUTaihSjURUY0E1yEXAQ7KJUjOSZYOfZ1btWP58zpXkhv0BdsiQnM8mnBKwVNr7FecELikR5qRJY2ALMJUJGuCAxw4LcvMhyY1Xwf1crHX4PiIz8Z4XZkvVq/wE4WxQyXgwVeYsfN94KX+G+P9396fDyxqVnrdK3aMUu3WYv56py4jsl/XdrbrRvUnUHa64fD0AW+DaIO9FEXp2nvRzwpaJUzCVQrcdRWEJiAJOBWuC3bjSrCT0iszY2EJcqYT4bR4F3LTPC0UHZJwI4NrlUYkmtd5nNbB+kb2ot+S9tXMH0bWK4LCtgkq4aTSuBocDtZPGEK0ZB1BYQqri9LKaXRBEKdv6B/YXo5ptvg7P9YfRquP/5df/wqPuPDfQMPUcDFKE36BAdo1M0QtQ79qT3zVv4n3zwjd+sUn2vq3mK1sL/huXVdzO</latexit>

Objective Generative model with 2 latent variables Deterministic, from same- class complementary data Standard VAE Standard ELBOs

slide-17
SLIDE 17

Inference in pictures

17

slide-18
SLIDE 18

Results

Invariant-equivariant representation learning

18

slide-19
SLIDE 19

Inferred latent space on MNIST

19

Latent spaces disentangle The invariant latent learns to separate the classes The equivariant latent learns to ignore class information

slide-20
SLIDE 20

Generations from various latent configurations (MNIST)

20

Samples from each class Equivariant interpolations, for multiple invariant latents Invariant steps, for multiple equivariant latents

slide-21
SLIDE 21

Generations from various latent configurations (SVHN)

21

Samples from each class Equivariant interpolations, for multiple invariant latents Invariant steps, for multiple equivariant latents

slide-22
SLIDE 22

Classification

Semi supervised (Using label-inference distribution)

22

Fully supervised (Using 0-parameter distance to nearest )

ry

<latexit sha1_base64="HBR01F8pV72qJ7S0Eq0QpicXOZA=">AB63icbVBNS8NAEJ3Urxq/qh69LJaCp5JUQY9FLx4r2lpoQ9lsN+3S3U3Y3Qgh9Cd4EhTEq7/Ik/GbZuDtj4YeLw3w8y8MOFMG8/7dkpr6xubW+Vtd2d3b/+gcnjU0XGqCG2TmMeqG2JNOZO0bZjhtJsoikXI6WM4uZn5j09UaRbLB5MlNB4JFnECDZWuleDbFCpenVvDrRK/IJUoUBrUPnqD2OSCioN4Vjrnu8lJsixMoxwOnVr/VTBJMJHtGepRILqoN8fusU1awyRFGsbEmD5qr7ayLHQutMhLZTYDPWy95M/M/rpSa6CnImk9RQSRaLopQjE6PZ42jIFCWGZ5Zgopg9FpExVpgYG49rU/CXf14lnUbdP6837i6qzesijzKcwCmcgQ+X0IRbaEbCIzgGV7hzRHOi/PufCxaS04xcwx/4Hz+ANb4jhE=</latexit>

Benchmark is always the equivalent network, +2 dropout layers, trained to classify the labelled data

slide-23
SLIDE 23

Outlook

Invariant-equivariant representation learning

23

slide-24
SLIDE 24

Outlook

Advantages

  • f this approach
  • A. Easy to implement
  • B. Very little tuning needed

C.Reasonably intuitive D.Performs similarly to comparable approaches At Faculty, we are implementing this technique into our generic

  • bject detection pipeline to extract invariant object representations

24

Disadvantages

  • A. Requires some labels
  • B. Does not achieve state-of-the-art
slide-25
SLIDE 25

info@faculty.ai +44 20 3637 9415 54 Welbeck St, Marylebone, 
 London W1G 9XS, UK
 
 Follow us:

If you’re interested in finding out more about this work, or Faculty in general, please get in touch!

Thank you

25

ilya@faculty.ai Director of AI