Understanding Geometry of Encoder-Decoder CNNs (E-D CNNs) Jong - - PowerPoint PPT Presentation

understanding geometry of encoder decoder cnns
SMART_READER_LITE
LIVE PREVIEW

Understanding Geometry of Encoder-Decoder CNNs (E-D CNNs) Jong - - PowerPoint PPT Presentation

Understanding Geometry of Encoder-Decoder CNNs (E-D CNNs) Jong Chul Ye & Woon Kyoung Sung BISPL - BioImaging, Signal Processing and Learning Lab. Dept. Bio & Brain Engineering Dept. of Mathematical Sciences KAIST, Korea E-D CNN


slide-1
SLIDE 1

Understanding Geometry of Encoder-Decoder CNNs

(E-D CNNs)

Jong Chul Ye & Woon Kyoung Sung

BISPL - BioImaging, Signal Processing and Learning Lab.

  • Dept. Bio & Brain Engineering
  • Dept. of Mathematical Sciences

KAIST, Korea

slide-2
SLIDE 2

CNN

E-D CNN for Inverse Problems

slide-3
SLIDE 3

CNN

E-D CNN for Inverse Problems

slide-4
SLIDE 4

CNN Successful applications to various inverse problems

E-D CNN for Inverse Problems

slide-5
SLIDE 5

Why Same Architecture Works for Different Inverse Problems ?

slide-6
SLIDE 6

Classical Methods for Inverse Problems

Synthesis basis Analysis basis coefficients

Step 1: Signal Representation

x = X

i

hbi, xi˜ bi

<latexit sha1_base64="1xOFabKTay95z7c6ZUXCSJWXZAE=">ACE3icbVDLSsNAFJ3UV62vqEs3g0UQkZKoBuh4MZlC/YBTQiTyaQdOpmEmYm0hIKf4MZfceNCEbdu3Pk3TpMutPXAhTPn3Mvce/yEUaks69soLS2vrK6V1ysbm1vbO+buXlvGqcCkhWMWi6PJGUk5aipFuIgiKfEY6/vBm6nfuiZA05ndqnBA3Qn1OQ4qR0pJnoyuHZlGHoXQYj3GYG+R0/hyBHFy1GUBbnomVWrZuWAi8SekSqYoeGZX04Q4zQiXGpOzZVqLcDAlFMSOTipNKkiA8RH3S05SjiEg3y2+awCOtBDCMhS6uYK7+nshQJOU48nVnhNRAzntT8T+vl6rwys0oT1JFOC4+ClMGVQynAcGACoIVG2uCsKB6V4gHSCsdIwVHYI9f/IiaZ/V7POa1byo1psPRxlcAOwTGwSWog1vQAC2AwSN4Bq/gzXgyXox346NoLRmzCPfBHxifP6mndg=</latexit>
slide-7
SLIDE 7
  • Eg. Compressed

Sensing

Classical Methods for Inverse Problems

Step 2: Basis Search by Optimization

x = X

i

˜ bihbi, xi

<latexit sha1_base64="dRyoeK2luEuIb3X8ywuvlvflFPU=">ACEnicbZBNS8MwGMdTX+d8q3r0EhyCgoxWBb0IQy8eJ7gXWEtJ03QLS9OSpLJR9hm8+FW8eFDEqydvfhvTrgfd/EPgl/zPCTP308Ylcqyvo2FxaXldXKWnV9Y3Nr29zZbcs4FZi0cMxi0fWRJIxy0lJUMdJNBEGRz0jH97k9c4DEZLG/F6NE+JGqM9pSDFS2vLM49GVI9PIo9BRlAUE+jkyxPus4BM4ckRx8yaVbcKwXmwS6iBUk3P/HKCGKcR4QozJGXPthLlZkgoihmZVJ1UkgThIeqTnkaOIiLdrFhpAg+1E8AwFvpwBQv390SGIinHka87I6QGcraWm/VeqkKL92M8iRVhOPpQ2HKoIphng8MqCBYsbEGhAXVf4V4gATCSqdY1SHYsyvPQ/u0bp/VrbvzWuO6jKMC9sEBOAI2uANcAuaoAUweATP4BW8GU/Gi/FufExbF4xyZg/8kfH5AyCNnR8=</latexit>
slide-8
SLIDE 8

Why do They Look so Different ? Any Link between Them ?

slide-9
SLIDE 9

Our Theoretical Findings

y = X

i

hbi(x), xi˜ bi(x)

<latexit sha1_base64="DaFmbtzayW3V2tBvW3rbADydJY=">ACGXicbZDLSsNAFIYnXmu9RV26GSxCBSmJCroRim5cVrAXaEKYTCbt0MkzEykIfQ13Pgqblwo4lJXvo3TNoK2/jDw851zOHN+P2FUKsv6MhYWl5ZXVktr5fWNza1tc2e3JeNUYNLEMYtFx0eSMpJU1HFSCcRBEU+I21/cD2ut+JkDTmdypLiBuhHqchxUhp5JlWdunINPIohA5DvMcI9D1aHR4dw6EjpsBRlAU/3DMrVs2aCM4buzAVUKjhmR9OEOM0IlxhqTs2lai3BwJRTEjo7KTSpIgPEA90tWo4hIN59cNoKHmgQwjIV+XMEJ/T2Ro0jKLPJ1Z4RUX87WxvC/WjdV4YWbU56kinA8XRSmDKoYjmOCARUEK5Zpg7Cg+q8Q95FAWOkwyzoEe/bkedM6qdmnNev2rFK/KuIogX1wAKrABuegDm5AzQBg/gCbyAV+PReDbejPdp64JRzOyBPzI+vwEaXJ8Y</latexit>
slide-10
SLIDE 10

Our Theoretical Findings

y = X

i

hbi(x), xi˜ bi(x)

<latexit sha1_base64="DaFmbtzayW3V2tBvW3rbADydJY=">ACGXicbZDLSsNAFIYnXmu9RV26GSxCBSmJCroRim5cVrAXaEKYTCbt0MkzEykIfQ13Pgqblwo4lJXvo3TNoK2/jDw851zOHN+P2FUKsv6MhYWl5ZXVktr5fWNza1tc2e3JeNUYNLEMYtFx0eSMpJU1HFSCcRBEU+I21/cD2ut+JkDTmdypLiBuhHqchxUhp5JlWdunINPIohA5DvMcI9D1aHR4dw6EjpsBRlAU/3DMrVs2aCM4buzAVUKjhmR9OEOM0IlxhqTs2lai3BwJRTEjo7KTSpIgPEA90tWo4hIN59cNoKHmgQwjIV+XMEJ/T2Ro0jKLPJ1Z4RUX87WxvC/WjdV4YWbU56kinA8XRSmDKoYjmOCARUEK5Zpg7Cg+q8Q95FAWOkwyzoEe/bkedM6qdmnNev2rFK/KuIogX1wAKrABuegDm5AzQBg/gCbyAV+PReDbejPdp64JRzOyBPzI+vwEaXJ8Y</latexit>
slide-11
SLIDE 11

Our Theoretical Findings

y = X

i

hbi(x), xi˜ bi(x)

<latexit sha1_base64="DaFmbtzayW3V2tBvW3rbADydJY=">ACGXicbZDLSsNAFIYnXmu9RV26GSxCBSmJCroRim5cVrAXaEKYTCbt0MkzEykIfQ13Pgqblwo4lJXvo3TNoK2/jDw851zOHN+P2FUKsv6MhYWl5ZXVktr5fWNza1tc2e3JeNUYNLEMYtFx0eSMpJU1HFSCcRBEU+I21/cD2ut+JkDTmdypLiBuhHqchxUhp5JlWdunINPIohA5DvMcI9D1aHR4dw6EjpsBRlAU/3DMrVs2aCM4buzAVUKjhmR9OEOM0IlxhqTs2lai3BwJRTEjo7KTSpIgPEA90tWo4hIN59cNoKHmgQwjIV+XMEJ/T2Ro0jKLPJ1Z4RUX87WxvC/WjdV4YWbU56kinA8XRSmDKoYjmOCARUEK5Zpg7Cg+q8Q95FAWOkwyzoEe/bkedM6qdmnNev2rFK/KuIogX1wAKrABuegDm5AzQBg/gCbyAV+PReDbejPdp64JRzOyBPzI+vwEaXJ8Y</latexit>
slide-12
SLIDE 12

Our Theoretical Findings

analysis basis

y = X

i

hbi(x), xi˜ bi(x)

<latexit sha1_base64="DaFmbtzayW3V2tBvW3rbADydJY=">ACGXicbZDLSsNAFIYnXmu9RV26GSxCBSmJCroRim5cVrAXaEKYTCbt0MkzEykIfQ13Pgqblwo4lJXvo3TNoK2/jDw851zOHN+P2FUKsv6MhYWl5ZXVktr5fWNza1tc2e3JeNUYNLEMYtFx0eSMpJU1HFSCcRBEU+I21/cD2ut+JkDTmdypLiBuhHqchxUhp5JlWdunINPIohA5DvMcI9D1aHR4dw6EjpsBRlAU/3DMrVs2aCM4buzAVUKjhmR9OEOM0IlxhqTs2lai3BwJRTEjo7KTSpIgPEA90tWo4hIN59cNoKHmgQwjIV+XMEJ/T2Ro0jKLPJ1Z4RUX87WxvC/WjdV4YWbU56kinA8XRSmDKoYjmOCARUEK5Zpg7Cg+q8Q95FAWOkwyzoEe/bkedM6qdmnNev2rFK/KuIogX1wAKrABuegDm5AzQBg/gCbyAV+PReDbejPdp64JRzOyBPzI+vwEaXJ8Y</latexit>

Encoder

slide-13
SLIDE 13

Our Theoretical Findings

analysis basis synthesis basis

y = X

i

hbi(x), xi˜ bi(x)

<latexit sha1_base64="DaFmbtzayW3V2tBvW3rbADydJY=">ACGXicbZDLSsNAFIYnXmu9RV26GSxCBSmJCroRim5cVrAXaEKYTCbt0MkzEykIfQ13Pgqblwo4lJXvo3TNoK2/jDw851zOHN+P2FUKsv6MhYWl5ZXVktr5fWNza1tc2e3JeNUYNLEMYtFx0eSMpJU1HFSCcRBEU+I21/cD2ut+JkDTmdypLiBuhHqchxUhp5JlWdunINPIohA5DvMcI9D1aHR4dw6EjpsBRlAU/3DMrVs2aCM4buzAVUKjhmR9OEOM0IlxhqTs2lai3BwJRTEjo7KTSpIgPEA90tWo4hIN59cNoKHmgQwjIV+XMEJ/T2Ro0jKLPJ1Z4RUX87WxvC/WjdV4YWbU56kinA8XRSmDKoYjmOCARUEK5Zpg7Cg+q8Q95FAWOkwyzoEe/bkedM6qdmnNev2rFK/KuIogX1wAKrABuegDm5AzQBg/gCbyAV+PReDbejPdp64JRzOyBPzI+vwEaXJ8Y</latexit>

Encoder Decoder

slide-14
SLIDE 14

Linear E-D CNN

pooling un-pooling Learned filters

y = ˜ BB>x = X

i

hx, bii˜ bi

<latexit sha1_base64="bo3reUJLRgiLys4OrWvNpVArY=">ACJ3icbVBNSwMxEM36bf2qevQSLIHKbsq6KUievGoaK3QrSWbzrah2eySzIpl8d948a94EVREj/4T03YP2joQePePCbzgkQKg675UxMTk3PzM7NFxYWl5ZXiqtr1yZONYcqj2WsbwJmQAoFVRQo4SbRwKJAQi3onvb12h1oI2J1hb0EGhFrKxEKztBSzeJRr+KjkC2gJ/Tk1sc4ofe0Qn2TRk1BfclUWwK936FBv9XDNndYqlksuWV3UHQceDkokbzOm8VXvxXzNAKFXDJj6p6bYCNjGgWX8FDwUwMJ413WhrqFikVgGtngzge6ZkWDWNtn0I6YH87MhYZ04sCOxkx7JhRrU/+p9VTDA8bmVBJiqD4cFGYSox7YdGW0IDR9mzgHEt7F8p7zDNONpoCzYEb/TkcXC9W/b2yu7Ffun4Mo9jmyQTbJNPHJAjskZOSdVwskjeSZv5N15cl6cD+dzODrh5J518qec7x9ZY6R3</latexit>
slide-15
SLIDE 15

Linear E-D CNN w/ Skipped Connection

more redundant expression

Learned filters

y = ˜ BB>x = X

i

hx, bii˜ bi

<latexit sha1_base64="bo3reUJLRgiLys4OrWvNpVArY=">ACJ3icbVBNSwMxEM36bf2qevQSLIHKbsq6KUievGoaK3QrSWbzrah2eySzIpl8d948a94EVREj/4T03YP2joQePePCbzgkQKg675UxMTk3PzM7NFxYWl5ZXiqtr1yZONYcqj2WsbwJmQAoFVRQo4SbRwKJAQi3onvb12h1oI2J1hb0EGhFrKxEKztBSzeJRr+KjkC2gJ/Tk1sc4ofe0Qn2TRk1BfclUWwK936FBv9XDNndYqlksuWV3UHQceDkokbzOm8VXvxXzNAKFXDJj6p6bYCNjGgWX8FDwUwMJ413WhrqFikVgGtngzge6ZkWDWNtn0I6YH87MhYZ04sCOxkx7JhRrU/+p9VTDA8bmVBJiqD4cFGYSox7YdGW0IDR9mzgHEt7F8p7zDNONpoCzYEb/TkcXC9W/b2yu7Ffun4Mo9jmyQTbJNPHJAjskZOSdVwskjeSZv5N15cl6cD+dzODrh5J518qec7x9ZY6R3</latexit>
slide-16
SLIDE 16

Deep Convolutional Framelets

x = ˜ BB>x = X

i

hx, bii˜ bi

<latexit sha1_base64="9EuOyjKGC2x9hAgBpajvIdywLlA=">ACJ3icbVBNSwMxEM36bf2qevQSLIHKbsq6EWRevGoaK3QXZdsOq2h2eySzEpL6b/x4l/xIqiIHv0npu0etDoQePePCbzolQKg676UxMTk3PzM7NFxYWl5ZXiqtr1ybJNIcqT2SibyJmQAoFVRQo4SbVwOJIQi1qnw702j1oIxJ1hd0Ugpi1lGgKztBSYfG4c+SjkA2gFVq59TFJaYceUd9kcSioL5lqSaCdHRoNWj1qc4elwmLJLbvDon+Bl4MSyes8L74jYRnMSjkhlT9wUgx7TKLiEfsHPDKSMt1kL6hYqFoMJesM7+3TLMg3aTLR9CumQ/enosdiYbhzZyZjhnRnXBuR/Wj3D5mHQEyrNEBQfLWpmkmJCB6HRhtDAUXYtYFwL+1fK75hmHG20BRuCN37yX3C9W/b2yu7FfunkMo9jmyQTbJNPHJATsgZOSdVwskDeSKv5M15dJ6d+djNDrh5J518qucr29XoqR2</latexit>

Perfect reconstruction

Ye et al, SIAM J. Imaging Science, 2018

Frame conditions

w skipped connection w/o skipped connection

slide-17
SLIDE 17

Role of ReLUs? Generator for Multiple Expressions y = ˜ B(x)B(x)>x = X

i

hx, bi(x)i˜ bi(x)

<latexit sha1_base64="T/1m1u26m8O8vLHErH3u6EKQhAM=">ACM3icbVDLSgMxFM3Ud31VXboJFkFByowKuhGKbsSVotVCpw6Z9LYNZjJDckdaSv/JjT/iQhAXirj1H0zbWaj1QsLhPEjuCRMpDLrui5ObmJyanpmdy8vLC4tF1ZWr02cag4VHstYV0NmQAoFRQoZpoYFEo4Sa8OxnoN/egjYjVFXYTqEespURTcIaWCgpn3SMfhWwAPd7qbA+vWx/jhHboEfVNGgWC+pKplgTa2aFhIAY2X4+YLDpig0LRLbnDoePAy0CRZHMeFJ78RszTCBRyYypeW6C9R7TKLiEft5PDSM37EW1CxULAJT7w137tNyzRoM9b2KRD9meixyJjulFonRHDtvmrDcj/tFqKzcN6T6gkRVB89FAzlRjOiQNoQGjrJrAeNa2L9S3macbQ1520J3t+Vx8H1bsnbK7kX+8XyZVbHLFknG2SLeOSAlMkpOScVwskDeSZv5N15dF6dD+dzZM05WaN/Brn6xvQsKgT</latexit>

Σl(x) =      σ1 · · · σ2 · · · . . . . . . ... . . . · · · σml     

<latexit sha1_base64="1HS4n8UkvGQcnzeL2YdPrnXeg=">ACkXicbVHLSgMxFM2M7/qunQTLIpuyowK6kIpuhHcKFoVmjpkMrdtMkMSUYsQ/H73Hn35i2g4/qhQvnNfuYkzwY0Ng/Pn5qemZ2bX6gsLi2vrFbX1u9NmsGTZaKVD/G1IDgCpqWwGPmQYqYwEP8fPFUH94AW14qu5sP4O2pF3FO5xR6io+kZueVfSJ7H7uodPMYmhy1URS2o1fx1gYoZqFOIdHDgnLEmtGQWEjJlxwv6kWCEvZfQNkt+MSwom+5btChkJNxU8rVKVK0F9WBk+C8IS1BDpV1H1XeSpCyXoCwT1JhWGS2XVBtORMwqJDcQEbZM+1Cy0FJZh2MbroAG87JsGdVDtXFo/YnxUFlcb0Zewy3X49M6kNyf+0Vm47x+2Cqy3oNh4UCcX2KZ4+D04RqYFX0HKNPc7YpZj2rKrPvEijtCOPnkv+B+vx4e1IObw1rjvDzHPNpEW2gXhegINdAlukZNxLwV79A79c78Df/Eb/hlru+VNRvol/lXn1/Lv9k=</latexit>

Input dependent {0,1} matrix

  • -> Input adaptivity
slide-18
SLIDE 18

Input Space Partitioning for Multiple Expressions

slide-19
SLIDE 19

Expressivity of E-D CNN

# of representation

# of network elements

slide-20
SLIDE 20

Expressivity of E-D CNN

# of representation

# of network elements # of channel

slide-21
SLIDE 21

Expressivity of E-D CNN

# of representation

# of network elements # of channel Network depth

slide-22
SLIDE 22

Expressivity of E-D CNN

# of representation

# of network elements # of channel Network depth Skipped connection

slide-23
SLIDE 23

Lipschitz Continuity

K = max

p

Kp, Kp = k ˜ B(zp)B(zp)>k2

<latexit sha1_base64="zV0QFc8bcwR20HLOVcDQeQMOtmY=">ACIHicbZDLSgMxFIYz9V5voy7dBItQcpMFeqmUOpGcKNgbaFTh0wmtaGZmZicEWv1Udz4Km5cKI7fRrTy0KtBxI+/v8ckvMHUnANjvNpZamZ2bn5heyi0vLK6v2vq5TlJFWY0mIlGNgGgmeMxqwEGwhlSMRIFg9aB7OPDr10xpnsRn0JOsFZHLmLc5JWAk3y4dl72I3PgSH/ty17tKSTgXMbenQdchAxX87e+3BndFx4k0rvzi76dcwrOsPAkuGPIoXGd+PaHFyY0jVgMVBCtm64jodUnCjgV7D7rpZpJQrvkjUNxiRiutUfLniPt40S4naizIkBD9WfE30Sad2LAtMZEejov95A/M9rptA+aPV5LFNgMR091E4FhgQP0sIhV4yC6BkgVHzV0w7RBEKJtOsCcH9u/IknBcL7l7BOd3PVarjObRJtpCeSiEqgI3SCaoiB/SEXtCr9Wg9W2/W+6g1Y41nNtCvsr6+AcwXoYQ=</latexit>

z1

<latexit sha1_base64="Ob3+IEXFhF5uWyRIGKNYQ89lNRY=">AB6nicbVBNS8NAEJ3Ur1q/qh69LBbBU0lU0GPRi8eK9gPaUDbTbt0swm7E6G/gQvHhTx6i/y5r9x2+agrQ8GHu/NMDMvSKQw6LrfTmFldW19o7hZ2tre2d0r7x80TZxqxhslrFuB9RwKRvoEDJ24nmNAokbwWjm6nfeuTaiFg94DjhfkQHSoSCUbTS/VP65UrbtWdgSwTLycVyFHvlb+6/ZilEVfIJDWm47kJ+hnVKJjk1I3NTyhbEQHvGOpohE3fjY7dUJOrNInYaxtKSQz9fdERiNjxlFgOyOKQ7PoTcX/vE6K4ZWfCZWkyBWbLwpTSTAm079JX2jOUI4toUwLeythQ6opQ5tOyYbgLb68TJpnVe+86t5dVGrXeRxFOIJjOAUPLqEGt1CHBjAYwDO8wpsjnRfn3fmYtxacfOYQ/sD5/AEPZo2k</latexit>

zp

<latexit sha1_base64="Q3WIlMLDjf+qfP58xUVUuKL5KD4=">AB6nicbVBNS8NAEJ3Ur1q/qh69LBbBU0lU0GPRi8eK9gPaUDbSbt0swm7G6G/gQvHhTx6i/y5r9x2+agrQ8GHu/NMDMvSATXxnW/ncLK6tr6RnGztLW9s7tX3j9o6jhVDBsFrFqB1Sj4BIbhuB7UQhjQKBrWB0M/Vbj6g0j+WDGSfoR3QgecgZNVa6f+olvXLFrbozkGXi5aQCOeq98le3H7M0QmYoFp3PDcxfkaV4UzgpNRNSaUjegAO5ZKGqH2s9mpE3JilT4JY2VLGjJTf09kNJ6HAW2M6JmqBe9qfif10lNeOVnXCapQcnmi8JUEBOT6d+kzxUyI8aWUKa4vZWwIVWUGZtOyYbgLb68TJpnVe+86t5dVGrXeRxFOIJjOAUPLqEGt1CHBjAYwDO8wpsjnBfn3fmYtxacfOYQ/sD5/AFu4o3j</latexit>

Related to the generalizability Dependent on the Local Lipschitz

slide-24
SLIDE 24

Benign Optimization Landscape

full-rank condition

Independent features

Nguyen, et al, ICML, 2018

slide-25
SLIDE 25

Benign Optimization Landscape

full-rank condition

Independent features Independent features full-rank condition

Nguyen, et al, ICML, 2018 This paper

slide-26
SLIDE 26

Summary

  • Deep learning is a novel signal representation using

combinatorial framelets

  • ReLUs generate multiple linear representation by partitioning

the input space

  • Local Lipschitz controls the global Liptschiz continuity
  • Skipped connection improves the optimization landscape

Poster #99: 06:30 -- 09:00 PM @ Pacific Ballroom