model driven deep learning
play

Model-driven Deep Learning Jian Sun ( ) Xi'an Jiaotong University - PowerPoint PPT Presentation

Model-driven Deep Learning Jian Sun ( ) Xi'an Jiaotong University Email : jiansun@mail.xjtu.edu.cn Home page : http://jiansun.gr.xjtu.edu.cn April, 2019 Outline Introduction Background: Image analysis / deep neural networks


  1. Model-driven Deep Learning Jian Sun ( 孙剑 ) Xi'an Jiaotong University Email : jiansun@mail.xjtu.edu.cn Home page : http://jiansun.gr.xjtu.edu.cn April, 2019

  2. Outline ⚫ Introduction – Background: Image analysis / deep neural networks – Motivation ⚫ Model-driven Deep Learning Approach – Learning Markov Random Field Model for Image Restoration – Deep ADMM-Net for Fast Compressive Sensing MRI – Deep Fusion-Net for Multi-Atlas MR Image Segmentation ⚫ Recent Progress – Learning proximal operators – Multimodal medical image synthesis – Learning Graph CNNs for 3D shape analysis – Learning to Optimize ⚫ Discussion & Conclusion

  3. Backgrounds--Image Processing & Analysis ⚫ Restoration & Reconstruction Image Degradation : noises, motion blur, k-space sampling, etc. Physical imaging model Restoration & Reconstruction ? Inverse Problems

  4. Backgrounds--Image Processing & Analysis ⚫ Segmentation & Recognition Semantic Segmentation Lesion (Pulmonary nodule) localization and classification

  5. Backgrounds--Models ⚫ Conventional Models: Signal processing approaches – Wavelets – Image Filtering

  6. Backgrounds--Models ⚫ Conventional Models: Energy model and its optimization – Energy Model with Regularization x * = argmin D ( x , y ; w ) + R ( w ) x – Dictionary Learning Applications: Image Restoration / Segmentation / Classification / MRI / Lesion detection

  7. Backgrounds--Models ⚫ Conventional Models: statistical models Evidence lower bound (ELBO) Expectation-maximization (EM) Variational Inference Variational expectation-maximization

  8. Backgrounds--Deep Neural Networks ⚫ Deep Convolutional Neural Network CNN [ Krizhevsky A, et al., 2012]

  9. Backgrounds--Deep Neural Networks ⚫ LSTM: A [Hochreiter & Schmidhuber,1997] ⚫ GAN Generator Discriminator true/fake [Ian Goodfellow et al., 2014]

  10. Conventional Model Vs. Deep NNs Conventional Models Deep Neural Networks ( Optimization / statistics / energy model… ) ( CNN / LSTM / GAN…. ) Pros: Pros: ⚫ Easy to incorporate domain ⚫ An universal regressor knowledge ⚫ Efficiency ⚫ Rely on less training data ⚫ Effectiveness ⚫ Good generalization ability Cons: Cons: ⚫ Rely on large training set ⚫ Maybe not optimal for specific ⚫ Relatively fixed structure task ⚫ Hardly incorporate domain ⚫ Parameter tuning knowledge

  11. Model-driven Deep Learning Model ⚫ Formulations? Task-specific training data – Energy model – Statistical model Deep learning – Image priors ⚫ Parameters? – Hyperparameters – Statistical model parameters ⚫ Strategies? – Gradient updates in optimization – Actions in control Why model-driven? Explainable ML; Prior knowledge; Traditional model-based approach

  12. Model-driven Deep Learning ⚫ Optimization-driven DL – Sparse coding optimization [Karol Gregor, et al, ICML 2010; P. Sprechmann, et al, PAMI 2015, etc.] – Gradient descent, ADMM, proximal operators, etc [J. Sun, et al., CVPR 2011; Y. Yang, J. Sun et al., NIPS 2016; Tim. Meinhardt, et al., ICCV 2017, etc.] ⚫ Statistical model-driven DL – MRF, CRF [S. Zheng, et al., ICCV 2015; J. Sun, et. al., IEEE TIP 2013, etc.] – Variational inference [J. Marino, et al., ICLR 2018; etc ] – EM [D. P. Kingma, ICLR 2014; Greff, Klaus, et al., NIPS 2017, etc] ……

  13. Outline ⚫ Introduction – Background: Image analysis / deep neural networks – Motivation ⚫ Model-driven Deep Learning Approach – Learning Markov Random Field Model for Image Restoration – Deep ADMM-Net for Fast Compressive Sensing MRI – Deep Fusion-Net for Multi-Atlas MR Image Segmentation ⚫ Recent Progress – Learning proximal operators – Multimodal medical image synthesis – Learning Graph CNNs for 3D shape analysis – Learning to Optimize ⚫ Discussion & Conclusion

  14. Example ⚫ Non-local Range MRF [ J. Sun, M. Tappen, CVPR 2011 ]  A novel Markov random field model  Discriminative parameter learning

  15. Example ⚫ Non-local Range MRF [ J. Sun, M. Tappen, CVPR 2011 ]  A novel Markov random field model  Discriminative parameter learning Non-local Range MRF

  16. Example ⚫ Non-local Range MRF [ J. Sun, M. Tappen, CVPR 2011 ]  A novel Markov random field model  Discriminative parameter learning Non-local Range MRF

  17. Example ⚫ Non-local Range MRF [ J. Sun, M. Tappen, CVPR 2011 ]  A novel Markov random field model  Discriminative parameter learning Non-local Range MRF

  18. Example ⚫ Non-local Range MRF [ J. Sun, M. Tappen, CVPR 2011 ]  A novel Markov random field model  Discriminative parameter learning Non-local Range MRF unfolding

  19. Non-local Range Markov Random Field Model ⚫ Gradients of loss function w.r.t. model parameters KEY IDEA: Similar to a Neural Network with K layers – General framework to compute gradient of the parameter Back-propagation:

  20. Outline ⚫ Introduction – Background: Image analysis / deep neural networks – Motivation ⚫ Model-driven Deep Learning Approach – Learning Markov Random Field Model for Image Restoration – Deep ADMM-Net for Fast Compressive Sensing MRI – Deep Fusion-Net for Multi-Atlas MR Image Segmentation ⚫ Recent Progress – Learning proximal operators – Multimodal medical image synthesis – Learning Graph CNNs for 3D shape analysis – Learning to Optimize ⚫ Discussion & Conclusion

  21. Deep ADMM-Net for Compressive Sensing MRI Image Reconstruction ◆ Less sampling and fast reconstruction ? Reconstruction ◆ Compressive sensing : A dominant approach in fast MRI reconstruction [1] Michael Lustig,David L. Donoho,Compressed Sensing MRI, IEEE SIGNAL PROCESSING MAGAZINE, 2008.

  22. Deep ADMM-Net for Compressive Sensing A basic compressive sensing (CS) model: A : measurement matrix, A = PF ( P : Sampling matrix; F : Fourier transform) D l : filter matrix corresponding to convolution operation : regularization term, e.g., l 0 , l 1 norm : regularization term l l

  23. Deep ADMM-Net for Compressive Sensing ADMM (Alternating Direction Method of Multipliers) Augmented Lagrangian function: ADMM iterations: [Y Yang, J Sun, et al., NIPS 2016]

  24. Deep ADMM-Net for Compressive Sensing Data Flow Graph (DFG) for ADMM C ( n ) = D l x ( n ) Unfolding to stage n in DFG

  25. Deep ADMM-Net for Compressive Sensing ⚫ Deep ADMM-Net: Reconstruction layer (X (n) ): Convolution layer (C (n) ): Nonlinear transform layer (Z (n) ): Multiplier updating layer (M (n) ):

  26. Deep ADMM-Net for Compressive Sensing ⚫ Network training: Gradient computation by backpropagation Parameter optimization: L-BFGS

  27. Deep ADMM-Net for Compressive Sensing ⚫ Training Data Generation Sampling in k-space … … ground truth Observe ved data ⚫ Training loss

  28. Deep ADMM-Net for Compressive Sensing

  29. Deep ADMM-Net for Compressive Sensing ⚫ Extensions of ADMM-Net ( [IEEE PAMI, 2018] ) – More flexible network structure

  30. Deep ADMM-Net for Compressive Sensing ADMM-Net-v2 … … stage n …

  31. Deep ADMM-Net for Compressive Sensing

  32. Deep ADMM-Net for Compressive Sensing

  33. Deep ADMM-Net for Compressive Sensing Our results : ground truth :

  34. Deep ADMM-Net for Compressive Sensing Applications to more general compressive imaging: Bottleneck Fast inversion: • Partial Fourier matrix • Random matrix with orthogonal rows • Structurally random matrix

  35. Deep ADMM-Net for Compressive Sensing Natural image compressive sensing

  36. Outline ⚫ Introduction – Background: Image analysis / deep neural networks – Motivation ⚫ Model-driven Deep Learning Approach – Learning Markov Random Field Model for Image Restoration – Deep ADMM-Net for Fast Compressive Sensing MRI – Deep Fusion-Net for Multi-Atlas MR Image Segmentation ⚫ Recent Progress – Learning proximal operators – Multimodal medical image synthesis – Learning Graph CNNs for 3D shape analysis – Learning to Optimize ⚫ Discussion & Conclusion

  37. Deep Fusion Net for MR Image Segmentation Introduction ⚫ Background : Multi-atlas segmentation has been one of the most widely-used and successful medical image segmentation techniques in the past decade. Registration Atlas Selection Target Image ? Atlases weighted voting Label Fusion statistical theory … … Image Label Iglesias, J.E., et. al: Multi-atlas segmentation of biomedical images: a survey. (Med. Image Anal. 2015 )

  38. Deep Fusion Net for MR Image Segmentation Non-local patch-based label fusion (NL-PLF) model ⚫ Label fusion: w pq Fusion weight: 1. Intensity (Coupe et al., 2011) 2. Intensity + spatial context (Wang et al., 2014) Hand-crafted 3. Intensity + gradient + contextual (Bai et al., features 2015) [1] Coupe, P., et al. Patch-based segmentation using expert priors: Application to hippocampus and ventricle segmentation. (NeuroImage 2011) [2] Wang Z, et al. Geodesic patch-based segmentation. (MICCAI 2014) [3] Bai, W., et al. Multi-atlas segmentation with augmented features for cardiac MR images. (Med. Image Anal. 2015)

  39. Deep Fusion Net for MR Image Segmentation Deep Fusion Net • Deep Fusion Net ( MICCAI 2016 ) : An end-to-end learnable deep architecture for NL-PLF concatenating feature extraction and non-local patch-based label fusion F ( T ; q ) F ( X 1 ; q ) CNN layers for Atlas X 1 feature extraction F ( X 2 ; q ) Atlas X 2 Deep features Feature extraction [H. R. Yang, J. Sun, et al., MICCAI 2016, Medical Image Analysis, 2018]

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend