Model-driven Deep Learning Jian Sun ( ) Xi'an Jiaotong University - PowerPoint PPT Presentation

Model-driven Deep Learning Jian Sun ( 孙剑 ) Xi'an Jiaotong University Email : jiansun@mail.xjtu.edu.cn Home page : http://jiansun.gr.xjtu.edu.cn April, 2019

Outline ⚫ Introduction – Background: Image analysis / deep neural networks – Motivation ⚫ Model-driven Deep Learning Approach – Learning Markov Random Field Model for Image Restoration – Deep ADMM-Net for Fast Compressive Sensing MRI – Deep Fusion-Net for Multi-Atlas MR Image Segmentation ⚫ Recent Progress – Learning proximal operators – Multimodal medical image synthesis – Learning Graph CNNs for 3D shape analysis – Learning to Optimize ⚫ Discussion & Conclusion

Backgrounds--Image Processing & Analysis ⚫ Restoration & Reconstruction Image Degradation : noises, motion blur, k-space sampling, etc. Physical imaging model Restoration & Reconstruction ？ Inverse Problems

Backgrounds--Image Processing & Analysis ⚫ Segmentation & Recognition Semantic Segmentation Lesion (Pulmonary nodule) localization and classification

Backgrounds--Models ⚫ Conventional Models: Signal processing approaches – Wavelets – Image Filtering

Backgrounds--Models ⚫ Conventional Models: Energy model and its optimization – Energy Model with Regularization x * = argmin D ( x , y ; w ) + R ( w ) x – Dictionary Learning Applications: Image Restoration / Segmentation / Classification / MRI / Lesion detection

Backgrounds--Models ⚫ Conventional Models: statistical models Evidence lower bound (ELBO) Expectation-maximization (EM) Variational Inference Variational expectation-maximization

Backgrounds--Deep Neural Networks ⚫ Deep Convolutional Neural Network CNN [ Krizhevsky A, et al., 2012]

Backgrounds--Deep Neural Networks ⚫ LSTM: A [Hochreiter & Schmidhuber,1997] ⚫ GAN Generator Discriminator true/fake [Ian Goodfellow et al., 2014]

Conventional Model Vs. Deep NNs Conventional Models Deep Neural Networks ( Optimization / statistics / energy model… ) ( CNN / LSTM / GAN…. ) Pros: Pros: ⚫ Easy to incorporate domain ⚫ An universal regressor knowledge ⚫ Efficiency ⚫ Rely on less training data ⚫ Effectiveness ⚫ Good generalization ability Cons: Cons: ⚫ Rely on large training set ⚫ Maybe not optimal for specific ⚫ Relatively fixed structure task ⚫ Hardly incorporate domain ⚫ Parameter tuning knowledge

Model-driven Deep Learning Model ⚫ Formulations? Task-specific training data – Energy model – Statistical model Deep learning – Image priors ⚫ Parameters? – Hyperparameters – Statistical model parameters ⚫ Strategies? – Gradient updates in optimization – Actions in control Why model-driven? Explainable ML; Prior knowledge; Traditional model-based approach

Model-driven Deep Learning ⚫ Optimization-driven DL – Sparse coding optimization [Karol Gregor, et al, ICML 2010; P. Sprechmann, et al, PAMI 2015, etc.] – Gradient descent, ADMM, proximal operators, etc [J. Sun, et al., CVPR 2011; Y. Yang, J. Sun et al., NIPS 2016; Tim. Meinhardt, et al., ICCV 2017, etc.] ⚫ Statistical model-driven DL – MRF, CRF [S. Zheng, et al., ICCV 2015; J. Sun, et. al., IEEE TIP 2013, etc.] – Variational inference [J. Marino, et al., ICLR 2018; etc ] – EM [D. P. Kingma, ICLR 2014; Greff, Klaus, et al., NIPS 2017, etc] ……

Example ⚫ Non-local Range MRF [ J. Sun, M. Tappen, CVPR 2011 ]  A novel Markov random field model  Discriminative parameter learning

Example ⚫ Non-local Range MRF [ J. Sun, M. Tappen, CVPR 2011 ]  A novel Markov random field model  Discriminative parameter learning Non-local Range MRF

Example ⚫ Non-local Range MRF [ J. Sun, M. Tappen, CVPR 2011 ]  A novel Markov random field model  Discriminative parameter learning Non-local Range MRF unfolding

Non-local Range Markov Random Field Model ⚫ Gradients of loss function w.r.t. model parameters KEY IDEA: Similar to a Neural Network with K layers – General framework to compute gradient of the parameter Back-propagation:

Deep ADMM-Net for Compressive Sensing MRI Image Reconstruction ◆ Less sampling and fast reconstruction ? Reconstruction ◆ Compressive sensing ： A dominant approach in fast MRI reconstruction [1] Michael Lustig,David L. Donoho,Compressed Sensing MRI, IEEE SIGNAL PROCESSING MAGAZINE, 2008.

Deep ADMM-Net for Compressive Sensing A basic compressive sensing (CS) model: A : measurement matrix, A = PF ( P : Sampling matrix; F : Fourier transform) D l : filter matrix corresponding to convolution operation : regularization term, e.g., l 0 , l 1 norm : regularization term l l

Deep ADMM-Net for Compressive Sensing ADMM (Alternating Direction Method of Multipliers) Augmented Lagrangian function: ADMM iterations: [Y Yang, J Sun, et al., NIPS 2016]

Deep ADMM-Net for Compressive Sensing Data Flow Graph (DFG) for ADMM C ( n ) = D l x ( n ) Unfolding to stage n in DFG

Deep ADMM-Net for Compressive Sensing ⚫ Deep ADMM-Net: Reconstruction layer (X (n) ): Convolution layer (C (n) ): Nonlinear transform layer (Z (n) ): Multiplier updating layer (M (n) ):

Deep ADMM-Net for Compressive Sensing ⚫ Network training: Gradient computation by backpropagation Parameter optimization: L-BFGS

Deep ADMM-Net for Compressive Sensing ⚫ Training Data Generation Sampling in k-space … … ground truth Observe ved data ⚫ Training loss

Deep ADMM-Net for Compressive Sensing

Deep ADMM-Net for Compressive Sensing ⚫ Extensions of ADMM-Net ( [IEEE PAMI, 2018] ) – More flexible network structure

Deep ADMM-Net for Compressive Sensing ADMM-Net-v2 … … stage n …

Deep ADMM-Net for Compressive Sensing

Deep ADMM-Net for Compressive Sensing Our results ： ground truth ：

Deep ADMM-Net for Compressive Sensing Applications to more general compressive imaging: Bottleneck Fast inversion: • Partial Fourier matrix • Random matrix with orthogonal rows • Structurally random matrix

Deep ADMM-Net for Compressive Sensing Natural image compressive sensing

Deep Fusion Net for MR Image Segmentation Introduction ⚫ Background : Multi-atlas segmentation has been one of the most widely-used and successful medical image segmentation techniques in the past decade. Registration Atlas Selection Target Image ？ Atlases weighted voting Label Fusion statistical theory … … Image Label Iglesias, J.E., et. al: Multi-atlas segmentation of biomedical images: a survey. (Med. Image Anal. 2015 )

Deep Fusion Net for MR Image Segmentation Non-local patch-based label fusion (NL-PLF) model ⚫ Label fusion: w pq Fusion weight: 1. Intensity (Coupe et al., 2011) 2. Intensity + spatial context (Wang et al., 2014) Hand-crafted 3. Intensity + gradient + contextual (Bai et al., features 2015) [1] Coupe, P., et al. Patch-based segmentation using expert priors: Application to hippocampus and ventricle segmentation. (NeuroImage 2011) [2] Wang Z, et al. Geodesic patch-based segmentation. (MICCAI 2014) [3] Bai, W., et al. Multi-atlas segmentation with augmented features for cardiac MR images. (Med. Image Anal. 2015)

Deep Fusion Net for MR Image Segmentation Deep Fusion Net • Deep Fusion Net ( MICCAI 2016 ) : An end-to-end learnable deep architecture for NL-PLF concatenating feature extraction and non-local patch-based label fusion F ( T ; q ) F ( X 1 ; q ) CNN layers for Atlas X 1 feature extraction F ( X 2 ; q ) Atlas X 2 Deep features Feature extraction [H. R. Yang, J. Sun, et al., MICCAI 2016, Medical Image Analysis, 2018]

Model-driven Deep Learning Jian Sun ( ) Xi'an Jiaotong University - PowerPoint PPT Presentation

Model-driven Deep Learning Jian Sun ( ) Xi'an Jiaotong University Email : jiansun@mail.xjtu.edu.cn Home page : http://jiansun.gr.xjtu.edu.cn April, 2019 Outline Introduction Background: Image analysis / deep neural networks

Hao Su July 6, 2017 Outline Overview of 3D deep learning 3D deep learning algorithms

All You Want To Know About CNNs Yukun Zhu Deep Learning Deep Learning Image from

Deep Neural Networks and Deep Reinforcement Learning Deep Learning, Goodfellow, Bengio and

Priority-Driven Scheduling of Periodic Tasks Priority-driven vs. clock-driven scheduling:

Relational Deep Learning: A Deep Latent Variable Model for Link Prediction Hao Wang, Xingjian

False fasting is driven by pride False fasting is driven by pride False fasting is

AGN deep multiwavelength AGN deep multiwavelength AGN deep multiwavelength surveys: surveys:

Deep Learning: Theory and Practice Deep Learning - Practical 02-04-2020 Considerations

Presentation about Deep Learning --- Zhongwu xie Contents 1.Brief introduction of Deep learning.

Deep Learning on GPUs March 2016 What is Deep Learning? GPUs and DL AGENDA DL in practice

Deep learning Deep reinforcement learning Hamid Beigy Sharif university of technology December

Differen'able Func'onal Programming Noel Welsh @noelwelsh underscore Goals Deep learning

DSC 102 Systems for Scalable Analytics Arun Kumar Topic 6: Deep Learning Systems 1 Outline

Deep Reinforcement Learning and Complex Environments Raia Hadsell End-to-end Deep Learning

ACCELERATE DEEP LEARNING WITH NVIDIA'S DEEP LEARNING PLATFORM | STEPHEN JONES | GTC16 DEEP

Deep learning for natural language processing A short primer on deep learning Benoit Favre <

Nitrite bioactivation by Globin X in zebrafish blood and effects on heart regeneration Paola

Breast Cancer Risk categories: Objectives 1. Breast cancer risk 2. CVD risk or benefit 3. Venous

Disclosure Statement Honorarium & Consultation Medtronic Inc. Biosense Webster Inc.

CLIA and Point of Care Testing Serafina Brea, MBEE, MLS(ASCP) CM Clinical Laboratory Scientist

Data & Safety Monitoring Board October 30, 2012 Valentin Fuster, MD PhD FREEDOM Trial Main

Computational Methods and Software for Bioelectric Field Problems Christopher R. Johnson

Xavier Pennec Asclepios team, INRIA Sophia-Antipolis Mediterrane, France with V. Arsigny,

HFES Public Outreach Webinar Series The Real Reasons You Want Sit/Stand Workstations in Your

Model-driven Deep Learning Jian Sun ( ) Xi'an Jiaotong University - PowerPoint PPT Presentation

Model-driven Deep Learning Jian Sun ( ) Xi'an Jiaotong University Email : jiansun@mail.xjtu.edu.cn Home page : http://jiansun.gr.xjtu.edu.cn April, 2019 Outline Introduction Background: Image analysis / deep neural networks

Hao Su July 6, 2017 Outline Overview of 3D deep learning 3D deep learning algorithms

All You Want To Know About CNNs Yukun Zhu Deep Learning Deep Learning Image from

Deep Neural Networks and Deep Reinforcement Learning Deep Learning, Goodfellow, Bengio and

Priority-Driven Scheduling of Periodic Tasks Priority-driven vs. clock-driven scheduling:

Relational Deep Learning: A Deep Latent Variable Model for Link Prediction Hao Wang, Xingjian

False fasting is driven by pride False fasting is driven by pride False fasting is

AGN deep multiwavelength AGN deep multiwavelength AGN deep multiwavelength surveys: surveys:

Deep Learning: Theory and Practice Deep Learning - Practical 02-04-2020 Considerations

Presentation about Deep Learning --- Zhongwu xie Contents 1.Brief introduction of Deep learning.

Deep Learning on GPUs March 2016 What is Deep Learning? GPUs and DL AGENDA DL in practice

Deep learning Deep reinforcement learning Hamid Beigy Sharif university of technology December

Differen'able Func'onal Programming Noel Welsh @noelwelsh underscore Goals Deep learning

DSC 102 Systems for Scalable Analytics Arun Kumar Topic 6: Deep Learning Systems 1 Outline

Deep Reinforcement Learning and Complex Environments Raia Hadsell End-to-end Deep Learning

ACCELERATE DEEP LEARNING WITH NVIDIA'S DEEP LEARNING PLATFORM | STEPHEN JONES | GTC16 DEEP

Deep learning for natural language processing A short primer on deep learning Benoit Favre &lt;

Nitrite bioactivation by Globin X in zebrafish blood and effects on heart regeneration Paola

Breast Cancer Risk categories: Objectives 1. Breast cancer risk 2. CVD risk or benefit 3. Venous

Disclosure Statement Honorarium &amp; Consultation Medtronic Inc. Biosense Webster Inc.

CLIA and Point of Care Testing Serafina Brea, MBEE, MLS(ASCP) CM Clinical Laboratory Scientist

Data &amp; Safety Monitoring Board October 30, 2012 Valentin Fuster, MD PhD FREEDOM Trial Main

Computational Methods and Software for Bioelectric Field Problems Christopher R. Johnson

Xavier Pennec Asclepios team, INRIA Sophia-Antipolis Mediterrane, France with V. Arsigny,

HFES Public Outreach Webinar Series The Real Reasons You Want Sit/Stand Workstations in Your

Deep learning for natural language processing A short primer on deep learning Benoit Favre <

Disclosure Statement Honorarium & Consultation Medtronic Inc. Biosense Webster Inc.

Data & Safety Monitoring Board October 30, 2012 Valentin Fuster, MD PhD FREEDOM Trial Main