Multi-Input Cardiac Image Super-Resolution using Convolutional - PowerPoint PPT Presentation

Multi-Input Cardiac Image Super-Resolution using Convolutional Neural Networks Ozan Oktay, Wenjia Bai, Matthew Lee, Ricardo Guerrero, Konstantinos Kamnitsas, Jose Caballero, Antonio de Marvao, Stuart Cook, Declan O’Regan, and Daniel Rueckert 19 th International Conference on Medical Image Computing and Computer Assisted Interventions (MICCAI 2016) October 2016, Athens

Clinical Motivation SAX Cardiac MR Image Acquisition • Large slice thickness (8-10 mm) Slice I Slice III • Due to constrains on SNR, Slice II acquisition and breath-hold time Slice IV • It hampers subsequent image analysis and quantitative measurements. 2

Clinical Motivation SAX Cardiac MR Image Acquisition 4 - Chamber LAX Slice • Large slice thickness (8-10 mm) 2 - Chamber • Due to constrains on SNR, LAX Slice acquisition and breath-hold time • It hampers subsequent image analysis and quantitative measurements. • LAX image acquisitions are performed to complement SAX images 3

Low and High Resolution Images PSF kernel and Down-sample patient motion 3D HR Image Sinc Filter Sub-sampling grid Output Input Image Super Resolution Model 2D LAX Images 2D LAX Images 3D LR Image 3D LR Image 4

Related Work on Super Resolution External Example & Model Based SR I. Coupled Dictionary Learning and S. Coding [Yang et al TIP’12, Bhatia K. ISBI’14 ] LR Image HR Dictionary LR Dictionary HR Image 5

Related Work on Super Resolution External Example & Model Based SR I. Coupled Dictionary Learning and S. Coding [Yang et al TIP’12, Bhatia K. ISBI’14 ] LR Image II. Multi-Atlas Based SR Techniques [Shi et al. MICCAI’13] HR atlases HR Image 6

Related Work on Super Resolution External Example & Model Based SR I. Coupled Dictionary Learning and S. Coding [Yang et al TIP’12, Bhatia K. ISBI’14 ] LR Image II. Multi-Atlas Based SR Techniques [Shi et al. MICCAI’13] Regression III. Decision Forest based Regression Tree [Alexander et al MICCAI’14, Schulter S. CVPR’15] HR Image 7

Related Work on Super Resolution External Example & Model Based SR I. Coupled Dictionary Learning and S. Coding [Yang et al TIP’12, Bhatia K. ISBI’14 ] LR Image II. Multi-Atlas Based SR Techniques Convolution and Non-Linear Units [Shi et al. MICCAI’13] III. Decision Forest based Regression [Alexander et al MICCAI’14, Schulter S. CVPR’15] IV. Neural Network based Regression i. CNNs [Dong et al. ECCV’14, Shi et al. CVPR’16] ii. CNNs + GANs [Ledig et al Arxiv Sept’16] HR Image 8

Proposed 3D-SR Model (Single-Image) Components of the model - 3D Convolution and Deconvolution (inverse convolution) Kernels - Rectified Linear Units (ReLUs) - Regression Based Cost Function (Smooth L1-Norm) - Input (2D Stack-LR) and Output (3D-HR) Images 9

Proposed 3D-SR Model (Single-Image) Proposed improvements on SR-CNN model: I. Residual Learning • An easier regression problem to solve • Robust and faster model convergence 10

Proposed 3D-SR Model (Single-Image) Proposed improvements on SR-CNN model: II. Learning Upsampling Layers • End-to-end training of convolution and upsampling kernels 11

Proposed 3D-SR Model (Single-Image) Proposed improvements on SR-CNN model: III. Multi-Input model extension • Constrains the regression task with more input data • In cardiac imaging usually multiple image stacks are acquired. 12

Proposed 3D-SR Model (Multi-Image) - Siamese model is used to combine information from multiple stacks - The learned kernels can be easily integrated in this multi-model. 13

Method Evaluation Strategy I. Image Quality Analysis § Peak-to-Signal-Noise Ratio (PSNR) (Images from 300 Subjects) § Structural Similarity Index Measure (SSIM) [Wang et al. IEEE TIP’04] II. Subsequent Image Analysis (SR is used for pre-processing) § Cardiac Image Segmentation (Images from 18 Subjects) § Cardiac Motion Tracking (Images from 10 Subjects) III. Our method is compared against: § Linear, C-Spline, MAPM [Shi MICCAI’13], CNN [Dong TPAMI’15] 14

Image Quality Assessment Table 1: Quantitative comparison of di ff erent image upsampling methods. Method PSNR (dB) SSIM Linear 20.83 ± 1.10 .70 ± .03 CSpline 22.38 ± 1.13 .73 ± .03 MAPM 22.75 ± 1.22 .73 ± .03 sh-CNN 23.67 ± 1.18 .74 ± .02 CNN 24.12 ± 1.18 .76 ± .02 de-CNN 24.45 ± 1.20 .77 ± .02 • MAPM: Multi-Atlas Patch Match [Shi et al MICCAI’13] • sh-CNN: 4 - Layer Network without Deconvolution Layer [Dong TPAMI’15] • CNN: 7 - Layer Network without Deconvolution Layer • de-CNN: 7 - Layer Network with Deconvolution Layer 15

Image Quality Assessment Low Resolution Linear The Proposed High Resolution Input Image Interpolation Method Ground-truth Upsampling x5 Inference Time: 6-8 Seconds for image size (140x140x10) 16

Image Quality Assessment 0.79 deCNN CSpline MAPM nrCNN 24.7 Structural Similarity Index 0.78 24.4 0.77 (Dashed Lines) 24.1 0.76 PSNR (dB) 23.8 0.75 23.5 0.74 23.2 0.73 22.9 0.72 22.6 0.71 22.3 0.7 0 5 10 15 Number of Training Epochs • nr-CNN: 7 - Layer Network without Residual Learning. • de-CNN: 7 - Layer Network with Residual Learning 17

Experiments with Multi-Image Model Table 2: Image quality results obtained with three di ff erent models: single-image de-CNN, Siamese, and multi-channel (MC) CNN. Method PSNR (dB) SSIM de-CNN(SAX) 24.76 ± 0.48 .807 ± .009 Siamese(SAX/4CH) 25.13 ± 0.48 .814 ± .013 MC(SAX/4CH) 25.15 ± 0.47 .814 ± .012 MC(SAX/2/4CH) 25.26 ± 0.37 .818 ± .012 • MC (SAX/4CH): Multi-Channel input – SAX and 4 Chamber LAX Images • MC (SAX/2/4CH): Multi-Channel input – SAX and 2/4 Chamber LAX Images 18

Motion Tracking Experiments (SR is used as a preprocessing method) Surface to Surface Distance Surface to Surface Distance (Linear vs HR) 5.50 mm (Proposed vs HR) 4.73 mm 19

Motion Tracking Experiments (SR is used as a preprocessing method) 20

LV Segmentation Experiments (SR is used as a preprocessing method) Table 3: Segmentation results for di ff erent upsampling methods, CSpline ( p = . 007) and MAPM ( p = . 009). They are compared in terms of mean and Hausdor ff distances (MYO) and LV cavity volume di ff erences (w.r.t. manual annotations). Linear CSpline MAPM de-CNN High Res LV Vol Di ff (ml) 11.72 ± 6.96 10.80 ± 6.46 9.55 ± 5.42 8.24 ± 5.47 Exp (c) 9.09 ± 5.36 Mean Dist (mm) 1.49 ± 0.30 1.45 ± 0.29 1.40 ± 0.29 1.38 ± 0.28 1.38 ± 0.29 Haus Dist (mm) 7.74 ± 1.73 7.29 ± 1.63 6.83 ± 1.61 6.70 ± 1.85 6.67 ± 1.77 • Multi-Atlas patch based label fusion [Coupe NeuroImage’11] is used to segment images (20 Atlases) 21

Difference Between Trained and Fixed Deconvolution Kernels 22

Take Home Messages I. SR as a preprocessing step / Could it replace standard interpolation techniques ? II. Importance of learning upsampling filters and residual connections in SR models. III. Models could be trained with combined images and stacks acquired from different directions. IV. Future work a. Other imaging modalities or applications (DTI or MR Image Reconstruction) b. Perceptual loss function: Could it be applicable to medical images ? 23

Multi-Input Cardiac Image Super-Resolution using Convolutional Neural Networks Acknowledgments: Poster Session 1 – Cardiac Image Analysis (CARD) – PS1.40

Some Additional Slides Additional Details about the SR-CNN Model 25

Model Training Strategy I. Batch Normalization [Ioffe and Szegedy ICML’15] • Faster Model Convergence. • Reduces the dependency of model on filter coefficient initialization. II. Data Augmentation • Training data, LR-HR pairs, are generated from 3D-HR Images based on the following model [Shi et al. MICCAI’13]: • Trained with cine cardiac HR - MR images acquired from 930 healthy adult subjects. x = DBSMy + η III. Smooth L1-Norm Function • Improves the convergence when outliers are observed in training data. 26

Number of Feature Maps / Atlases Table 1: Quantitative comparison of di ff erent image upsampling methods. Exp (a) PSNR (dB) SSIM # Filters/Atlases Linear 20.83 ± 1.10 .70 ± .03 – CSpline 22.38 ± 1.13 .73 ± .03 – MAPM 22.75 ± 1.22 .73 ± .03 350 sh-CNN 23.67 ± 1.18 .74 ± .02 64,64,32,1 CNN 24.12 ± 1.18 .76 ± .02 64,64,32,16,8,4,1 de-CNN 64,64,32,16,8,4,1 24.45 ± 1.20 .77 ± .02 • MAPM: Multi-Atlas Patch Match [Shi et al MICCAI’13] • sh-CNN: 4 - Layer Network without Deconvolution Layer [Dong TPAMI’15] • CNN: 7 - Layer Network without Deconvolution Layer • de-CNN: 7 - Layer Network with Deconvolution Layer 27

SR-CNN (9-5-5) - ImageNet Low Resolution Cubic Spline SR-CNN (9-5-5) Output Input Image Interpolation Image Upsampling x4 A 3-Layer model is trained with ImageNet Dataset 28

Image Quality Assessment Low Resolution Linear SR-CNN Output High Resolution Input Image Interpolation Image Ground-truth Upsampling x5 Inference Time: 6-8 Seconds for image size (140x140x10) 29

Multi-Input Cardiac Image Super-Resolution using Convolutional - PowerPoint PPT Presentation

Multi-Input Cardiac Image Super-Resolution using Convolutional Neural Networks Ozan Oktay, Wenjia Bai, Matthew Lee, Ricardo Guerrero, Konstantinos Kamnitsas, Jose Caballero, Antonio de Marvao, Stuart Cook, Declan ORegan, and Daniel Rueckert

Chapter 4 Multiframe Super-resolution Image Reconstruction 1 Multi-frame SRIR (Video

Residuals in Deep Super-Resolution Ruofan Zhou, Fayez Lahoud , Majed EI Helou, and Sabine

UNVEILING THE SUPER ORBITAL UNVEILING THE SUPER ORBITAL UNVEILING THE SUPER-ORBITAL UNVEILING

SYNTHESIS OF SUPER SYNTHESIS OF SUPER NANOPOROUS SYNTHESIS OF SUPER SYNTHESIS OF

Super GPU & Super Kernels: Make programming of multi-GPU systems easy Michael Frumkin, May 8,

Super-Resolution via Image Recapture and Bayesian Effect Modeling Neil Toronto Oral Thesis

SIGBI Limited General Meeting 2019 Resolutions 1-6 Resolution 1 Resolution 2 Resolution 3

Patagonia Gold Plc 2009 Patagonia Gold VOTING ORDINARY SPECIAL Resolution 1 Resolution 2

File Input and Output File Input and Output 1 / 9 File input/output input function reads values

Input Input devices Text entry Positional input Input Devices 1 iPod Wheel Input Devices 2

Multi-Modal Spectral Image Super-Resolution IVRL Prime Fayez Lahoud, Ruofan Zhou, Sabine

Applications of GANs Photo-Realistic Single Image Super-Resolution Using a Generative

Applications of GANs Photo-Realistic Single Image Super-Resolution Using a Generative

Image Restoration Image Enhancement and Image Restoration both deal with improving images. Image

Tra ffi c Management as a Service | Ghent, Belgium INPUT PROCESS OUTPUT INPUT PROCESS OUTPUT

Input Input devices Text entry Positional input Input Devices 1 MacBook Wheel (The Onion) -

Identifying Hazards in the Cardiac Operating Room* Event-driven, nonlinear, unpredictable

February 22, 2017 Presenters Heather Holsey Josiah Mueller Rhya Ghose 2 Disclaimer

Why could patients with HF and T2DM benefit from SGLT2i? Subodh Verma, MD Ontario, Canada May

Renal cell cancer 9 1 0 2 High incidence in North America and s Western Europe s a l

PHLAME a TWH for firefighters: Outcomes to Out There (Lessons for taking Science to Service)

Cardiac Rehab Utilization among Medicare Fee-for-service Beneficiaries, 2013 [preliminary

Integrative human cardiovascular control Danish Cardiovascular Research Academy Ph.D. course The

Therapist Empathy 60 Miller,Taylor &West (1980) Client Drinking Outcomes Accounted for by

Sambuz

Useful Links

Newsletter

Mail Us

Multi-Input Cardiac Image Super-Resolution using Convolutional - PowerPoint PPT Presentation

Multi-Input Cardiac Image Super-Resolution using Convolutional Neural Networks Ozan Oktay, Wenjia Bai, Matthew Lee, Ricardo Guerrero, Konstantinos Kamnitsas, Jose Caballero, Antonio de Marvao, Stuart Cook, Declan ORegan, and Daniel Rueckert

Chapter 4 Multiframe Super-resolution Image Reconstruction 1 Multi-frame SRIR (Video

Residuals in Deep Super-Resolution Ruofan Zhou, Fayez Lahoud , Majed EI Helou, and Sabine

UNVEILING THE SUPER ORBITAL UNVEILING THE SUPER ORBITAL UNVEILING THE SUPER-ORBITAL UNVEILING

SYNTHESIS OF SUPER SYNTHESIS OF SUPER NANOPOROUS SYNTHESIS OF SUPER SYNTHESIS OF

Super GPU &amp; Super Kernels: Make programming of multi-GPU systems easy Michael Frumkin, May 8,

Super-Resolution via Image Recapture and Bayesian Effect Modeling Neil Toronto Oral Thesis

SIGBI Limited General Meeting 2019 Resolutions 1-6 Resolution 1 Resolution 2 Resolution 3

Patagonia Gold Plc 2009 Patagonia Gold VOTING ORDINARY SPECIAL Resolution 1 Resolution 2

File Input and Output File Input and Output 1 / 9 File input/output input function reads values

Input Input devices Text entry Positional input Input Devices 1 iPod Wheel Input Devices 2

Multi-Modal Spectral Image Super-Resolution IVRL Prime Fayez Lahoud, Ruofan Zhou, Sabine

Applications of GANs Photo-Realistic Single Image Super-Resolution Using a Generative

Applications of GANs Photo-Realistic Single Image Super-Resolution Using a Generative

Image Restoration Image Enhancement and Image Restoration both deal with improving images. Image

Tra ffi c Management as a Service | Ghent, Belgium INPUT PROCESS OUTPUT INPUT PROCESS OUTPUT

Input Input devices Text entry Positional input Input Devices 1 MacBook Wheel (The Onion) -

Identifying Hazards in the Cardiac Operating Room* Event-driven, nonlinear, unpredictable

February 22, 2017 Presenters Heather Holsey Josiah Mueller Rhya Ghose 2 Disclaimer

Why could patients with HF and T2DM benefit from SGLT2i? Subodh Verma, MD Ontario, Canada May

Renal cell cancer 9 1 0 2 High incidence in North America and s Western Europe s a l

PHLAME a TWH for firefighters: Outcomes to Out There (Lessons for taking Science to Service)

Cardiac Rehab Utilization among Medicare Fee-for-service Beneficiaries, 2013 [preliminary

Integrative human cardiovascular control Danish Cardiovascular Research Academy Ph.D. course The

Therapist Empathy 60 Miller,Taylor &amp;West (1980) Client Drinking Outcomes Accounted for by

Sambuz

Useful Links

Newsletter

Mail Us

Super GPU & Super Kernels: Make programming of multi-GPU systems easy Michael Frumkin, May 8,

Therapist Empathy 60 Miller,Taylor &West (1980) Client Drinking Outcomes Accounted for by