Sparsity Based Spectral Embedding: Application to Multi-Atlas - - PowerPoint PPT Presentation

sparsity based spectral embedding application to multi
SMART_READER_LITE
LIVE PREVIEW

Sparsity Based Spectral Embedding: Application to Multi-Atlas - - PowerPoint PPT Presentation

Sparsity Based Spectral Embedding: Application to Multi-Atlas Echocardiography Segmentation Ozan Oktay, Wenzhe Shi, Jose Caballero, Kevin Keraudren, and Daniel Rueckert Department of Compu.ng Imperial


slide-1
SLIDE 1

Sparsity Based Spectral Embedding: Application to Multi-Atlas Echocardiography Segmentation

Ozan Oktay, Wenzhe Shi, Jose Caballero, Kevin Keraudren, and Daniel Rueckert

  • Challenge on Endocardial Three-dimensional

Ultrasound Segmentation, MICCAI 2014 14th September 2014

Department ¡of ¡Compu.ng ¡ Imperial ¡College ¡London ¡

slide-2
SLIDE 2

2

Problem Definition and Literature Review

STMI’14 – September 2014

Problem: Left ventricle (LV) endocardium segmentation in 3D Echo Images

  • Motivation:

Estimation of clinical indices: (1) ejection fraction, (2) stroke volume, and (3) cardiac motion

  • The existing work in the literature:
  • 1. B-spline based active surfaces [Barbosa et al., 2013]
  • 2. Statistical-shape models [Butakoff et al., 2011]
  • 3. Edge based level-set segmentation [Rajpoot et al., 2011]
  • RV

RA LV LA

slide-3
SLIDE 3

3

Multi-Atlas Image Segmentation

STMI’14 – September 2014

§ It uses the manually labeled atlases to segment the target organ. § It does not require any training or prior estimation. § Successfully applied in i. Brain MRI Segmentation [Aljabar et al. 2009 NeuroImage]

  • ii.

Cardiac MRI Segmentation

  • [Isgum et al. 2009 TMI]

Linear and deformable registration Propagate the labels & majority voting Atlas-1 Atlas-2 Atlas-3,4,5

Target image

slide-4
SLIDE 4

4

Proposed Segmentation Framework

Image Registration Segmentation Output

STMI’14 – September 2014

Target Image Speckle Reduction Structural Representation Atlas Image Speckle Reduction Structural Representation

slide-5
SLIDE 5

5 STMI’14 – September 2014

1. Wachinger C. and Navab N.: “Entropy and Laplacian images: Structural representations for multi-modal registration” Medical Image Analysis 2012.

20 40 60 80 100 120 140 160 180 20 40 60 80 100 120 140 160 180 20 40 60 80 100 120 140 160 180 20 40 60 80 100 120 140 160 180

Patch Based Spectral Representation

§ Structural image representation § Unsupervised learning of shape and contextual information. § Laplacian Eigenmaps. § Useful for echo images since intensity data does not explicitly reveal the structural information.

Input image Patch matrix Lower dimensional embedding Manifold Structural representation

slide-6
SLIDE 6

6

Dictionary Based Spectral Representation

STMI’14 – September 2014

Target image Spectral embedding

  • f image patches

Computationally prohibitive

slide-7
SLIDE 7

7

Dictionary Based Spectral Representation

STMI’14 – September 2014

Training Images Dictionary Learning Spectral Embedding

  • f Dictionary Atoms

What is the main motivation for the sparsity and dictionary learning ?

  • § Computationally efficient due to elimination of redundancy

§ Large number of images can be mapped to the same embedding space

slide-8
SLIDE 8

8

Dictionary Based Spectral Representation

STMI’14 – September 2014

Training Images Dictionary Learning Spectral Embedding

  • f Dictionary Atoms

Sparse and Local Coding Spectral Representation Query Image Patches Mapping to the manifold space

slide-9
SLIDE 9

9

20 40 60 80 100 120 140 160 20 40 60 80 100 120 140 160

Input Image Dictionary Learning

min

C,X kY CXk2

s.t. 8i , kxik0  T0.

STMI’14 – September 2014

Dictionary Based Spectral Representation

slide-10
SLIDE 10

10

20 40 60 80 100 120 140 160 20 40 60 80 100 120 140 160

Input Image Dictionary Learning

2 4 6 8 10 12 2 4 6 8 10 12 2 4 6 8 10 12 2 4 6 8 10 12 2 4 6 8 10 12 2 4 6 8 10 12 2 4 6 8 10 12

min

C,X kY CXk2

s.t. 8i , kxik0  T0.

Dictionary Atoms Spectral Embedding

L = V>Λ V

L = D − W

Laplacian Graph Eigen decomposition

STMI’14 – September 2014

Dictionary Based Spectral Representation

slide-11
SLIDE 11

11

Sparse and Local Coding for Spectral Representation

1.

  • K. Yu et al. : “Nonlinear learning using local coordinate coding” NIPS 2009

2.

  • J. Wang et al. : “Locality-constrained linear coding for image classification” CVPR 2010

Locality constrained linear coding

min

X N

P

n=1

kyn C˜ xnk2 + λ kbn ˜ xnk2 s.t. 8n , 1>˜ xn = 1

bn = [b(n,1), . . . , b(n,M)]

STMI’14 – September 2014

20 40 60 80 100 120 140 160 20 40 60 80 100 120 140 160

Input Image Query Patch

2 4 6 8 10 12 2 4 6 8 10 12 2 4 6 8 10 12 2 4 6 8 2 4 6 8 10 12 2 4 6 8 10 12 2 4 6 8 10 12 2 4 6 8 10 12

Atom 1 Atom 2 Atom 3

= ¡ b1× ¡ + ¡b2× ¡ + ¡b3× ¡

b(n,m) = exp

  • kyn cmk2/σ
slide-12
SLIDE 12

12

20 40 60 80 100 120 140 160 20 40 60 80 100 120 140 160 20 40 60 80 100 120 140 160 20 40 60 80 100 120 140 160

Dictionary Based Spectral Representation

20 40 60 80 100 120 140 160 20 40 60 80 100 120 140 160

First Spectral Mode Second Spectral Mode Input Image

STMI’14 – September 2014

slide-13
SLIDE 13

13

Registration Strategy

. . .

Mode K Mode 1

. . .

Mode K Mode 1 Atlas Image Target Image

STMI’14 – September 2014

K

X

k=1

kSAk( T (p) ) S Tk(p)k2 + β R(p) p 2 R3 , T : R3 7! R3

§ B-spline free-form deformations. [Rueckert et al. TMI 99] § Sum of squared differences similarity measure. § Number of modes K = 4.

Single deformation field (T ) for all 3D-3D spectral image pairs T T T

(SAK) (STK)

slide-14
SLIDE 14

14

The Proposed Segmentation Framework

STMI’14 – September 2014

Training dataset (atlases) Dictionary learning Spectral embedding Speckle reduction Target image Locality constrained sparse coding Target image shape representation Multi-atlas segmentation Atlas shape representations

slide-15
SLIDE 15

15

Validation Dataset

  • 1. Training Dataset (Atlases) (15 Patients)
  • 3D+T echo scans.
  • Cross-validation is performed on the training dataset.
  • Ground-truth segmentations are available for ED and ES frames.
  • 2. Testing Dataset (15 Patients)
  • 3D+T echo scans, obtained from different view angles.
  • Only the ED and ES frames are segmented

STMI’14 – September 2014

slide-16
SLIDE 16

16

Other Image Representations

Original echo image Local phase image [2] Spectral Representation Boundary Image [1]

1. Rajpoot, K. et al.: ISBI 2009 2. Zhuang, X. et al.: ISBI 2010

Intensity and phase features

  • Encodes only the tissue

boundary information.

  • It is not sufficient for image

analysis applications Spectral representation

  • Encodes the contextual

information

  • STMI’14 – September 2014
slide-17
SLIDE 17

17

Surface to Surface Distance Errors

0.0 0.2 0.4 0.6 0.8 1.0

0.0 0.2 0.4 0.6 0.8 1.0 Mesh Surface Distance Errors (mm) Testing Dataset Training Dataset (Cross-validation) 1.5 2.0 2.5 3.0 3.5 4.0 4.5 Mean Surface Distance

Unprocessed Images Speckle Reduced Images Phase Symmetry Images Local Phase Images Spectral Representation

Testing Dataset Training Dataset (Cross-validation) 6 7 8 9 10 11 12 13 Maximum Surface Distance

slide-18
SLIDE 18

18

Estimation of Clinical Indices

0.0 0.2 0.4 0.6 0.8 1.0

0.0 0.2 0.4 0.6 0.8 1.0 Percentage Agreement with the Ground Truth Values Testing Dataset Training Dataset (Cross-validation) 0.75 0.80 0.85 0.90 0.95 1.00 Dice Coefficient Testing EF Testing SV Training EF Training SV 0.5 0.6 0.7 0.8 0.9 1.0 Ejection Fraction and Stroke Volume Correlation with Reference Values

Unprocessed Images Speckle Reduced Images Phase Symmetry Images Local Phase Images Spectral Representation

slide-19
SLIDE 19

19

Qualitative Results

Segmentation using the proposed spectral representation Segmentation using local phase image Ground-truth segmentation

Testing Dataset Training Dataset

STMI’14 – September 2014

slide-20
SLIDE 20

20

Conclusion & Acknowledgements

§ Conclusion

  • We propose a novel structural representation for

echocardiography

  • This representation enables accurate multi-atlas segmentation

§ Future work

  • The linear approximation of non-linear manifold can be improved
  • Application of the proposed feature in echocardiography strain

imaging

  • Explore application to multi modal image registration (echo / CT)
slide-21
SLIDE 21

21

  • § Acknowledgements:

Sparsity Based Spectral Embedding: Application to Multi-Atlas Echocardiography Segmentation

Ozan Oktay*, Wenzhe Shi, Jose Caballero, Kevin Keraudren, and Daniel Rueckert *E: o.oktay13@imperial.ac.uk

slide-22
SLIDE 22

22

Comparison against the state-of-the-art echocardiographic image segmentation methods

Table 1: Comparison of the proposed multi-atlas approach (A) against the state-of-the-art echocardiogaphy segmentation: active surfaces [1] and active shape model [2]. Estimated ejection fraction (EF) and end-diastolic volume (EDV ) are compared against their reference values. The correlation accuracy is reported in terms of Pearson’s coefficient (R) and Bland-Altman’s limit of agreement (BA). Mean (mm) REF BAEF (µ ± 2σ) REDV BAEDV (µ ± 2σ) # of Patients (A) 2.32±0.78 0.923

  • 0.74±6.26

0.926 12.88±35.71 15 [1]

  • 0.907
  • 2.4±23

0.971

  • 24.60±21.80

24 [2] 1.84±1.86

  • 0±19
  • 3.06±46.86

10

1. Barbosa, D., et al.: Fast and fully automatic 3-D echocardiographic segmentation using B-spline explicit active surfaces. Ultrasound in medicine and biology (2013) 2. Butakoff, C., et al: Order statistic based cardiac boundary detection in 3D+T echocardiograms. FIMH. Springer (2011)

slide-23
SLIDE 23

23

Accuracy of the Derived Clinical Indices

Table 2: This table shows the accuracy of the derived clinical indices for the training (Patient 1 to 15) and testing datasets (Patient 16 to 30). Pearson’s correlation coefficient (PCC) and Bland-Altman’s limit of agreement (µ±1.96σ) values are given for the following indices: ejection fraction, stroke volume, end- systolic volume, and end-diastolic volume. Testing dataset PCC LOA (µ ± 1.96σ) ED volume (ml) 0.926 12.81±33.77 ES volume (ml) 0.936

  • 7.77±28.27

Ejection fraction (%) 0.923 0.74±7.58 Stroke volume (ml) 0.832

  • 5.05±12.49

Training dataset PCC LOA (µ ± 1.96σ) ED volume (ml) 0.983 9.80±45.66 ES volume (ml) 0.961 11.21±56.91 Ejection fraction (%) 0.787

  • 1.07±18.52

Stroke volume (ml) 0.856

  • 1.38±27.08
slide-24
SLIDE 24

24

Method runtime

§ Results on 3 GHz machine § Speckle reduction: 3 min § Shape representation: 2.5 min § Deformable registration (5 atlases): 30 min