High-Resolution Breast Cancer Screening with Multi-View Deep - - PowerPoint PPT Presentation

high resolution breast cancer screening with multi view
SMART_READER_LITE
LIVE PREVIEW

High-Resolution Breast Cancer Screening with Multi-View Deep - - PowerPoint PPT Presentation

High-Resolution Breast Cancer Screening with Multi-View Deep Convolutional Neural Networks Krzysztof J. Geras Joint work with Kyunghyun Cho , Linda Moy , Gene Kim , Stacey Wolfson and Artie Shen . GTC 2017 W HERE DEEP LEARNING IS USEFUL amount


slide-1
SLIDE 1

High-Resolution Breast Cancer Screening with Multi-View Deep Convolutional Neural Networks

Krzysztof J. Geras

Joint work with Kyunghyun Cho, Linda Moy, Gene Kim, Stacey Wolfson and Artie Shen.

GTC 2017

slide-2
SLIDE 2

WHERE DEEP LEARNING IS USEFUL

difficulty of the task amount of data available

slide-3
SLIDE 3

WHERE DEEP LEARNING IS USEFUL

difficulty of the task amount of data available

the learning tasks for which deep learning makes a difference

natural image recognition machine translation speech recognition game playing

slide-4
SLIDE 4

Can we save the world?

slide-5
SLIDE 5

WHERE DEEP LEARNING IS USEFUL

difficulty of the task amount of data available

the learning tasks for which deep learning makes a difference

natural image recognition machine translation speech recognition game playing

medical image analysis?

slide-6
SLIDE 6

BREAST CANCER SCREENING

slide-7
SLIDE 7

BREAST CANCER SCREENING About 40 million exams performed yearly in the US.

slide-8
SLIDE 8

BREAST CANCER SCREENING About 40 million exams performed yearly in the US. About 250 thousand women are diagnosed with cancer.

slide-9
SLIDE 9

BREAST CANCER SCREENING About 40 million exams performed yearly in the US. About 250 thousand women are diagnosed with cancer. About 40 thousand die.

slide-10
SLIDE 10

BREAST CANCER SCREENING

R-MLO L-MLO R-CC L-CC

(left cranial caudal) (right cranial caudal) (left mediolateral oblique) (right mediolateral oblique)

slide-11
SLIDE 11

BREAST CANCER SCREENING

We try to mimic predictions of a radiologist.

◮ Class 0: incomplete (≈ 15%). ◮ Class 1: negative (≈ 50%). ◮ Class 2: bening findings (≈ 35%).

slide-12
SLIDE 12

BREAST CANCER SCREENING

We try to mimic predictions of a radiologist.

◮ Class 0: incomplete (≈ 15%). ◮ Class 1: negative (≈ 50%). ◮ Class 2: bening findings (≈ 35%).

Radiologists call these classes BI-RADS (short for Breast Imaging-Reporting and Data System).

slide-13
SLIDE 13

CHALLENGES (1)

slide-14
SLIDE 14

CHALLENGES (1)

You need a lot of data to do deep learning.

slide-15
SLIDE 15

CHALLENGES (1)

You need a lot of data to do deep learning. Publicly available data sets contain about 1k images.

slide-16
SLIDE 16

CHALLENGES (1)

You need a lot of data to do deep learning. Publicly available data sets contain about 1k images. We build our own data set:

◮ 23k exams, ◮ 103k images. ◮ Each image is at least 2600×2000 pixels.

slide-17
SLIDE 17

CHALLENGES (2)

slide-18
SLIDE 18

CHALLENGES (2)

slide-19
SLIDE 19

CHALLENGES (2)

slide-20
SLIDE 20

CHALLENGES (2)

slide-21
SLIDE 21

CHALLENGES (2)

slide-22
SLIDE 22

CHALLENGES (2)

slide-23
SLIDE 23

CHALLENGES (2)

slide-24
SLIDE 24

CHALLENGES (2)

High resolution necessary - computational and engineering challenge.

slide-25
SLIDE 25

CHALLENGES (3)

Multi-view data. How to integrate information?

slide-26
SLIDE 26

OUR MODEL

Classifier p(y|x) Concatenation (256×4 dim) DCN DCN DCN DCN L-CC R-CC L-MLO R-MLO

slide-27
SLIDE 27

OUR MODEL

Classifier p(y|x) Concatenation (256×4 dim) DCN DCN DCN DCN L-CC R-CC L-MLO R-MLO layer kernel size stride #maps repetition global average pooling 256 convolution 3×3 1×1 256 ×3 max pooling 2×2 2×2 128 convolution 3×3 1×1 128 × 3 max pooling 2×2 2×2 128 convolution 3×3 1×1 128 × 3 max pooling 2×2 2×2 64 convolution 3×3 1×1 64 × 2 convolution 3×3 2×2 64 max pooling 3×3 3×3 32 convolution 3×3 2×2 32 input 1

slide-28
SLIDE 28

RESULTS

0.0 0.2 0.4 0.6 0.8 1.0 false positive rate 0.0 0.2 0.4 0.6 0.8 1.0 true positive rate BI-RADS 0 BI-RADS 1 BI-RADS 2

AUC

0 vs. others: 0.609 1 vs. others: 0.717 2 vs. others: 0.728 Average: 0.685

slide-29
SLIDE 29

CONFIDENT TEST DATA

We can compute the entropy of predictions, H(y|x) = −

  • y′∈C

p(y′|x) log p(y′|x), and sort examples according to it.

slide-30
SLIDE 30

CONFIDENT TEST DATA

We can compute the entropy of predictions, H(y|x) = −

  • y′∈C

p(y′|x) log p(y′|x), and sort examples according to it. We will consider the 30% with the lowest entropy to be “confident”.

slide-31
SLIDE 31

RESULTS FOR CONFIDENT TEST DATA

0.0 0.2 0.4 0.6 0.8 1.0 false positive rate 0.0 0.2 0.4 0.6 0.8 1.0 true positive rate BI-RADS 0 BI-RADS 1 BI-RADS 2

AUC

0 vs. others: 0.636 1 vs. others: 0.816 2 vs. others: 0.844 Average: 0.765

slide-32
SLIDE 32

IMPACT OF DOWNSCALING

1/8 1/4 1/2 1 resolution fraction 0.55 0.60 0.65 0.70 0.75 0.80 AUC average AUC average AUC (confident)

slide-33
SLIDE 33

IMPACT OF THE DATA SET SIZE

1/10 1/5 1/2 1 data set size fraction 0.50 0.55 0.60 0.65 0.70 0.75 0.80 AUC average AUC average AUC (confident)

slide-34
SLIDE 34

VISUALISATION

We visualise

  • ∂H(y|x)

∂xv

(i,j)

  • ,

where H(y|x) = −

  • y′∈C

p(y′|x) log p(y′|x).

slide-35
SLIDE 35

VISUALISATION

slide-36
SLIDE 36

VISUALISATION

slide-37
SLIDE 37

CONCLUSIONS

◮ We made a first step in the direction of end-to-end breast cancer screening

with neural networks.

slide-38
SLIDE 38

CONCLUSIONS

◮ We made a first step in the direction of end-to-end breast cancer screening

with neural networks.

◮ It is much harder to learn the “incomplete” (0) class than other classes.

slide-39
SLIDE 39

CONCLUSIONS

◮ We made a first step in the direction of end-to-end breast cancer screening

with neural networks.

◮ It is much harder to learn the “incomplete” (0) class than other classes. ◮ We need to use the full resolution. A lot more effort is necessary to develop

archiectures appropriate for data of large dimensionality.

slide-40
SLIDE 40

CONCLUSIONS

◮ We made a first step in the direction of end-to-end breast cancer screening

with neural networks.

◮ It is much harder to learn the “incomplete” (0) class than other classes. ◮ We need to use the full resolution. A lot more effort is necessary to develop

archiectures appropriate for data of large dimensionality.

◮ We need more data.

slide-41
SLIDE 41

CONCLUSIONS

◮ We made a first step in the direction of end-to-end breast cancer screening

with neural networks.

◮ It is much harder to learn the “incomplete” (0) class than other classes. ◮ We need to use the full resolution. A lot more effort is necessary to develop

archiectures appropriate for data of large dimensionality.

◮ We need more data. (We are currently processing a 10 times bigger data set).

slide-42
SLIDE 42

Thank you!

High-Resolution Breast Cancer Screening with Multi-View Deep Convolutional Neural Networks

  • K. J. Geras, S. Wolfson, S. G. Kim, L. Moy, K. Cho

arXiv:1703.07047