Towards Practical Problems in Deep Learning for Radiology Image - - PowerPoint PPT Presentation

towards practical problems in deep learning for radiology
SMART_READER_LITE
LIVE PREVIEW

Towards Practical Problems in Deep Learning for Radiology Image - - PowerPoint PPT Presentation

Towards Practical Problems in Deep Learning for Radiology Image Analysis Quanzheng Li, Xiang Li, James H.Thrall Center for Clinical Data Science Department of Radiology Massachusetts General Hospital, Harvard Medical School Background


slide-1
SLIDE 1

Towards Practical Problems in Deep Learning for Radiology Image Analysis

Quanzheng Li, Xiang Li, James H.Thrall Center for Clinical Data Science Department of Radiology Massachusetts General Hospital, Harvard Medical School

slide-2
SLIDE 2

Background Challenges Solution Performance Our Vision

  • Purpose of AI in medical imaging: creating value

in the delivery of medical care and delivery of radiology services:

– increasing diagnostic certainty – decreasing time on task for radiologists – faster availability of results – reducing costs of care.

  • Interrogating image data for extracting maximum

value, with/without pre-defined model structure.

  • Accuracy, Efficiency, Robustness.
slide-3
SLIDE 3

Background Challenges Solution Performance Our Vision

  • Automatically detect (i.e. screening) the presence
  • f free air lesion regions in the lung CT images.
  • Manual

inspection

  • f

the incoming medical images can be time-consuming and lack of the efficiency in handling life-threatening cases (such as pneumothorax).

  • Certain image abnormities can be subtle human

inspector, leads to potential mistakes in handling the patients.

  • No

systematic way

  • f

using learning-based methods for fully automatic screening.

slide-4
SLIDE 4

Background Challenges Solution Performance Our Vision

  • Region with free

air in the lung.

  • Can be presented

anywhere.

  • Low HUT area in

the image.

  • Usually

associated with

  • ther conditions.
slide-5
SLIDE 5

Background Challenges Solution Performance Our Vision

Data Preprocessing and Quality Control Data Storage, Transfer, and Sharing Preparation for Analysis Analysis Postprocessing

  • PACS
  • HDFS
  • Cloud storage
  • Meta-information

processing

  • Format conversion
  • Image enhancement

/de-noising

  • Patch extraction
  • Data augmentation
  • Network definition
  • Model training and

validation

  • Model testing
  • Model application

for practice

  • Heatmap

generation

  • Diagnosis
  • Visualization

? Positive

slide-6
SLIDE 6

Background Challenges Solution Performance Our Vision

1 https://arxiv.org/abs/1512.03385

Data Preprocessing and Quality Control Data Storage, Transfer, and Sharing Preparation for Analysis Analysis Postprocessing

? Heterogeneity ? Missing / erroneous data items ? Online vs. Offline

  • PACS
  • HDFS
  • Cloud storage
  • Meta-information

processing

  • Fomat conversion
  • Image enhancement

/de-noising

  • Patch extraction
  • Data augmentation
  • Network definition
  • Model training and

validation

  • Model testing
  • Model application

for practice

  • Heatmap

generation

  • Diagnosis
  • Visualization
slide-7
SLIDE 7
  • High

image/feature heterogeneity + lack

  • f

training samples: more likely to over-fitting.

  • Data items can be missing or wrong (e.g. in

DICOM headers during the scan).

  • Most

sophisticated preprocessing (e.g. image restoration, image segmentation) techniques have to be done

  • ff-line

with group-wise information provided and/or ground-truth.

Background Challenges Solution Performance Our Vision

1 https://arxiv.org/abs/1512.03385

slide-8
SLIDE 8

Background Challenges Solution Performance Our Vision

1 https://arxiv.org/abs/1512.03385

Data Preprocessing and Quality Control Data Storage, Transfer, and Sharing Preparation for Analysis Analysis Postprocessing

? Low speed and intensive I/O for patch extraction ? Lack of training data samples ? Arbitrary parameter / model structure

  • PACS
  • HDFS
  • Cloud storage
  • Meta-information

processing

  • Fomat conversion
  • Image enhancement

/de-noising

  • Patch extraction
  • Data augmentation
  • Network definition
  • Model training and

validation

  • Model testing
  • Model application

for practice

  • Heatmap

generation

  • Diagnosis
  • Visualization
slide-9
SLIDE 9
  • Deep learning models typically run on small

image patches for increased sample size and better feature representation.

Background Challenges Solution Performance Our Vision

1 https://arxiv.org/abs/1512.03385

Pneumothorax Patches Normal Control Patches

slide-10
SLIDE 10

Background Challenges Solution Performance Our Vision

1 https://arxiv.org/abs/1512.03385

Data Preprocessing and Quality Control Data Storage, Transfer, and Sharing Preparation for Analysis Analysis Postprocessing

? Computational time: large data size + complex models ? Needs for real-time results.

  • PACS
  • HDFS
  • Cloud storage
  • Meta-information

processing

  • Fomat conversion
  • Image enhancement

/de-noising

  • Patch extraction
  • Data augmentation
  • Network definition
  • Model training and

validation

  • Model testing
  • Model application

for practice

  • Heatmap

generation

  • Diagnosis
  • Visualization
slide-11
SLIDE 11
  • More

complex models and deeper networks: Increased computational load for the system.

  • Example: >1000 layered Deep Residual Learning

network1 has been evaluated on the ImageNet 2012 dataset consists of 1000 classes, trained on 1.28 million training images.

Background Challenges Solution Performance Our Vision

1 https://arxiv.org/abs/1512.03385

slide-12
SLIDE 12
  • Large data size of most medical image types,

high performance computing becomes a crucial component for a practical and running solution.

  • Example: A typical CT image has more than 30

million voxels (512×512×120). The pneumothorax project dataset constitutes imaging data from >600 subjects.

Background Challenges Solution Performance Our Vision

slide-13
SLIDE 13

Background Challenges Solution Performance Our Vision

  • Dataset consists of 648 subjects with/without

pneumothorax, 66 of them are annotated.

  • Network trained on 31 subjects, totally 21,540

36×36 patches.

  • Training a 16-layer, VGG-like 2D CNN for lesion

detection

  • n

two classes: pneumothorax vs. normal.

  • The whole pipeline takes DICOM images as input,

generates a lesion heatmap, provides diagnosis score for the probability of pneumothorax.

slide-14
SLIDE 14

Background Challenges Solution Performance Our Vision

Data Preprocessing and Quality Control Data Storage, Transfer, and Sharing Preparation for Analysis Analysis Postprocessing

? Security ? Privacy ? Transfer speed ? Heterogeneity ? Missing / erroneous data items ? Online vs. Offline ? Low speed and intensive I/O for patch extraction ? Lack of training data samples ? Arbitrary parameter / model structure ? Computational time: large data size + complex models ? Integration into the workflow ? Real-time feedback

  • PACS
  • HDFS
  • Cloud storage
  • Meta-information

processing

  • Fomat conversion
  • Image enhancement

/de-noising

  • Patch extraction
  • Data augmentation
  • Network definition
  • Model training and

validation

  • Model testing
  • Model application

for practice

  • Heatmap

generation

  • Diagnosis
  • Visualization

√ Co-development with high radiologist involvement √ Self-paced learning √ Asynchronous I/O √ HPC platform supported by DGX1 √ CUDA implementation

slide-15
SLIDE 15

Background Challenges Solution Performance Our Vision

  • Intensive involvement of radiologist During the

training phase: addressing the data heterogeneity and under-coverage of training samples:

  • Four types of mis-classification cases identified,

3 false-positive, 1 false-negative:

– Extra small pneumothorax lesions (mainly caused by the image view). – Empyema. – Imaging artifacts (e.g. dark strips). – Irregular trachea/branches shapes.

slide-16
SLIDE 16

Background Challenges Solution Performance Our Vision

  • Self-paced learning scheme to further increase

the sample size.

spCNN: Close Looped, Multiple Rounds of Training

Distribution of prediction probability Virtual samples at round i-1 Bootstrapping Module Dataset for analyze Bootstrapping CNN k,i New CNN, at round i Original samples

apply virtual sample selection

Classification results and diagnosis

  • btain

Bootstrapping CNN 1,i New, unlabeled data

apply apply

  • btain

Classification Module

retraining retraining

Original samples

retraining retraining

slide-17
SLIDE 17

Background Challenges Solution Performance Our Vision

  • Tested 35 subjects, patch-wise accuracy: 93.9%.
  • Subject-wise accuracy is calculated by counting

the number of detected patches followed by thresholding.

0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1.0 0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1.0 True positive rate False positive rate

Subject-wise ROC Curve

slide-18
SLIDE 18

Background Challenges Solution Performance Our Vision

  • Although the detection is done on each slice (i.e.

2D network), the detected lesion boundary is stable across slices.

slide-19
SLIDE 19

Background Challenges Solution Performance Our Vision

  • Detection (i.e. generating heatmap) of a single

subject takes less than 3 minutes.

  • Enabled by the computing power of DGX1: 50

times faster than single K40, 10 times faster than single P100.

  • Most

time consuming step is

  • n

the patch extraction, further I/O synchronous will help.

  • The detection speed is on the same scale of a

typical CT scan (minutes), thus enables real-time screening of the patients.

slide-20
SLIDE 20

Background Challenges Solution Performance Our Vision

  • Self-paced learning method helps in increasing

the training sample size by 23,100 patches (>100%) from 200 subjects (independent with the current annotation set).

  • Performed in a single round.
  • All the patches are consistently classified by the

10 bootstrapping networks, the accuracy is controlled by the Family-wise Error Rate (FWER)

  • f p=0.05.
  • Training of the 10 bootstrapping networks is

done within one hour.

slide-21
SLIDE 21

Background Challenges Solution Performance Our Vision

  • The

latest NVIVDIA DGX-1 provides us an unprecedented computational power to support

  • ur approach of data augmentation and fast user

interaction.

  • Such

massive data-initiated, computational- intensive solution will be the dominant trend for the medical researches and clinical practices.

  • A new way for researchers and radiologists to

apply, interpret and convert their domain knowledges (“fourth paradigm”).

slide-22
SLIDE 22

Background Challenges Solution Performance Our Vision

  • Acknowledgement:

– Radiology at MGH: Subba Digumarthy, Mannudeep Kalra – Gordon Center for Medical Imaging at MGH: Ning Guo, Kyungsang Kim, Dufan Wu, Aoxiao Zhong – Beijing International Center for Mathematical Research at PKU:

  • Prof. Bin Dong

– MGH Center for Clinical Data Science: