Diabetes Diagnostic Imaging Machine Learning Undergraduate Research - - PowerPoint PPT Presentation

diabetes diagnostic imaging
SMART_READER_LITE
LIVE PREVIEW

Diabetes Diagnostic Imaging Machine Learning Undergraduate Research - - PowerPoint PPT Presentation

Diabetes Diagnostic Imaging Machine Learning Undergraduate Research Walker Christensen & Mitch Maegaard Problem Statement Project objective Company from Inspiration Build algorithm for app China Database of tongue Ancient Chinese User


slide-1
SLIDE 1

Diabetes Diagnostic Imaging

Machine Learning Undergraduate Research Walker Christensen & Mitch Maegaard

slide-2
SLIDE 2

Problem Statement

slide-3
SLIDE 3

Project objective

Inspiration

Ancient Chinese medicine, doctors could diagnose diabetes by looking at the tongue

Company from China

Database of tongue images and personal health questions

Build algorithm for app

User can take a picture of their tongue, answer a few health-related questions, then receive a real-time diabetes diagnosis

slide-4
SLIDE 4

Understanding the problem

Step 1 Step 2 Step 3 Step 4

Can we diagnose diabetes using only the picture of a tongue? Can we diagnose the stage of diabetes with the picture of a tongue? Can we diagnose diabetes using health survey questions? Can we improve diagnostic accuracy by combining picture and survey?

slide-5
SLIDE 5

Data Introduction

slide-6
SLIDE 6

Images

➢ 517 healthy ➢ 224 diabete

slide-7
SLIDE 7

Health Survey

➢ 57 questions ➢ 164 respondents

➢ Age ➢ Gender ➢ Height ➢ Weight Demographics ➢ Are you pregnant? ➢ Do you have unexplained weight loss? ➢ Do you feel hungry/thirsty? ➢ Do you have insomnia? Questions ➢ Identification Code ➢ Diabetes Status Labels

slide-8
SLIDE 8

Machine Learning Techniques

slide-9
SLIDE 9

Image processing

➢ Images are made up of pixels (a single color)

slide-10
SLIDE 10

Image processing

➢ Each pixel has value range:

0 (black) to 255 (white)

➢ (5x5) = 25 data points

5x5 grayscale image

255 180 95 180 95 230 255 255 255 255 180 180 180 230 230 180 25 25 25

slide-11
SLIDE 11

Image processing

➢ Each pixel has value range:

0 (dark) to 255 (light)

Red, Green, Blue “channels”

➢ (5x5x3) = 75 data points

5x5 colored image

slide-12
SLIDE 12

Image processing

Balance Normalize Apply

too many pixels vs. too few pixels → 128x128 pixel images divide each point by 255 → data range {0.0, 1.0} 128x128x3 = (49,152) x (741 images) → 34.5 million data points

Algorithm

how do we utilize these numbers? → Convolutional Neural Network

slide-13
SLIDE 13

Convolutional Neural Network

(CNN, ConvNet)

slide-14
SLIDE 14

What is a Neural Network?

➢ Want to classify images as diabetic or healthy ➢ Inspired by neurons in the brain

INPUT OUTPUT COMPUTATION

slide-15
SLIDE 15

What is a Neural Network?

➢ Neurons working together create a network

49,152 per image!

slide-16
SLIDE 16

ConvNet approach

➢ “Slide” a filter over image ➢ Output is a convolved image that’s smaller than the original

Original Convolved

slide-17
SLIDE 17

ConvNet layers

INPUT CONV POOL RELU

raw pixel values of image compute dot product between weights and small connected portion in input volume downsampling operation along spatial dimensions (width and height) applies element-wise activation function

FC

(i.e. fully-connected) computes probability of being in a class

slide-18
SLIDE 18

ConvNet architecture

Edges Shapes Objects

slide-19
SLIDE 19

Transfer Learning

slide-20
SLIDE 20

What is transfer learning

Original Model Transfer Model

Learning

Source task Target task

Store knowledge gained from solving

a problem and use it to solve a similar one

slide-21
SLIDE 21

Why use transfer learning?

➢ Small dataset ➢ Similar to ImageNet

Speed Size Accuracy

slide-22
SLIDE 22

Problem 1

Can we diagnose diabetes with the picture of a tongue?

slide-23
SLIDE 23

Data preprocessing

Label Images Train Set Test Set

{ healthy = 0 : diabetic = 1 } 497 healthy, 204 diabetic (pull extra samples to create balanced dataset) 20 images of each class

slide-24
SLIDE 24

Model architecture

➢ Input image 128x128x3 ➢

VGG-19 “ImageNet” base model

➢ Fine-tune top model ○ Flatten ○ 64-unit F.C. & ReLU activation ○ Dropout 20% ○ 2-unit F.C. & sigmoid activation

Input Image Flatten FC (ReLU) 20% Dropout FC (sigmoid)

CONV 2D CONV 2D MAX POOL CONV 2D CONV 2D CONV 2D CONV 2D MAX POOL CONV 2D CONV 2D MAX POOL CONV 2D CONV 2D CONV 2D CONV 2D MAX POOL CONV 2D CONV 2D CONV 2D CONV 2D MAX POOL 128x128x3 64x64x64 32x32x128 16x16x256 8x8x512 4x4x512

healthy diabetic

slide-25
SLIDE 25

Training results

➢ 40 epoch ➢ 64 mini-batch ➢ Test accuracy: 87.5%

slide-26
SLIDE 26

Hyperparameter tuning

Input Size Epoch Mini-Batch FC 1 Dropout Accuracy

256x256x3 40 64 64 20% 82.5% 128x128x3 60 64 64 20% 82.5% 128x128x3 40 32 64 20% 87.5% 128x128x3 40 64 32 20% 86.25% 128x128x3 40 64 64 10% 85% 128x128x3 40 64 64 20% 87.5%

slide-27
SLIDE 27

Model comparisons

Model Image Size Layers Parameters Epoch Mini-Batch Train Time Accuracy

Scratch 128x128x3 21 14,731,074 30 64 300 sec. 57.5% CapsuleNet 128x128x3 9 62,256,096 10 x 224 sec. 62.5% VGG16 Transfer 128x128x3 21 131,122 30 64 80 sec. 82.5% VGG19 Transfer 128x128x3 25 524,482 40 64 105 sec. 87.5%

slide-28
SLIDE 28

Problem 2

Can we diagnose the stage of diabetes?

slide-29
SLIDE 29

Multi-class classification

5 unique stages of diabetes

○ Healthy ○ Pre-diabetes ○ Mild ○ Moderate ○ Severe

stage?

slide-30
SLIDE 30

Multi-class classification

Model Image Size Layers Parameters Epoch Mini-Batch Train Time Accuracy

Random Guess

  • 20%

Multi-Class Transfer 128x128x3 21 125,353 20 64 72 sec. 37%

slide-31
SLIDE 31

Problem 3

Can we make our results more interpretable?

slide-32
SLIDE 32

Unboxing the “black box”

Question 1 Question 2

Which layers collect specific feature information? What parts of the tongue are contributing to diabetes classifications?

Question 3

Can we find a more interpretable model?

slide-33
SLIDE 33

Global average pooling (GAP)

➢ Map to one prediction per color channel

slide-34
SLIDE 34

Grad-CAM (Gradient-weighted Class Activation Mapping)

Step 1 Step 2

Train CNN model Extract class probabilities from final convolution layer

Step 3

Multiply feature map by pooled gradients → 8x8x512

Step 4

Average the weighted feature map along channel dimension → 1x512

slide-35
SLIDE 35

Grad-CAM

slide-36
SLIDE 36

Results?

➢ Activations effectively

localize “hotspots” for

distinguishing diabetes ➢ Allows us to present distinguishable features to

health experts

slide-37
SLIDE 37

Conclusion

slide-38
SLIDE 38

Conclusion

I. Binary accuracy: 87.5% II. Multi-class accuracy: 37% III. Identified localized areas of tongue images that distinguish diabetes

slide-39
SLIDE 39

Future work

slide-40
SLIDE 40

Future work

➢ Filter survey results such that we retain a subset of most important questions ➢ Extend algorithm to include classification based off survey results ➢ Apply computer vision techniques to other areas of healthcare