advisors robert chang jeff ullman andreas paepcke all
play

Advisors: Robert Chang, Jeff Ullman, Andreas Paepcke *All - PowerPoint PPT Presentation

Apaar Sadhwani, Leo Tam, Jason Su Advisors: Robert Chang, Jeff Ullman, Andreas Paepcke *All contributors are/were affiliated with Stanford University at time of their contributions. Leo Tam now works at Nvidia. Address correspondence to Apaar


  1. Apaar Sadhwani, Leo Tam, Jason Su Advisors: Robert Chang, Jeff Ullman, Andreas Paepcke *All contributors are/were affiliated with Stanford University at time of their contributions. Leo Tam now works at Nvidia. Address correspondence to Apaar Sadhwani at apaars@stanford.edu or Jason Su at sujason@stanford.edu.

  2.  Motivation:  Affects ~100M, many in developed, ~45% of diabetics  Make process faster, assist ophthalmologist, self-help  Widespread disease, enable early diagnosis/care  Given fundus image  Rate severity of Diabetic Retinopathy  5 Classes: 0 (Normal), 1, 2, 3, 4 (Severe)  Hard classification (may solve as ordinal though)  Metric: quadratic weighted kappa, (pred – real) 2 penalty  Data from Kaggle  ~35,000 training images, ~54,000 test images  High resolution: variable, more than 2560 x 1920  Other unlabeled data from Stanford

  3.  Motivation:  Affects ~100M, many in developed, ~45% of diabetics  Make process faster, assist ophthalmologist, self-help  Widespread disease, enable early diagnosis/care  Given fundus image  Rate severity of Diabetic Retinopathy  5 Classes: 0 (Normal), 1, 2, 3, 4 (Severe)  Hard classification (may solve as ordinal though)  Metric: quadratic weighted kappa, (pred – real) 2 penalty  Data from Kaggle  ~35,000 training images, ~54,000 test images  High resolution: variable, more than 2560 x 1920  Other unlabeled data from Stanford

  4. Class 0 (normal) Class 4 (severe)

  5.  Motivation:  Affects ~100M, many in developed, ~45% of diabetics  Make process faster, assist ophthalmologist, self-help  Widespread disease, enable early diagnosis/care  Given fundus image  Rate severity of Diabetic Retinopathy  5 Classes: 0 (Normal), 1, 2, 3, 4 (Severe)  Hard classification (may solve as ordinal though)  Metric: quadratic weighted kappa, (pred – real) 2 penalty  Data from Kaggle  ~35,000 training images, ~54,000 test images  High resolution: variable, more than 2560 x 1920  Other unlabeled data from Stanford

  6.  High resolution images Image size Batch Size Atypical in vision, GPU batch size issues  224 x 224 128  Discriminative features small 2K x 2K 2  Grading criteria: not clear (EyePACS guidelines)  learn from data   Incorrect labeling  Artifacts in ~40% images  Optimizing approach to QWK  Severe class imbalance class 0 dominates   Too few training examples

  7.  High resolution images 0 1 Atypical in vision, GPU batch size issues   Discriminative features small  Grading criteria: not clear (EyePACS guidelines)  learn from data  2 3  Incorrect labeling  Artifacts in ~40% images  Optimizing approach to QWK  Severe class imbalance class 0 dominates   Too few training examples 4

  8.  High resolution images Atypical in vision, GPU batch size issues   Discriminative features small  Grading criteria: not clear (EyePACS guidelines)  learn from data   Incorrect labeling  Artifacts in ~40% images  Optimizing approach to QWK  Severe class imbalance class 0 dominates   Too few training examples Class 2

  9.  High resolution images Atypical in vision, GPU batch size issues   Discriminative features small  Grading criteria: not clear (EyePACS guidelines)  learn from data   Incorrect labeling  Artifacts in ~40% images  Optimizing approach to QWK  Severe class imbalance class 0 dominates   Too few training examples

  10.  High resolution images Atypical in vision, GPU batch size issues   Discriminative features small  Grading criteria: not clear (EyePACS guidelines)  learn from data   Incorrect labeling - Mentioned in problem statement  Artifacts in ~40% images - Confirmed with doctors  Optimizing approach to QWK  Severe class imbalance class 0 dominates   Too few training examples

  11.  High resolution images Atypical in vision, GPU batch size issues   Discriminative features small  Grading criteria: not clear (EyePACS guidelines)  learn from data   Incorrect labeling  Artifacts in ~40% images  Optimizing approach to QWK  Severe class imbalance class 0 dominates   Too few training examples

  12. Penalty/Loss  High resolution images Atypical in vision, GPU batch size issues   Discriminative features small  Grading criteria: not clear (EyePACS guidelines)  learn from data   Incorrect labeling Class 0 1 2 3 4  Artifacts in ~40% images Truth  Optimizing approach to QWK  Severe class imbalance - Hard classification non-differentiable class 0 dominates  - Backprop difficult  Too few training examples

  13. Penalty/Loss  High resolution images Atypical in vision, GPU batch size issues   Discriminative features small Predict  Grading criteria: 1 not clear (EyePACS guidelines)  learn from data   Incorrect labeling Class 0 1 2 3 4  Artifacts in ~40% images Truth  Optimizing approach to QWK  Severe class imbalance - Hard classification non-differentiable class 0 dominates  - Backprop difficult  Too few training examples

  14. Penalty/Loss  High resolution images Atypical in vision, GPU batch size issues   Discriminative features small Predict 2  Grading criteria: not clear (EyePACS guidelines)  learn from data   Incorrect labeling Class 0 1 2 3 4  Artifacts in ~40% images Truth  Optimizing approach to QWK  Severe class imbalance - Hard classification non-differentiable class 0 dominates  - Backprop difficult  Too few training examples

  15. Penalty/Loss Predict  High resolution images 3 Atypical in vision, GPU batch size issues   Discriminative features small  Grading criteria: not clear (EyePACS guidelines)  learn from data   Incorrect labeling Class 0 1 2 3 4  Artifacts in ~40% images Truth  Optimizing approach to QWK  Severe class imbalance - Hard classification non-differentiable class 0 dominates  - Backprop difficult  Too few training examples

  16. Penalty/Loss  High resolution images Atypical in vision, GPU batch size issues   Discriminative features small  Grading criteria: not clear (EyePACS guidelines)  learn from data   Incorrect labeling Class 0 1 2 3 4  Artifacts in ~40% images Truth  Optimizing approach to QWK  Severe class imbalance - Hard classification non-differentiable class 0 dominates  - Backprop difficult  Too few training examples

  17. Penalty/Loss  High resolution images Atypical in vision, GPU batch size issues   Discriminative features small  Grading criteria: not clear (EyePACS guidelines)  learn from data   Incorrect labeling Class 2.5 0 1 2 3 4  Artifacts in ~40% images Truth  Optimizing approach to QWK  Severe class imbalance - Squared error approximation? class 0 dominates  - Differentiable  Too few training examples

  18.  High resolution images Atypical in vision, GPU batch size issues   Discriminative features small  Grading criteria: not clear (EyePACS guidelines)  learn from data   Incorrect labeling  Artifacts in ~40% images  Optimizing approach to QWK  Severe class imbalance class 0 dominates  - Naïve: 3 class problem, or all zeros!  Too few training examples - Learn all classes separately: 1 vs All? - Balanced while training - At test time?

  19.  High resolution images Atypical in vision, GPU batch size issues   Discriminative features small  Grading criteria: not clear (EyePACS guidelines)  learn from data   Incorrect labeling  Artifacts in ~40% images  Optimizing approach to QWK  Severe class imbalance class 0 dominates  - Big learning models take more data!  Too few training examples - Harness test set?

  20.  Literature survey:  Hand-designed features to pick each component  Clean images, small datasets  Optic disk, exudate segmentation: fail due to artifacts  SVM: poor performance

  21.  Literature survey:  Hand-designed features to pick each component  Clean images, small datasets  Optic disk, exudate segmentation: fail due to artifacts  SVM: poor performance

  22. 1. Registration, Pre-processing 2. Convolutional Neural Nets (CNNs) 3. Hybrid Architecture

  23.  Registration  Hough circles, remove outside portion  Downsize to common size (224 x 224, 1K x 1K)  Color correction  Normalization (mean, variance)

  24. Class probabilities  Network in Network architecture AvgPool  7.5M parameters MaxPool (stride2)  No FC layers, spatial average pooling 3 Conv layers (depth 1024) instead  Transfer learning (ImageNet) MaxPool (stride2) 3 Conv layers  Variable learning rates (depth 384)  Low for “ ImageNet ” layers MaxPool (stride2) 3 Conv layers  Schedule (depth 256)  Combat lack of data, over-fitting MaxPool (stride2)  Dropout, Early stopping 3 Conv layers (depth 96)  Data augmentation (flips, rotation) Input Image

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend