Advisors: Robert Chang, Jeff Ullman, Andreas Paepcke *All - PowerPoint PPT Presentation

Apaar Sadhwani, Leo Tam, Jason Su Advisors: Robert Chang, Jeff Ullman, Andreas Paepcke *All contributors are/were affiliated with Stanford University at time of their contributions. Leo Tam now works at Nvidia. Address correspondence to Apaar Sadhwani at apaars@stanford.edu or Jason Su at sujason@stanford.edu.

 Motivation:  Affects ~100M, many in developed, ~45% of diabetics  Make process faster, assist ophthalmologist, self-help  Widespread disease, enable early diagnosis/care  Given fundus image  Rate severity of Diabetic Retinopathy  5 Classes: 0 (Normal), 1, 2, 3, 4 (Severe)  Hard classification (may solve as ordinal though)  Metric: quadratic weighted kappa, (pred – real) 2 penalty  Data from Kaggle  ~35,000 training images, ~54,000 test images  High resolution: variable, more than 2560 x 1920  Other unlabeled data from Stanford

Class 0 (normal) Class 4 (severe)

 Motivation:  Affects ~100M, many in developed, ~45% of diabetics  Make process faster, assist ophthalmologist, self-help  Widespread disease, enable early diagnosis/care  Given fundus image  Rate severity of Diabetic Retinopathy  5 Classes: 0 (Normal), 1, 2, 3, 4 (Severe)  Hard classification (may solve as ordinal though)  Metric: quadratic weighted kappa, (pred – real) 2 penalty  Data from Kaggle  ~35,000 training images, ~54,000 test images  High resolution: variable, more than 2560 x 1920  Other unlabeled data from Stanford

 High resolution images Image size Batch Size Atypical in vision, GPU batch size issues  224 x 224 128  Discriminative features small 2K x 2K 2  Grading criteria: not clear (EyePACS guidelines)  learn from data   Incorrect labeling  Artifacts in ~40% images  Optimizing approach to QWK  Severe class imbalance class 0 dominates   Too few training examples

 High resolution images 0 1 Atypical in vision, GPU batch size issues   Discriminative features small  Grading criteria: not clear (EyePACS guidelines)  learn from data  2 3  Incorrect labeling  Artifacts in ~40% images  Optimizing approach to QWK  Severe class imbalance class 0 dominates   Too few training examples 4

 High resolution images Atypical in vision, GPU batch size issues   Discriminative features small  Grading criteria: not clear (EyePACS guidelines)  learn from data   Incorrect labeling  Artifacts in ~40% images  Optimizing approach to QWK  Severe class imbalance class 0 dominates   Too few training examples Class 2

 High resolution images Atypical in vision, GPU batch size issues   Discriminative features small  Grading criteria: not clear (EyePACS guidelines)  learn from data   Incorrect labeling  Artifacts in ~40% images  Optimizing approach to QWK  Severe class imbalance class 0 dominates   Too few training examples

 High resolution images Atypical in vision, GPU batch size issues   Discriminative features small  Grading criteria: not clear (EyePACS guidelines)  learn from data   Incorrect labeling - Mentioned in problem statement  Artifacts in ~40% images - Confirmed with doctors  Optimizing approach to QWK  Severe class imbalance class 0 dominates   Too few training examples

 High resolution images Atypical in vision, GPU batch size issues   Discriminative features small  Grading criteria: not clear (EyePACS guidelines)  learn from data   Incorrect labeling  Artifacts in ~40% images  Optimizing approach to QWK  Severe class imbalance class 0 dominates   Too few training examples

Penalty/Loss  High resolution images Atypical in vision, GPU batch size issues   Discriminative features small  Grading criteria: not clear (EyePACS guidelines)  learn from data   Incorrect labeling Class 0 1 2 3 4  Artifacts in ~40% images Truth  Optimizing approach to QWK  Severe class imbalance - Hard classification non-differentiable class 0 dominates  - Backprop difficult  Too few training examples

Penalty/Loss  High resolution images Atypical in vision, GPU batch size issues   Discriminative features small Predict  Grading criteria: 1 not clear (EyePACS guidelines)  learn from data   Incorrect labeling Class 0 1 2 3 4  Artifacts in ~40% images Truth  Optimizing approach to QWK  Severe class imbalance - Hard classification non-differentiable class 0 dominates  - Backprop difficult  Too few training examples

Penalty/Loss  High resolution images Atypical in vision, GPU batch size issues   Discriminative features small Predict 2  Grading criteria: not clear (EyePACS guidelines)  learn from data   Incorrect labeling Class 0 1 2 3 4  Artifacts in ~40% images Truth  Optimizing approach to QWK  Severe class imbalance - Hard classification non-differentiable class 0 dominates  - Backprop difficult  Too few training examples

Penalty/Loss Predict  High resolution images 3 Atypical in vision, GPU batch size issues   Discriminative features small  Grading criteria: not clear (EyePACS guidelines)  learn from data   Incorrect labeling Class 0 1 2 3 4  Artifacts in ~40% images Truth  Optimizing approach to QWK  Severe class imbalance - Hard classification non-differentiable class 0 dominates  - Backprop difficult  Too few training examples

Penalty/Loss  High resolution images Atypical in vision, GPU batch size issues   Discriminative features small  Grading criteria: not clear (EyePACS guidelines)  learn from data   Incorrect labeling Class 0 1 2 3 4  Artifacts in ~40% images Truth  Optimizing approach to QWK  Severe class imbalance - Hard classification non-differentiable class 0 dominates  - Backprop difficult  Too few training examples

Penalty/Loss  High resolution images Atypical in vision, GPU batch size issues   Discriminative features small  Grading criteria: not clear (EyePACS guidelines)  learn from data   Incorrect labeling Class 2.5 0 1 2 3 4  Artifacts in ~40% images Truth  Optimizing approach to QWK  Severe class imbalance - Squared error approximation? class 0 dominates  - Differentiable  Too few training examples

 High resolution images Atypical in vision, GPU batch size issues   Discriminative features small  Grading criteria: not clear (EyePACS guidelines)  learn from data   Incorrect labeling  Artifacts in ~40% images  Optimizing approach to QWK  Severe class imbalance class 0 dominates  - Naïve: 3 class problem, or all zeros!  Too few training examples - Learn all classes separately: 1 vs All? - Balanced while training - At test time?

 High resolution images Atypical in vision, GPU batch size issues   Discriminative features small  Grading criteria: not clear (EyePACS guidelines)  learn from data   Incorrect labeling  Artifacts in ~40% images  Optimizing approach to QWK  Severe class imbalance class 0 dominates  - Big learning models take more data!  Too few training examples - Harness test set?

 Literature survey:  Hand-designed features to pick each component  Clean images, small datasets  Optic disk, exudate segmentation: fail due to artifacts  SVM: poor performance

1. Registration, Pre-processing 2. Convolutional Neural Nets (CNNs) 3. Hybrid Architecture

 Registration  Hough circles, remove outside portion  Downsize to common size (224 x 224, 1K x 1K)  Color correction  Normalization (mean, variance)

Class probabilities  Network in Network architecture AvgPool  7.5M parameters MaxPool (stride2)  No FC layers, spatial average pooling 3 Conv layers (depth 1024) instead  Transfer learning (ImageNet) MaxPool (stride2) 3 Conv layers  Variable learning rates (depth 384)  Low for “ ImageNet ” layers MaxPool (stride2) 3 Conv layers  Schedule (depth 256)  Combat lack of data, over-fitting MaxPool (stride2)  Dropout, Early stopping 3 Conv layers (depth 96)  Data augmentation (flips, rotation) Input Image

Advisors: Robert Chang, Jeff Ullman, Andreas Paepcke *All - PowerPoint PPT Presentation

Apaar Sadhwani, Leo Tam, Jason Su Advisors: Robert Chang, Jeff Ullman, Andreas Paepcke *All contributors are/were affiliated with Stanford University at time of their contributions. Leo Tam now works at Nvidia. Address correspondence to Apaar

A note about books Ullman is easy to digest Ullman costs money but saves time Ullman is clueless

Tagging Human Knowledge Paul Heymann, Andreas Paepcke, and Hector Garcia-Molina Department of

PhotoSpread Sean Kandel with Eric Ableson, Hector Garcia-Molina, Andreas Paepcke, Martin

About FDx Advisors FolioDynamix and FDx Advisors FDx Advisors is a wholly owned

The Big Board. The Big Board. Jeff Heard Jeff Heard The Renaissance Computing Institute The

AIDA - Abstract Interfaces for Data Analysis Andreas Pfeiffer, CERN/IT Andreas Pfeiffer, CERN/IT

CS#3000:#Algorithms#&#Data Jonathan#Ullman Lecture#1# Course#Overview

Computing Marginals Using MapReduce Foto Afrati , Shantanu Sharma , Jeffrey D. Ullman ,

ELC Advisors, LLC Efficient Low Cost Wealth Management ELC Advisors, LLC Our principles Clients

THE H EALTHCARE A RENA By: Noah Meek & LaVerne Chang Cardwell & Chang, PLLC Cardwell

Use of AIDA in tests and examples Andreas Pfeiffer CERN IT/API andreas.pfeiffer@cern.ch

ERGs as a Talent Engine Dr. Robert Rodriguez President DRR Advisors, LLC Dr.

Gallio: Crafting a Toolchain Jeff Brown jeff.brown@gmail.com About Me Jeff Brown Lead

Disks and Files Garcia Molina, Ullman, Widom Ramakrishnan/Gehrke Ch. 9 "Digital information

PACE OF M&A MAY 20, 2019 Colonnade Advisors LLC C OLONNADE A DVISORS LLC Colonnade Advisors

LIGHTHOUSE ADVISORS Keeping Your Capital Safe August 2020 1 www.lighthouse-advisors.com

Retina Group of New York Chairman James M. Maisel, MD to Chair Diabetes Epidemiology, Prevention,

California Childrens Services (CCS) Redesign Outcome Measures/Quality Technical Workgroup

A few points for our WebEx today: Please dial in on your phone: 0800 032 8069 and then use the

pregnancy Fiona McKeeman-Credentialled Diabetes Educator 1 Diabetes In Pregnancy Gestational

Diabetic Retinopathy Prof. Andrew Hunter ahunter@lincoln.ac.uk Lincoln University 22 th Sept.

Pregnancy and the implications for eye health Su Down Somerset Partnership NHS Foundation Trust

The Diabetic Retinopathy Eye Screening Grading Tool By Nicholas Barclay Senior Screener Grader

The Potential For Teleophthalmology in the Patient-Centered Medical Home Model Christina

Advisors: Robert Chang, Jeff Ullman, Andreas Paepcke *All - PowerPoint PPT Presentation

Apaar Sadhwani, Leo Tam, Jason Su Advisors: Robert Chang, Jeff Ullman, Andreas Paepcke *All contributors are/were affiliated with Stanford University at time of their contributions. Leo Tam now works at Nvidia. Address correspondence to Apaar

A note about books Ullman is easy to digest Ullman costs money but saves time Ullman is clueless

Tagging Human Knowledge Paul Heymann, Andreas Paepcke, and Hector Garcia-Molina Department of

PhotoSpread Sean Kandel with Eric Ableson, Hector Garcia-Molina, Andreas Paepcke, Martin

About FDx Advisors FolioDynamix and FDx Advisors FDx Advisors is a wholly owned

The Big Board. The Big Board. Jeff Heard Jeff Heard The Renaissance Computing Institute The

AIDA - Abstract Interfaces for Data Analysis Andreas Pfeiffer, CERN/IT Andreas Pfeiffer, CERN/IT

CS#3000:#Algorithms#&amp;#Data Jonathan#Ullman Lecture#1# Course#Overview

Computing Marginals Using MapReduce Foto Afrati , Shantanu Sharma , Jeffrey D. Ullman ,

ELC Advisors, LLC Efficient Low Cost Wealth Management ELC Advisors, LLC Our principles Clients

THE H EALTHCARE A RENA By: Noah Meek &amp; LaVerne Chang Cardwell &amp; Chang, PLLC Cardwell

Use of AIDA in tests and examples Andreas Pfeiffer CERN IT/API andreas.pfeiffer@cern.ch

ERGs as a Talent Engine Dr. Robert Rodriguez President DRR Advisors, LLC Dr.

Gallio: Crafting a Toolchain Jeff Brown jeff.brown@gmail.com About Me Jeff Brown Lead

Disks and Files Garcia Molina, Ullman, Widom Ramakrishnan/Gehrke Ch. 9 &quot;Digital information

PACE OF M&amp;A MAY 20, 2019 Colonnade Advisors LLC C OLONNADE A DVISORS LLC Colonnade Advisors

LIGHTHOUSE ADVISORS Keeping Your Capital Safe August 2020 1 www.lighthouse-advisors.com

Retina Group of New York Chairman James M. Maisel, MD to Chair Diabetes Epidemiology, Prevention,

California Childrens Services (CCS) Redesign Outcome Measures/Quality Technical Workgroup

A few points for our WebEx today: Please dial in on your phone: 0800 032 8069 and then use the

pregnancy Fiona McKeeman-Credentialled Diabetes Educator 1 Diabetes In Pregnancy Gestational

Diabetic Retinopathy Prof. Andrew Hunter ahunter@lincoln.ac.uk Lincoln University 22 th Sept.

Pregnancy and the implications for eye health Su Down Somerset Partnership NHS Foundation Trust

The Diabetic Retinopathy Eye Screening Grading Tool By Nicholas Barclay Senior Screener Grader

The Potential For Teleophthalmology in the Patient-Centered Medical Home Model Christina

CS#3000:#Algorithms#&#Data Jonathan#Ullman Lecture#1# Course#Overview

THE H EALTHCARE A RENA By: Noah Meek & LaVerne Chang Cardwell & Chang, PLLC Cardwell

Disks and Files Garcia Molina, Ullman, Widom Ramakrishnan/Gehrke Ch. 9 "Digital information

PACE OF M&A MAY 20, 2019 Colonnade Advisors LLC C OLONNADE A DVISORS LLC Colonnade Advisors