(De)Constructing Bias on Skin Lesion Datasets
- A. Bissoto¹, M. Fornaciali², E. Valle², S. Avila¹
¹RECOD Lab., IC, University of Campinas (UNICAMP) ²RECOD Lab., DCA, FEEC, University of Campinas (UNICAMP) ISIC Workshop @ CVPR 2019
(De)Constructing Bias on Skin Lesion Datasets A. Bissoto, M. - - PowerPoint PPT Presentation
(De)Constructing Bias on Skin Lesion Datasets A. Bissoto, M. Fornaciali, E. Valle, S. Avila RECOD Lab., IC, University of Campinas (UNICAMP) RECOD Lab., DCA, FEEC, University of Campinas (UNICAMP) ISIC Workshop @ CVPR 2019 RECOD
(De)Constructing Bias on Skin Lesion Datasets
¹RECOD Lab., IC, University of Campinas (UNICAMP) ²RECOD Lab., DCA, FEEC, University of Campinas (UNICAMP) ISIC Workshop @ CVPR 2019
2
h t t p : / / w w w . t
a y i f
n d
t . c
/ i n d e x . p h p / 2 1 3 / 1 2 / a n t i
a n k
s
l d
a r
i / 3
Bias
Reproduced from: “Unbiased Look at Dataset Bias”, Torralba et al. (2011) 4
Reproduced from: “An Overview of Melanoma Detection in Dermoscopy Images Using Image Processing and Machine Learning”, Mishra et al. (2016)
Confounders on Skin Lesion Datasets
5
Vignetting (dark borders) Staining Color markers Rulers
Inflate Performance Spurious Correlations Destruction Experiments
Bias
Play Down Performance Legitimate (Overlooked?) Correlations Construction Experiments
6
➔ Educational ➔ Rich Metadata ➔ Clinical and dermoscopic images for every case ➔ Clinical data (location, diameter, elevation) ➔ Metadata for dermoscopic features. ➔ Large ➔ Diverse ➔ Different sources, different devices ➔ Segmentation masks for lesion (large subset) ➔ Segmentation masks for dermoscopic features (small subset).
Datasets
Atlas of Dermoscopy ISIC Archive
7
8
Traditional
9
Traditional Only Skin
10
Traditional Only Skin Bbox
11
Traditional Only skin Bbox Bbox70
13
Destruction Experiments
14
Destruction Experiments
Destruction Experiments
Performance of machine learning with all cogent information removed on ISIC Archive: 71% AUC
15
Destruction Experiments
Performance of machine learning with all cogent information removed on ISIC Archive: 71% AUC
¹“The Melanoma Classification Benchmark”, Brinker et al. (2019) 16
Performance of 157 dermatologists¹ on ISIC Archive: 67% AUC
17
Traditional b) Grayscale Attributes c) RGB Attributes d) Traditional + Grayscale Attributes
18
Traditional Grayscale Attributes c) RGB Attributes d) Traditional + Grayscale Attributes
19
Traditional Grayscale Attributes RGB Attributes d) Traditional + Grayscale Attributes
20
Traditional Grayscale Attributes RGB Attributes Traditional + Grayscale Attributes
21
Construction Experiments
22
Machine learning results results are probably optimistic Feeding the model with relevant dermoscopic attributes is worse than feeding it with “only skin” or “bbox” sets Solving the bias problem is critical for deploying automated skin lesion analysis to the real world
Conclusions
23
24
Team
24
Acknowledgments
reasoning for complex data
25
Thanks!
ISIC Workshop @ CVPR 2019