interpreting mechanisms of prediction for skin cancer
play

Interpreting mechanisms of prediction for skin cancer diagnosis - PowerPoint PPT Presentation

Interpreting mechanisms of prediction for skin cancer diagnosis using multi-task learning D. Coppola 1 (speaker) H. K. Lee 1 , C. Guan 2 1 Bioinformatics Institute, A*STAR, Singapore 2 School of Computer Science and Engineering, NTU, Singapore


  1. Interpreting mechanisms of prediction for skin cancer diagnosis using multi-task learning D. Coppola 1 (speaker) H. K. Lee 1 , C. Guan 2 1 Bioinformatics Institute, A*STAR, Singapore 2 School of Computer Science and Engineering, NTU, Singapore Presented at the ISIC Skin Image Analysis Workshop @ CVPR 2020

  2. πŸ” Outline πŸ›‘ πŸ“‹ βš— πŸ“ 🎰 Methods Data Experiments Conclusions Introduction Interpreting mechanisms of prediction for skin cancer diagnosis 15 Jun 2020 2 using multi-task learning

  3. 🎰 Introduction 7-point checklist Melanoma method identification Identification of 7 attributes; each carries a score (0, 1 or 2) Rule-based procedures If the sum of the scores exceeds a β€’ ABCD rule certain threshold Ο„ β€’ 7-point checklist (typically 1 or 3), the method lesion is deemed a melanoma Interpreting mechanisms of prediction for skin cancer diagnosis 15 Jun 2020 3 using multi-task learning

  4. 🎰 Introduction Real-world medical application of DL is limited, despite good performance Main barrier is the opaqueness of the models Growing interest in developing methods to understand the mechanics of the models (XAI – Barredo Arrieta, 2020) Interpreting mechanisms of prediction for skin cancer diagnosis 15 Jun 2020 4 using multi-task learning

  5. 🎰 Introduction How can we examine what a DL model is learning? How to join rule-based methods with deep learning? Interpreting mechanisms of prediction for skin cancer diagnosis 15 Jun 2020 5 using multi-task learning

  6. 🎰 Introduction MTL method that learns what to share between tasks through gates Our Gates allow inspection the relationships learned by the network proposal Application to the 7-point checklist method (Argenziano, 1998) Interpreting mechanisms of prediction for skin cancer diagnosis 15 Jun 2020 6 using multi-task learning

  7. πŸ›‘ Methods – Overall System Interpreting mechanisms of prediction for skin cancer diagnosis 15 Jun 2020 7 using multi-task learning

  8. πŸ›‘ Methods – Gates Tasks should share features only when useful A β€œgate” applied to a tensor of feature maps feature tensor gate vector output tensor allows to selectively pick or suppress some features Interpreting mechanisms of prediction for skin cancer diagnosis 15 Jun 2020 8 using multi-task learning

  9. πŸ›‘ Methods – Gates Ideally a gate would be binary Not be learnable through gradient descent feature tensor gate vector output tensor Modelled as vector of continuous values in [0, 1] Interpreting mechanisms of prediction for skin cancer diagnosis 15 Jun 2020 9 using multi-task learning

  10. πŸ›‘ Methods – Gated Block The gates are always β€œopen” for the features corresponding to the task itself Features 𝐺 𝑒 Features 𝐺 π‘’βˆ— obtained are input for through conv next conv layer for π‘ˆ layer tasks Interpreting mechanisms of prediction for skin cancer diagnosis 15 Jun 2020 10 using multi-task learning

  11. Methods – Training matters Implementation of sampling strategy from Task index 𝑒 Kawahara et al. (2019) Sample index 𝑑 𝐾 𝑒 Labels for task 𝑒 Focal cross-entropy loss (Lin et al., 2017) Label index π‘˜ 𝐾 𝑒 𝑒 Weight computed by π‘₯ 𝛾 π‘˜ sampling strategy 𝑒 = ෍ 𝑒 ) 1 βˆ’ ΰ·ͺ log( ΰ·ͺ 𝑒 𝑧 𝑑,π‘˜ 𝑒 𝑒 𝐺𝑀 𝑑 π‘₯ 𝑧 𝑑,π‘˜ 𝑧 𝑑,π‘˜ 𝑒 Ground truth label 𝑧 𝑑,π‘˜ π‘˜ ΰ·ͺ Predicted label 𝑒 π‘˜ 𝑧 𝑑,π‘˜ 𝛾 Focal cross-entropy 1 βˆ’ ΰ·ͺ This loss is applied to each sample for each task 𝑒 𝑧 𝑑,π‘˜ coefficient ( 𝛾 = 2 ) Interpreting mechanisms of prediction for skin cancer diagnosis 15 Jun 2020 11 using multi-task learning

  12. πŸ“‹ Data 7pt-derm dataset Data per Labels for 8 patient tasks 1011 patient Train-val-test β€’ metadata β€’ lesion diagnosis samples split provided β€’ clinical image β€’ 7-point checklist attributes β€’ dermoscopic image β€’ labels Interpreting mechanisms of prediction for skin cancer diagnosis 15 Jun 2020 12 using multi-task learning

  13. βš— Experiments – Definition  Standard β€’ basic architecture  Binary β€’ DIAG has 5 unbalanced labels. What if they are grouped as β€œmelanoma vs all”? ο‚ͺ Gates-off β€’ what happens if no sharing is permitted? Model is always trained from scratch Interpreting mechanisms of prediction for skin cancer diagnosis 15 Jun 2020 13 using multi-task learning

  14. βš— Experiments – Performance Avg. Diagnosis experiment metric 7pt-checklist Standard has best performance (DIAG) attributes among experiments with similar accuracy 45.8 61.3  recall 45.5 57.7 setup standard precision 40.3 55.2 Closing the gates shows slight accuracy 44.3 51.4  recall 38.5 55.6 drop in performance gates-off precision 35.3 51.7 accuracy 77.2** 61.3 ο‚ͺ recall 71.0 ** 58.3 Binary has easier DIAG binary precision 70.3 ** 55.6 classification but otherwise accuracy 74.2 73.6 comparable performance Kawahara et al., recall 60.4 64.7 2019 precision 69.6 65.4 Interpreting mechanisms of prediction for skin cancer diagnosis 15 Jun 2020 14 using multi-task learning

  15. βš— Experiments – Performance Avg. Diagnosis experiment metric 7pt-checklist (DIAG) attributes Method by Kawahara et al. (2019) has accuracy 45.8 61.3  better overall performance recall 45.5 57.7 standard precision 40.3 55.2 accuracy 44.3 51.4  recall 38.5 55.6 gates-off precision 35.3 51.7 accuracy 77.2** 61.3 ο‚ͺ Possible reasons recall 71.0 ** 58.3 binary precision 70.3 ** 55.6 Use of additional data Starts from pre-trained (metadata, clinical accuracy 74.2 73.6 network on ImageNet Kawahara et al., images) in the pipeline recall 60.4 64.7 2019 precision 69.6 65.4 Interpreting mechanisms of prediction for skin cancer diagnosis 15 Jun 2020 15 using multi-task learning

  16. βš— Experiments – Application of the 7pt- checklist rule The 7-point checklist rule can be applied on the predicted attributes as an additional way of determining the diagnosis (only as β€œmelanoma vs all) β€’ Direct diagnosis : the model’s prediction of the DIAG task β€’ Inferred diagnosis : the diagnosis obtained by applying the 7-point checklist method on the predicted attributes Interpreting mechanisms of prediction for skin cancer diagnosis 15 Jun 2020 16 using multi-task learning

  17. βš— Experiments – Application of the 7pt- checklist rule GT : application of the 7-point checklist rule on the ground truth labels Using the 7pt rule, binary binary standard GT and standard have similar performance to GT when inferring melanoma A low threshold ( 𝜐 = 1 ) provides high sensitivity to melanoma but many false positives 1: melanoma; 0: otherwise Interpreting mechanisms of prediction for skin cancer diagnosis 15 Jun 2020 17 using multi-task learning

  18. βš— Experiments – Sharing Fraction Defined as the average value of the gates between task 𝑒 (taking the features) and 𝑗 (giving the features) 𝐷 𝑒 = 1 𝑒 SF 𝑗 𝐷 ෍ 𝛽 𝑗,𝑑 𝑑 Indicates the amount of sharing between two tasks at a given gated block Interpreting mechanisms of prediction for skin cancer diagnosis 15 Jun 2020 18 using multi-task learning

  19. βš— Experiments – Sharing Fraction Looking at the SF at the last gated block for experiment standard DIAG is the task that has more sharing with the other task β€’ High values with the major criteria (PN, BWV, VS) In the other rows, some values are close to 0, the model is learning to be selective Interpreting mechanisms of prediction for skin cancer diagnosis 15 Jun 2020 19 using multi-task learning

  20. πŸ“ Conclusions – Summary New framework for MTL β€’ Based on gates that learn what to features to share among tasks β€’ 7-point checklist fits MTL model design Gates allow to inspect the learned relationships between tasks β€’ Give insights on the mechanisms of the model β€’ Strategy shows selectivity in choosing which features to share Interpreting mechanisms of prediction for skin cancer diagnosis 15 Jun 2020 20 using multi-task learning

  21. πŸ“ Conclusions – Future directions Qualitative insights Performance β€’ Explore advanced metric to evaluate the sharing between tasks matters β€’ Discuss findings with practitioners β€’ Experiment with different task-specific architectures β€’ Include the metadata in the pipeline Interpreting mechanisms of prediction for skin cancer diagnosis 15 Jun 2020 21 using multi-task learning

  22. Thank you for your attention ☺ Contacts Davide Coppola (davidec@bii.a-star.edu.sg) Interpreting mechanisms of prediction for skin cancer diagnosis 15 Jun 2020 22 using multi-task learning

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend