Image-based profiling using deep learning Juan C. Caicedo Ph.D - - PowerPoint PPT Presentation

image based profiling using deep learning
SMART_READER_LITE
LIVE PREVIEW

Image-based profiling using deep learning Juan C. Caicedo Ph.D - - PowerPoint PPT Presentation

mitosis Image-based profiling using deep learning Juan C. Caicedo Ph.D Broad Institute of MIT and Harvard Images can be quantified for all kinds of phenotypes Muscle structure Patient biopsy tissue Image Mass Spec David Thomas Margaret


slide-1
SLIDE 1

Image-based profiling using deep learning

Juan C. Caicedo Ph.D Broad Institute of MIT and Harvard

mitosis

slide-2
SLIDE 2

Images can be quantified for all kinds of phenotypes

David Thomas Margaret Shipp/Scott Rodig Michael Angelo Allen Institute for Cell Science Olivier Pourquie Muscle structure 3D Muscle structure Patient biopsy tissue Control human iPS Isogenic Duchenne-like iPS Image Mass Spec

slide-3
SLIDE 3

Screen for specific phenotypes using images

Clinical trials underway for Alisertib in adults with AMKL. Wen Q, et al. (2012). Cell 150(3):575-89

DNA stain with outlines identifying the nuclei DMSO SU6656

slide-4
SLIDE 4

TKK 0.1 uM

slide-5
SLIDE 5

AZ138 - 0.01uM

slide-6
SLIDE 6

What is image-based profiling?

Caicedo J.C., Singh S., Carpenter A. "Applications of Image-Based Profiling of Perturbations". Current Opinion in Biotechnology - 2016.

slide-7
SLIDE 7

Cell Painting assay

Gustafsdottir, et al. PLOS ONE 2013 Bray, et al. Nature Protocols 2016

slide-8
SLIDE 8

Image-based Profiling

  • 1. Raw images
  • 2. Segmented images
  • 3. Single-cell feature matrices
  • 4. Population profiles of treatments
  • 5. Downstream statistical analysis

Are treatments significantly different / effective?

Caicedo, J.C., et al. "Data-analysis strategies for image-based cell profiling." Nature Methods 14.9, 2017

slide-9
SLIDE 9

Automatic Identification of Cells

Segmentation

slide-10
SLIDE 10

Neural Networks for Segmentation

Example Image Labeled objects Manual annotation

Train Model

Training Applying

Run Model

Labeled objects New image

slide-11
SLIDE 11

Neural Nets produce fewer segmentation errors

Caicedo, J.C., et al. "Evaluation of Deep Learning Strategies for Nucleus Segmentation in Fluorescence Images." BioRxiv (2019): 335216.

slide-12
SLIDE 12

More data improves performance

slide-13
SLIDE 13

Diversity of imaging studies

slide-14
SLIDE 14

Can we create a single tool to detect cell nuclei in any light microscopy image?

slide-15
SLIDE 15

Universal nucleus segmentation

65,333 experiments 3,634 teams 3 months

slide-16
SLIDE 16

Dataset

  • 37,333 annotated nuclei
  • 841 images
  • 30 biological experiments
  • 22 cell types
  • 5 image groups
slide-17
SLIDE 17

Performance of participants

0.0

0.1 0.2 0.3 0.4 0.5 0.6

1 - [ods.ai] topcoders 2 - jacobkie 3 - Deep Retina 4 - Nuclear Vision 5 - Inom Mirzaev CellProfiler reference*

Competition score Number of participant teams

10 20 30 40

Distribution of scores in second-stage evaluation

Intersection over Union Threshold

0.0 0.2 0.4 0.6 0.8

Accuracy: F1-score

1.0 1st place 2nd place 3rd place reference

Accuracy of top-3 models

0.5 0.6 0.7 0.8 0.9

a b

slide-18
SLIDE 18

Top 3 solutions

[ods.ai] topcoders jacobkie Deep Retina 1st place 2nd place 3rd place

slide-19
SLIDE 19

Performance by image type

0.0 0.2 0.4 0.6 0.8 Small fluorescent Pink and purple tissue Purple tissue Large fluorescent Grayscale tissue

Accuracy by image type

Accuracy: F1-score @ 0.7 IoU

Dataset distribution b a

1st place 2nd place 3rd place

Training Test

80.6% 0.6% 15.5% 0.9% 2.4% 67.9% 4.7% 15.1% 11.3% 0.9% reference

slide-20
SLIDE 20

More accurate and takes no time

Caicedo et al. 2019 Nature Methods. In Press.

Small fluorescent Pink and purple tissue Purple tissue Large fluorescent Grayscale tissue

slide-21
SLIDE 21
slide-22
SLIDE 22

Lung Adenocarcinoma A Cell Painting study

slide-23
SLIDE 23

Patient with metastatic breast cancer

  • She is diagnosed in 2016
  • Doctors observe tennis-ball sized tumors

in several parts of her body

  • They give her 3 months life expectancy

The patient accepts an experimental treatment called immunotherapy.

slide-24
SLIDE 24

Train the immune system to fight cancer

slide-25
SLIDE 25

The patient fully recovered

Taken from https://www.bbc.co.uk/news/health-44338276

slide-26
SLIDE 26

Why did the treatment work?

Tumor sequencing revealed 62 mutations They knew treatment for 7 of them The experiment worked only with 4 mutations

slide-27
SLIDE 27

Cell Painting LUAD dataset

Anne Carpenter Shantanu Singh Juan Caicedo Mohammad Rohban

Cell line: A549 Over-expression 8 replicates 50 million+ single cells

EGFR_WT CONTROL ARAF_WT

CTNNB1_WT

FBXW7_WT KRAS_WT KEAP1_WT MAPK7_WT RIT1_WT STK11_WT

slide-28
SLIDE 28

Morphology-based VIP

ARAF_p.S214F

slide-29
SLIDE 29

Image-based Profiling

  • 1. Raw images
  • 2. Segmented images
  • 3. Single-cell feature matrices
  • 4. Population profiles of treatments
  • 5. Downstream statistical analysis

Are treatments significantly different / effective?

Caicedo, et al. 2017 Nature Methods

slide-30
SLIDE 30

Cell Feature Extraction

Engineer measurements Define and compute useful properties

Area Shape Color distribution Nuclei size

slide-31
SLIDE 31

Cell Feature Extraction

classification

Learn features to solve a task Train a deep neural network

slide-32
SLIDE 32

Weakly supervised learning

Discover associations

Compound A Compound B Compound C Compound D

Mechanistic Class X Mechanistic Class Y

Batch 1

A1 A2 B1 B2 C1 C2

Batch 2

A3 D1 D2 B3 C3 D3

slide-33
SLIDE 33

Weakly supervised learning

softmax CNN

Main goal: Treatment-level profiling Auxiliary task: Single-cell treatment classification Caicedo, J. C., et al. "Weakly supervised learning of single-cell feature embeddings". Computer Vision and Pattern Recognition, IEEE CVPR 2018

slide-34
SLIDE 34

Deep Learning for Image-based Profiling

Caicedo et al. CVPR 2018

CellProfiler Weakly Supervised Learning

Control TP53.WT STK11.WT NFE2L2.WT MDM2.WT KRAS_p.G12V KEAP1.WT EGFR_p.T790M EGFR_p.L858R EGFR.WT

Transfer learning In all plots, x-axis is t-SNE1 and y-axis is t-SNE2 of the projected phenotypic space.

slide-35
SLIDE 35

Determining variant impact

EGFR Wild Type Control EGFR Mutant

EGFR_p.S645C

slide-36
SLIDE 36

Single-cell morphological analysis

EGFR_p.S645C

EGFR WT Control EGFR MUT

Variant impact: 66.9%

slide-37
SLIDE 37

Prioritizing variants by impact

EGFR_p.T790M, p.L858R.o EGFR_p.L858R EGFR_p.S654C EGFR_p.K754E EGFR_p.Q102H EGFR_p.R222L

Find more results at: https://broad.io/cp-luad

slide-38
SLIDE 38

Prioritizing variants by impact

Morphological Variant Impact Score (%) EGFR_p.T790M, p.L858R.o EGFR_p.L858R EGFR_p.S654C EGFR_p.K754E EGFR_p.Q102H EGFR_p.R222L

0.0% 20.0% 40.0% 60.0% 80.0%

Find more results at: https://broad.io/cp-luad

slide-39
SLIDE 39

Conclusions

Imaging is a rich source of information. Computer vision has powerful tools for image analysis. Many computer vision tasks can be fully automated. Imaging data can be connected to other sources of data.

slide-40
SLIDE 40

Gratitude

Hamdah Abbasi Jeanelle Ackerman Beth Cimini Minh Doan Allen Goodman Profiling group: Shantanu Singh Tim Becker Marzieh Haghighi Matt Smith Broad Imaging Platform Anne Carpenter Many thanks to our many biology collaborators! Recent major funding for this work provided by:

  • NIH NIGMS: R01 GM089652
  • NIH NIGMS: MIRA R35 GM122547
  • NSF CAREER: DBI 1148823
  • NSF/BBSRC: DBI 1458626