Not Just a Black Box: Interpretable Deep Learning for Genomics
Presented by: AvanA Shrikumar
1
Not Just a Black Box: Interpretable Deep Learning for Genomics - - PowerPoint PPT Presentation
Not Just a Black Box: Interpretable Deep Learning for Genomics Presented by: AvanA Shrikumar 1 With great power comes really poor interpretability Deep Power Learning Traditional machine learning Classical statistics 2
1
Deep Learning
Interpretability Power
Classical statistics Traditional machine learning
2
Deep Learning
Interpretability Power
Classical statistics Traditional machine learning Interpretable Deep Learning
2
3
3
4
Output
Yellow = inputs
5
Output
Yellow = inputs
5
Output
Yellow = inputs
5
Output
Yellow = inputs
5
Output
Yellow = inputs
5
Output
Yellow = inputs
5
Output
Yellow = inputs
5
Output
Yellow = inputs
Drawbacks
1) Computa:onal efficiency - requires one forward prop for each perturba:on
5
Output
Yellow = inputs
Drawbacks
1) Computa:onal efficiency - requires one forward prop for each perturba:on 2) Satura:on
5
i1 i2
y
1 1 2
6
i1 i2
y
1 1 2
=1 =1 =1
6
i1 i2
y
1 1 2
=1 =1 =1
6
Output
Yellow = inputs
7
Output
Yellow = inputs
7
Output
Yellow = inputs
7
Output
Yellow = inputs
Examples:
7
Output
Yellow = inputs
Examples:
deepli^, ICML 2017
Kundaje
7
8
1 1 2
i1 i2
y
When (i1 + i2) >= 1, gradient is 0
8
1 1 2
i1 i2
y
1 1 2 Reference: i1=0 & i2=0
9
i1 i2
y
1 1 2 y=0 when (i1 + i2) = 0 (reference) Reference: i1=0 & i2=0
9
i1 i2
y
1 1 2 y=0 when (i1 + i2) = 0 (reference)
At (i1 + i2) = 2, the “difference from reference” is +1, NOT 0
Reference: i1=0 & i2=0
9
i1 i2
y
1 1 2 y=0 when (i1 + i2) = 0 (reference)
At (i1 + i2) = 2, the “difference from reference” is +1, NOT 0
Reference: i1=0 & i2=0
9
i1 i2
y
DeepLIFT addresses other failure modes besides saturaAon (see paper)
Original
CIFAR10 model, class = “ship”
10
Original Reference DeepLIFT scores
CIFAR10 model, class = “ship”
10
Original Reference DeepLIFT scores
CIFAR10 model, class = “ship”
SuggesAons on how to pick a reference:
10
Original Reference DeepLIFT scores
CIFAR10 model, class = “ship”
SuggesAons on how to pick a reference:
generated by shuffling a genomic sequence
10
8->3 8->6 Guided Backprop Integrated gradients DeepLIFT
11
fer:lized egg liver cells cardiac cells blood cells
12
Cell-types are different because different genes are turned on
fer:lized egg liver cells cardiac cells blood cells
How is cell-type-specific gene expression controlled?
12
Cell-types are different because different genes are turned on
fer:lized egg liver cells cardiac cells blood cells
How is cell-type-specific gene expression controlled?
Cell-types are different because different genes are turned on
13
DNA sequence of a gene Control element
13
DNA sequence of a gene Control element ACGTGTAACTGATAATGCCGATATT Sequence contain “DNA words” that controller proteins bind to
13
DNA sequence of a gene Control element ACGTGTAACTGATAATGCCGATATT Controller proteins bind to DNA words Sequence contain “DNA words” that controller proteins bind to
13
DNA sequence of a gene Control element + controller proteins loop over…
13
DNA sequence of a gene Control element + controller proteins loop over… …and ac:vate nearby genes
14
DNA sequence of a gene Controller proteins *Stranger et al., Genet., 2011
14
DNA sequence of a gene ACGTGTAACTGATAATGCCGATATT Controller proteins Control element has “DNA words” that controller proteins bind to *Stranger et al., Genet., 2011
14
DNA sequence of a gene ACGTGTAACTGATAATGCCGATATT Controller proteins Control element has “DNA words” that controller proteins bind to *Stranger et al., Genet., 2011
14
DNA sequence of a gene ACGTGTAACTGATAATGCCGATATT Controller proteins Control element has “DNA words” that controller proteins bind to *Stranger et al., Genet., 2011
14
DNA sequence of a gene ACGTGTAACTGATAATGCCGATATT Controller proteins Control element has “DNA words” that controller proteins bind to
Many posi:ons in a control element are not essen:al for its func:on!
*Stranger et al., Genet., 2011
14
DNA sequence of a gene ACGTGTAACTGATAATGCCGATATT Controller proteins Control element has “DNA words” that controller proteins bind to
Many posi:ons in a control element are not essen:al for its func:on!
*Stranger et al., Genet., 2011
15
Experimentally measure control elements in different :ssues
15
Experimentally measure control elements in different :ssues Predict :ssue- specific ac:vity of control elements from sequence using deep learning
15
Experimentally measure control elements in different :ssues Predict :ssue- specific ac:vity of control elements from sequence using deep learning
15
Interpret the model to learn important posi:ons!
Learned pa<ern detectors Input: DNA sequence represented as ones and zeros Later layers build on pa<erns of previous layer Accessible in HSCs Output: ON (+1) vs OFF (0)
A C G T 1 1 1 1 1 1 1 1 1 1 1 1 1
ON in cell- type X ON in cell- type Y
16
Architecture:
layers + batch norm
connected layers
Peyton Greenside
Publicly available dataset profiling control element ac:vity (Corces & Buenrostro et al., 2016)
17
Peyton Greenside
Publicly available dataset profiling control element ac:vity (Corces & Buenrostro et al., 2016)
Hematopoe:c stem cell White blood cell Red blood cell
17
Peyton Greenside 18
Gata Gata Gata SPI1
Peyton Greenside 18
Importance in HSC’s Gata Gata Gata SPI1
SPI1 controller protein binding signal GATA1 controller protein binding signal
(Data unavailable) (Data unavailable)
“Is an acAve control element” signal Peyton Greenside HSC’s
18
Importance in B-cells Gata Gata Gata SPI1
SPI1 controller protein binding signal GATA1 controller protein binding signal
“Is an acAve control element” signal Peyton Greenside HSC’s B-cells
18
Importance in Erythroid Gata Gata Gata SPI1
SPI1 controller protein binding signal GATA1 controller protein binding signal “Is an acAve control element” signal Peyton Greenside HSC’s Erythroid B-cells
18
Peyton Greenside
19
20
Individual GATA pa<ern detectors mo:fs found by DeepBind (Alipanahi et al.)
Problem: High levels of redundancy, because mulAple neurons cooperate with each other
21
Individual GATA pa<ern detectors mo:fs found by DeepBind (Alipanahi et al.)
Problem: High levels of redundancy, because mulAple neurons cooperate with each other Computer vision
21
22
Insight: input-level importance scores reveal combined contribu:ons
Sequence 1 Sequence 2 Sequence 3 score score score
22
Insight: input-level importance scores reveal combined contribu:ons
22
Insight: input-level importance scores reveal combined contribu:ons
22
Insight: input-level importance scores reveal combined contribu:ons
22
Insight: input-level importance scores reveal combined contribu:ons
22
23
23
23
23
24
25
– First step of t-sne: compute condi:onal probs – βi is tuned to a<ain a desired perplexity! 25
– First step of t-sne: compute condi:onal probs – βi is tuned to a<ain a desired perplexity!
25
– First step of t-sne: compute condi:onal probs – βi is tuned to a<ain a desired perplexity!
– Use density-adapted probabili:es with clustering based on Louvain community detec:on 25
26
Peyton Greenside
– With advantages over other methods – h<ps://github.com/kundajelab/deepli^
– More details in talk at NIPS comp bio: h<ps://www.youtube.com/watch?v=fXPGVJg956E
27
Oana Ursu Amr Alexandari Daniel Kim Michael Wainberg Maryna Taranova Chris Probert Jin-Wook Lee
Chuan Sheng Foo Johnny Israeli Irene Kaplow Funding HHMI Interna:onal Student Research Fellowship Bio-X Fellowship Microso^ Women’s Fellowship NIH R01ES02500902 Peyton Greenside Anna Shcherbina Anshul Kundaje