GPU-ENABLED DIFFERENTIAL DEPENDENCY NETWORK ANALYSIS OF LARGE - - PowerPoint PPT Presentation
GPU-ENABLED DIFFERENTIAL DEPENDENCY NETWORK ANALYSIS OF LARGE - - PowerPoint PPT Presentation
GPU-ENABLED DIFFERENTIAL DEPENDENCY NETWORK ANALYSIS OF LARGE DATASETS Gil Speyer May 9, 2017 Nvidia GTC S7254 Transcriptomic Data Personome.com Biological Pathways Nature.org Differential expression analysis Class labels C1 C2
Transcriptomic Data
Personome.com
Biological Pathways
Nature.org
Differential expression analysis
C1 C2
Class labels Differential Expression GSEA
Pathway p-value P1 0.00002 P2 0.0001 … … Pk 0.01
Pathway analysis
Differential Coexpression and DDNs
Biolayout.org
EDDY: discovery of differential dependency
Gene Set DB Gene set G1, G2, … Gi = {gi,1, gi,2, …} ¡
g1 ¡ g2 ¡ g3 ¡ g4 ¡ g5 ¡
known ¡ interaction ¡ inferred ¡ dependency ¡
GDNi ¡ Known gene-gene Interactions Gj: DDN
C2-specific dependency C1-specific dependency Common dependency
Prior ¡knowledge ¡
Gene set p-value G1 0.00001 G2 0.00005 … … Gk 0.007
C1 ¡
¡
C2 ¡
¡ g1 ¡ g2 ¡ g3 ¡ g4 ¡ g5 ¡
GINi ¡
P G DC1
( )
Network Likelihoods C1 C2
…
Statistical ¡ ¡ test ¡
…
Jung ¡et ¡al., ¡NAR ¡2014 ¡ Speyer ¡et ¡al., ¡PSB ¡2016 ¡
Transcriptomic Data
EDDY + CTRP-CCLE
– Identifies pathways enriched with differential dependency between sensitive and non-sensitive cancer cell lines, as in DDNs – Discover mediators of drug sensitivity, i.e. potential targets?
Cell lines chemical sensitivity for 481 small compounds RNAseq of 935 cancer cell lines
Drug Sensitivity
10 15 0.0 0.1 0.2 0.3
auc
- ●
- ●
- ●
- ●
- ●
- ●
- 200
400 600 800 BRD−K12502280
- Sensitive (125)
Uncertain (596) NOT sensitive (126)
- BRD−K12502280
[TG−101348 −− inhibitor of Janus kinase 2]
Viability (% DMSO) Concentration (log)
Dose-Response Curve
Area Under Curve
IC50 ¡ EC50 ¡ Cell line chemical sensitivity for 481 small compounds across >900 CCLs
EDDY-CTRP Workflow
- ●
- ●
- ●
- ●
- ●
- ●
- ●
- ●
- ●
- 200
- Sensitive (209)
sensitive non-sensitive
P G DC1
( )
Network Likelihoods C1 C2
…
EDDY
Pathway DB # genes p-val Rewiring FIBRINOLYSIS B 12 0.0062 0.56 IL4 B 11 0.0074 0.67 DCC MEDIATED ATTRACTIVE SIGNALING R 13 0.0086 0.66 GRANULOCYTES B 14 0.0093 0.53 P130CAS LINKAGE TO MAPK SIGNALING FOR INTEGRINS R 15 0.0101 0.61 DSCAM INTERACTIONS R 11 0.0105 0.6 TRAF3 DEPENDENT IRF ACTIVATION R 14 0.0169 0.73 CARM1 B 13 0.0173 0.68 SPRY REGULATION OF FGF SIGNALING R 14 0.0183 0.56 THE NLRP3 INFLAMMASOME R 12 0.0185 0.72
Gene Set DB Known gene-gene Interactions
CTRP CCLE
Pathway Summary
- ~9,000 drug-pathway pairs identified
- http://biocomputing.tgen.org/software/EDDY/CTRP
Top 10 most frequent pathways associated with drug response
TG 101348, a JAK2 inhibitor
Sensitive Resistant BOTH PRIOR NEW
TG101-348
- A JAK2 inhibitor
- EPONFKB Pathway
- Essentiality mediators
JAK2, CDKN1A
TG 101348, a JAK2 inhibitor
Sensitive Resistant BOTH PRIOR NEW
Nutlin-3 Response
Non-Sensitive Sensitive BOTH PRIOR NEW RB Pathway Essentiality mediators
- CDK4, CDK1
High specificity mediators
- TP53
Nutlin-3 Oncoprint
188 resistant samples 176 sensitive samples
GPU Implementation
EDDY usage
EDDY internals
… … … …
K1 K2
Unique set of networks Independence test on each edge
… … … …
K3 assessed from Condition 1 samples assessed from Condition 2 samples
EDDY internals
Likelihood Distribution
GPU Implementation
Independence test on each edge
GPU Implementation
Unique set of networks
GPU Implementation
Distribution of network scores
GPU Verification
GPU Performance
CPU: Intel Xeon 2.3 GHz with 33 GB of RAM GPU: NVIDIA Quadro K6000 902 MHz with 12 GB DDR5
GPU Implementation
Level 1: Avg. fraction of possible edges
GPU Implementation
Level 2: Avg. number of networks
GPU Implementation
Level 3: Number of significant pathways
TCGA Pan-Cancer
PIK3CA WT Mutation PRIOR NEW
- PIK3CA 465
Mutation and 4289 Wild Type (WT)
- PI3K Activation
pathway
- PIK3CA not a node
- f significant
alteration
Single Cell Transcriptomic Data
Single Cell Transcriptomic Data Analysis
REGULATION OF THE FANCONI ANEMIA
FANCD2 is an important protein in DNA double strand break repair. It stays connected to ATR. In the mesenchymal cells, FANCD2 disconnects from this network, but now ATR rewires with ATM, which has a similar activity to ATR. This is suggestive of a switch in the type of DNA repair mechanisms and possible lack of DNA double strand repair in mesenchymal subtype
Non-Mesenchymal Mesenchymal BOTH PRIOR NEW
Future Directions and Acknowledgements
- Compute the Cure award
- Single-cell RNAseq dataset
- drop-out events
- more outliers
- eddy-gpu will be available
- n github once manuscript
is submitted (this month)
- TGen Biocomputing
- Seungchan Kim
- Juan Jose Rodriguez
- Tomas Bencomo
- TGen CTD2
- Michael Berens
- Harshil Dhruv
- Jeff Kiefer