PSIchomics Shiny application for the integrated analysis of - - PowerPoint PPT Presentation

psichomics shiny application for the integrated analysis
SMART_READER_LITE
LIVE PREVIEW

PSIchomics Shiny application for the integrated analysis of - - PowerPoint PPT Presentation

PSIchomics Shiny application for the integrated analysis of alternative splicing from large transcriptomic datasets Nuno Agostinho Nuno Morais EuroBioC 6 Dec. 2016 laboratory 2016 2 Alternative Splicing Gene Exon 1 Exon 2 Exon 3


slide-1
SLIDE 1

PSIchomics 
 Shiny application for the integrated analysis of alternative splicing from large transcriptomic datasets

Nuno Morais laboratory Nuno Agostinho 6 Dec. 2016 EuroBioC 2016

slide-2
SLIDE 2

Introduction Workflow Case Study Testing Conclusions

2

Alternative Splicing

Exon 1 Exon 2 Exon 3

Gene

slide-3
SLIDE 3

Introduction Workflow Case Study Testing Conclusions

2

Alternative Splicing

Exon 1 Exon 2 Exon 3

Gene

slide-4
SLIDE 4

Introduction Workflow Case Study Testing Conclusions

Alternative Splicing

3

Exon 1 Exon 2 Exon 3

Gene

  • Occurs in 95% of human multi-exon genes (Pan et al., 2009)
  • Involved in the control of many cellular processes (Oltean & Bates, 2014)
slide-5
SLIDE 5

Introduction Workflow Case Study Testing Conclusions

Alternative Splicing

4

Studying alternative splicing changes may allow to identify prognostic factors and therapeutic targets

  • Occurs in 95% of human multi-exon genes (Pan et al., 2009)
  • Involved in the control of many cellular processes (Oltean & Bates, 2014)
  • Alternative splicing deregulation is linked with cancer development

(Oltean & Bates, 2014)

slide-6
SLIDE 6

Introduction Workflow Case Study Testing Conclusions

RNA Sequencing

5

1 Exon 1 Exon 2 Exon 3 Extract RNA 3 Convert
 to DNA 2 Divide in fragments 4 Sequence DNA 5 Obtain reads 6 Map reads to DNA of reference Exon 2 Exon 3 Exon 1 Junction reads Exonic reads Reference DNA

slide-7
SLIDE 7

Introduction Workflow Case Study Testing Conclusions

Alternative Splicing Quantification

Alternative splicing annotation Junction read counts

6

Percent Spliced-In (PSI) = inclusion reads inclusion + exclusion reads

Distribution of PSI values

0.4 0.1 0.2 0.3 0.5 0.6 0.7 0.8 0.9 1 2 4

Median: 0.82 Variance: 0.05 Median: 0.17 Variance: 0.06 Mann–Whitney U test's p-value (FDR): 2.28e-07

ACTN1 (exon 19)

slide-8
SLIDE 8

Introduction Workflow Case Study Testing Conclusions

Quantification and Analytical Tools

  • Many programs quantify, analyse and visualise alternative

splicing data

7

MISO rMATS VAST-TOOLS SUPPA jSplice AltAnalyze SpliceSeq TIN SGSeq splicegear JunctionSeq DRIMSeq SeqGSEA spliceR DEXSeq Cufflinks FineSplice JuncBASE Splicing Compass

slide-9
SLIDE 9

Introduction Workflow Case Study Testing Conclusions

Quantification and Analytical Tools

  • Many programs quantify, analyse and visualise alternative

splicing data

  • No standard pipeline
  • Their problems range from…

8

No user-friendly interfaces in most tools Time-consuming quantification of alternative splicing No incorporation of clinical information Over-simplistic analyses or focus in the quantification step

slide-10
SLIDE 10

Introduction Workflow Case Study Testing Conclusions

PSIchomics

9

Quantify, analyse and visualise alternative splicing in cancer data Incorporate clinical information Modular architecture to easily modify and extend the program Visual and 
 command-line 
 interfaces

slide-11
SLIDE 11

Introduction Design decisions Implementation Testing Conclusions

  • Open-source, free and cross-platform functional language
  • Ideal for statistics, data manipulation and graphical

computation

  • Used by the scientific community

Choosing as the language

10

  • Web app framework using R, HTML, CSS

and JavaScript

  • Reactive programming model (Excel-like

reactivity)

slide-12
SLIDE 12

Workflow

Introduction Workflow Case Study Testing Conclusions

slide-13
SLIDE 13

Introduction Workflow Case Study Testing Conclusions

Alternative Splicing Quantification

Alternative splicing annotation Junction read counts

12

Percent Spliced-In (PSI) = inclusion reads inclusion + exclusion reads

slide-14
SLIDE 14

Introduction Workflow Case Study Testing Conclusions

! " #

13

Alternative splicing annotation

  • Human (hg19 assembly) annotation is

available

  • Custom annotation files may be created
  • Alt. Splicing

Quantification Analyses and Visualisation Data Retrieval

Clinical data Junction read counts

  • Data is downloaded in-app using

Firebrowse web API

(data from human tumours)

The Cancer Genome Atlas

slide-15
SLIDE 15

Introduction Workflow Case Study Testing Conclusions

14

!

  • Alt. Splicing

Quantification Analyses and Visualisation Data Retrieval

# "

Quantify 
 alternative splicing Retrieve TCGA data
 (optional)

Junction 
 read counts

  • Alt. splicing

annotation

provided or prepared by user

Percent Spliced-In (PSI) = inclusion reads inclusion + exclusion reads

slide-16
SLIDE 16

Introduction Workflow Case Study Testing Conclusions

15

! " #

  • Alt. Splicing

Quantification Analyses and Visualisation Data Retrieval

Differential splicing analysis Gene, RNA and protein information Principal component analysis Survival analysis

slide-17
SLIDE 17

Introduction Workflow Case Study Testing Conclusions

  • Vast array of customisable and interactive plots
  • Features tooltips, zooming and plot exporting

16

Survival curves Density plots

! " #

  • Alt. Splicing

Quantification Analyses and Visualisation Data Retrieval

slide-18
SLIDE 18

Case Study

Introduction Workflow Case Study Testing Conclusions

slide-19
SLIDE 19

18

slide-20
SLIDE 20

19

slide-21
SLIDE 21

20

slide-22
SLIDE 22

21

slide-23
SLIDE 23

22

slide-24
SLIDE 24

23

PCA: dimensionality reduction by selecting the main directions of variance

slide-25
SLIDE 25

24

slide-26
SLIDE 26

25

slide-27
SLIDE 27

26

slide-28
SLIDE 28

Testing

Introduction Workflow Case Study Testing Conclusions

$ % &

Performance
 Benchmarking Continuous and
 Unit Testing Usability
 Testing

slide-29
SLIDE 29

Introduction Workflow Case Study Testing Conclusions

Breast cancer (1093 patients) Pan-kidney cohort (889 patients) Glioma cohort (676 patients) Liver cancer (371 patients) Running time (with default settings) 30s 1m 1m 30s 2m 2m 30s 3m 3m 30s 4m 4m 30s 5m 5m 30s 6m 6m 30s 2m 20s 2m 37s 2m 34s 2m 35s 35s 1m 20s 2m 2s 2m 39s 16s 33s 22s 47s

Load data Quantify AS (skipped exon) Differential analyses (Normal vs Tumour)

  • Common tumour types were selected from TCGA
  • Average time was collected over 10 consecutive runs

28 MacBook Pro 2011:
 i7 (8 cores), HDD and 8GB RAM

$ % &

Performance
 Benchmarking Continuous and
 Unit Testing Usability
 Testing

slide-30
SLIDE 30

Conclusions

Introduction Workflow Case Study Testing Conclusions

slide-31
SLIDE 31

Introduction Workflow Case Study Testing Conclusions

Conclusions

30

6 minutes using processed data with the highest number of patients in TCGA Quantify, analyse and visualise alternative splicing in cancer data Incorporates clinical information Modular architecture to easily modify and extend the program Command-line and easy-to-use graphical interface

slide-32
SLIDE 32

Introduction Workflow Case Study Testing Conclusions

Future work

31

' ( )

  • Extend available data sources
  • Quantify alternative splicing from raw data
  • Less dependence on processed data
  • Deploy to the web
  • No installation required
  • Always up-to-date
slide-33
SLIDE 33

32

GitHub Code hosting Bioconductor Biological R packages

PSIchomics

(MIT license)

slide-34
SLIDE 34

Thanks to you all!

Nuno Morais Lab André Falcão Ana Rita Grosso