Pathway Analysis Susan Steinbusch-Coort, PhD Department of - - PowerPoint PPT Presentation

pathway analysis
SMART_READER_LITE
LIVE PREVIEW

Pathway Analysis Susan Steinbusch-Coort, PhD Department of - - PowerPoint PPT Presentation

Pathway Analysis Susan Steinbusch-Coort, PhD Department of Bioinformatics-BiGCaT, Maastricht University, The Netherlands BioSB course: Biological Network Analysis http://tinyurl.com/pl5yreh Amsterdam, 17-18 September 2015


slide-1
SLIDE 1

Pathway Analysis

Susan Steinbusch-Coort, PhD

Department of Bioinformatics-BiGCaT, Maastricht University, The Netherlands

BioSB course: Biological Network Analysis http://tinyurl.com/pl5yreh Amsterdam, 17-18 September 2015 susan.coort@maastrichtuniversity.nl

slide-2
SLIDE 2

BioSB 2015 course: Biological Network Analysis, 17 & 18 September 2015

Outline

  • 1. What are biological pathways?
  • 2. Pathway databases
  • 3. Pathway analysis
  • 4. Software: PathVisio
  • 5. Examples
  • 6. Hands on session

2

slide-3
SLIDE 3

What are biological pathways?

slide-4
SLIDE 4

BioSB 2015 course: Biological Network Analysis, 17 & 18 September 2015

What are biological pathways?

Definition on Wikipedia (August 2015):

“A biological pathway is a series of interactions among molecules in a cell that leads to a certain product or a change in a cell.”

4

slide-5
SLIDE 5

BioSB 2015 course: Biological Network Analysis, 17 & 18 September 2015

What are biological pathways?

Definition on Wikipedia (August 2015):

“A biological pathway is a series of interactions among molecules in a cell that leads to a certain product or a change in a cell.”

5

slide-6
SLIDE 6

BioSB 2015 course: Biological Network Analysis, 17 & 18 September 2015

What are biological pathways?

Definition on Wikipedia (August 2015):

“A biological pathway is a series of interactions among molecules in a cell that leads to a certain product or a change in a cell.”

6

slide-7
SLIDE 7

BioSB 2015 course: Biological Network Analysis, 17 & 18 September 2015

What are biological pathways?

7

DNA RNA PROTEIN Genomics-25,000 genes Transcriptomics – 100,000 Transcripts Proteomics – 1,000,000 Proteins BIOCHEMICALS (METABOLITES) Metabolomics – 1,800 Biochemicals

Types of molecules:

slide-8
SLIDE 8

BioSB 2015 course: Biological Network Analysis, 17 & 18 September 2015

What are biological pathways?

8

Protein 1 Protein 2 Protein 1 Protein 2 Complex

Cell membrane

External stimulus

Transport in cytoplasm Metabolite A Metabolite B Enzyme Protein Gene

TF1 TF2 Activation Inhibition

Types of interactions:

Complex Assembly: Transport: Catalysis: Gene regulation:

slide-9
SLIDE 9

BioSB 2015 course: Biological Network Analysis, 17 & 18 September 2015

Biological pathway types

  • 1. Metabolic pathways
  • 2. Gene regulation

pathways

  • 3. Signal transduction

pathways

9

http://www.genome.gov/multimedia/illustrations/Biological_Pathways.pdf

slide-10
SLIDE 10

BioSB 2015 course: Biological Network Analysis, 17 & 18 September 2015

Metabolic pathways

  • “… series of chemical

reactions occuring within a cell”

  • Enzymes calayze the

reactions

  • Example: Glycolysis

10

http://wikipathways.org/index.php/Pathway:WP534

slide-11
SLIDE 11

Pathway databases

slide-12
SLIDE 12

BioSB 2015 course: Biological Network Analysis, 17 & 18 September 2015

Pathway databases

  • Where can we find biological pathways?

PathGuide:

  • List of > 500 pathway and interaction related resources

– http://www.pathguide.org

  • Commonly used pathway databases:

– WikiPathways (www.wikipathways.org) – Reactome (www.reactome.org) – KEGG (www.genome.jp/kegg) – BioCyc (www.biocyc.org) – Species specic pathway databases ...

12

slide-13
SLIDE 13

BioSB 2015 course: Biological Network Analysis, 17 & 18 September 2015

Pathway databases

  • Be aware!
  • Only 50% of the human protein coding genes are

present in biological pathways.

  • A lot of information hidden in literature and

researchers' minds.

  • Detailed functions and mechanisms for many

proteins still unknown.

13

slide-14
SLIDE 14

BioSB 2015 course: Biological Network Analysis, 17 & 18 September 2015

Pathway databases

  • WikiPathways (www.wikipathways.org)
  • Community curated pathway databases

14

What is a wiki? A wiki is an application, typically a web application, which allows collaborative modification, extension,

  • r deletion of its content and structure.
slide-15
SLIDE 15

BioSB 2015 course: Biological Network Analysis, 17 & 18 September 2015

Pathway databases

  • WikiPathways (www.wikipathways.org)

– everybody can contribute and share pathways – everybody can edit and curate pathways – everybody can use the pathway collection – not just diagrams but fully annotated pathways – interactive pathway viewer – integrated pathway editor – changes can be reverted easily – new findings can be added immediately

15

slide-16
SLIDE 16

BioSB 2015 course: Biological Network Analysis, 17 & 18 September 2015

WikiPathways

16

slide-17
SLIDE 17

BioSB 2015 course: Biological Network Analysis, 17 & 18 September 2015

WikiPathways

17

slide-18
SLIDE 18

Pathway analysis

slide-19
SLIDE 19

BioSB 2015 course: Biological Network Analysis, 17 & 18 September 2015

Why pathway analysis?

19

Scanned microarrays Raw intensities Normalized intensities Gene level statistics Set of affected pathways Set of

  • verrepresentated

GO terms Set of co-regulated genes Biological Interpretation Image analysis QC Normalization Statistical analysis Pathway analysis Clustering GO analysis

slide-20
SLIDE 20

BioSB 2015 course: Biological Network Analysis, 17 & 18 September 2015

Why pathway analysis?

20

Hmgcr Dgat1 Ldlr Mttp Soat1 Lipc Pltp Lcat

slide-21
SLIDE 21

BioSB 2015 course: Biological Network Analysis, 17 & 18 September 2015

Why pathway analysis?

“A picture is worth a thousand words.”

  • Intuitive
  • Puts data into biological context
  • More efficient than looking up single gene

information

21

slide-22
SLIDE 22

BioSB 2015 course: Biological Network Analysis, 17 & 18 September 2015

Why pathway analysis?

  • Involvement in pathways

– Group genes, proteins and other biological molecules – Reducing complexity – Several hundred pathways instead of thousands of genes – Analysis on functional level

  • Identify active pathways that differ between two

conditions

– Higher explanatory power than a simple list of genes

22

slide-23
SLIDE 23

BioSB 2015 course: Biological Network Analysis, 17 & 18 September 2015

Pathway analysis methods

  • Overrepresentation analysis (ORA)
  • Functional Class Scoring (FCS)
  • Pathway Topology (PT) Based

23

Khatri, Purvesh, Marina Sirota, and Atul J. Butte. "Ten years of pathway analysis: current approaches and outstanding challenges.“ PLoS computational biology 8.2 (2012): e1002375.

slide-24
SLIDE 24

BioSB 2015 course: Biological Network Analysis, 17 & 18 September 2015

Pathway analysis methods

  • Overrepresentation analysis:

– Define input list through criteria – Count genes in pathway for input and background list – Perform statistical test for over- or under-representation (e.g. hypergeometric test)

24

slide-25
SLIDE 25

BioSB 2015 course: Biological Network Analysis, 17 & 18 September 2015

Pathway analysis methods

25

  • 1. Total number of genes measured in experiment = N
  • 2. Set criteria to define total number of “differential expressed genes” = R (p-value<0.05)
  • 3. Total number of genes in pathway that are measured in experiment = n
  • 4. Number of genes changed in the pathway = r

N = 35 genes R = 9 genes n = 12 genes r = 6 genes Z-Score = 2.34033 Pathway X

slide-26
SLIDE 26

BioSB 2015 course: Biological Network Analysis, 17 & 18 September 2015

Pathway analysis methods

  • What does the Z-Score tell you?

– Z-Score > 1.96:

  • Significantly more genes are changed in the pathway compared to

the complete data set

– Z-Score = 0:

  • Distribution of changed genes in the pathway is the same as in the

complete data set

– Z-Score < -1.96

  • Significantly less genes are changed in the pathway compared to

the complete data set

26

BUT ...

slide-27
SLIDE 27

BioSB 2015 course: Biological Network Analysis, 17 & 18 September 2015

Pathway analysis methods

  • Be aware!

– Overrepresentation analysis and functional class scoring DO NOT take pathway topology into account! – Always manually verify the pathways to make the right conclusions!

27

slide-28
SLIDE 28

BioSB 2015 course: Biological Network Analysis, 17 & 18 September 2015

PathVisio 3.2.0

a tool to edit and analyze biological pathways

29

www.pathvisio.org

slide-29
SLIDE 29

BioSB 2015 course: Biological Network Analysis, 17 & 18 September 2015

PathVisio 3.2.0

  • What can you do with PathVisio?

– Draw your pathways – Visualize your data on pathways – Find pathways regulated in your data set

  • More functionality through plugins

– Plugin repository with 16 plugins and more coming – Many different plugin developers

30

slide-30
SLIDE 30

BioSB 2015 course: Biological Network Analysis, 17 & 18 September 2015

PathVisio application

31

Toolbar: drawing objects, layout, view, visualization Sidepanel: Drawing, Editing, Backpage, Search, Legend Status bar: Loaded databases, data sets Pathway display area

slide-31
SLIDE 31

BioSB 2015 course: Biological Network Analysis, 17 & 18 September 2015

PathVisio walkthrough

32

Load identifier mapping databases Create new pathway Load dataset Create data visualization Find regulated pathways WikiPathways Export pathway image with data 1 Select pathway collection Draw pathways Visualize data on pathway 2 Visualize data 3 Pathway statistics

Share/Upload

slide-32
SLIDE 32

BioSB 2015 course: Biological Network Analysis, 17 & 18 September 2015

Load identifier mapping databases

  • Identifier mapping datbases from

– Download bridge files from http://bridgedb.org/data/gene_database/ – Gene products (based on Ensembl) – Metabolites (based on HMDB)

  • Data -> Select Gene/Metabolite Database

33

Check status bar!

slide-33
SLIDE 33

BioSB 2015 course: Biological Network Analysis, 17 & 18 September 2015

Identifier mapping

  • Annotation of data nodes and interactions
  • Xref → identifier + data source

34

slide-34
SLIDE 34

BioSB 2015 course: Biological Network Analysis, 17 & 18 September 2015

Identifier mapping

How does it work in PathVisio?

35

Annotation ID Mapping

slide-35
SLIDE 35

BioSB 2015 course: Biological Network Analysis, 17 & 18 September 2015

Data visualization

  • Load dataset

– Quantitative data – Comma separated file (.csv or .txt) – Identifier column – System code (1-2 letter code) column → datasource (optional)

36

2

slide-36
SLIDE 36

BioSB 2015 course: Biological Network Analysis, 17 & 18 September 2015

Data visualization

  • Create visualization

– Basic – one value (e.g. logFC) – Advanced – multiple values (e.g. logFC + p.value)

  • Gradient based visualization

– logFC

  • Rule based visualization

– p.value

37

slide-37
SLIDE 37

BioSB 2015 course: Biological Network Analysis, 17 & 18 September 2015

Data visualization

  • Visualize data on

pathway

38

slide-38
SLIDE 38

BioSB 2015 course: Biological Network Analysis, 17 & 18 September 2015

Data visualization

  • Export high quality pathway image with data for

publications

  • Plugins:

– HTMLexport plugin – BioPAX plugin

39

slide-39
SLIDE 39

BioSB 2015 course: Biological Network Analysis, 17 & 18 September 2015

Pathway statistics

  • Download pathway collection from WikiPathways

– http://www.wikipathways.org/index.php/Download_Pathways – 20 different species

  • Find regulated pathways

– Default method: Z-Score statistics

40

3

slide-40
SLIDE 40

Examples: Pathway Analysis

slide-41
SLIDE 41

BioSB 2015 course: Biological Network Analysis, 17 & 18 September 2015

Time series data

42

JR Tisoncik et al. Microbiology and Molecular Biology Reviews (2012)

Acute-phase response during H5N1 virus infection.

slide-42
SLIDE 42

BioSB 2015 course: Biological Network Analysis, 17 & 18 September 2015

Tissue expression data

43

DGJ Jennen et al. Drug Discovery Today (2010)

Comparison of expression in different cell types.

slide-43
SLIDE 43

BioSB 2015 course: Biological Network Analysis, 17 & 18 September 2015

Multi-omics data visualization

44

I Rubio-Aliaga et al. Physiological Genomics (2011)

Visualization of transcriptomics and metabolomics data together in the pathway.

slide-44
SLIDE 44

BioSB 2015 course: Biological Network Analysis, 17 & 18 September 2015

Nutritional studies

45

M Kutmon et al. Genes & Nutrition (2015)

Cell cycle down- regulation in prostate cancer cells after vitamin D treatment

slide-45
SLIDE 45

Hands on session

http://projects.bigcat.unimaas.nl/biosb2015/material/pathway-analysis/

slide-46
SLIDE 46

Questions?