Detecting Pathological Pathways of the Chronic Fatigue Syndrome by - - PowerPoint PPT Presentation

detecting pathological pathways of the chronic fatigue
SMART_READER_LITE
LIVE PREVIEW

Detecting Pathological Pathways of the Chronic Fatigue Syndrome by - - PowerPoint PPT Presentation

Introduction Results so far Hypothesis about CFS Approach Results Detecting Pathological Pathways of the Chronic Fatigue Syndrome by the Comparison of Networks Frank Emmert-Streib 1 Earl F. Glynn 1 Christopher Seidel 1 Christoph L. Bausch 1


slide-1
SLIDE 1

Introduction Results so far Hypothesis about CFS Approach Results

Detecting Pathological Pathways of the Chronic Fatigue Syndrome by the Comparison of Networks

Frank Emmert-Streib1 Earl F. Glynn1 Christopher Seidel1 Christoph L. Bausch1 Arcady Mushegian1,2

1Stowers Institute for Medical Research 2University of Kansas School of Medicine

7th June 2006

Frank Emmert-Streib Detecting pathological Pathways of the CFS

slide-2
SLIDE 2

Introduction Results so far Hypothesis about CFS Approach Results

Outline

1

Introduction Properties of CFS

2

Results so far

3

Hypothesis about CFS Pragmatic definitions

4

Approach Quasi-pathway Quasi-pathway Classify patients Classify genes Inferring causality Network Comparison

5

Results Biological processes used in our analysis Network comparison

Frank Emmert-Streib Detecting pathological Pathways of the CFS

slide-3
SLIDE 3

Introduction Results so far Hypothesis about CFS Approach Results Properties of CFS

CFS has no diagnostic clinical signs or laboratory abnormalities CFS is defined by symptoms and disability It is unclear if CFS represents single disease

Frank Emmert-Streib Detecting pathological Pathways of the CFS

slide-4
SLIDE 4

Introduction Results so far Hypothesis about CFS Approach Results Properties of CFS

CFS has no diagnostic clinical signs or laboratory abnormalities CFS is defined by symptoms and disability It is unclear if CFS represents single disease

Frank Emmert-Streib Detecting pathological Pathways of the CFS

slide-5
SLIDE 5

Introduction Results so far Hypothesis about CFS Approach Results Properties of CFS

CFS has no diagnostic clinical signs or laboratory abnormalities CFS is defined by symptoms and disability It is unclear if CFS represents single disease

Frank Emmert-Streib Detecting pathological Pathways of the CFS

slide-6
SLIDE 6

Introduction Results so far Hypothesis about CFS Approach Results

Characterize (define) CFS by clinical data + questionnaire microarray + clinical data = ⇒ (classify patients by clinical data, clustering, differentially expressed genes) heterogeneous illness & fundamental metabolic perturbations WHISTLER et al. 2003

Frank Emmert-Streib Detecting pathological Pathways of the CFS

slide-7
SLIDE 7

Introduction Results so far Hypothesis about CFS Approach Results Pragmatic definitions

Hypothesis

Pathways are important rather than ’genes’ = ⇒ differentially expressed pathways, M. XIONG 2004

Questions

1

How to define pathways?

2

How to identify pathways?

3

How to compare pathways?

Frank Emmert-Streib Detecting pathological Pathways of the CFS

slide-8
SLIDE 8

Introduction Results so far Hypothesis about CFS Approach Results Pragmatic definitions

Definition

A pathway (directed graph) is an interconnected group of genes (variables) that regulates a biological process

Definition

A biological process is (hierarchically) defined by GO (gene ontology) terms

Frank Emmert-Streib Detecting pathological Pathways of the CFS

slide-9
SLIDE 9

Introduction Results so far Hypothesis about CFS Approach Results Pragmatic definitions

Definition

A pathway (directed graph) is an interconnected group of genes (variables) that regulates a biological process

Definition

A biological process is (hierarchically) defined by GO (gene ontology) terms

Frank Emmert-Streib Detecting pathological Pathways of the CFS

slide-10
SLIDE 10

Introduction Results so far Hypothesis about CFS Approach Results Quasi-pathway Quasi-pathway Classify patients Classify genes Inferring causality Network Comparison

Used data

Clinical Data (questionnaire + blood) = ⇒ classify patients Gene Expression (peripheral blood mononuclear cells) GO database = ⇒ classify genes = ⇒ reconstruct quasi-pathways (biological subprocesses)

Frank Emmert-Streib Detecting pathological Pathways of the CFS

slide-11
SLIDE 11

Introduction Results so far Hypothesis about CFS Approach Results Quasi-pathway Quasi-pathway Classify patients Classify genes Inferring causality Network Comparison

Why quasi-pathways?

Central Dogma of Molecular Biology

DNA - CHIP-chip RNA - microarray Protein - proteomics Only partial information is used (available) to reconstruct the network

Frank Emmert-Streib Detecting pathological Pathways of the CFS

slide-12
SLIDE 12

Introduction Results so far Hypothesis about CFS Approach Results Quasi-pathway Quasi-pathway Classify patients Classify genes Inferring causality Network Comparison

Why quasi-pathways?

Central Dogma of Molecular Biology

DNA - CHIP-chip RNA - microarray Protein - proteomics Only partial information is used (available) to reconstruct the network

Frank Emmert-Streib Detecting pathological Pathways of the CFS

slide-13
SLIDE 13

Introduction Results so far Hypothesis about CFS Approach Results Quasi-pathway Quasi-pathway Classify patients Classify genes Inferring causality Network Comparison

Assumption

Patients participating are ’fair’

Result

Two groups of patients (classification)

1

non-sick

2

sick (chronic fatigue syndrome)

Frank Emmert-Streib Detecting pathological Pathways of the CFS

slide-14
SLIDE 14

Introduction Results so far Hypothesis about CFS Approach Results Quasi-pathway Quasi-pathway Classify patients Classify genes Inferring causality Network Comparison

Assumption

GO database is correct (mega experiment)

Result

N groups of genes for N different biological processes (classification)

Frank Emmert-Streib Detecting pathological Pathways of the CFS

slide-15
SLIDE 15

Introduction Results so far Hypothesis about CFS Approach Results Quasi-pathway Quasi-pathway Classify patients Classify genes Inferring causality Network Comparison

GO is a hierarchical database

molecular function (7460) cellular component (1533) biological process (9384) 18377 GO terms

Frank Emmert-Streib Detecting pathological Pathways of the CFS

slide-16
SLIDE 16

Introduction Results so far Hypothesis about CFS Approach Results Quasi-pathway Quasi-pathway Classify patients Classify genes Inferring causality Network Comparison

Examples of biological (sub)processes:

regulation of cell cycle, GO:0000074 DNA repair, GO:0006281 circadian rhythm, GO:0007623 endocytosis, GO:0006897 ATP metabolism, GO:0046034

Frank Emmert-Streib Detecting pathological Pathways of the CFS

slide-17
SLIDE 17

Introduction Results so far Hypothesis about CFS Approach Results Quasi-pathway Quasi-pathway Classify patients Classify genes Inferring causality Network Comparison

Examples of biological (sub)processes:

regulation of cell cycle, GO:0000074, 791 DNA repair, GO:0006281, 538 circadian rhythm, GO:0007623, 44 endocytosis, GO:0006897, 225 ATP metabolism, GO:0046034, 14

Frank Emmert-Streib Detecting pathological Pathways of the CFS

slide-18
SLIDE 18

Introduction Results so far Hypothesis about CFS Approach Results Quasi-pathway Quasi-pathway Classify patients Classify genes Inferring causality Network Comparison

Expected disorder in biological processes

immune cell activation , GO:0045321, 36 positive regulation of apoptosis, GO:0043065, 42 positive regulation of transcription, GO:0045941, 101 circadian rhythm, GO:0007623, 44

Expected order in biological processes

housekeeping pathways, ???

Frank Emmert-Streib Detecting pathological Pathways of the CFS

slide-19
SLIDE 19

Introduction Results so far Hypothesis about CFS Approach Results Quasi-pathway Quasi-pathway Classify patients Classify genes Inferring causality Network Comparison

correlation ρAC ↑ = ⇒ edge between A and B temporal ordering = ⇒ direction

Frank Emmert-Streib Detecting pathological Pathways of the CFS

slide-20
SLIDE 20

Introduction Results so far Hypothesis about CFS Approach Results Quasi-pathway Quasi-pathway Classify patients Classify genes Inferring causality Network Comparison

correlation ρAC ↑ = ⇒ edge between A and B temporal ordering = ⇒ direction

Frank Emmert-Streib Detecting pathological Pathways of the CFS

slide-21
SLIDE 21

Introduction Results so far Hypothesis about CFS Approach Results Quasi-pathway Quasi-pathway Classify patients Classify genes Inferring causality Network Comparison

correlation does not imply causality: ρAC ↑ partial correlation of first order: ρAC.B ↓

Frank Emmert-Streib Detecting pathological Pathways of the CFS

slide-22
SLIDE 22

Introduction Results so far Hypothesis about CFS Approach Results Quasi-pathway Quasi-pathway Classify patients Classify genes Inferring causality Network Comparison

correlation does not imply causality: ρAC ↑ partial correlation of first order: ρAC.Bi ↑ partial correlation of higher order: ρAC.{Bi} ↓ (parallel pathways)

Frank Emmert-Streib Detecting pathological Pathways of the CFS

slide-23
SLIDE 23

Introduction Results so far Hypothesis about CFS Approach Results Quasi-pathway Quasi-pathway Classify patients Classify genes Inferring causality Network Comparison

correlation does not imply causality: ρAC ↑ partial correlation of first order: ρAC.Bi ↑ partial correlation of higher order: ρAC.{Bi} ↓

Example

N = 50, n = |{Bi}| = 8 N n

  • ∼ 108

Frank Emmert-Streib Detecting pathological Pathways of the CFS

slide-24
SLIDE 24

Introduction Results so far Hypothesis about CFS Approach Results Quasi-pathway Quasi-pathway Classify patients Classify genes Inferring causality Network Comparison

d-separation

x

_ _ _ _ _ y|{Bi} ⇐

⇒ ρxy.{Bi} = 0 (1) VERMA et al. 1988, PEARL 1988, GEIGER et al. 1990, SPIRTES et al. 1998

Frank Emmert-Streib Detecting pathological Pathways of the CFS

slide-25
SLIDE 25

Introduction Results so far Hypothesis about CFS Approach Results Quasi-pathway Quasi-pathway Classify patients Classify genes Inferring causality Network Comparison

variance σx = E[(X − µx)2] (2) covariance σxy = E[(X − µx)(Y − µy)] (3) Pearson correlation ρxy = σxy

  • σ2

xσ2 y

(4) partial Pearson correlation ρxy|z = ρxy − ρxzρyz

  • (1 − ρ2

xz)(1 − ρ2 yz)

(5)

Frank Emmert-Streib Detecting pathological Pathways of the CFS

slide-26
SLIDE 26

Introduction Results so far Hypothesis about CFS Approach Results Quasi-pathway Quasi-pathway Classify patients Classify genes Inferring causality Network Comparison

Definition (Undirected dependency graph (UDG) of first order)

An UDG G of first order is an undirected, unweighted graph with N nodes (number of genes) that is obtained via the following procedure:

1

connect all nodes with an edge with each other

2

calculate the correlation between all profiles xi

3

delete all edges connecting node xi with xj if rxixj < Θc

4

calculate the partial correlation of first order for all triplets of nodes (xi, xj, xk) that have an edge between xi and xj

5

delete all edges connecting node xi with xj if rxixj|xk < Θpc similar to PC-algorithm SPIRTES et al. 1991

Frank Emmert-Streib Detecting pathological Pathways of the CFS

slide-27
SLIDE 27

Introduction Results so far Hypothesis about CFS Approach Results Quasi-pathway Quasi-pathway Classify patients Classify genes Inferring causality Network Comparison

Graph Edit Distance

Minimal number of edge deletions/insertions to transform graph G1 to G2 Quasi-pathways:

compare only sick vs non-sick pathways = ⇒ same number of genes nodes are labeled (genes)

BUNKE 1997

Frank Emmert-Streib Detecting pathological Pathways of the CFS

slide-28
SLIDE 28

Introduction Results so far Hypothesis about CFS Approach Results Quasi-pathway Quasi-pathway Classify patients Classify genes Inferring causality Network Comparison

5 . =

  • 0.00

0.02 0.04 0.06 0.08 0.10 0.12 0.14

graph edit distance (%)

0.000 0.005 0.010 0.015 0.020 0.025 0.030 Frank Emmert-Streib Detecting pathological Pathways of the CFS

slide-29
SLIDE 29

Introduction Results so far Hypothesis about CFS Approach Results Biological processes used in our analysis Network comparison

GO term name GO:0006471 protein amino acid ADP-ribosylation (31) GO:0007219 Notch signaling pathway (28) GO:0008360 regulation of cell shape (22) GO:0042157 lipoprotein metabolism (20) GO:0007126 meiosis (36) GO:0006958 complement activation, classical pathway (30) GO:0007222 frizzled signaling pathway (19) GO:0006633 fatty acid biosynthesis (37) GO:0043087 regulation of GTPase activity (40) GO:0042742 defense response to bacteria (32) GO:0001525 angiogenesis (45) GO:0006493 protein amino acid O-linked glycosylation (25)

Frank Emmert-Streib Detecting pathological Pathways of the CFS

slide-30
SLIDE 30

Introduction Results so far Hypothesis about CFS Approach Results Biological processes used in our analysis Network comparison

GO:0006471 GO:0007219 GO:0006958 GO:0006633 GO:0008360 GO:0042157 GO:0042742 GO:0001525 GO:0043087 GO:0006493 GO:0007126 GO:0007222

0.00 0.02 0.04 0.06 0.08 0.10 0.12 0.14

GO:0006471 protein amino acid ADP-ribosylation GO:0007219 Notch signaling pathway

Frank Emmert-Streib Detecting pathological Pathways of the CFS

slide-31
SLIDE 31

Introduction Results so far Hypothesis about CFS Approach Results Biological processes used in our analysis Network comparison

Summary

gene network represents biological (sub)process (pathway) comparison between normal (non-sick) and perturbed (sick)

  • rganism is reduced to the comparison between networks

representing the corresponding biological processes conceptual generalization of differentially expressed genes to ’differentially’ expressed biological processes (quasi-gene networks, M. XIONG et al. 2004) predicted pathways involved in CFS: GO:0006471 protein amino acid ADP-ribosylation GO:0007219 Notch signaling pathway

Frank Emmert-Streib Detecting pathological Pathways of the CFS

slide-32
SLIDE 32

Introduction Results so far Hypothesis about CFS Approach Results Biological processes used in our analysis Network comparison

Acknowledgments

Malcolm Cook Bioinformatics Stowers Institute for Medical Research, USA Matthias Dehmer Center for Integrative Bioinformatics Max F . Perutz Laboratories, Austria Galina V. Glazko Department of Biostatistics and Computational Biology University of Rochester, USA Daniel Thomasset Bioinformatics Stowers Institute for Medical Research, USA

Frank Emmert-Streib Detecting pathological Pathways of the CFS