Using Network Flow to Bridge the Gap Using Network Flow to Bridge - - PowerPoint PPT Presentation

using network flow to bridge the gap using network flow
SMART_READER_LITE
LIVE PREVIEW

Using Network Flow to Bridge the Gap Using Network Flow to Bridge - - PowerPoint PPT Presentation

Using Network Flow to Bridge the Gap Using Network Flow to Bridge the Gap between Genotype and Phenotype Teresa Przytycka NIH / NLM / NCBI NIH / NLM / NCBI Journal Wisla (1902) Picture from a local fare in Lublin Poland from a local


slide-1
SLIDE 1

Using Network Flow to Bridge the Gap Using Network Flow to Bridge the Gap between Genotype and Phenotype Teresa Przytycka NIH / NLM / NCBI NIH / NLM / NCBI

slide-2
SLIDE 2

Journal “Wisla” (1902) Picture from a local fare in Lublin Poland from a local fare in Lublin, Poland

slide-3
SLIDE 3

Phenotypes Genotypes

Journal “Wisla” (1902) Picture from a local fare in Lublin Poland from a local fare in Lublin, Poland

slide-4
SLIDE 4

G 1

Association studies

Genome 1 Genome 2 Genome 3 Genome n

slide-5
SLIDE 5

Genotype: effects of genotypic effects of genotypic variation:

  • change in amino acid
  • change in gene structure
  • copy number variations ….

5

slide-6
SLIDE 6

Genotype: Phenotype (e.g. disease) effects of genotypic effects of genotypic variation:

  • change in amino acid
  • change in gene structure
  • copy number variations ….

6

slide-7
SLIDE 7

G l Goals :

  • A method for system level analysis of propagation of

y y p p g such perturbation in the network

  • Prediction of “causal” mutations
  • Prediction of master regulators (network hubs)

involved in disease

  • Prediction of pathways dys-regulated in disease
slide-8
SLIDE 8

Propagation of the effects of Copy number aberrations in Glioma

Cancer Cases G i d t CNV

Gene 1 Gene 2 Gene 3

Gene expression data . . mosomes . . . chrom I t t d

Gene n

Integrated Protein-protein, protein-DNA phosphorylation network

slide-9
SLIDE 9

Copy number aberrations py

  • r/and mutations

Gene expression

slide-10
SLIDE 10

Copy number aberrations py

  • r/and mutations

Gene expression

Signature genes

slide-11
SLIDE 11

Copy number aberrations py

  • r/and mutations

Signature genes

slide-12
SLIDE 12

Copy number aberrations py

  • r/and mutations

Signature genes

slide-13
SLIDE 13

Method outline Method outline

  • 1. Selecting marker genes to be used as “phenotype”
  • 2. Genotype-phenotype association
  • 3. Uncovering information flow between genotype and

phenotype

  • 4. Inferring dys- regulated, genes, pathways, and causal

mutations

13

slide-14
SLIDE 14

Selecting “phenotype” genes

C C

Gene 1 Gene 2

Cancer Cases Gene expression data

Gene 2 Gene 3

. . . . . target genes

Gene n

slide-15
SLIDE 15

Selecting “phenotype” genes

slide-16
SLIDE 16

Selecting “phenotype” genes

Smallest set of genes so that each case is “covered” at least specified number of times

slide-17
SLIDE 17

Associations between copy number variations and gene expression of selected target genes and gene expression of selected target genes

Cancer Cases Gene expression data Cancer Cases CNV data

17

slide-18
SLIDE 18

Significant correlation between CNV and expression

Cancer Cases

expression

Gene 1 Gene 2 Gene 3

Gene expression da . . . . .

Gene n

18

slide-19
SLIDE 19

Significant correlation between CNV and expression

Cancer Cases

expression

Gene expression da target gene locus

19

slide-20
SLIDE 20

Significant correlation between CNV and expression

Cancer Cases

expression

Gene expression da target gene candidate causal genes candidate causal genes

20

slide-21
SLIDE 21

Uncovering pathways of information flow between CNV and target gene CNV and target gene

Cancer Cases Gene expression da

21

slide-22
SLIDE 22

Using expression to guide path discovery

Cancer Cases Gene expression da

22

slide-23
SLIDE 23

Translating probabilities it resistances

Cancer Cases Gene expression da

23

Resistance - set to favor most likely path -based on gene expression values

(reversely proportional to the average correlation of the expression of the adjacent genes with expression of the target gene)

slide-24
SLIDE 24

Finding subnetworks with significant current flow

Cancer Cases Gene expression da

24

Resistance - set to favor most likely path -based on gene expression values

(reversely proportional to the average correlation of the expression of the adjacent genes with expression of the target gene)

slide-25
SLIDE 25

G l Goals :

  • A method for system level analysis of propagation of

y y p p g such perturbation in the network

  • Prediction of “causal” mutations
  • Identification master regulators (network hubs)

involved in disease

  • Identification pathways dys-regulated in disease
slide-26
SLIDE 26

Putative causal variation

(with lots of additional caveats) Cancer Cases (with lots of additional caveats) Gene expression da

26

Resistance - set to favor most likely path -based on gene expression values

(reversely proportional to the average correlation of the expression of the adjacent genes with expression of the target gene)

slide-27
SLIDE 27

Causal copy number aberrations Causal copy number aberrations

27 27

slide-28
SLIDE 28

G l Goals :

  • A method for system level analysis of propagation of

y y p p g such perturbation in the network

  • Prediction of “causal” mutations
  • Prediction “master regulators” (network hubs) involved

in disease

  • Prediction pathways dys-regulated in disease
slide-29
SLIDE 29

Solve current flow for all pairs and find nodes belonging to many paths g g y p

Cancer Cases Gene expression data Cancer Cases CNV data

29

slide-30
SLIDE 30

Hubs Hubs

30

slide-31
SLIDE 31

G l Goals :

  • A method for system level analysis of propagation of

y y p p g such perturbation in the network

  • Prediction of “causal” mutations
  • Prediction of “master regulators” (network hubs)

involved in disease

  • Prediction of pathways dys-regulated in disease
slide-32
SLIDE 32

Are there common functional pathways?

Cancer Cases Gene expression dat Cancer Cases CNV data

32

slide-33
SLIDE 33

Common GO pathways

33

slide-34
SLIDE 34

G l Goals :

  • A method for system level analysis of propagation of

y y p p g such perturbation in the network

  • Prediction of “causal” mutations
  • Prediction of “master regulators” (network hubs)

involved in disease

  • Prediction of pathways dys-regulated in disease
slide-35
SLIDE 35

Design details under the hood g

  • Current flow reduces to solving a set of linear equations (Kirchhoff's

laws) Caveat: We had to solving a linear system with 20,000 variables thousands of times for permutation test required new methodology

  • Many biological interactions are directional. This can be taken care by

solving linear program with corresponding constraints - Caveat: the network is to big for solving thousands of linear programs network is to big for solving thousands of linear programs

  • Null model and p-value estimations

Kim, Wuchty, Przytycka – PloS Comp Bio 2011

Kim, Przytycki, Wuchty, Przytycka – Phys. Bio. 2011

35

slide-36
SLIDE 36

Acknowledgments

Group members: Yoo-Ah Kim DongYeon Cho Yang Huang Jan Hoinka Xiangjun Du g g Damian Wojtowicz Raheleh Salari

Stefan Wuchty (NCBI)

Collaborators:

Journal “Wisla” (1902) Picture from a local fare in Lublin, Poland

Jozef Przytycki (GWU) Stefan Wuchty (NCBI)

my great-great uncle (the “Giant”)

slide-37
SLIDE 37

37

slide-38
SLIDE 38

Acknowledgments

Group members: Collaborators: Yoo-Ah Kim DongYeon Cho

B i Oli (NIDDK) Stefan Wuchty (NCBI)

Yang Huang

Brian Oliver (NIDDK) John Malone Nicolas Mattiuzzo J ti A d (I di U i it )

Jan Hoinka Xiangjun Du g g

Justin Andrews (Indiana University) Jozef Przytycki (GWU)

Damian Wojtowicz Raheleh Salari

slide-39
SLIDE 39

39

slide-40
SLIDE 40

Impact of gene copy number on gene expression in Drosophila melanogaster expression in Drosophila melanogaster

ge (log2)

  • ld chang

ression fo

  • 1

E i ( ild t ) Exp

40

Expression (wild type) collaboration with Brian Oliver group (NIDDK)

slide-41
SLIDE 41

CNV-related perturbations propagate t h i t ti t k trough interaction network

41

Co-complex network from Artavanis-Tsakonas group (unpublished)

slide-42
SLIDE 42

Impact on copy number on gene i i li expression in glioma

CNV Chromosomes Correlation between CNV and expression

42

slide-43
SLIDE 43

Genotype: effects of genotypic effects of genotypic variation:

  • change in amino acid
  • change in gene structure
  • copy number variations ….

43

slide-44
SLIDE 44

Phenotype Genotype: effects of genotypic effects of genotypic variation:

  • change in amino acid
  • change in gene structure
  • copy number variations ….

44

slide-45
SLIDE 45

Phenotype Genotype: effects of genotypic effects of genotypic variation:

  • change in amino acid

Molecular phenotypes

  • change in gene structure
  • copy number variations ….

phenotypes

  • gene expression
  • Metabolite level

45

slide-46
SLIDE 46

Copy number variations (CNV) (gene dosage)

  • implicated in large number of human diseases (cancer, Crohn's disease,

autism)

(gene dosage)

  • 28,025 structural variants identified in 1000 genome study (2,000 changes

affecting full genes or exons)

  • Frequent type of somatic mutations in cancer
slide-47
SLIDE 47

Phenotype Genotype: Molecular phenotypes phenotypes

  • gene expression
  • Metabolite level

47