Latent Variable models for GWAs
Oliver Stegle
Machine Learning and Computational Biology Research Group Max-Planck-Institutes T¨ ubingen, Germany
September 2011
- O. Stegle
Latent variable models for GWAs T¨ ubingen 1
Latent Variable models for GWAs Oliver Stegle Machine Learning and - - PowerPoint PPT Presentation
Latent Variable models for GWAs Oliver Stegle Machine Learning and Computational Biology Research Group Max-Planck-Institutes T ubingen, Germany September 2011 O. Stegle Latent variable models for GWAs T ubingen 1 Motivation Why
Latent variable models for GWAs T¨ ubingen 1
Motivation
◮ Primary variable of
◮ Covariates ◮ Population structure
◮ Sample handling ◮ Sample history ◮ Subtle environmental
ATGACCTGAAACTGGGGGACTGACGTGGAACGGT ATGACCTGCAACTGGGGGACTGACGTGCAACGGT ATGACCTGCAACTGGGGGACTGACGTGCAACGGT ATGACCTGAAACTGGGGGATTGACGTGGAACGGT ATGACCTGCAACTGGGGGATTGACGTGCAACGGT ATGACCTGCAACTGGGGGATTGACGTGCAACGGT individuals phenotypes SNPs
y1
y2
yN
Latent variable models for GWAs T¨ ubingen 2
Motivation
◮ Primary variable of
◮ Covariates ◮ Population structure
◮ Sample handling ◮ Sample history ◮ Subtle environmental
ATGACCTGAAACTGGGGGACTGACGTGGAACGGT ATGACCTGCAACTGGGGGACTGACGTGCAACGGT ATGACCTGCAACTGGGGGACTGACGTGCAACGGT ATGACCTGAAACTGGGGGATTGACGTGGAACGGT ATGACCTGCAACTGGGGGATTGACGTGCAACGGT ATGACCTGCAACTGGGGGATTGACGTGCAACGGT individuals phenotypes SNPs
y1
y2
yN
Covariates Population
Latent variable models for GWAs T¨ ubingen 2
Motivation
◮ Primary variable of
◮ Covariates ◮ Population structure
◮ Sample handling ◮ Sample history ◮ Subtle environmental
ATGACCTGAAACTGGGGGACTGACGTGGAACGGT ATGACCTGCAACTGGGGGACTGACGTGCAACGGT ATGACCTGCAACTGGGGGACTGACGTGCAACGGT ATGACCTGAAACTGGGGGATTGACGTGGAACGGT ATGACCTGCAACTGGGGGATTGACGTGCAACGGT ATGACCTGCAACTGGGGGATTGACGTGCAACGGT individuals phenotypes SNPs
y1
y2
yN
Confounders Covariates Population
Latent variable models for GWAs T¨ ubingen 2
Outline
Latent variable models for GWAs T¨ ubingen 3
Dimension reduction and the Gaussian Process Latent Variable Model (GPLVM)
Latent variable models for GWAs T¨ ubingen 4
Dimension reduction and the Gaussian Process Latent Variable Model (GPLVM)
Latent variable models for GWAs T¨ ubingen 5
Dimension reduction and the Gaussian Process Latent Variable Model (GPLVM)
◮ H: latent factors in low-dimensional space ◮ W: weights for factors on data dimensions ◮ Ψ: noise, ψn,g ∼ N(0, σ2). ◮ Challenge: neither W nor H known!
◮ Principle component analysis (PCA) ◮ Independent component analysis (ICA) ◮ ...
Latent variable models for GWAs T¨ ubingen 6
Dimension reduction and the Gaussian Process Latent Variable Model (GPLVM)
◮ H: latent factors in low-dimensional space ◮ W: weights for factors on data dimensions ◮ Ψ: noise, ψn,g ∼ N(0, σ2). ◮ Challenge: neither W nor H known!
◮ Principle component analysis (PCA) ◮ Independent component analysis (ICA) ◮ ...
Latent variable models for GWAs T¨ ubingen 6
Dimension reduction and the Gaussian Process Latent Variable Model (GPLVM)
◮ H: latent factors in low-dimensional space ◮ W: weights for factors on data dimensions ◮ Ψ: noise, ψn,g ∼ N(0, σ2). ◮ Challenge: neither W nor H known!
◮ Principle component analysis (PCA) ◮ Independent component analysis (ICA) ◮ ...
Latent variable models for GWAs T¨ ubingen 6
Dimension reduction and the Gaussian Process Latent Variable Model (GPLVM)
◮ Covariance matrix: C = YYT ◮ Eigenvalue/Eigen vectors Cvi = λivi ◮ Projection matrix P = [v1, . . . , vK] ◮ Principle components Hn = P · Yn.
Latent variable models for GWAs T¨ ubingen 7
Dimension reduction and the Gaussian Process Latent Variable Model (GPLVM)
◮ Covariance matrix: C = YYT ◮ Eigenvalue/Eigen vectors Cvi = λivi ◮ Projection matrix P = [v1, . . . , vK] ◮ Principle components Hn = P · Yn.
Latent variable models for GWAs T¨ ubingen 7
Dimension reduction and the Gaussian Process Latent Variable Model (GPLVM)
◮ Covariance matrix: C = YYT ◮ Eigenvalue/Eigen vectors Cvi = λivi ◮ Projection matrix P = [v1, . . . , vK] ◮ Principle components Hn = P · Yn.
Latent variable models for GWAs T¨ ubingen 7
Dimension reduction and the Gaussian Process Latent Variable Model (GPLVM)
◮ Covariance matrix: C = YYT ◮ Eigenvalue/Eigen vectors Cvi = λivi ◮ Projection matrix P = [v1, . . . , vK] ◮ Principle components Hn = P · Yn.
Latent variable models for GWAs T¨ ubingen 7
Dimension reduction and the Gaussian Process Latent Variable Model (GPLVM)
◮ Covariance matrix: C = YYT ◮ Eigenvalue/Eigen vectors Cvi = λivi ◮ Projection matrix P = [v1, . . . , vK] ◮ Principle components Hn = P · Yn.
Latent variable models for GWAs T¨ ubingen 7
Dimension reduction and the Gaussian Process Latent Variable Model (GPLVM)
p(Y|H, W) =
N
N
N
N
hI
N
N
hWWT + σ2I
p(Y|H, W) =
G
N
G
N
hI
G
N y:,g
hHHT
+σ2I
[Lawrence, 2005]
Latent variable models for GWAs T¨ ubingen 8
Dimension reduction and the Gaussian Process Latent Variable Model (GPLVM)
p(Y|H, W) =
N
N
N
N
hI
N
N
hWWT + σ2I
p(Y|H, W) =
G
N
G
N
hI
G
N y:,g
hHHT
+σ2I
[Lawrence, 2005]
Latent variable models for GWAs T¨ ubingen 8
Dimension reduction and the Gaussian Process Latent Variable Model (GPLVM)
Latent variable models for GWAs T¨ ubingen 9
Dimension reduction and the Gaussian Process Latent Variable Model (GPLVM)
Latent variable models for GWAs T¨ ubingen 9
Dimension reduction and the Gaussian Process Latent Variable Model (GPLVM)
Latent variable models for GWAs T¨ ubingen 10
Dimension reduction and the Gaussian Process Latent Variable Model (GPLVM)
Latent variable models for GWAs T¨ ubingen 10
Dimension reduction and the Gaussian Process Latent Variable Model (GPLVM)
Latent variable models for GWAs T¨ ubingen 10
Modeling hidden confounders in GWAs
Latent variable models for GWAs T¨ ubingen 11
Modeling hidden confounders in GWAs
◮ Experimental procedures ◮ Gene regulation ◮ (Translation)
◮ Sample preparation ◮ Sample history
Phenome Genome
ATGACCTGAAACTGGGGGACTGACGTGGAACGGT ATGACCTGCAACTGGGGGACTGACGTGCAACGGT ATGACCTGCAACTGGGGGACTGACGTGCAACGGT ATGACCTGAAACTGGGGGATTGACGTGGAACGGT ATGACCTGCAACTGGGGGATTGACGTGCAACGGT ATGACCTGCAACTGGGGGATTGACGTGCAACGGT individuals phenotypes SNPs
y1
y2
yN
Confounders Covariates Population
Latent variable models for GWAs T¨ ubingen 12
Modeling hidden confounders in GWAs
◮ Experimental procedures ◮ Gene regulation ◮ (Translation)
◮ Sample preparation ◮ Sample history
ATGACCTGAAACTGGGGGACTGACGTGGAACGGT ATGACCTGCAACTGGGGGACTGACGTGCAACGGT ATGACCTGCAACTGGGGGACTGACGTGCAACGGT ATGACCTGAAACTGGGGGATTGACGTGGAACGGT ATGACCTGCAACTGGGGGATTGACGTGCAACGGT ATGACCTGCAACTGGGGGATTGACGTGCAACGGT SNPs
mRNA proteins
translation gene regulation transcription
alternative splicing transcription factors small RNAs
DNA
external influenes
experimental procedures environment
Latent variable models for GWAs T¨ ubingen 12
Modeling hidden confounders in GWAs
◮ Experimental procedures ◮ Gene regulation ◮ (Translation)
◮ Sample preparation ◮ Sample history
ATGACCTGAAACTGGGGGACTGACGTGGAACGGT ATGACCTGCAACTGGGGGACTGACGTGCAACGGT ATGACCTGCAACTGGGGGACTGACGTGCAACGGT ATGACCTGAAACTGGGGGATTGACGTGGAACGGT ATGACCTGCAACTGGGGGATTGACGTGCAACGGT ATGACCTGCAACTGGGGGATTGACGTGCAACGGT SNPs
mRNA proteins
translation gene regulation transcription
alternative splicing transcription factors small RNAs
DNA
external influenes
experimental procedures environment
Latent variable models for GWAs T¨ ubingen 12
Modeling hidden confounders in GWAs
◮ Experimental procedures ◮ Gene regulation ◮ (Translation)
◮ Sample preparation ◮ Sample history
ATGACCTGAAACTGGGGGACTGACGTGGAACGGT ATGACCTGCAACTGGGGGACTGACGTGCAACGGT ATGACCTGCAACTGGGGGACTGACGTGCAACGGT ATGACCTGAAACTGGGGGATTGACGTGGAACGGT ATGACCTGCAACTGGGGGATTGACGTGCAACGGT ATGACCTGCAACTGGGGGATTGACGTGCAACGGT SNPs
mRNA proteins
translation gene regulation transcription
alternative splicing transcription factors small RNAs
DNA
external influenes
experimental procedures environment
Latent variable models for GWAs T¨ ubingen 12
Modeling hidden confounders in GWAs
(a) Standard (b) VBFA accounting for hidden factors
5 10 15 eQTL LOD SLC35B4
0.01% FPR
0.01% FPR 1.3354 1.3356 1.3358 1.336 1.3362 1.3364 1.3366 1.3368 1.337 1.3372 1.3374 x 10
8
5 10 15 VBFA eQTL LOD Position in chr. 7 SLC35B4
0.01% FPR
0.01% FPR SLC35B4
Latent variable models for GWAs T¨ ubingen 13
Modeling hidden confounders in GWAs Model
Phenome Genome
ATGACCTGAAACTGGGGGACTGACGTGGAACGGT ATGACCTGCAACTGGGGGACTGACGTGCAACGGT ATGACCTGCAACTGGGGGACTGACGTGCAACGGT ATGACCTGAAACTGGGGGATTGACGTGGAACGGT ATGACCTGCAACTGGGGGATTGACGTGCAACGGT ATGACCTGCAACTGGGGGATTGACGTGCAACGGT individuals phenotypes SNPs
y1
y2
yN
Covariates Population
genetic
Latent variable models for GWAs T¨ ubingen 14
Modeling hidden confounders in GWAs Model
Phenome Genome
ATGACCTGAAACTGGGGGACTGACGTGGAACGGT ATGACCTGCAACTGGGGGACTGACGTGCAACGGT ATGACCTGCAACTGGGGGACTGACGTGCAACGGT ATGACCTGAAACTGGGGGATTGACGTGGAACGGT ATGACCTGCAACTGGGGGATTGACGTGCAACGGT ATGACCTGCAACTGGGGGATTGACGTGCAACGGT individuals phenotypes SNPs
y1
y2
yN
Confounders Covariates Population
genetic
K
Latent variable models for GWAs T¨ ubingen 14
Modeling hidden confounders in GWAs Model
Phenome Genome
ATGACCTGAAACTGGGGGACTGACGTGGAACGGT ATGACCTGCAACTGGGGGACTGACGTGCAACGGT ATGACCTGCAACTGGGGGACTGACGTGCAACGGT ATGACCTGAAACTGGGGGATTGACGTGGAACGGT ATGACCTGCAACTGGGGGATTGACGTGCAACGGT ATGACCTGCAACTGGGGGATTGACGTGCAACGGT individuals phenotypes SNPs
y1
y2
yN
Confounders Covariates Population
genetic
K
Latent variable models for GWAs T¨ ubingen 14
Modeling hidden confounders in GWAs Model
G
K
eI
G
h K
k + σ2 eI
Latent variable models for GWAs T¨ ubingen 15
Modeling hidden confounders in GWAs Model
G
K
eI
G
h K
k + σ2 eI
Latent variable models for GWAs T¨ ubingen 15
Modeling hidden confounders in GWAs Model
G
K
eI
G
h K
k + σ2 eI
Latent variable models for GWAs T¨ ubingen 15
Modeling hidden confounders in GWAs Model
Latent variable models for GWAs T¨ ubingen 16
Modeling hidden confounders in GWAs Model
Latent variable models for GWAs T¨ ubingen 16
Modeling hidden confounders in GWAs Model
Latent variable models for GWAs T¨ ubingen 16
Modeling hidden confounders in GWAs Model
Latent variable models for GWAs T¨ ubingen 16
Modeling hidden confounders in GWAs Model
ˆ H, ˆ σe, ˆ σg, ˆ σe
Latent variable models for GWAs T¨ ubingen 17
Modeling hidden confounders in GWAs Model
ˆ H, ˆ σe, ˆ σg, ˆ σe
Latent variable models for GWAs T¨ ubingen 17
Modeling hidden confounders in GWAs Applications
gene: YJL213W
ATGACCTGAAACTGGGCGGATGACGTGGAACGGTATGACCTGAAACTGGGGGACTGACGTGGAACGGTATGACCTGAAACT
Promoter
gene: YJL213W
ATGA..................ACTGGGCGGATGACGTGGAACGGTATGACCTGAAACTGGGGGACTGACGTGGAACGGTATGACCTGAAACT
Promoter
Latent variable models for GWAs T¨ ubingen 18
Modeling hidden confounders in GWAs Applications
gene: YJL213W
ATGACCTGAAACTGGGCGGATGACGTGGAACGGTATGACCTGAAACTGGGGGACTGACGTGGAACGGTATGACCTGAAACT
Promoter
gene: YJL213W
ATGA..................ACTGGGCGGATGACGTGGAACGGTATGACCTGAAACTGGGGGACTGACGTGGAACGGTATGACCTGAAACT
Promoter
(a) Y east cis eQT L s (b) Y east tr ans eQT L s
Latent variable models for GWAs T¨ ubingen 18
Modeling hidden confounders in GWAs Applications
Latent variable models for GWAs T¨ ubingen 19
Modeling hidden confounders in GWAs Applications
(f) cis V B eQT L location and strength relative to gene start (a) P robes with a V B eQT L in pooled population
Latent variable models for GWAs T¨ ubingen 19
Modeling unobserved cellular phenotypes in genetic analyses
Latent variable models for GWAs T¨ ubingen 20
Modeling unobserved cellular phenotypes in genetic analyses Model
◮ Accounting for hidden
yg,j =
genetic
vgfj
known factors
+ wgxj
hidden factors
+ ψn,g
noise
.
ATGACCTGAAACTGGGGGACTGACGTGGAACGGT ATGACCTGCAACTGGGGGACTGACGTGCAACGGT ATGACCTGCAACTGGGGGACTGACGTGCAACGGT ATGACCTGAAACTGGGGGATTGACGTGGAACGGT ATGACCTGCAACTGGGGGATTGACGTGCAACGGT ATGACCTGCAACTGGGGGATTGACGTGCAACGGT SNPs
mRNA proteins
translation gene regulation transcription
alternative splicing transcription factors small RNAs
DNA
external influenes
experimental procedures environment
Latent variable models for GWAs T¨ ubingen 21
Modeling unobserved cellular phenotypes in genetic analyses Model
◮ Accounting for hidden
yg,j =
genetic
vgfj
known factors
+ wgxj
hidden factors
+ ψn,g
noise
.
ATGACCTGAAACTGGGGGACTGACGTGGAACGGT ATGACCTGCAACTGGGGGACTGACGTGCAACGGT ATGACCTGCAACTGGGGGACTGACGTGCAACGGT ATGACCTGAAACTGGGGGATTGACGTGGAACGGT ATGACCTGCAACTGGGGGATTGACGTGCAACGGT ATGACCTGCAACTGGGGGATTGACGTGCAACGGT SNPs
mRNA proteins
translation gene regulation transcription
alternative splicing transcription factors small RNAs
DNA
external influenes
experimental procedures environment
Latent variable models for GWAs T¨ ubingen 21
Modeling unobserved cellular phenotypes in genetic analyses Model
◮ Transcription factors ◮ Pathway components
◮ Difficult and expensive.
mRNA proteins
translation transcription
DNA
gene regulation
alternative splicing small RNAs
external influenes
experimental procedures environment
ATGACCTGAAACTGGGGGACTGACGTGGAACGGT ATGACCTGCAACTGGGGGACTGACGTGCAACGGT ATGACCTGCAACTGGGGGACTGACGTGCAACGGT ATGACCTGAAACTGGGGGATTGACGTGGAACGGT ATGACCTGCAACTGGGGGATTGACGTGCAACGGT ATGACCTGCAACTGGGGGATTGACGTGCAACGGT SNPs
[Parts et al., 2011]
Latent variable models for GWAs T¨ ubingen 22
Modeling unobserved cellular phenotypes in genetic analyses Model
◮ Transcription factors ◮ Pathway components
◮ Difficult and expensive.
mRNA proteins
translation transcription
DNA
gene regulation
alternative splicing small RNAs
external influenes
experimental procedures environment
ATGACCTGAAACTGGGGGACTGACGTGGAACGGT ATGACCTGCAACTGGGGGACTGACGTGCAACGGT ATGACCTGCAACTGGGGGACTGACGTGCAACGGT ATGACCTGAAACTGGGGGATTGACGTGGAACGGT ATGACCTGCAACTGGGGGATTGACGTGCAACGGT ATGACCTGCAACTGGGGGATTGACGTGCAACGGT SNPs
[Parts et al., 2011]
Latent variable models for GWAs T¨ ubingen 22
Modeling unobserved cellular phenotypes in genetic analyses Model
◮ Transcription factors ◮ Pathway components
◮ Difficult and expensive.
mRNA proteins
translation transcription
DNA
gene regulation
alternative splicing small RNAs
external influenes
experimental procedures environment
ATGACCTGAAACTGGGGGACTGACGTGGAACGGT ATGACCTGCAACTGGGGGACTGACGTGCAACGGT ATGACCTGCAACTGGGGGACTGACGTGCAACGGT ATGACCTGAAACTGGGGGATTGACGTGGAACGGT ATGACCTGCAACTGGGGGATTGACGTGCAACGGT ATGACCTGCAACTGGGGGATTGACGTGCAACGGT SNPs
[Parts et al., 2011]
Latent variable models for GWAs T¨ ubingen 22
Modeling unobserved cellular phenotypes in genetic analyses Model
◮ W is sparse; each factor
◮ Transcription factor binding
mRNA proteins
translation transcription
DNA
gene regulation
alternative splicing small RNAs
external influenes
experimental procedures environment
ATGACCTGAAACTGGGGGACTGACGTGGAACGGT ATGACCTGCAACTGGGGGACTGACGTGCAACGGT ATGACCTGCAACTGGGGGACTGACGTGCAACGGT ATGACCTGAAACTGGGGGATTGACGTGGAACGGT ATGACCTGCAACTGGGGGATTGACGTGCAACGGT ATGACCTGCAACTGGGGGATTGACGTGCAACGGT SNPs
Latent variable models for GWAs T¨ ubingen 23
Modeling unobserved cellular phenotypes in genetic analyses Model
◮ W is sparse; each factor
◮ Transcription factor binding
gene: YJL213W TF: PHO4
ATGACCTGAAACTGGGCGGATGACGTGGAACGGTATGACCTGAAACTGGGGGACTGACGTGGAACGGTATGACCTGAAACT
Promoter
Latent variable models for GWAs T¨ ubingen 23
Modeling unobserved cellular phenotypes in genetic analyses Model
◮ W is sparse; each factor
◮ Transcription factor binding
G e n e s ( 5 4 9 3 ) PHO4 Segregants (218) Segregants (218) Known targets from Yeastract
PHO4 factor activation
Gene expression
Latent variable models for GWAs T¨ ubingen 23
Modeling unobserved cellular phenotypes in genetic analyses Model
1
Latent variable models for GWAs T¨ ubingen 24
Modeling unobserved cellular phenotypes in genetic analyses Model
1
Latent variable models for GWAs T¨ ubingen 24
Modeling unobserved cellular phenotypes in genetic analyses Model
1
Latent variable models for GWAs T¨ ubingen 24
Modeling unobserved cellular phenotypes in genetic analyses Model
Latent variable models for GWAs T¨ ubingen 25
Modeling unobserved cellular phenotypes in genetic analyses Model
Latent variable models for GWAs T¨ ubingen 25
Modeling unobserved cellular phenotypes in genetic analyses Model
Stegle et al., in preparation
Latent variable models for GWAs T¨ ubingen 26
Modeling unobserved cellular phenotypes in genetic analyses Model
Stegle et al., in preparation
Latent variable models for GWAs T¨ ubingen 26
Modeling unobserved cellular phenotypes in genetic analyses Applications
◮ TF binding affinities (Yeastract). ◮ Pathway information (KEGG).
[Smith and Kruglyak, 2008]
Latent variable models for GWAs T¨ ubingen 27
Modeling unobserved cellular phenotypes in genetic analyses Applications
mRNA proteins
translation transcription
DNA
gene regulation
alternative splicing small RNAs
external influenes
experimental procedures environment
ATGACCTGAAACTGGGGGACTGACGTGGAACGGT ATGACCTGCAACTGGGGGACTGACGTGCAACGGT ATGACCTGCAACTGGGGGACTGACGTGCAACGGT ATGACCTGAAACTGGGGGATTGACGTGGAACGGT ATGACCTGCAACTGGGGGATTGACGTGCAACGGT ATGACCTGCAACTGGGGGATTGACGTGCAACGGT SNPs
Latent variable models for GWAs T¨ ubingen 28
Modeling unobserved cellular phenotypes in genetic analyses Applications
mRNA proteins
translation transcription
DNA
gene regulation
alternative splicing small RNAs
external influenes
experimental procedures environment
ATGACCTGAAACTGGGGGACTGACGTGGAACGGT ATGACCTGCAACTGGGGGACTGACGTGCAACGGT ATGACCTGCAACTGGGGGACTGACGTGCAACGGT ATGACCTGAAACTGGGGGATTGACGTGGAACGGT ATGACCTGCAACTGGGGGATTGACGTGCAACGGT ATGACCTGCAACTGGGGGATTGACGTGCAACGGT SNPs
Latent variable models for GWAs T¨ ubingen 28
Modeling unobserved cellular phenotypes in genetic analyses Applications
mRNA proteins
translation transcription
DNA
gene regulation
alternative splicing small RNAs
external influenes
experimental procedures environment
ATGACCTGAAACTGGGGGACTGACGTGGAACGGT ATGACCTGCAACTGGGGGACTGACGTGCAACGGT ATGACCTGCAACTGGGGGACTGACGTGCAACGGT ATGACCTGAAACTGGGGGATTGACGTGGAACGGT ATGACCTGCAACTGGGGGATTGACGTGCAACGGT ATGACCTGCAACTGGGGGATTGACGTGCAACGGT SNPs
Latent variable models for GWAs T¨ ubingen 28
Modeling unobserved cellular phenotypes in genetic analyses Applications
mRNA proteins
translation transcription
DNA
gene regulation
alternative splicing small RNAs
external influenes
experimental procedures environment
ATGACCTGAAACTGGGGGACTGACGTGGAACGGT ATGACCTGCAACTGGGGGACTGACGTGCAACGGT ATGACCTGCAACTGGGGGACTGACGTGCAACGGT ATGACCTGAAACTGGGGGATTGACGTGGAACGGT ATGACCTGCAACTGGGGGATTGACGTGCAACGGT ATGACCTGCAACTGGGGGATTGACGTGCAACGGT SNPs
Latent variable models for GWAs T¨ ubingen 28
Modeling unobserved cellular phenotypes in genetic analyses Applications
1 10
# of associations
Yeastract KEGG Freeform I II III IV V VI VII VIII IX X XI XII XIII XIV XV XVI
Chromosomal position
1 10 100
# of associations
Trans genes
Latent variable models for GWAs T¨ ubingen 29
Modeling unobserved cellular phenotypes in genetic analyses Applications
−1.0 −0.5 0.0 0.5 PHO4 activation 5 10 15 20 25 Segregant count Association LOD = 21.5 (PHO84 locus) BY RM
Genes (5493) PHO4 Segregants (218) Segregants (218) Segregants (218) PHO4 factor activation Growth condition Genotype Known targets from Yeastract (a) (b) (c) PHO4 factor activation
−1.0 −0.5 0.0 0.5 PHO4 activation −4 −3 −2 −1 1 2 3 4 YJL213W expression BY Glu RM Glu BY Eth RM Eth Interaction LOD = 35.0 (PHO84 locus)
Factor activation / gene expression low high Genotype Growth condition BY RM Eth Glu
Legend: Segregating loci (2956) Segregants (218)
Genes (5493)
Segregating loci (2956)
Gene expression Genotype Gene expression
Latent variable models for GWAs T¨ ubingen 30
Modeling unobserved cellular phenotypes in genetic analyses Applications
(a) YAP1-IRA2 interaction (b) PHO4-PHO84 interaction (c) MAT-YOX1 interaction
Latent variable models for GWAs T¨ ubingen 30
Modeling unobserved cellular phenotypes in genetic analyses Applications
I II III IV V VI VII VIII IX X XI XII XIII XIV XV XVI
Chromosomal position
100 500 1000 1500
# of interacting genes
OAF1 IRA2 CIN5 PHO84 HAP1 MKT1 AMN1 Environment Yeastract KEGG Freeform
Latent variable models for GWAs T¨ ubingen 30
A unifying view
Latent variable models for GWAs T¨ ubingen 31
A unifying view
confounders hidden known
1 2 3 4 samples g e n e s S
Latent variable models for GWAs T¨ ubingen 32
A unifying view
samples genes S
Latent variable models for GWAs T¨ ubingen 32
A unifying view
Latent variable models for GWAs T¨ ubingen 32
A unifying view
s a m p l e s genes S
Latent variable models for GWAs T¨ ubingen 33
A unifying view
s a m p l e s genes S
20 40 60 80 100 sampless 20 40 60 80 100 sampless
Latent variable models for GWAs T¨ ubingen 33
A unifying view
s a m p l e s genes S
20 40 60 80 100 sampless 20 40 60 80 100 sampless
10 20 30 40 50 genes 10 20 30 40 50 genes
Latent variable models for GWAs T¨ ubingen 33
A unifying view
s a m p l e s genes S
20 40 60 80 100 sampless 20 40 60 80 100 sampless
10 20 30 40 50 genes 10 20 30 40 50 genes
Latent variable models for GWAs T¨ ubingen 33
A unifying view
Latent variable models for GWAs T¨ ubingen 34
A unifying view
Latent variable models for GWAs T¨ ubingen 34
A unifying view
0.0 0.2 0.4 0.6 0.8 1.0 Recall 0.0 0.2 0.4 0.6 0.8 1.0 Precision GLasso Kronecker GLasso Ideal GLasso
Latent variable models for GWAs T¨ ubingen 35
A unifying view
0.0 0.2 0.4 0.6 0.8 1.0 Recall 0.0 0.2 0.4 0.6 0.8 1.0 Precision GLasso Kronecker GLasso Ideal GLasso
Latent variable models for GWAs T¨ ubingen 35
Summary
Latent variable models for GWAs T¨ ubingen 36
Summary
Latent variable models for GWAs T¨ ubingen 37
Summary
Latent variable models for GWAs T¨ ubingen 38