Genome-Wide Association Studies
Caitlin Collins, Thibaut Jombart
MRC Centre for Outbreak Analysis and Modelling Imperial College London
Genetic data analysis using 30-10-2014
Genome-Wide Association Studies Caitlin Collins , Thibaut Jombart - - PowerPoint PPT Presentation
Genome-Wide Association Studies Caitlin Collins , Thibaut Jombart MRC Centre for Outbreak Analysis and Modelling Imperial College London Genetic data analysis using 30-10-2014 Outline Introduction to GWAS Study design o GWAS design o
MRC Centre for Outbreak Analysis and Modelling Imperial College London
Genetic data analysis using 30-10-2014
2
3
bases per day per machine
bases per day per machine
degeneration
Genomics & GWAS 4
Genomics & GWAS 5
associated with a phenotype.
Genomics & GWAS 6
between two measured quantities that renders them statistically dependent.
variance explained by genetics
Genomics & GWAS 7
p
SNPs
n
individuals
Cases Controls
Genomics & GWAS 8
Genomics & GWAS 9
10
Study Design 11
Study Design 12
population
Study Design 13
population structure
frequencies btw. sub- populations…
ancestry
Components of PCA
Study Design 14
independent of each other
beneficial
Study Design 15
16
Testing for Association 17
squared test, ANOVA
Testing for Association 18
p
SNPs
n
individuals
Cases Controls
testing
univariate framework
SNPs may be too small
individual SNPs ≠ combined effects
Testing for Association 19
Testing for Association 20
general linear model”
𝑙 = 𝑞𝑙 𝑙! k-way interactions
interactions considered…
Testing for Association 21
𝑗 = 𝑥0 + 𝑥1𝐵𝑗 + 𝑥2𝐶𝑗 +𝒙𝟒𝑩𝒋𝑪𝒋
Testing for Association 22
LASSO penalized regression Ridge regression Neural Networks Penalized Regression Bayesian Approaches Factorial Methods Bayesian Epistasis Association Mapping Logic Trees Modified Logic Regression-Gene Expression Programming Genetic Programming for Association Studies Logic feature selection Monte Carlo Logic Regression Logic regression Supervised-PCA Sparse-PCA DAPC-based FS (snpzip) Bayesian partitioning The elastic net Bayesian Logistic Regression with Stochastic Search Variable Selection Odds-ratio- based MDR Multi-factor dimensionality reduction method Genetic programming
networks Parametric decreasing method Restricted partitioning method Combinatorial partitioning method Random forests Set association approach Non-parametric Methods
feature selection
Testing for Association 23
Testing for Association 24
Testing for Association 25
parameters
Testing for Association 26
Testing for Association 27
Discriminant axis Density of individuals Discriminant axis Density of individuals
a b c d e
Alleles Individuals
0.1 0.2 0.3 0.4 0.5 a b c d e Contribution to Discriminant Axis
Healthy (“controls”) Diseased (“cases”) Testing for Association 28 Discriminant Axis
Discriminant Axis
0.05 0.1 0.15 0.2 0.25 0.3 0.35 0.4 a b c d e Contribution to Discriminant Axis
Discriminant axis Density of individuals
Testing for Association 30
0.1 0.2 0.3 0.4 0.5 a b c d e Contribution to Discriminant Axis
Hooray!
Testing for Association 31
32
33
34