An Application of Principal Components Analysis in Genetics
Samuel Morrissette April 14, 2020
Samuel Morrissette PCA in Genetics April 14, 2020 1 / 22
An Application of Principal Components Analysis in Genetics Samuel - - PowerPoint PPT Presentation
An Application of Principal Components Analysis in Genetics Samuel Morrissette April 14, 2020 Samuel Morrissette PCA in Genetics April 14, 2020 1 / 22 Background and Terminology 1 Eigenstrat Algorithm and Definitions 2 Results 3
Samuel Morrissette PCA in Genetics April 14, 2020 1 / 22
Samuel Morrissette PCA in Genetics April 14, 2020 2 / 22
Samuel Morrissette PCA in Genetics April 14, 2020 3 / 22
◮ Test for an association between certain genetic variants (alleles) and a
◮ Are frequently conducted through a case-control study.
Samuel Morrissette PCA in Genetics April 14, 2020 4 / 22
◮ Overrepresentation of a population in the case or control group can
Samuel Morrissette PCA in Genetics April 14, 2020 5 / 22
◮ Genomic control and structured association were two of the most
◮ Eigenstrat, proposed by Price et al. in 2006, has since become the
Samuel Morrissette PCA in Genetics April 14, 2020 6 / 22
Samuel Morrissette PCA in Genetics April 14, 2020 7 / 22
1 Apply PCA to random SNPs (preferably unrelated to the candidate
2 Adjust the candidate SNPs and phenotypes of the samples based on
3 Compute a test statistic using adjusted values Samuel Morrissette PCA in Genetics April 14, 2020 8 / 22
◮ May have a geographical interpretation within continents (figure below) Samuel Morrissette PCA in Genetics April 14, 2020 9 / 22
◮ Adjustment corrects for population stratification
Samuel Morrissette PCA in Genetics April 14, 2020 10 / 22
Samuel Morrissette PCA in Genetics April 14, 2020 11 / 22
1
2
3
◮ Armitage trend test statistic (uncorrected for stratification) ◮ Genomic control (corrects for stratification using a uniform inflation
Samuel Morrissette PCA in Genetics April 14, 2020 12 / 22
◮ Fewer spurious associations in non-causal SNPs. ◮ More powerful when detecting true associations at causal SNPs.
Samuel Morrissette PCA in Genetics April 14, 2020 13 / 22
Samuel Morrissette PCA in Genetics April 14, 2020 14 / 22
Samuel Morrissette PCA in Genetics April 14, 2020 15 / 22
−0.10 −0.05 0.00 0.05 0.10 −0.08 −0.04 0.00 0.04
PC1 PC2 Country
Africa France
Samuel Morrissette PCA in Genetics April 14, 2020 16 / 22
−0.10 −0.05 0.00 0.05 0.10 −0.08 −0.04 0.00 0.04
PC1 PC2 Breed
Borgou Zebu Lagunaire NDama Somba Aubrac Bazadais BlondeAquitaine BretPieNoire Charolais Gascon Limousin MaineAnjou Montbeliard Salers
Samuel Morrissette PCA in Genetics April 14, 2020 17 / 22
◮ 0.8 for Africa ◮ 0.2 for France
◮ 100 cases from Africa and 50 from France ◮ 50 controls from Africa and 100 from France Samuel Morrissette PCA in Genetics April 14, 2020 18 / 22
Samuel Morrissette PCA in Genetics April 14, 2020 19 / 22
Samuel Morrissette PCA in Genetics April 14, 2020 20 / 22
◮ Poor designs may result in a loss of power with Eigenstrat Samuel Morrissette PCA in Genetics April 14, 2020 21 / 22
Samuel Morrissette PCA in Genetics April 14, 2020 22 / 22