MENDELIAN RANDOMIZATION
Maria Carolina Borges
Research Fellow MRC Integrative Epidemiology Unit University of Bristol UK
MENDELIAN RANDOMIZATION Maria Carolina Borges Research Fellow MRC - - PowerPoint PPT Presentation
MENDELIAN RANDOMIZATION Maria Carolina Borges Research Fellow MRC Integrative Epidemiology Unit University of Bristol UK Outline Motivation Assumptions One-sample MR Two-sample MR Recent extensions MR-Base MOTIVATION
Maria Carolina Borges
Research Fellow MRC Integrative Epidemiology Unit University of Bristol UK
4
Confounding Bias
Reverse causation
6 N pair-wise associations Expected (P ≤ 0.01) Observed (P ≤ 0.01) P value for
expected 96 (non- genetic) traits 4560 45.6 (1%) 2036 (44.6%) < 0.000001 23 SNPs 253 2.5 (1%) 4 (1.6%) 0.33 96 traits x 23 SNPs 2208 22.1 (1%) 27 (1.1%) 0.29
Bone Marrow Transplant. 1991;7 Suppl 3:9-12. Lancet 1986;i:507–08
Gregor Mendel (1822–1884): “the behavior of each pair of differentiating characteristics in hybrid union is independent of the other differences between the two original plants, and, further, the hybrid produces just so many kinds of egg and pollen cells as there are possible constant combination forms...”
Inheritance of one trait is independent
Randomization
Mendelian randomization Randomized controlled trial
Random segregation of alleles Random allocation Exposed: Allele A Control: Other allele Control: No intervention Exposed: Intervention Outcomes compared between groups Outcomes compared between groups Confounders equal between groups Confounders equal between groups
Adapted from Ebrahim, Davey-Smith, 2008
Z: genetic instrument X: exposure Y: outcome U: confounder
Z: rs12345 X: LDLc Y: CHD U Z → X: 0.05 mmol/L of LDLc per T allele Z → Y: 0.03 log odds CHD per T allele X → Y: 0.03/0.05=0.6 log odds CHD per 1 mmol/L of LDLc (OR=1.82)
Instrumental variable: randomization to HMGCR variant (rs12345)
U X Y Z
IV1 IV2 IV3 Z: genetic IV X: exposure Y: outcome U: confounder
U X Y Z
IV1 Z: genetic IV X: exposure Y: outcome U: confounder
genotypic subgroups
(two-sample MR with no sample overlap)
Burgess et al., 2011; 2016
U X Y Z
IV2 Z: genetic IV X: exposure Y: outcome U: confounder
Balding, 2006
U X Y Z
U: genetic ancestry
U X Y Z
IV3 Z: genetic IV X: exposure Y: outcome U: confounder
VanderWeele et al. 2014
Phenotype Abbreviation Neurological phenotypes Alzheimer disease AD Migraine MIGR Parkinson disease PD Photic sneeze reflex PS Schizophrenia SCZ Anthropometric and social traits Beighton hypermobility BHM Breast size CUP Body mass index BMI Bone mineral density (femoral neck) FNBMD Bone mineral density (lumbar spine) LSBMD Chin dimples DIMP Educational attainment EDU Height HEIGHT Male-pattern baldness MPB Nearsightedness NST Nose size NOSE Waist–hip ratio WHR Unibrow UB Immune-related traits Any allergies ALL Asthma ATH Childhood ear infections CEI Crohn's disease CD Hypothyroidism HTHY Rheumatoid arthritis RA Tonsillectomy TS Ulcerative colitis UC Metabolic phenotypes Age at menarche AAM Age at menarche (23andMe) AAM (23) Age at voice drop AVD Coronary artery disease CAD Type 2 diabetes T2D Fasting glucose FG Low-density lipoproteins LDL High-density lipoproteins HDL Triglycerides TG Total cholesterol TC Hematopoietic traits Hemoglobin HB Mean cell hemoglobin concentration MCHC Mean red blood cell volume MCV Packed red blood cell volume PCV Red blood cell count RBC Platelet count PLT Mean platelet volume MPV
Pickrell et al., 2016; Visscher, Yang, 2016
✓ Vertical pleiotropy
(mediation) X Horizontal pleiotropy
Definition
Genotypes (Z), exposure (X), and outcome (Y) available from individuals in the same sample
Z X Y U
βZX βZY
One-sample MR
SNPs with well understood functions….
26
Burgess et al., 2011; Davies et al., 2013
IV1 IV2 IV3
Common one-sample MR estimator
Outcome Exposure Instruments (37 SNPs) Overidentification test
Instrument strength Endogeneity test
Polygenic score
SNP-exposure association SNP-outcome association Effect of exposure on outcome Underlying population
Definition
SNP-exposure and SNP-outcome association estimates from two independent samples from the same underlying population
Z X Y U
βZX βZY
One-sample MR Two-sample MR
Hartwig et al., 2017
same direction
Hartwig et al., 2017
Burgess et al., 2016 N: sample size K: number of Ivs R2: variance of X by IVs SD: standard deviation α: SNP-exposure association in SD units MAF: minor allele frequency
For each IV*:
*If multiple & independent SNPs are available, R2 can be added up to calculate F statistics
Instrument strength IV1
Explore presence of horizontal pleiotropy IV3
rs1 rs2 rs3 rs4 rs5 rs6 Causal estimate (βIV = βzy /βzx) 1
Del Greco et al., 2015; Burgess et al., 2017
Substantial heterogeneity indicates that either modelling or IV assumptions are violated
m: number of IVs
Cochran’s Q statistic
Causal estimate (βxy) Precision (1/SE) Causal estimate (βxy) Precision (1/SE)
Funnel plot symmetric: Balanced pleiotropy (IVW OK) Funnel plot asymmetric: Directional pleiotropy (IVW biased)
coefficients
Assumption: no invalid instruments
rs1 rs2 rs3 rs4 rs5 rs6 Causal estimate (βIV = βzy /βzx) 1
Del Greco et al., 2015; Burgess et al., 2017
Assumption: no invalid instruments
βzy βzx βIVW
regress βzy ~ βzx [weigths=1/seβzy^2] *** *** With intercept constraint to be zero Assumption: no invalid instruments
≠ assumptions)
IVW OK MR-Egger OK IVW biased MR-Egger OK βzy βzy βzx βzx βIVW βIVW βEgger βEgger
regress βzy βzx [aw=1/seβzy^2]
αEgger αEgger=0
αEgger → non-zero estimate is evidence for directional pleiotropy βEgger → causal effect estimate adjusted for directional pleiotropy
Bowden et al., 2015, 2016; Burgess, Thompson, 2017
exposure effects should NOT correlate with the horizontal pleiotropic effects
relatively homogeneous
SNP-outcome effects flipped accordingly)
Bowden et al., 2016; Burgess, Thompson, 2017
Bowden et al., 2016
βzy βzx βzy βzx
50% or more of the total weight is valid)
Hypothetical example – finite sample Hypothetical example – infinite sample
βzy βzx βzy βzx
Hypothetical example: Truth=MBE Hypothetical example: Truth≠MBE
Hartwig et al., 2017
Zheng et al., 2017
Multivariable MR Factorial MR Two-step MR (mediation) Bidirectional MR
Web server Web-based API GWAS database
Information on two-sample MR R package: https://github.com/MRCIEU/TwoSampleMR
from GWAS
Motivation
11.
randomized genes: a fundamental distinction between conventional and genetic epidemiology. PLoS
understanding environmental determinants of disease? Int J Epidemiol. 2003;32(1):1-22.
failures of observational epidemiology? Hum. Genet. 2008;123:15–33.
Weak instrument bias
Avoiding bias from weak instruments in Mendelian randomization studies. Int J
sample Mendelian randomization. Genet Epidemiol. 2016;40(7):597-608.
Population stratification
Exclusion restriction assumption
Pleiotropy
Detection and interpretation of shared genetic influences on 42 human traits. Nat Genet. 2016 Jul;48(7):709-17.
One-sample MR
Avoiding bias from weak instruments in Mendelian randomization studies. Int J
for Mendelian randomization. Stat Methods Med Res. 2017 Oct;26(5):2333-2355.
Two-sample MR
the downsides of a powerful, widely applicable but potentially fallible technique. Int J Epidemiol. 2016 Dec 1;45(6):1717-1726.
sample Mendelian randomization. Genet Epidemiol. 2016 Nov;40(7):597-608.
randomisation studies with summary data and a continuous outcome. Stat Med. 2015 Sep 20;34(21):2926-40.
Inference from Mendelian Randomization Analyses with Multiple Genetic Variants.
estimation and bias detection through Egger regression. Int J Epidemiol. 2015;44(2):512-25.
Egger method. Eur J Epidemiol. 2017;32(5):377-389.
analyses using MR-Egger regression: the role of the I2 statistic. Int J Epidemiol. 2016;45(6):1961- 1974.
Using a Weighted Median Estimator. Genet Epidemiol. 2016;40(4):304-14.
via the zero modal pleiotropy assumption. Int J Epidemiol. 2017;46(6):1985-1998.
MR extensions
Smith GD. Recent Developments in Mendelian Randomization Studies. Curr Epidemiol Rep. 2017;4(4):330-345.
MR-base
across the human phenome. eLife 2018. doi: https://doi.org/10.7554/eLife.34408