Causal inference with missing values
Effect of tranexamic acid on mortality for head trauma patient
Julie Josse, (INRIA XPOP - X) - Imke Mayer 22 January, 2019
Statistic seminar Nice 1
Causal inference with missing values Effect of tranexamic acid on - - PowerPoint PPT Presentation
Causal inference with missing values Effect of tranexamic acid on mortality for head trauma patient Julie Josse, (INRIA XPOP - X) - Imke Mayer 22 January, 2019 Statistic seminar Nice 1 Research activities Dimensionality reduction methods
Statistic seminar Nice 1
2
3
4
Center Accident Age Sex Weight Height BMI BP SBP 1 Beaujon Fall 54 m 85 NR NR 180 110 2 Lille Other 33 m 80 1.8 24.69 130 62 3 Pitie Salpetriere Gun 26 m NR NR NR 131 62 4 Beaujon AVP moto 63 m 80 1.8 24.69 145 89 6 Pitie Salpetriere AVP bicycle 33 m 75 NR NR 104 86 7 Pitie Salpetriere AVP pedestrian 30 w NR NR NR 107 66 9 HEGP White weapon 16 m 98 1.92 26.58 118 54 10 Toulon White weapon 20 m NR NR NR 124 73 ................... SpO2 Temperature Lactates Hb Glasgow Transfusion ........... 1 97 35.6 <NA> 12.7 12 yes 2 100 36.5 4.8 11.1 15 no 3 100 36 3.9 11.4 3 no 4 100 36.7 1.66 13 15 yes 6 100 36 NM 14.4 15 no 7 100 36.6 NM 14.3 15 yes 9 100 37.5 13 15.9 15 yes 10 100 36.9 NM 13.7 15 no
5
25 50 75 100 AIS.face AIS.tete Choc.hemorragique Trauma.cranien Glasgow Anomalie.pupillaire IOT.SMUR FC Mydriase Glasgow.initial ACR.1 Catecholamines PAS PAD Temps.en.rea SpO2 Hb DC.en.rea Plaquettes Traitement.antiagregants Traitement.anticoagulant TP.pourcentage PAS.min Glasgow.moteur.initial FC.max PAD.min Ventilation.FiO2 SpO2.min Fibrinogene.1 LATA KTV.poses.avant.TDM Dose.NAD.depart Temps.depart.scanner.ou.bloc Derniere.PAS.avant.depart Derniere.PAD.avant.depart Lactates Temps.lieux.hop Glasgow.moteur PaO2 pCO2 ARDS Couple Alcool EER FC.SMUR PAS.SMUR PAD.SMUR DTC.IP.max PIC Osmotherapie DVE Craniectomie.decompressive Diplome.plus.eleve.ou.niveau Lactates.H2.1 DTC.IP.max.24h.HTIC HTIC Lactates.H2 Glasgow.sortie Lactates.prehosp Mannitol.SSH Hypothermie.therapeutique Cause.du.DC Delai.DC Temps.arrivee.pose.PIC Temperature.min Regression.mydriase.sous.osmotherapie Temps.arrivee.pose.DVE
Percentage variable
null.data na.data nr.data nf.data imp.data
Percentage of missing values
6
7
8
8
d
n→∞ N(0, VDM),
P(Wi=0) + Var(Yi(1)) P(Wi=1) . 9
1 n
n
d
n→∞ N(0, VOLS). And VDM = VOLS + β(0) + β(1))2 A. 10
Died P(Outcome | Treatment) Treated 1 1 FALSE 2225 340 0.867 0.133 TRUE 436 168 0.722 0.278
Standardized mean differences between treated and control.
AIS.externe DTC.IP.max AIS.tete PaO2 Temps.lieux.hop SpO2 SpO2.min FC.max Plaquettes pCO2 AIS.face PAD Glasgow.moteur.initial FC Glasgow.initial Dose.NAD.depart PAD.min AIS.thorax PAS PAS.min Lactates AIS.membres.bassin AIS.abdo.pelvien Fibrinogene.1 Hb TP.pourcentage 0.00 0.25 0.50 0.75 1.00
Absolute Mean Differences Sample
Covariate Balance
11
0.00 0.25 0.50 0.75 1.00 0.00 0.25 0.50 0.75 1.00
pscore scaled as.factor(treatment)
1
Propensity Score before Weighting
0.00 0.25 0.50 0.75 1.00 0.00 0.25 0.50 0.75 1.00
pscore scaled as.factor(treatment)
1
Propensity Score after Weighting
12
13
14
14
n
15
1 1+e−xT α
n
i=1
ˆ τ = ¯ X(β(1) − β(0)) + [term for ε] +
n
n
ˆ γ(1)(Xi)WiXi − ¯ X
n
n
ˆ γ(0)(Xi)(1 − Wi)Xi − ¯ X
X(β(1) − β(0)) + Wi(Yi − µ(1)(Xi)) e(Xi) − (1 − Wi)(Yi − µ(0)(Xi)) 1 − e(Xi)
16
n
17
n
17
18
Litterature : Schaefer (2002) ; Little & Rubin (2002) ; Gelman & Meng (2004) ; Kim & Shao (2013) ; Carpenter & Kenward (2013) ; van Buuren (2015)
19
0.22 -0.52 0.67 1.46 1.11 0.63 1.56 1.10 2.00 1.00
0.04 -0.34 1.24 0.89 1.05 0.69 1.50 1.15 1.67 1.33
X
1 2 3
1 2 3 x1 x2
^ μ
2 = tr(AA⊤) :
µ
2 : rank (µ) ≤ k
′
p×k
′
p×k
20
NA -0.77
0.22 -0.52 0.67 1.46 NA 0.63 1.56 1.10 2.00 1.00
0.04 -0.34 1.24 0.89 1.05 0.69 1.50 1.15 1.67 1.33
X
1 2 3
1 2 3 x1 x2
^ μ
2 = tr(AA⊤) :
µ
2 : rank (µ) ≤ k
′
p×k
′
p×k
20
µ
2 : rank (µ) ≤ k
µ
2 : rank (µ) ≤ k
21
1 2 3
1 2 3 x1 x2
x1 x2
0.0 -0.01 1.5 NA 2.0 1.98
22
1 2 3
1 2 3 x1 x2
x1 x2
0.0 -0.01 1.5 NA 2.0 1.98 x1 x2
0.0 -0.01 1.5 0.00 2.0 1.98
22
1 2 3
1 2 3 x1 x2
x1 x2
0.0 -0.01 1.5 NA 2.0 1.98 x1 x2
0.0 -0.01 1.5 0.00 2.0 1.98 x1 x2
0.15 -0.18 1.00 0.57 2.27 1.67
22
1 2 3
1 2 3 x1 x2
x1 x2
0.0 -0.01 1.5 NA 2.0 1.98 x1 x2
0.0 -0.01 1.5 0.00 2.0 1.98 x1 x2
0.15 -0.18 1.00 0.57 2.27 1.67
22
1 2 3
1 2 3 x1 x2
x1 x2
0.0 -0.01 1.5 NA 2.0 1.98 x1 x2
0.0 -0.01 1.5 0.00 2.0 1.98 x1 x2
0.15 -0.18 1.00 0.57 2.27 1.67 x1 x2
0.0 -0.01 1.5 0.57 2.0 1.98
22
x1 x2
0.0 -0.01 1.5 NA 2.0 1.98 x1 x2
0.0 -0.01 1.5 0.57 2.0 1.98 x1 x2
0.0 -0.01 1.5 0.57 2.0 1.98
1 2 3
1 2 3 x1 x2
22
x1 x2
0.0 -0.01 1.5 NA 2.0 1.98 x1 x2
0.0 -0.01 1.5 0.57 2.0 1.98 x1 x2
0.09 -0.11 1.20 0.90 2.18 1.78 x1 x2
0.0 -0.01 1.5 0.90 2.0 1.98
1 2 3
1 2 3 x1 x2
22
x1 x2
0.0 -0.01 1.5 NA 2.0 1.98 x1 x2
0.0 -0.01 1.5 0.00 2.0 1.98 x1 x2
0.15 -0.18 1.00 0.57 2.27 1.67 x1 x2
0.0 -0.01 1.5 0.57 2.0 1.98
1 2 3
1 2 3 x1 x2
22
x1 x2
0.0 -0.01 1.5 NA 2.0 1.98 x1 x2
0.0 -0.01 1.5 1.46 2.0 1.98
1 2 3
1 2 3 x1 x2
22
q=1 dquqv
′
q
q=1 (dq − λ)+uqv
′
q arg minµ
2 + λµ∗
& Donoho (2014), J. & Wager (2015), J. & Sardy (2014), etc.
iid
(Udell & Townsend, 2017) 23
age weight size alcohol sex snore tobacco NA 100 190 NA M yes no 70 96 186 1-2 gl/d M NA <=1 NA 104 194 No W no NA 62 68 165 1-2 gl/d M no <=1 age weight size alcohol sex snore tobacco 51 100 190 1-2 gl/d M yes no 70 96 186 1-2 gl/d M no <=1 48 104 194 No W no <=1 62 68 165 1-2 gl/d M no <=1 51 100 190 0.2 0.7 0.1 1 0 0 1 1 0 0 70 96 186 0 1 0 1 0 0.8 0.2 0 1 0 48 104 194 1 0 0 0 1 1 0 0.1 0.8 0.1 62 68 165 0 1 0 1 0 1 0 0 1 0 NA 100 190 NA NA NA 1 0 0 1 1 0 0 70 96 186 0 1 0 1 0 NA NA 0 1 0 NA 104 194 1 0 0 0 1 1 0 NA NA NA 62 68 165 0 1 0 1 0 1 0 0 1 0
(J., Husson, Robin & Balasu., 2018, Imputation of mixed data with multilevel SVD. JCGS)
24
−j = (X t 1, X t j−1, X t−1 j+1 , X t−1 p
j is obtained by
j
−j
j
−j
25
Feat1 Feat2 Feat3 Feat4 Feat5... C1 1 1 1 1 1 C2 1 1 1 1 1 C3 2 2 2 2 2 C4 2 2 2 2 2 C5 3 3 3 3 3 C6 3 3 3 3 3 C7 4 4 4 4 4 C8 4 4 4 4 4 C9 5 5 5 5 5 C10 5 5 5 5 5 C11 6 6 6 6 6 C12 6 6 6 6 6 C13 7 7 7 7 7 C14 7 7 7 7 7 Igor 8 NA NA 8 8 Frank 8 NA NA 8 8 Bertrand 9 NA NA 9 9 Alex 9 NA NA 9 9 Yohann 10 NA NA 10 10 Jean 10 NA NA 10 10
Feat1 Feat2 Feat3 Feat4 Feat5 1 1.0 1.00 1 1 1 1.0 1.00 1 1 2 2.0 2.00 2 2 2 2.0 2.00 2 2 3 3.0 3.00 3 3 3 3.0 3.00 3 3 4 4.0 4.00 4 4 4 4.0 4.00 4 4 5 5.0 5.00 5 5 5 5.0 5.00 5 5 6 6.0 6.00 6 6 6 6.0 6.00 6 6 7 7.0 7.00 7 7 7 7.0 7.00 7 7 8 6.87 6.87 8 8 8 6.87 6.87 8 8 9 6.87 6.87 9 9 9 6.87 6.87 9 9 10 6.87 6.87 10 10 10 6.87 6.87 10 10
Feat1 Feat2 Feat3 Feat4 Feat5 1 1 1 1 1 1 1 1 1 1 2 2 2 2 2 2 2 2 2 2 3 3 3 3 3 3 3 3 3 3 4 4 4 4 4 4 4 4 4 4 5 5 5 5 5 5 5 5 5 5 6 6 6 6 6 6 6 6 6 6 7 7 7 7 7 7 7 7 7 7 8 8 8 8 8 8 8 8 8 8 9 9 9 9 9 9 9 9 9 9 10 10 10 10 10 10 10 10 10 10
26
j=1 βjxij)
j=1 βjxij)
i.i.d. Np(µ, Σ)
n
27
28
y
y
y
y M
mis
29
30
(Kallus, Mao and Udell (2018)) .
i α + τTi + εi.
31
32
32
n
i=1
ˆ e(Xi) − (1−Wi)Yi 1−ˆ e(Xi)
32
MF.glm FAMD.grf Inf.grf MF.grf −.saem −5 5
ATE (in %) Imputation.set Imputation.method
Inf − MF
IPW ATE estimation
MF.glm FAMD.grf Inf.grf MF.grf −.saem −5 5
ATE (in %) Imputation.set Imputation.method
Inf − MF
DR ATE estimation
(y-axis : imputation . ps estimation), (x-axis : ATE estimation with bootstrap CI) We compute the mortality rate in the treated group and the mortality rate in the control group (after covariate balancing). The value obtained corresponds to the difference in percentage points between mortality rates in treatment and control. 33
Low-rank estimation with missing non at random data.
34
35
35
0.00 0.25 0.50 0.75 1.00 0.00 0.25 0.50 0.75 1.00 pscore scaled as.factor(treatment) 1
Propensity Score before Weighting
0.00 0.25 0.50 0.75 1.00 0.00 0.25 0.50 0.75 1.00 pscore scaled as.factor(treatment) 1
Propensity Score after Weighting
0.00 0.25 0.50 0.75 1.00 0.00 0.25 0.50 0.75 1.00 pscore scaled as.factor(treatment) 1
Propensity Score before Weighting
0.00 0.25 0.50 0.75 1.00 0.00 0.25 0.50 0.75 1.00 pscore scaled as.factor(treatment) 1
Propensity Score after Weighting
0.00 0.25 0.50 0.75 1.00 0.00 0.25 0.50 0.75 1.00 pscore scaled as.factor(treatment) 1
Propensity Score before Weighting
0.00 0.25 0.50 0.75 1.00 0.00 0.25 0.50 0.75 1.00 pscore scaled as.factor(treatment) 1
Propensity Score after Weighting
36
Anomalie.pupillaire:<NA> Plaquettes:<NA> ACR.1:<NA> FC:<NA> Glasgow.initial:<NA> Mydriase:<NA> Traitement.antiagregants:<NA> Hb:<NA> IOT.SMUR:<NA> Traitement.anticoagulant:<NA> Mannitol.SSH:<NA> Glasgow.moteur.initial:<NA> Traitement.anticoagulant Alcool Craniectomie.decompressive SpO2.min:<NA> Traitement.antiagregants TP.pourcentage:<NA> Hypothermie.therapeutique PAS:<NA> DTC.IP.max:<NA> SpO2:<NA> Alcool:<NA> PAD.min:<NA> PAS.min:<NA> Fibrinogene.1:<NA> PIC DVE Ventilation.FiO2:<NA> FC.max:<NA> Trauma.cranien AIS.externe ACR.1 Mannitol.SSH Temps.lieux.hop:<NA> Lactates:<NA> pCO2:<NA> PaO2:<NA> Mydriase Anomalie.pupillaire Osmotherapie AIS.tete DTC.IP.max PaO2 Temps.lieux.hop Catecholamines SpO2 SpO2.min FC.max IOT.SMUR Plaquettes AIS.face pCO2 Choc.hemorragique KTV.poses.avant.TDM PAD FC Glasgow.initial Glasgow.moteur.initial Ventilation.FiO2 Dose.NAD.depart AIS.thorax PAD.min PAS PAS.min Lactates AIS.membres.bassin AIS.abdo.pelvien Fibrinogene.1 Hb TP.pourcentage fitted[, c("pscore")] 0.0 0.5 1.0 1.5
Absolute Mean Differences
Covariate Balance
Ventilation.FiO2_4 Ventilation.FiO2_3 Traitement.anticoagulant_1 Regression.mydriase.sous.osmotherapie_1 Mannitol.SSH_SSH Craniectomie.decompressive_1 Alcool Traitement.antiagregants_1 Hypothermie.therapeutique_1 Osmotherapie_Mannitol PIC_1 DVE_1 Anomalie.pupillaire_Anisocorie (unilatérale) Trauma.cranien_1 Mydriase_Anisocorie (unilatérale) Mannitol.SSH_Rien AIS.externe ACR.1_1 Regression.mydriase.sous.osmotherapie_0 Mannitol.SSH_Mannitol Ventilation.FiO2_2 Regression.mydriase.sous.osmotherapie_Not tested Mydriase_Anomalie pupillaire Anomalie.pupillaire_Mydriase (bilatérale) Osmotherapie_SSH Mydriase_Non Mannitol.SSH_No mydriase Ventilation.FiO2_5 Anomalie.pupillaire_Non Osmotherapie_Rien DTC.IP.max AIS.tete PaO2 Temps.lieux.hop SpO2 Catecholamines_1 Ventilation.FiO2_1 SpO2.min FC.max IOT.SMUR_1 Plaquettes pCO2 AIS.face Choc.hemorragique_1 KTV.poses.avant.TDM_1 PAD Glasgow.moteur.initial FC Glasgow.initial Dose.NAD.depart PAD.min AIS.thorax PAS PAS.min Lactates AIS.membres.bassin AIS.abdo.pelvien Fibrinogene.1 Hb TP.pourcentage fitted$pscore 0.0 0.5 1.0 1.5
Absolute Mean Differences
Covariate Balance
Dim.7 Dim.4 Dim.6 Dim.5 Dim.3 Dim.2 Dim.1 fitted$pscore 0.0 0.5 1.0 1.5
Absolute Mean Differences
Covariate Balance
37
0.00 0.25 0.50 0.75 1.00 0.0 0.2 0.4 0.6 0.8 pscore scaled as.factor(treatment) 1
Propensity Score before Weighting
0.00 0.25 0.50 0.75 1.00 0.0 0.2 0.4 0.6 0.8 pscore scaled as.factor(treatment) 1
Propensity Score after Weighting
0.00 0.25 0.50 0.75 1.00 0.0 0.2 0.4 0.6 0.8 pscore scaled as.factor(treatment) 1
Propensity Score before Weighting
0.00 0.25 0.50 0.75 1.00 0.0 0.2 0.4 0.6 0.8 pscore scaled as.factor(treatment) 1
Propensity Score after Weighting
38
Ventilation.FiO2_4 Ventilation.FiO2_3 Traitement.anticoagulant_1 Regression.mydriase.sous.osmotherapie_1 Mannitol.SSH_SSH Craniectomie.decompressive_1 Alcool Traitement.antiagregants_1 Hypothermie.therapeutique_1 Osmotherapie_Mannitol PIC_1 DVE_1 Anomalie.pupillaire_Anisocorie (unilatérale) Trauma.cranien_1 Mydriase_Anisocorie (unilatérale) Mannitol.SSH_Rien AIS.externe ACR.1_1 Regression.mydriase.sous.osmotherapie_0 Mannitol.SSH_Mannitol Ventilation.FiO2_2 Regression.mydriase.sous.osmotherapie_Not tested Mydriase_Anomalie pupillaire Anomalie.pupillaire_Mydriase (bilatérale) Osmotherapie_SSH Mydriase_Non Mannitol.SSH_No mydriase Ventilation.FiO2_5 Anomalie.pupillaire_Non Osmotherapie_Rien DTC.IP.max AIS.tete PaO2 Temps.lieux.hop SpO2 Catecholamines_1 Ventilation.FiO2_1 SpO2.min FC.max IOT.SMUR_1 Plaquettes pCO2 AIS.face Choc.hemorragique_1 KTV.poses.avant.TDM_1 PAD Glasgow.moteur.initial FC Glasgow.initial Dose.NAD.depart PAD.min AIS.thorax PAS PAS.min Lactates AIS.membres.bassin AIS.abdo.pelvien Fibrinogene.1 Hb TP.pourcentage fitted$pscore 0.0 0.5 1.0 1.5 Absolute Mean Differences Sample Unadjusted Adjusted
Covariate Balance
Dim.7 Dim.4 Dim.6 Dim.5 Dim.3 Dim.2 Dim.1 fitted$pscore 0.0 0.5 1.0 1.5 Absolute Mean Differences Sample Unadjusted Adjusted
Covariate Balance
39
10 20 30 −0.06 −0.04 −0.02 0.00 0.02
ate density
DR ATE with PS via logistic regression
10 20 30 40 −0.04 −0.02 0.00 0.02
ate density
DR ATE with PS via regression tree (grf default)
40
5 10 15 20 25 −0.06 −0.03 0.00 0.03 0.06
ate density
IPW ATE with PS via logistic regression
10 20 30 40 0.000 0.025 0.050 0.075
ate density
IPW ATE with PS via generalized random forest
41