Two Challenges β’ Some possible explanations for the discrepancy in those results are: entire/target β¨ population β¨ π * 1. Transportability π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ There is a mismatch between the study π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ population π and the general clinical π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ population π * regarding ethnicity, race, π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ and income (covariates named E ). π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ οΏ½ 7
Two Challenges β’ Some possible explanations for the discrepancy in those results are: study/source β¨ entire/target β¨ population β¨ population β¨ π π * 1. Transportability π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ There is a mismatch between the study π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ population π and the general clinical π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ population π * regarding ethnicity, race, π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ and income (covariates named E ). π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ οΏ½ 7
Two Challenges β’ Some possible explanations for the discrepancy in those results are: study/source β¨ entire/target β¨ population β¨ population β¨ π π * 1. Transportability π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ There is a mismatch between the study π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ population π and the general clinical π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ population π * regarding ethnicity, race, π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ and income (covariates named E ). π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ P *( e ) β P ( e ) οΏ½ 7
Two Challenges β’ Some possible explanations for the discrepancy in those results are: study/source β¨ entire/target β¨ population β¨ population β¨ π π * π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ οΏ½ 8
Two Challenges β’ Some possible explanations for the discrepancy in those results are: study/source β¨ entire/target β¨ population β¨ population β¨ π π * 2. Selection Bias π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ FDA's studies sampled from a π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ distinct population by excluding π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ youths with elevated baseline risk for suicide (B) from their cohorts. π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ οΏ½ 8
Two Challenges β’ Some possible explanations for the discrepancy in those results are: study/source β¨ entire/target β¨ population β¨ population β¨ π π * 2. Selection Bias π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ FDA's studies sampled from a π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ distinct population by excluding π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ youths with elevated baseline risk for suicide (B) from their cohorts. π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ sampled individuals β¨ ( S=1 ) π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ οΏ½ 8
Two Challenges β’ Some possible explanations for the discrepancy in those results are: study/source β¨ entire/target β¨ population β¨ population β¨ π π * 2. Selection Bias π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ FDA's studies sampled from a π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ distinct population by excluding π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ youths with elevated baseline risk for suicide (B) from their cohorts. π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ sampled individuals β¨ ( S=1 ) π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ π₯ P ( y , b , e | do ( x ), S = 1) β P ( y , b , e | do ( x )) P ( x , y , b , e | S = 1) β P ( x , y , b , e ) οΏ½ 8
Formalizing the Problem B E X Y οΏ½ 9
Formalizing the Problem We use indicator named T to mark variables with di ff erences B E T between domains π and π *. X Y οΏ½ 9
Formalizing the Problem We use indicator named T to mark variables with di ff erences B E T between domains π and π *. Similarly, the indicator , named S, is defined such that S=1 for every unit sampled in the study, and 0, otherwise. X Y S οΏ½ 9
Formalizing the Problem We use indicator named T to mark variables with di ff erences B E T between domains π and π *. Similarly, the indicator , named S, is defined such that S=1 for every unit sampled in the study, and 0, otherwise. X Y S (called selection β¨ diagram D ) οΏ½ 9
Formalizing the Problem We use indicator named T to mark variables with di ff erences B E T between domains π and π *. Similarly, the indicator , named S, is defined such that S=1 for every unit sampled in the study, and 0, otherwise. X Y S In this example, the causal e ff ect can be estimated by recalibrating (called selection β¨ the experimental findings using observations from the target domain diagram D ) P *( y | do ( x )) = β P ( y | do ( x ), b , e , S = 1) P *( b , e ) b , e οΏ½ 9
Formalizing the Problem We use indicator named T to mark variables with di ff erences B E T between domains π and π *. Similarly, the indicator , named S, is defined such that S=1 for every unit sampled in the study, and 0, otherwise. X Y S In this example, the causal e ff ect can be estimated by recalibrating (called selection β¨ the experimental findings using observations from the target domain diagram D ) P *( y | do ( x )) = β P ( y | do ( x ), b , e , S = 1) P *( b , e ) b , e causal e ff ect β¨ in target domain οΏ½ 9
Formalizing the Problem We use indicator named T to mark variables with di ff erences B E T between domains π and π *. Similarly, the indicator , named S, is defined such that S=1 for every unit sampled in the study, and 0, otherwise. X Y S In this example, the causal e ff ect can be estimated by recalibrating (called selection β¨ the experimental findings using observations from the target domain diagram D ) P *( y | do ( x )) = β P ( y | do ( x ), b , e , S = 1) P *( b , e ) b , e causal e ff ect β¨ experimental data from the β¨ in target domain source under selection bias οΏ½ 9
Formalizing the Problem We use indicator named T to mark variables with di ff erences B E T between domains π and π *. Similarly, the indicator , named S, is defined such that S=1 for every unit sampled in the study, and 0, otherwise. X Y S In this example, the causal e ff ect can be estimated by recalibrating (called selection β¨ the experimental findings using observations from the target domain diagram D ) P *( y | do ( x )) = β Observations from β¨ P ( y | do ( x ), b , e , S = 1) P *( b , e ) the target domain b , e causal e ff ect β¨ experimental data from the β¨ in target domain source under selection bias οΏ½ 9
Problem Statement οΏ½ 10
Problem Statement B E T X Y S Selection Diagram D οΏ½ 10
Problem Statement B E T X Y S Selection Diagram D P ( v | do ( x ), S = 1) Selection-biased Exp. β¨ Distribution P 1 from π οΏ½ 10
Problem Statement B E T X Y S Selection Diagram D P ( v | do ( x ), S = 1) Selection-biased Exp. β¨ Distribution P 1 from π P *( w ) Covariate Distribution β¨ P 2 from π * οΏ½ 10
β¨ Problem Statement B E T X Y S Selection Diagram D Is there a function f such that β¨ P ( v | do ( x ), S = 1) P *( y | do ( x )) = f ( P 1 , P 2 ) Selection-biased Exp. β¨ Distribution P 1 from π P *( w ) Covariate Distribution β¨ P 2 from π * οΏ½ 10
β¨ Problem Statement B E T X Y S Selection Diagram D Is there a function f such that β¨ P ( v | do ( x ), S = 1) yes ( ) / no f P *( y | do ( x )) = f ( P 1 , P 2 ) π βΉ Selection-biased Exp. β¨ Distribution P 1 from π P *( w ) Covariate Distribution β¨ P 2 from π * οΏ½ 10
Related Work οΏ½ 11
Related Work confounding type of input selection bias transportability complete οΏ½ 11
Related Work confounding type of input selection bias transportability complete Backdoor Criterion [Pearl β93] β¨ obs. β Extended Backdoor [Pearl and Paz β10] οΏ½ 11
Related Work confounding type of input selection bias transportability complete Backdoor Criterion [Pearl β93] β¨ obs. β Extended Backdoor [Pearl and Paz β10] Adjustment Criterion β¨ obs. β β [Shpitser et al. β10; Perkovic et al. β15,β18] οΏ½ 11
Related Work confounding type of input selection bias transportability complete Backdoor Criterion [Pearl β93] β¨ obs. β Extended Backdoor [Pearl and Paz β10] Adjustment Criterion β¨ obs. β β [Shpitser et al. β10; Perkovic et al. β15,β18] Selection Backdoor β¨ obs. β β [Bareinboim, Tian and Pearl β14] οΏ½ 11
Related Work confounding type of input selection bias transportability complete Backdoor Criterion [Pearl β93] β¨ obs. β Extended Backdoor [Pearl and Paz β10] Adjustment Criterion β¨ obs. β β [Shpitser et al. β10; Perkovic et al. β15,β18] Selection Backdoor β¨ obs. β β [Bareinboim, Tian and Pearl β14] Generalized Adjustment Criterion β¨ obs. β β β [Correa, Tian and Bareinboim β18] οΏ½ 11
Related Work confounding type of input selection bias transportability complete Backdoor Criterion [Pearl β93] β¨ obs. β Extended Backdoor [Pearl and Paz β10] Adjustment Criterion β¨ obs. β β [Shpitser et al. β10; Perkovic et al. β15,β18] Selection Backdoor β¨ obs. β β [Bareinboim, Tian and Pearl β14] Generalized Adjustment Criterion β¨ obs. β β β [Correa, Tian and Bareinboim β18] st-Adjustment Criterion β¨ β exp. β β β [Correa, Tian and Bareinboim β19] οΏ½ 11
Solution: Covariate st -Adjustment β’ Strategy: Recalibrate the results from experiments in the the studied population using observations from the target population. οΏ½ 12
Solution: Covariate st -Adjustment β’ Strategy: Recalibrate the results from experiments in the the studied population using observations from the target population. P *( y | do ( x )) οΏ½ 12
Solution: Covariate st -Adjustment β’ Strategy: Recalibrate the results from experiments in the the studied population using observations from the target population. P *( y | do ( x )) unbiased target β¨ e ff ect in π * οΏ½ 12
Solution: Covariate st -Adjustment β’ Strategy: Recalibrate the results from experiments in the the studied population using observations from the target population. = β P *( y | do ( x )) P ( y | do ( x ), z , S = 1) P *( z ) unbiased target β¨ z e ff ect in π * οΏ½ 12
Solution: Covariate st -Adjustment β’ Strategy: Recalibrate the results from experiments in the the studied population using observations from the target population. = β P *( y | do ( x )) P ( y | do ( x ), z , S = 1) P *( z ) unbiased target β¨ z experiment results β¨ e ff ect in π * in source domain π οΏ½ 12
Solution: Covariate st -Adjustment β’ Strategy: Recalibrate the results from experiments in the the studied population using observations from the target population. = β P *( y | do ( x )) P ( y | do ( x ), z , S = 1) P *( z ) unbiased target β¨ z experiment results β¨ observations from β¨ e ff ect in π * in source domain π the target domain π * οΏ½ 12
Solution: Covariate st -Adjustment β’ Strategy: Recalibrate the results from experiments in the the studied population using observations from the target population. = β P *( y | do ( x )) P ( y | do ( x ), z , S = 1) P *( z ) unbiased target β¨ z experiment results β¨ observations from β¨ e ff ect in π * in source domain π the target domain π * β’ Questions: οΏ½ 12
Solution: Covariate st -Adjustment β’ Strategy: Recalibrate the results from experiments in the the studied population using observations from the target population. = β P *( y | do ( x )) P ( y | do ( x ), z , S = 1) P *( z ) unbiased target β¨ z experiment results β¨ observations from β¨ e ff ect in π * in source domain π the target domain π * β’ Questions: 1. How to determine if st-adjustment holds for a set of covariates Z ? οΏ½ 12
Solution: Covariate st -Adjustment β’ Strategy: Recalibrate the results from experiments in the the studied population using observations from the target population. = β P *( y | do ( x )) P ( y | do ( x ), z , S = 1) P *( z ) unbiased target β¨ z experiment results β¨ observations from β¨ e ff ect in π * in source domain π the target domain π * β’ Questions: 1. How to determine if st-adjustment holds for a set of covariates Z ? 2. How to find admissible covariate sets? οΏ½ 12
Challenge I. Covariate Admissibility οΏ½ 13
Challenge I. Covariate Admissibility β’ In general, adjusting for some variables that are a ff ected by the treatment could introduce more bias, instead of controlling for the current, existent ones. οΏ½ 13
Challenge I. Covariate Admissibility β’ In general, adjusting for some variables that are a ff ected by the treatment could introduce more bias, instead of controlling for the current, existent ones. β’ In our setting, in particular, special attention needs to be paid to these covariates ( a ff ected by the treatment) that are correlated with the outcome given pre-treatment covariates. οΏ½ 13
Challenge I. Covariate Admissibility β’ In general, adjusting for some variables that are a ff ected by the treatment could introduce more bias, instead of controlling for the current, existent ones. β’ In our setting, in particular, special attention needs to be paid to these covariates ( a ff ected by the treatment) that are correlated with the outcome given pre-treatment covariates. β’ Letβs call this set Z p . οΏ½ 13
Challenge I. Covariate Admissibility β’ In general, adjusting for some variables that are a ff ected by the treatment could introduce more bias, instead of controlling for the current, existent ones. β’ In our setting, in particular, special attention needs to be paid to these covariates ( a ff ected by the treatment) that are correlated with the outcome given pre-treatment covariates. T β’ Letβs call this set Z p . Z 1 X Y Z 2 Z 3 S οΏ½ 13
Challenge I. Covariate Admissibility β’ In general, adjusting for some variables that are a ff ected by the treatment could introduce more bias, instead of controlling for the current, existent ones. β’ In our setting, in particular, special attention needs to be paid to these covariates ( a ff ected by the treatment) that are correlated with the outcome given pre-treatment covariates. T β’ Letβs call this set Z p . Z 1 X Y β’ For example if adjusting for Z = {Z 1 , Z 2 , Z 3 } in this model Z 2 Z 3 Z p = {Z 3 }. S οΏ½ 13
Main Result I: β¨ Complete Graphical Condition οΏ½ 14
Main Result I: β¨ Complete Graphical Condition A set of covariates Z is admissible for st-adjustment in D relative to treatment X and outcome Y if: οΏ½ 14
Main Result I: β¨ Complete Graphical Condition A set of covariates Z is admissible for st-adjustment in D relative to treatment X and outcome Y if: (i) Variables in Z p are independent of the treatment given all other covariates, and οΏ½ 14
Main Result I: β¨ Complete Graphical Condition A set of covariates Z is admissible for st-adjustment in D relative to treatment X and outcome Y if: (i) Variables in Z p are independent of the treatment given all other covariates, and (ii) The outcome Y is independent of all the transportability ( T ) and selection bias nodes ( S ) given the covariates Z and the treatment X . οΏ½ 14
Main Result I: β¨ Complete Graphical Condition A set of covariates Z is admissible for st-adjustment in D relative to treatment X and outcome Y if: (i) Variables in Z p are independent of the treatment given all other covariates, and (ii) The outcome Y is independent of all the transportability ( T ) and selection bias nodes ( S ) given the covariates Z and the treatment X . Thm. The causal e ff ect P * (y | do(x)) is identifiable by st-adjustment on a set Z with D if and only if the conditions above hold for Z relative to X and Y . οΏ½ 14
Understanding the criterion οΏ½ 15
Understanding the criterion T Task: Compute P * (y | do(x)) Z 1 X Y Z 2 Z 3 S οΏ½ 15
Understanding the criterion T T Task: Compute P * (y | do(x)) Z 1 Z 1 β’ The outcome Y is a ff ected by di ff erences in the distribution of Z 1 X X Y Y Z 2 Z 2 Z 3 Z 3 between the source and target domains. S S οΏ½ 15
Understanding the criterion T T T Task: Compute P * (y | do(x)) Z 1 Z 1 Z 1 β’ The outcome Y is a ff ected by di ff erences in the distribution of Z 1 X X X Y Y Y Z 2 Z 2 Z 2 Z 3 Z 3 Z 3 between the source and target domains. S S S β’ The variable Z 3 a ff ects the likelihood of units being sampled. οΏ½ 15
Understanding the criterion T T T Task: Compute P * (y | do(x)) Z 1 Z 1 Z 1 β’ The outcome Y is a ff ected by di ff erences in the distribution of Z 1 X X X Y Y Y Z 2 Z 2 Z 2 Z 3 Z 3 Z 3 between the source and target domains. S S S β’ The variable Z 3 a ff ects the likelihood of units being sampled. T Z 1 β’ If we adjust for Z 3 to control for selection bias, we introduce spurious correlation. Hence, we should also control for Z 2 . X Y Z 2 Z 3 S οΏ½ 15
Understanding the criterion T T T Task: Compute P * (y | do(x)) Z 1 Z 1 Z 1 β’ The outcome Y is a ff ected by di ff erences in the distribution of Z 1 X X X Y Y Y Z 2 Z 2 Z 2 Z 3 Z 3 Z 3 between the source and target domains. S S S β’ The variable Z 3 a ff ects the likelihood of units being sampled. T T Z 1 Z 1 β’ If we adjust for Z 3 to control for selection bias, we introduce spurious correlation. Hence, we should also control for Z 2 . X X Y Y Z 2 Z 2 Z 3 Z 3 S S οΏ½ 15
Getting the intuition behind the rules β¨ Example T Z 1 X Y Z 2 Z 3 S οΏ½ 16
Getting the intuition behind the rules β¨ Example By making Z ={ Z 1 , Z 2 , Z 3 }, we can verify the st-adjustment conditions, T i.e.: Z 1 X Y Z 2 Z 3 S οΏ½ 16
Getting the intuition behind the rules β¨ Example By making Z ={ Z 1 , Z 2 , Z 3 }, we can verify the st-adjustment conditions, T i.e.: Z 1 (i) The variable in Z p ={Z 3 } is independent of X given the other X Y covariates {Z 1 , Z 2 } . Z 2 Z 3 S οΏ½ 16
Getting the intuition behind the rules β¨ Example By making Z ={ Z 1 , Z 2 , Z 3 }, we can verify the st-adjustment conditions, T i.e.: Z 1 (i) The variable in Z p ={Z 3 } is independent of X given the other X Y covariates {Z 1 , Z 2 } . Z 2 Z 3 (ii) The outcome Y is independent of S and T given Z . S οΏ½ 16
Getting the intuition behind the rules β¨ Example By making Z ={ Z 1 , Z 2 , Z 3 }, we can verify the st-adjustment conditions, T i.e.: Z 1 (i) The variable in Z p ={Z 3 } is independent of X given the other X Y covariates {Z 1 , Z 2 } . Z 2 Z 3 (ii) The outcome Y is independent of S and T given Z . S Hence, the st-adjustment is guaranteed to hold, i.e.: P *( y | do ( x )) = β P ( y | do ( x ), z 1 , z 2 , z 3 , S = 1) P *( z 1 , z 2 , z 3 ) z 1 , z 2 , z 3 οΏ½ 16
Getting the intuition behind the rules β¨ Example By making Z ={ Z 1 , Z 2 , Z 3 }, we can verify the st-adjustment conditions, T i.e.: Z 1 (i) The variable in Z p ={Z 3 } is independent of X given the other X Y covariates {Z 1 , Z 2 } . Z 2 Z 3 (ii) The outcome Y is independent of S and T given Z . S Hence, the st-adjustment is guaranteed to hold, i.e.: P *( y | do ( x )) = β P ( y | do ( x ), z 1 , z 2 , z 3 , S = 1) P *( z 1 , z 2 , z 3 ) z 1 , z 2 , z 3 measurements from β¨ causal e ff ect β¨ experimental data from the β¨ the target domain in target domain source under selection bias οΏ½ 16
Challenge II. β¨ Searching for Admissible Sets οΏ½ 17
Challenge II. β¨ Searching for Admissible Sets β’ Given a candidate set Z, we have a condition to determine if it is admissible or not. οΏ½ 17
Challenge II. β¨ Searching for Admissible Sets β’ Given a candidate set Z, we have a condition to determine if it is admissible or not. β’ The natural question that follows is how to find an admissible set without resorting to trial and error. There could be exponentially many candidates (and even valid ones!). οΏ½ 17
Challenge II. β¨ Searching for Admissible Sets β’ Given a candidate set Z, we have a condition to determine if it is admissible or not. β’ The natural question that follows is how to find an admissible set without resorting to trial and error. There could be exponentially many candidates (and even valid ones!). β’ How to determine the existence of at least one admissible set? οΏ½ 17
Challenge II. β¨ Searching for Admissible Sets β’ Given a candidate set Z, we have a condition to determine if it is admissible or not. β’ The natural question that follows is how to find an admissible set without resorting to trial and error. There could be exponentially many candidates (and even valid ones!). β’ How to determine the existence of at least one admissible set? β’ There are sets that could be preferred among other admissible ones due to certain properties (e.g., cost, variance). οΏ½ 17
Main Result II: Listing Algorithm οΏ½ 18
Main Result II: Listing Algorithm B E T X Y S Selection Diagram D οΏ½ 18
Main Result II: Listing Algorithm B E T X Y S Selection Diagram D P ( v | do ( x ), S = 1) Selection-biased Exp. β¨ Distribution from π οΏ½ 18
Main Result II: Listing Algorithm B E T X Y S Selection Diagram D P ( v | do ( x ), S = 1) Selection-biased Exp. β¨ Distribution from π Set W of covariates β¨ measurable in π * οΏ½ 18
Main Result II: Listing Algorithm B E T X Y S Selection Diagram D What are all the admissible sets satisfying st-adjustment? P ( v | do ( x ), S = 1) Selection-biased Exp. β¨ Distribution from π Set W of covariates β¨ measurable in π * οΏ½ 18
β¨ Main Result II: Listing Algorithm B E T X Y S List of of sets β¨ Selection Diagram D What are all the admissible sets Z 1 , Z 2 , β¦ β W such that for β¨ satisfying st-adjustment? P ( v | do ( x ), S = 1) each Z i : P *( y | do ( x )) = β P ( y | do ( x ), z i , S = 1) P *( z i ) Selection-biased Exp. β¨ z i Distribution from π Set W of covariates β¨ measurable in π * οΏ½ 18
Recommend
More recommend