adjustment criteria for generalizing experimental findings

Adjustment Criteria for Generalizing Experimental Findings Juan D. - PowerPoint PPT Presentation

Adjustment Criteria for Generalizing Experimental Findings Juan D. Correa , Jin Tian and Elias Bareinboim Long Beach, CA 1 Causal Effects and Experiments 2 Causal Effects and Experiments Science is about understanding the laws


  1. Two Challenges β€’ Some possible explanations for the discrepancy in those results are: entire/target 
 population 
 𝛒 * 1. Transportability πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ There is a mismatch between the study πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ population 𝛒 and the general clinical πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ population 𝛒 * regarding ethnicity, race, πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ and income (covariates named E ). πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ οΏ½ 7

  2. Two Challenges β€’ Some possible explanations for the discrepancy in those results are: study/source 
 entire/target 
 population 
 population 
 𝛒 𝛒 * 1. Transportability πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ There is a mismatch between the study πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ population 𝛒 and the general clinical πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ population 𝛒 * regarding ethnicity, race, πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ and income (covariates named E ). πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ οΏ½ 7

  3. Two Challenges β€’ Some possible explanations for the discrepancy in those results are: study/source 
 entire/target 
 population 
 population 
 𝛒 𝛒 * 1. Transportability πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ There is a mismatch between the study πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ population 𝛒 and the general clinical πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ population 𝛒 * regarding ethnicity, race, πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ and income (covariates named E ). πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ P *( e ) β‰  P ( e ) οΏ½ 7

  4. Two Challenges β€’ Some possible explanations for the discrepancy in those results are: study/source 
 entire/target 
 population 
 population 
 𝛒 𝛒 * πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ οΏ½ 8

  5. Two Challenges β€’ Some possible explanations for the discrepancy in those results are: study/source 
 entire/target 
 population 
 population 
 𝛒 𝛒 * 2. Selection Bias πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ FDA's studies sampled from a πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ distinct population by excluding πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ youths with elevated baseline risk for suicide (B) from their cohorts. πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ οΏ½ 8

  6. Two Challenges β€’ Some possible explanations for the discrepancy in those results are: study/source 
 entire/target 
 population 
 population 
 𝛒 𝛒 * 2. Selection Bias πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ FDA's studies sampled from a πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ distinct population by excluding πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ youths with elevated baseline risk for suicide (B) from their cohorts. πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ sampled individuals 
 ( S=1 ) πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ οΏ½ 8

  7. Two Challenges β€’ Some possible explanations for the discrepancy in those results are: study/source 
 entire/target 
 population 
 population 
 𝛒 𝛒 * 2. Selection Bias πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ FDA's studies sampled from a πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ distinct population by excluding πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ youths with elevated baseline risk for suicide (B) from their cohorts. πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ sampled individuals 
 ( S=1 ) πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ πŸ‘₯ P ( y , b , e | do ( x ), S = 1) β‰  P ( y , b , e | do ( x )) P ( x , y , b , e | S = 1) β‰  P ( x , y , b , e ) οΏ½ 8

  8. Formalizing the Problem B E X Y οΏ½ 9

  9. Formalizing the Problem We use indicator named T to mark variables with di ff erences B E T between domains 𝛒 and 𝛒 *. X Y οΏ½ 9

  10. Formalizing the Problem We use indicator named T to mark variables with di ff erences B E T between domains 𝛒 and 𝛒 *. Similarly, the indicator , named S, is defined such that S=1 for every unit sampled in the study, and 0, otherwise. X Y S οΏ½ 9

  11. Formalizing the Problem We use indicator named T to mark variables with di ff erences B E T between domains 𝛒 and 𝛒 *. Similarly, the indicator , named S, is defined such that S=1 for every unit sampled in the study, and 0, otherwise. X Y S (called selection 
 diagram D ) οΏ½ 9

  12. Formalizing the Problem We use indicator named T to mark variables with di ff erences B E T between domains 𝛒 and 𝛒 *. Similarly, the indicator , named S, is defined such that S=1 for every unit sampled in the study, and 0, otherwise. X Y S In this example, the causal e ff ect can be estimated by recalibrating (called selection 
 the experimental findings using observations from the target domain diagram D ) P *( y | do ( x )) = βˆ‘ P ( y | do ( x ), b , e , S = 1) P *( b , e ) b , e οΏ½ 9

  13. Formalizing the Problem We use indicator named T to mark variables with di ff erences B E T between domains 𝛒 and 𝛒 *. Similarly, the indicator , named S, is defined such that S=1 for every unit sampled in the study, and 0, otherwise. X Y S In this example, the causal e ff ect can be estimated by recalibrating (called selection 
 the experimental findings using observations from the target domain diagram D ) P *( y | do ( x )) = βˆ‘ P ( y | do ( x ), b , e , S = 1) P *( b , e ) b , e causal e ff ect 
 in target domain οΏ½ 9

  14. Formalizing the Problem We use indicator named T to mark variables with di ff erences B E T between domains 𝛒 and 𝛒 *. Similarly, the indicator , named S, is defined such that S=1 for every unit sampled in the study, and 0, otherwise. X Y S In this example, the causal e ff ect can be estimated by recalibrating (called selection 
 the experimental findings using observations from the target domain diagram D ) P *( y | do ( x )) = βˆ‘ P ( y | do ( x ), b , e , S = 1) P *( b , e ) b , e causal e ff ect 
 experimental data from the 
 in target domain source under selection bias οΏ½ 9

  15. Formalizing the Problem We use indicator named T to mark variables with di ff erences B E T between domains 𝛒 and 𝛒 *. Similarly, the indicator , named S, is defined such that S=1 for every unit sampled in the study, and 0, otherwise. X Y S In this example, the causal e ff ect can be estimated by recalibrating (called selection 
 the experimental findings using observations from the target domain diagram D ) P *( y | do ( x )) = βˆ‘ Observations from 
 P ( y | do ( x ), b , e , S = 1) P *( b , e ) the target domain b , e causal e ff ect 
 experimental data from the 
 in target domain source under selection bias οΏ½ 9

  16. Problem Statement οΏ½ 10

  17. Problem Statement B E T X Y S Selection Diagram D οΏ½ 10

  18. Problem Statement B E T X Y S Selection Diagram D P ( v | do ( x ), S = 1) Selection-biased Exp. 
 Distribution P 1 from 𝛒 οΏ½ 10

  19. Problem Statement B E T X Y S Selection Diagram D P ( v | do ( x ), S = 1) Selection-biased Exp. 
 Distribution P 1 from 𝛒 P *( w ) Covariate Distribution 
 P 2 from 𝛒 * οΏ½ 10

  20. 
 Problem Statement B E T X Y S Selection Diagram D Is there a function f such that 
 P ( v | do ( x ), S = 1) P *( y | do ( x )) = f ( P 1 , P 2 ) Selection-biased Exp. 
 Distribution P 1 from 𝛒 P *( w ) Covariate Distribution 
 P 2 from 𝛒 * οΏ½ 10

  21. 
 Problem Statement B E T X Y S Selection Diagram D Is there a function f such that 
 P ( v | do ( x ), S = 1) yes ( ) / no f P *( y | do ( x )) = f ( P 1 , P 2 ) 😁 ☹ Selection-biased Exp. 
 Distribution P 1 from 𝛒 P *( w ) Covariate Distribution 
 P 2 from 𝛒 * οΏ½ 10

  22. Related Work οΏ½ 11

  23. Related Work confounding type of input selection bias transportability complete οΏ½ 11

  24. Related Work confounding type of input selection bias transportability complete Backdoor Criterion [Pearl ’93] 
 obs. βœ” Extended Backdoor [Pearl and Paz ’10] οΏ½ 11

  25. Related Work confounding type of input selection bias transportability complete Backdoor Criterion [Pearl ’93] 
 obs. βœ” Extended Backdoor [Pearl and Paz ’10] Adjustment Criterion 
 obs. βœ” βœ” [Shpitser et al. ’10; Perkovic et al. ’15,’18] οΏ½ 11

  26. Related Work confounding type of input selection bias transportability complete Backdoor Criterion [Pearl ’93] 
 obs. βœ” Extended Backdoor [Pearl and Paz ’10] Adjustment Criterion 
 obs. βœ” βœ” [Shpitser et al. ’10; Perkovic et al. ’15,’18] Selection Backdoor 
 obs. βœ” βœ” [Bareinboim, Tian and Pearl ’14] οΏ½ 11

  27. Related Work confounding type of input selection bias transportability complete Backdoor Criterion [Pearl ’93] 
 obs. βœ” Extended Backdoor [Pearl and Paz ’10] Adjustment Criterion 
 obs. βœ” βœ” [Shpitser et al. ’10; Perkovic et al. ’15,’18] Selection Backdoor 
 obs. βœ” βœ” [Bareinboim, Tian and Pearl ’14] Generalized Adjustment Criterion 
 obs. βœ” βœ” βœ” [Correa, Tian and Bareinboim ’18] οΏ½ 11

  28. Related Work confounding type of input selection bias transportability complete Backdoor Criterion [Pearl ’93] 
 obs. βœ” Extended Backdoor [Pearl and Paz ’10] Adjustment Criterion 
 obs. βœ” βœ” [Shpitser et al. ’10; Perkovic et al. ’15,’18] Selection Backdoor 
 obs. βœ” βœ” [Bareinboim, Tian and Pearl ’14] Generalized Adjustment Criterion 
 obs. βœ” βœ” βœ” [Correa, Tian and Bareinboim ’18] st-Adjustment Criterion 
 β€” exp. βœ” βœ” βœ” [Correa, Tian and Bareinboim ’19] οΏ½ 11

  29. Solution: Covariate st -Adjustment β€’ Strategy: Recalibrate the results from experiments in the the studied population using observations from the target population. οΏ½ 12

  30. Solution: Covariate st -Adjustment β€’ Strategy: Recalibrate the results from experiments in the the studied population using observations from the target population. P *( y | do ( x )) οΏ½ 12

  31. Solution: Covariate st -Adjustment β€’ Strategy: Recalibrate the results from experiments in the the studied population using observations from the target population. P *( y | do ( x )) unbiased target 
 e ff ect in 𝛒 * οΏ½ 12

  32. Solution: Covariate st -Adjustment β€’ Strategy: Recalibrate the results from experiments in the the studied population using observations from the target population. = βˆ‘ P *( y | do ( x )) P ( y | do ( x ), z , S = 1) P *( z ) unbiased target 
 z e ff ect in 𝛒 * οΏ½ 12

  33. Solution: Covariate st -Adjustment β€’ Strategy: Recalibrate the results from experiments in the the studied population using observations from the target population. = βˆ‘ P *( y | do ( x )) P ( y | do ( x ), z , S = 1) P *( z ) unbiased target 
 z experiment results 
 e ff ect in 𝛒 * in source domain 𝛒 οΏ½ 12

  34. Solution: Covariate st -Adjustment β€’ Strategy: Recalibrate the results from experiments in the the studied population using observations from the target population. = βˆ‘ P *( y | do ( x )) P ( y | do ( x ), z , S = 1) P *( z ) unbiased target 
 z experiment results 
 observations from 
 e ff ect in 𝛒 * in source domain 𝛒 the target domain 𝛒 * οΏ½ 12

  35. Solution: Covariate st -Adjustment β€’ Strategy: Recalibrate the results from experiments in the the studied population using observations from the target population. = βˆ‘ P *( y | do ( x )) P ( y | do ( x ), z , S = 1) P *( z ) unbiased target 
 z experiment results 
 observations from 
 e ff ect in 𝛒 * in source domain 𝛒 the target domain 𝛒 * β€’ Questions: οΏ½ 12

  36. Solution: Covariate st -Adjustment β€’ Strategy: Recalibrate the results from experiments in the the studied population using observations from the target population. = βˆ‘ P *( y | do ( x )) P ( y | do ( x ), z , S = 1) P *( z ) unbiased target 
 z experiment results 
 observations from 
 e ff ect in 𝛒 * in source domain 𝛒 the target domain 𝛒 * β€’ Questions: 1. How to determine if st-adjustment holds for a set of covariates Z ? οΏ½ 12

  37. Solution: Covariate st -Adjustment β€’ Strategy: Recalibrate the results from experiments in the the studied population using observations from the target population. = βˆ‘ P *( y | do ( x )) P ( y | do ( x ), z , S = 1) P *( z ) unbiased target 
 z experiment results 
 observations from 
 e ff ect in 𝛒 * in source domain 𝛒 the target domain 𝛒 * β€’ Questions: 1. How to determine if st-adjustment holds for a set of covariates Z ? 2. How to find admissible covariate sets? οΏ½ 12

  38. Challenge I. Covariate Admissibility οΏ½ 13

  39. Challenge I. Covariate Admissibility β€’ In general, adjusting for some variables that are a ff ected by the treatment could introduce more bias, instead of controlling for the current, existent ones. οΏ½ 13

  40. Challenge I. Covariate Admissibility β€’ In general, adjusting for some variables that are a ff ected by the treatment could introduce more bias, instead of controlling for the current, existent ones. β€’ In our setting, in particular, special attention needs to be paid to these covariates ( a ff ected by the treatment) that are correlated with the outcome given pre-treatment covariates. οΏ½ 13

  41. Challenge I. Covariate Admissibility β€’ In general, adjusting for some variables that are a ff ected by the treatment could introduce more bias, instead of controlling for the current, existent ones. β€’ In our setting, in particular, special attention needs to be paid to these covariates ( a ff ected by the treatment) that are correlated with the outcome given pre-treatment covariates. β€’ Let’s call this set Z p . οΏ½ 13

  42. Challenge I. Covariate Admissibility β€’ In general, adjusting for some variables that are a ff ected by the treatment could introduce more bias, instead of controlling for the current, existent ones. β€’ In our setting, in particular, special attention needs to be paid to these covariates ( a ff ected by the treatment) that are correlated with the outcome given pre-treatment covariates. T β€’ Let’s call this set Z p . Z 1 X Y Z 2 Z 3 S οΏ½ 13

  43. Challenge I. Covariate Admissibility β€’ In general, adjusting for some variables that are a ff ected by the treatment could introduce more bias, instead of controlling for the current, existent ones. β€’ In our setting, in particular, special attention needs to be paid to these covariates ( a ff ected by the treatment) that are correlated with the outcome given pre-treatment covariates. T β€’ Let’s call this set Z p . Z 1 X Y β€’ For example if adjusting for Z = {Z 1 , Z 2 , Z 3 } in this model Z 2 Z 3 Z p = {Z 3 }. S οΏ½ 13

  44. Main Result I: 
 Complete Graphical Condition οΏ½ 14

  45. Main Result I: 
 Complete Graphical Condition A set of covariates Z is admissible for st-adjustment in D relative to treatment X and outcome Y if: οΏ½ 14

  46. Main Result I: 
 Complete Graphical Condition A set of covariates Z is admissible for st-adjustment in D relative to treatment X and outcome Y if: (i) Variables in Z p are independent of the treatment given all other covariates, and οΏ½ 14

  47. Main Result I: 
 Complete Graphical Condition A set of covariates Z is admissible for st-adjustment in D relative to treatment X and outcome Y if: (i) Variables in Z p are independent of the treatment given all other covariates, and (ii) The outcome Y is independent of all the transportability ( T ) and selection bias nodes ( S ) given the covariates Z and the treatment X . οΏ½ 14

  48. Main Result I: 
 Complete Graphical Condition A set of covariates Z is admissible for st-adjustment in D relative to treatment X and outcome Y if: (i) Variables in Z p are independent of the treatment given all other covariates, and (ii) The outcome Y is independent of all the transportability ( T ) and selection bias nodes ( S ) given the covariates Z and the treatment X . Thm. The causal e ff ect P * (y | do(x)) is identifiable by st-adjustment on a set Z with D if and only if the conditions above hold for Z relative to X and Y . οΏ½ 14

  49. Understanding the criterion οΏ½ 15

  50. Understanding the criterion T Task: Compute P * (y | do(x)) Z 1 X Y Z 2 Z 3 S οΏ½ 15

  51. Understanding the criterion T T Task: Compute P * (y | do(x)) Z 1 Z 1 β€’ The outcome Y is a ff ected by di ff erences in the distribution of Z 1 X X Y Y Z 2 Z 2 Z 3 Z 3 between the source and target domains. S S οΏ½ 15

  52. Understanding the criterion T T T Task: Compute P * (y | do(x)) Z 1 Z 1 Z 1 β€’ The outcome Y is a ff ected by di ff erences in the distribution of Z 1 X X X Y Y Y Z 2 Z 2 Z 2 Z 3 Z 3 Z 3 between the source and target domains. S S S β€’ The variable Z 3 a ff ects the likelihood of units being sampled. οΏ½ 15

  53. Understanding the criterion T T T Task: Compute P * (y | do(x)) Z 1 Z 1 Z 1 β€’ The outcome Y is a ff ected by di ff erences in the distribution of Z 1 X X X Y Y Y Z 2 Z 2 Z 2 Z 3 Z 3 Z 3 between the source and target domains. S S S β€’ The variable Z 3 a ff ects the likelihood of units being sampled. T Z 1 β€’ If we adjust for Z 3 to control for selection bias, we introduce spurious correlation. Hence, we should also control for Z 2 . X Y Z 2 Z 3 S οΏ½ 15

  54. Understanding the criterion T T T Task: Compute P * (y | do(x)) Z 1 Z 1 Z 1 β€’ The outcome Y is a ff ected by di ff erences in the distribution of Z 1 X X X Y Y Y Z 2 Z 2 Z 2 Z 3 Z 3 Z 3 between the source and target domains. S S S β€’ The variable Z 3 a ff ects the likelihood of units being sampled. T T Z 1 Z 1 β€’ If we adjust for Z 3 to control for selection bias, we introduce spurious correlation. Hence, we should also control for Z 2 . X X Y Y Z 2 Z 2 Z 3 Z 3 S S οΏ½ 15

  55. Getting the intuition behind the rules 
 Example T Z 1 X Y Z 2 Z 3 S οΏ½ 16

  56. Getting the intuition behind the rules 
 Example By making Z ={ Z 1 , Z 2 , Z 3 }, we can verify the st-adjustment conditions, T i.e.: Z 1 X Y Z 2 Z 3 S οΏ½ 16

  57. Getting the intuition behind the rules 
 Example By making Z ={ Z 1 , Z 2 , Z 3 }, we can verify the st-adjustment conditions, T i.e.: Z 1 (i) The variable in Z p ={Z 3 } is independent of X given the other X Y covariates {Z 1 , Z 2 } . Z 2 Z 3 S οΏ½ 16

  58. Getting the intuition behind the rules 
 Example By making Z ={ Z 1 , Z 2 , Z 3 }, we can verify the st-adjustment conditions, T i.e.: Z 1 (i) The variable in Z p ={Z 3 } is independent of X given the other X Y covariates {Z 1 , Z 2 } . Z 2 Z 3 (ii) The outcome Y is independent of S and T given Z . S οΏ½ 16

  59. Getting the intuition behind the rules 
 Example By making Z ={ Z 1 , Z 2 , Z 3 }, we can verify the st-adjustment conditions, T i.e.: Z 1 (i) The variable in Z p ={Z 3 } is independent of X given the other X Y covariates {Z 1 , Z 2 } . Z 2 Z 3 (ii) The outcome Y is independent of S and T given Z . S Hence, the st-adjustment is guaranteed to hold, i.e.: P *( y | do ( x )) = βˆ‘ P ( y | do ( x ), z 1 , z 2 , z 3 , S = 1) P *( z 1 , z 2 , z 3 ) z 1 , z 2 , z 3 οΏ½ 16

  60. Getting the intuition behind the rules 
 Example By making Z ={ Z 1 , Z 2 , Z 3 }, we can verify the st-adjustment conditions, T i.e.: Z 1 (i) The variable in Z p ={Z 3 } is independent of X given the other X Y covariates {Z 1 , Z 2 } . Z 2 Z 3 (ii) The outcome Y is independent of S and T given Z . S Hence, the st-adjustment is guaranteed to hold, i.e.: P *( y | do ( x )) = βˆ‘ P ( y | do ( x ), z 1 , z 2 , z 3 , S = 1) P *( z 1 , z 2 , z 3 ) z 1 , z 2 , z 3 measurements from 
 causal e ff ect 
 experimental data from the 
 the target domain in target domain source under selection bias οΏ½ 16

  61. Challenge II. 
 Searching for Admissible Sets οΏ½ 17

  62. Challenge II. 
 Searching for Admissible Sets β€’ Given a candidate set Z, we have a condition to determine if it is admissible or not. οΏ½ 17

  63. Challenge II. 
 Searching for Admissible Sets β€’ Given a candidate set Z, we have a condition to determine if it is admissible or not. β€’ The natural question that follows is how to find an admissible set without resorting to trial and error. There could be exponentially many candidates (and even valid ones!). οΏ½ 17

  64. Challenge II. 
 Searching for Admissible Sets β€’ Given a candidate set Z, we have a condition to determine if it is admissible or not. β€’ The natural question that follows is how to find an admissible set without resorting to trial and error. There could be exponentially many candidates (and even valid ones!). β€’ How to determine the existence of at least one admissible set? οΏ½ 17

  65. Challenge II. 
 Searching for Admissible Sets β€’ Given a candidate set Z, we have a condition to determine if it is admissible or not. β€’ The natural question that follows is how to find an admissible set without resorting to trial and error. There could be exponentially many candidates (and even valid ones!). β€’ How to determine the existence of at least one admissible set? β€’ There are sets that could be preferred among other admissible ones due to certain properties (e.g., cost, variance). οΏ½ 17

  66. Main Result II: Listing Algorithm οΏ½ 18

  67. Main Result II: Listing Algorithm B E T X Y S Selection Diagram D οΏ½ 18

  68. Main Result II: Listing Algorithm B E T X Y S Selection Diagram D P ( v | do ( x ), S = 1) Selection-biased Exp. 
 Distribution from 𝛒 οΏ½ 18

  69. Main Result II: Listing Algorithm B E T X Y S Selection Diagram D P ( v | do ( x ), S = 1) Selection-biased Exp. 
 Distribution from 𝛒 Set W of covariates 
 measurable in 𝛒 * οΏ½ 18

  70. Main Result II: Listing Algorithm B E T X Y S Selection Diagram D What are all the admissible sets satisfying st-adjustment? P ( v | do ( x ), S = 1) Selection-biased Exp. 
 Distribution from 𝛒 Set W of covariates 
 measurable in 𝛒 * οΏ½ 18

  71. 
 Main Result II: Listing Algorithm B E T X Y S List of of sets 
 Selection Diagram D What are all the admissible sets Z 1 , Z 2 , … βŠ† W such that for 
 satisfying st-adjustment? P ( v | do ( x ), S = 1) each Z i : P *( y | do ( x )) = βˆ‘ P ( y | do ( x ), z i , S = 1) P *( z i ) Selection-biased Exp. 
 z i Distribution from 𝛒 Set W of covariates 
 measurable in 𝛒 * οΏ½ 18

Recommend


More recommend