learning models from data with measurement error tackling
play

Learning Models from Data with Measurement Error: Tackling - PowerPoint PPT Presentation

Learning Models from Data with Measurement Error: Tackling Underreporting Roy Adams, Yuelong Ji, Xiaobin Wang, and Suchi Saria Introduction Goal: Estimate the distribution of outcome Y given exposure A and covariates X from non-experimental data.


  1. Learning Models from Data with Measurement Error: Tackling Underreporting Roy Adams, Yuelong Ji, Xiaobin Wang, and Suchi Saria

  2. Introduction Goal: Estimate the distribution of outcome Y given exposure A and covariates X from non-experimental data.

  3. Introduction Goal: Estimate the distribution of outcome Y given exposure A and covariates X from non-experimental data. Measurement error is common source of bias when using non- experimental data.

  4. Introduction Goal: Estimate the distribution of outcome Y given exposure A and covariates X from non-experimental data. Measurement error is common source of bias when using non- experimental data. • We focus on underreporting error.

  5. Introduction Goal: Estimate the distribution of outcome Y given exposure A and covariates X from non-experimental data. Measurement error is common source of bias when using non- experimental data. • We focus on underreporting error. • E.g. survey data of sensitive variables such as drug use.

  6. Model Model Updated goal: Estimate the distribution of outcome Y given exposure A and covariates X when exposure observations à are subject to underreporting errors . X à A Y

  7. Model Model Updated goal: Estimate the distribution of outcome Y given exposure A and covariates X when exposure observations à are subject to underreporting errors . Assumptions: X à A Y

  8. Model Model Updated goal: Estimate the distribution of outcome Y given exposure A and covariates X when exposure observations à are subject to underreporting errors . Assumptions: 1. Strict underreporting ( A = 0 ⟹ à = 0 ) X à A Y

  9. Model Model Updated goal: Estimate the distribution of outcome Y given exposure A and covariates X when exposure observations à are subject to underreporting errors . Assumptions: 1. Strict underreporting ( A = 0 ⟹ à = 0 ) X 2. à is independent of X given A à A Y

  10. Model Model Outcome model … p 𝜄 (Y | A, X) X Exposure model … p 𝜚 (A | X) Error model ……… p 𝜐 (Ã | A) Ã A Y )

  11. Model Model Outcome model … p 𝜄 (Y | A, X) X Exposure model … p 𝜚 (A | X) Error model ……… p 𝜐 (Ã | A) Ã A Y ) Maximize the log marginal likelihood : θ , ϕ , τ ∑ log ∑ p θ ( y i | a , x i ) p τ ( ˜ a i | a ) p ϕ ( a | x i ) max i a

  12. Identifiability Identifiability

  13. Identifiability Identifiability We prove three separate identifiability conditions:

  14. Identifiability Identifiability We prove three separate identifiability conditions: 1. The error distribution is known

  15. Identifiability Identifiability We prove three separate identifiability conditions: 1. The error distribution is known 2. We have a second error-prone exposure observation

  16. Identifiability Identifiability We prove three separate identifiability conditions: 1. The error distribution is known 2. We have a second error-prone exposure observation 3. Under assumptions about the form of the exposure distribution (see paper/poster for details)

  17. Identifiability Identifiability We prove three separate identifiability conditions: 1. The error distribution is known 2. We have a second error-prone exposure observation 3. Under assumptions about the form of the exposure distribution (see paper/poster for details) In particular: If X is not independent of A and p(A | X) is a logit, probit, or cloglog regression model , then p(Y, Ã | X) is identifiable.

  18. Maternal drug use and childhood obesity Drug use and childhood obesity 6mRking (a) 0.25 sensitivity analysis subject-reSRrt 5isk difference Average causal e ff ect 0.20 subject-reSRrt + cRtinine 0.15 0.10 0.05 0.00 0.0 0.1 0.2 0.3 0.4 0.5 0.6 τ Underreporting rate

  19. Thanks! Drug use and childhood obesity Come see poster #75

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend