Learning Models from Data with Measurement Error: Tackling - - PowerPoint PPT Presentation

▶

Mar 04, 2023 134 likes •335 views

Learning Models from Data with Measurement Error: Tackling Underreporting Roy Adams, Yuelong Ji, Xiaobin Wang, and Suchi Saria Introduction Goal: Estimate the distribution of outcome Y given exposure A and covariates X from non-experimental data.

SLIDE 1

Learning Models from Data with Measurement Error: Tackling Underreporting

Roy Adams, Yuelong Ji, Xiaobin Wang, and Suchi Saria

SLIDE 2

Introduction

Goal: Estimate the distribution of outcome Y given exposure A and covariates X from non-experimental data.

SLIDE 3

Introduction

Measurement error is common source of bias when using non- experimental data. Goal: Estimate the distribution of outcome Y given exposure A and covariates X from non-experimental data.

SLIDE 4

Introduction

Measurement error is common source of bias when using non- experimental data.

We focus on underreporting error.

Goal: Estimate the distribution of outcome Y given exposure A and covariates X from non-experimental data.

SLIDE 5

Introduction

Measurement error is common source of bias when using non- experimental data.

We focus on underreporting error.
E.g. survey data of sensitive variables such as drug use.

Goal: Estimate the distribution of outcome Y given exposure A and covariates X from non-experimental data.

SLIDE 6

A X Y Ã

Model

Updated goal: Estimate the distribution of outcome Y given exposure A and covariates X when exposure observations Ã are subject to underreporting errors.

Model

SLIDE 7

A X Y Ã

Model

Updated goal: Estimate the distribution of outcome Y given exposure A and covariates X when exposure observations Ã are subject to underreporting errors. Assumptions:

Model

SLIDE 8

A X Y Ã

Model

Updated goal: Estimate the distribution of outcome Y given exposure A and covariates X when exposure observations Ã are subject to underreporting errors. Assumptions:

1. Strict underreporting (A = 0 ⟹ Ã = 0)

Model

SLIDE 9

A X Y Ã

Model

Updated goal: Estimate the distribution of outcome Y given exposure A and covariates X when exposure observations Ã are subject to underreporting errors. Assumptions:

1. Strict underreporting (A = 0 ⟹ Ã = 0)
2. Ã is independent of X given A

Model

SLIDE 10

A X Y Ã

Model

)

Outcome model … p𝜄(Y | A, X) Exposure model … p𝜚(A | X) Error model ……… p𝜐(Ã | A)

Model

SLIDE 11

A X Y Ã

Model

max

θ,ϕ,τ ∑ i

log∑

pθ(yi|a, xi)pτ( ˜ ai|a)pϕ(a|xi)

)

Maximize the log marginal likelihood: Outcome model … p𝜄(Y | A, X) Exposure model … p𝜚(A | X) Error model ……… p𝜐(Ã | A)

Model

SLIDE 12