Practical and Theoretical Advances for Inference in Partially - PowerPoint PPT Presentation

Practical and Theoretical Advances for Inference in Partially Identified Models by Azeem M. Shaikh, University of Chicago August 2015 amshaikh@uchicago.edu Collaborator: Ivan Canay, Northwestern University 1

Introduction Partially Identified Models : – Param. of interest is not uniquely determined by distr. of obs. data. – Instead, limited to a set as a function of distr. of obs. data. (i.e., the identified set) – Due largely to pioneering work by C. Manski, now ubiquitous. (many applications!) Inference in Partially Identified Models : – Focused mainly on the construction of confidence regions. – Most well-developed for moment inequalities. – Important practical issues remain subject of current research. 2

Outline of Talk 1. Definition of partially identified models 2. Confidence regions for partially identified models – Importance of uniform asymptotic validity 3. Moment inequalities – Common framework to describe five distinct approaches 4. Subvector inference for moment inequalities 5. More general framework – Unions of functional moment inequalities 3

Partially Identified Models Obs. data X ∼ P ∈ P = { P γ : γ ∈ Γ } . ( γ is possibly infinite-dim.) Identified set for γ : Γ 0 ( P ) = { γ ∈ Γ : P γ = P } . Typically, only interested in θ = θ ( γ ). Identified set for θ : Θ 0 ( P ) = { θ ( γ ) ∈ Θ : γ ∈ Γ 0 ( P ) } , where Θ = θ (Γ). 4

Partially Identified Models (cont.) θ is identified relative to P if Θ 0 ( P ) is a singleton for all P ∈ P . θ is unidentified relative to P if Θ 0 ( P ) = Θ for all P ∈ P . Otherwise, θ is partially identified relative to P . Θ 0 ( P ) has been characterized in many examples ... ... can often be characterized using moment inequalities. 5

Confidence Regions If θ is identified relative to P (so, θ = θ ( P )), then we require that lim inf n →∞ inf P ∈ P P { θ ( P ) ∈ C n } ≥ 1 − α . Now we require that lim inf n →∞ inf θ ∈ Θ 0 ( P ) P { θ ∈ C n } ≥ 1 − α . inf P ∈ P Refer to as conf. region for points in id. set unif. consistent in level. Remark : May also be interested in conf. regions for identified set itself: lim inf n →∞ inf P ∈ P P { Θ 0 ( P ) ⊆ C n } ≥ 1 − α . See Chernozkukov et al. (2007) and Romano & Shaikh (2010). 6

Confidence Regions (cont.) Unif. consistency in level vs. pointwise consistency in level, i.e., lim inf n →∞ P { θ ∈ C n } ≥ 1 − α for all P ∈ P and θ ∈ Θ 0 ( P ) . May be for every n there is P ∈ P and θ ∈ Θ 0 ( P ) with cov. prob. ≪ 1 − α . In well-behaved prob., distinction is entirely technical issue. (e.g., conf. regions for the univariate mean with i.i.d. data.) In less well-behaved prob., distinction is more important. (e.g., conf. regions in even simple partially id. models!) Some “natural” conf. reg. may need to restrict P in non-innocuous ways. (e.g., may need to assume model is “far” from identified.) See Imbens & Manski (2004). 7

Moment Inequalities Henceforth, W i , i = 1 , . . . , n are i.i.d. with common marg. distr. P ∈ P . Numerous ex. of partially identified models give rise to mom. ineq., i.e., Θ 0 ( P ) = { θ ∈ Θ : E P [ m ( W i , θ )] ≤ 0 } , where m takes values in R k . Goal : Conf. reg. for points in the id. set that are unif. consistent in level. Remark : Assume throughout mild uniform integrability condition ... ... ensures CLT and LLN hold unif. over P ∈ P and θ ∈ Θ 0 ( P ). 8

Moment Inequalities (cont.) How : Construct tests φ n ( θ ) of H θ : E P [ m ( W i , θ )] ≤ 0 that provide unif. asym. control of Type I error, i.e., lim sup n →∞ sup sup E P [ φ n ( θ )] ≤ α . P ∈ P θ ∈ Θ 0 ( P ) Given such φ n ( θ ), C n = { θ ∈ Θ : φ n ( θ ) = 0 } satisfies desired coverage property. Below describe five different tests, all of form φ n ( θ ) = I { T n ( θ ) > ˆ c n ( θ, 1 − α ) } . 9

Moment Inequalities (cont.) Some Notation : µ ( θ, P ) = E P [ m ( W i , θ )]. m n ( θ ) = sample mean of m ( W i , θ ). ¯ ˆ Ω n ( θ ) = sample correlation of m ( W i , θ ). σ 2 j ( θ, P ) = Var P [ m j ( W i , θ )]. σ 2 ˆ n,j ( θ ) = sample variance of m j ( W i , θ ). ˆ D n ( θ ) = diag(ˆ σ n, 1 ( θ ) , . . . , ˆ σ n,k ( θ )). 10

Moment Inequalities (cont.) Test Statistic : In all cases, n ( θ ) √ n ¯ T n ( θ ) = T ( ˆ m n ( θ ) , ˆ D − 1 Ω n ( θ )) for an appropriate choice of T ( x, V ), e.g., 1 ≤ j ≤ k max { x j , 0 } 2 – modified method of moments: � – maximum: max 1 ≤ j ≤ k max { x j , 0 } – quasi-likelihood ratio: inf t ≤ 0 ( x − t ) ′ V − 1 ( x − t ) Main requirement is that T weakly increasing in first argument. 11

Moment Inequalities (cont.) Critical Value : Useful to define � � T ( ˆ n ( θ ) Z n ( θ ) + ˆ n ( θ ) s ( θ ) , ˆ D − 1 D − 1 J n ( x, s ( θ ) , θ, P ) = P Ω n ( θ )) ≤ x , where Z n ( θ ) = √ n ( ¯ m n ( θ ) − µ ( θ, P )) , which is easy to estimate. On the other hand, J n ( x, √ nµ ( θ, P ) , θ, P ) = P { T n ( θ ) ≤ x } is difficult to estimate. See, e.g., Andrews (2000). Indeed, not even possible to estimate √ nµ ( θ, P ) consistently! Five diff. tests distinguished by how they circumvent this problem. 12

Moment Inequalities (cont.) Test #1: Least Favorable Tests : Main Idea : √ nµ ( θ, P ) ≤ 0 for any P ∈ P and θ ∈ Θ 0 ( P ) n (1 − α, √ nµ ( θ, P ) , θ, P ) ≤ J − 1 ⇒ J − 1 = n (1 − α, 0 , θ, P ) . Choosing c n (1 − α, θ ) = estimate of J − 1 ˆ n (1 − α, 0 , θ, P ) therefore leads to valid tests. See Rosen (2008) and Andrews & Guggenberger (2009). Closely related work by Kudo (1963) and Wolak (1987, 1991). 13

Moment Inequalities (cont.) Test #1: Least Favorable Tests (cont.) : Remark : Deemed “conservative,” but criticism not entirely fair: – In Gaussian setting, these tests are ( α - and d -) admissible. – Some are even maximin optimal among restricted class of tests. – See Lehmann (1952) and Romano & Shaikh (unpublished). Nevertheless, unattractive: – Tend to have best power against alternatives with all moments > 0. – As θ varies, many alternatives with only some moments > 0. – May therefore not lead to smallest confidence regions. Following tests incorporate info. about √ nµ ( θ, P ) in some way. = ⇒ better power against such alternatives. 14

Moment Inequalities (cont.) Test #2: Subsampling : See Politis & Romano (1994). Main Idea : Fix b = b n < n with b → ∞ and b/n → 0. � n � Compute T n ( θ ) on each of subsamples of data. b Denote by L n ( x, θ ) the empirical distr. of these quantities. Use L n ( x, θ ) as estimate of distr. of T n ( θ ), i.e., J n ( x, √ nµ ( θ, P ) , θ, P ) . Choosing c n (1 − α, θ ) = L − 1 ˆ n (1 − α, θ ) leads to valid tests. See Romano & Shaikh (2008) and Andrews & Guggenberger (2009). 15

Moment Inequalities (cont.) Test #2: Subsampling (cont.) : Why : L n ( x, θ ) is a “good” estimate of distr. of T b ( θ ), i.e., √ J b ( x, bµ ( θ, P ) , θ, P ) . See general results in Romano & Shaikh (2012). Moreover, √ √ nµ ( θ, P ) ≤ bµ ( θ, P ) for any P ∈ P and θ ∈ Θ 0 ( P ) √ n (1 − α, √ nµ ( θ, P ) , θ, P ) ≤ J − 1 ⇒ J − 1 = n (1 − α, bµ ( θ, P ) , θ, P ) . Desired conclusion follows. Remark : Incorporates information about √ nµ ( θ, P ) ... ... but remains unattractive because choice of b problematic. 16

Moment Inequalities (cont.) Test #3: Generalized Moment Selection : See Andrews & Soares (2010). Main Idea : Perhaps possible to estimate √ nµ ( θ, P ) “well enough”? n,k ( θ )) ′ with s gms s gms s gms Consider, e.g., ˆ ( θ ) = (ˆ n, 1 ( θ ) , . . . , ˆ n √ n ¯  m n,j ( θ ) 0 if > − κ n  s gms σ n,j ( θ ) ˆ ˆ n,j ( θ ) = , −∞ otherwise  where 0 < κ n → ∞ and κ n / √ n → 0. Choosing c n (1 − α, θ ) = estimate of J − 1 s gms ˆ n (1 − α, ˆ ( θ ) , θ, P ) n leads to valid tests. 17

Moment Inequalities (cont.) Test #3: Generalized Moment Selection (cont.) : Why : For any sequence P n ∈ P and θ n ∈ Θ 0 ( P n ) if √ nµ j ( θ n , P n ) → c ≤ 0  0  s gms ˆ n,j ( θ n ) = w.p.a.1 . if √ nµ j ( θ n , P n ) → −∞ −∞  ( θ ) provides an asymp. upper bound on √ nµ ( θ, P ). s gms In this sense, ˆ n Remark : Also incorporates information about √ nµ ( θ, P ) ... ... and, for typical κ n and b , more powerful than subsampling. Main drawback is choice of κ n : – In finite-samples, smaller choice always more powerful. – First- and higher-order properties do not depend on κ n . See Bugni (2014). – Precludes data-dependent rules for choosing κ n . 18

Practical and Theoretical Advances for Inference in Partially - PowerPoint PPT Presentation

Practical and Theoretical Advances for Inference in Partially Identified Models by Azeem M. Shaikh, University of Chicago August 2015 amshaikh@uchicago.edu Collaborator: Ivan Canay, Northwestern University 1 Introduction Partially

Inference in Bayesian networks Chapter 14.45 Chapter 14.45 1 Outline Exact inference

Practical Experience with Practical Experience with Practical Experience with Practical

Advances in QCD Theory Christopher Lee Los Alamos National Laboratory Theoretical Division

Change from a Practical Perspective Change from a Practical Perspective Change from a Practical

Soft Inference and Posterior Marginals September 19, 2013 Soft vs. Hard Inference Hard

Advances in Chemo-RT for cervical and Advances in Chemo-RT for cervical and Head & Neck

Post-Selection Inference Todd Kuffner Washington University in St. Louis PhyStat 2016

Type Inference 75 Definition Type Inference Type inference = Java compiler's ability

Inference in Bayesian networks Chapter 14.45 Chapter 14.45 1 Outline Exact inference

Exact Inference Inference Basic task for inference: Compute

GAME THEORETICAL INFERENCE OF HUMAN BEHAVIOR IN SOCIAL NETWORKS Symposium on Resilience and

Theoretical physics and theoretical astrophysics John Campbell 2015 Institutional Review

Software Re-engineering - Theoretical and Practical Approaches By Daniel Kinneryd Software

Practical Software Design & Style Practical Software Design & Style | 15 Sep 2017 | 1/25

CSpace CSpace CSpace CSpace A More Practical and A More Practical and A

Real-World applications of Boosting Yoav Freund UCSD Practical Advantages of AdaBoost

Da Data ta Re Requirements quirements Lothar Linde ADB GMS Environment Operations Center

Whats New in Alibabas X-DB SQL Engine Min Qiu, Alibaba Group Santa Clara, California |

Benefits for All Overview History What is Simplify? How Data Sharing Works

Strategies for Policing Innovation 2018 New Site Orientation Webinar January 15, 2019

Post-Zeroizing Obfuscation New Mathematical Tools and the Case of

Standards and Interoperability Lab - Asia 1604 Asian Development Bank HQ, ADB Avenue, Ortigas CBD,

Graphical Causal Models for Time Series Econometrics: Some Recent Developments and Applications

Solving Energy System Models with GAMS on HPC Platforms Michael R. Bussieck GAMS Development

Practical and Theoretical Advances for Inference in Partially - PowerPoint PPT Presentation

Practical and Theoretical Advances for Inference in Partially Identified Models by Azeem M. Shaikh, University of Chicago August 2015 amshaikh@uchicago.edu Collaborator: Ivan Canay, Northwestern University 1 Introduction Partially

Inference in Bayesian networks Chapter 14.45 Chapter 14.45 1 Outline Exact inference

Practical Experience with Practical Experience with Practical Experience with Practical

Advances in QCD Theory Christopher Lee Los Alamos National Laboratory Theoretical Division

Change from a Practical Perspective Change from a Practical Perspective Change from a Practical

Soft Inference and Posterior Marginals September 19, 2013 Soft vs. Hard Inference Hard

Advances in Chemo-RT for cervical and Advances in Chemo-RT for cervical and Head &amp; Neck

Post-Selection Inference Todd Kuffner Washington University in St. Louis PhyStat 2016

Type Inference 75 Definition Type Inference Type inference = Java compiler's ability

Inference in Bayesian networks Chapter 14.45 Chapter 14.45 1 Outline Exact inference

Exact Inference Inference Basic task for inference: Compute

GAME THEORETICAL INFERENCE OF HUMAN BEHAVIOR IN SOCIAL NETWORKS Symposium on Resilience and

Theoretical physics and theoretical astrophysics John Campbell 2015 Institutional Review

Software Re-engineering - Theoretical and Practical Approaches By Daniel Kinneryd Software

Practical Software Design &amp; Style Practical Software Design &amp; Style | 15 Sep 2017 | 1/25

CSpace CSpace CSpace CSpace A More Practical and A More Practical and A

Real-World applications of Boosting Yoav Freund UCSD Practical Advantages of AdaBoost

Da Data ta Re Requirements quirements Lothar Linde ADB GMS Environment Operations Center

Whats New in Alibabas X-DB SQL Engine Min Qiu, Alibaba Group Santa Clara, California |

Benefits for All Overview History What is Simplify? How Data Sharing Works

Strategies for Policing Innovation 2018 New Site Orientation Webinar January 15, 2019

Post-Zeroizing Obfuscation New Mathematical Tools and the Case of

Standards and Interoperability Lab - Asia 1604 Asian Development Bank HQ, ADB Avenue, Ortigas CBD,

Graphical Causal Models for Time Series Econometrics: Some Recent Developments and Applications

Solving Energy System Models with GAMS on HPC Platforms Michael R. Bussieck GAMS Development

Advances in Chemo-RT for cervical and Advances in Chemo-RT for cervical and Head & Neck

Practical Software Design & Style Practical Software Design & Style | 15 Sep 2017 | 1/25