Overlap in Observational Studies with High-Dimensional Covariates

11/07/2017
by   Alexander D'Amour, et al.
0

Causal inference in observational settings typically rests on a pair of identifying assumptions: (1) unconfoundedness and (2) covariate overlap, also known as positivity or common support. Investigators often argue that unconfoundedness is more plausible when many covariates are included in the analysis. Less discussed is the fact that covariate overlap is more difficult to satisfy in this setting. In this paper, we explore the implications of overlap in high-dimensional observational studies, arguing that this assumption is stronger than investigators likely realize. Our main innovation is to frame (strict) overlap in terms of bounds on a likelihood ratio, which allows us to leverage and expand on existing results from information theory. In particular, we show that strict overlap bounds discriminating information (e.g., Kullback-Leibler divergence) between the covariate distributions in the treated and control populations. We use these results to derive explicit bounds on the average imbalance in covariate means under strict overlap and a range of assumptions on the covariate distributions. Importantly, these bounds grow tighter as the dimension grows large, and converge to zero in some cases. We examine how restrictions on the treatment assignment and outcome processes can weaken the implications of certain overlap assumptions, but at the cost of stronger requirements for unconfoundedness. Taken together, our results suggest that adjusting for high-dimensional covariates does not necessarily make causal identification more plausible.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/09/2018

A note on strict functional covariate overlap in causal inference problems with high-dimensional covariates

A powerful tool for the analysis of nonrandomized observational studies ...
research
10/19/2021

Addressing Positivity Violations in Causal Effect Estimation using Gaussian Process Priors

In observational studies, causal inference relies on several key identif...
research
11/30/2021

Contrasting Identifying Assumptions of Average Causal Effects: Robustness and Semiparametric Efficiency

Semiparametric inference about average causal effects from observational...
research
03/15/2022

Distributed Design for Causal Inferences on Big Observational Data

A fundamental issue in causal inference for Big Observational Data is co...
research
04/28/2022

Mahalanobis balancing: a multivariate perspective on approximate covariate balancing

In the past decade, various exact balancing-based weighting methods were...
research
05/09/2018

Comparing Covariate Prioritization via Matching to Machine Learning Methods for Causal Inference using Five Empirical Applications

Matching methods have become one frequently used method for statistical ...
research
07/01/2020

Identifying Causal Effect Inference Failure with Uncertainty-Aware Models

Recommending the best course of action for an individual is a major appl...

Please sign up or login with your details

Forgot password? Click here to reset