Causal Inference in High Dimensions – Without Sparsity

03/23/2022
by   Steve Yadlowsky, et al.
0

We revisit the classical causal inference problem of estimating the average treatment effect in the presence of fully observed, high dimensional confounding variables, where the number of confounders d is of the same order as the sample size n. To make the problem tractable, we posit a generalized linear model for the effect of the confounders on the treatment assignment and outcomes, but do not assume any sparsity. Instead, we only require the magnitude of confounding to remain non-degenerate. Despite making parametric assumptions, this setting is a useful surrogate for some machine learning methods used to adjust for confounding in two-stage methods. In particular, the estimation of the first stage adds variance that does not vanish, forcing us to confront terms in the asymptotic expansion that normally are brushed aside as finite sample defects. Using new theoretical results and simulations, we compare G-computation, inverse propensity weighting (IPW), and two common doubly robust estimators – augmented IPW (AIPW) and targeted maximum likelihood estimation (TMLE). When the outcome model estimates are unbiased, G-computation outperforms the other estimators in both bias and variance. Among the doubly robust estimators, the TMLE estimator has the lowest variance. Existing theoretical results do not explain this advantage because the TMLE and AIPW estimators have the same asymptotic influence function. However, our model emphasizes differences in performance between these estimators beyond first-order asymptotics.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/21/2020

Machine learning for causal inference: on the use of cross-fit estimators

Modern causal inference methods allow machine learning to be used to wea...
research
03/23/2018

Robust semiparametric estimators: missing data and causal inference

Semiparametric inference with missing outcome data (including causal inf...
research
05/20/2022

A New Central Limit Theorem for the Augmented IPW Estimator: Variance Inflation, Cross-Fit Covariance and Beyond

Estimation of the average treatment effect (ATE) is a central problem in...
research
06/14/2021

Adaptive normalization for IPW estimation

Inverse probability weighting (IPW) is a general tool in survey sampling...
research
09/11/2023

Inverse probability of treatment weighting with generalized linear outcome models for doubly robust estimation

There are now many options for doubly robust estimation; however, there ...
research
12/03/2019

Confounding Adjustment Methods for Multi-level Treatment Comparisons Under Lack of Positivity and Unknown Model Specification

Imbalances in covariates between treatment groups are frequent in observ...
research
02/25/2022

Off-Policy Evaluation with Policy-Dependent Optimization Response

The intersection of causal inference and machine learning for decision-m...

Please sign up or login with your details

Forgot password? Click here to reset