Comparing Covariate Prioritization via Matching to Machine Learning Methods for Causal Inference using Five Empirical Applications
Matching methods have become one frequently used method for statistical adjustment under a selection on observables identification strategy. Matching methods typically focus on modeling the treatment assignment process rather than the outcome. Many of the recent advances in matching allow for various forms of covariate prioritization. This allows analysts to emphasize the adjustment of some covariates over others, typically based on subject matter expertise. While flexible machine learning methods have a long history of being used for statistical prediction, they have generally seen little use in causal modeliing. However, recent work has developed flexible machine learning methods based on outcome models for the esimation of causal effects. These methods are designed to use little analyst input. All covariate prioritization is done by the learner. In this study, we replicate five published studies that used customized matching methods for covariate prioritization. In each of these studies, subsets of covariates were given priority in the match based on substantive expertise. We replicate these studies using three different machine learning methods that have designed for causal modeling. We find that in almost every case matching and machine learning methods produce identical results. We conclude by discussing the implications for applied analysts.
READ FULL TEXT