MIRACLE: Causally-Aware Imputation via Learning Missing Data Mechanisms

11/04/2021
by   Trent Kyono, et al.
25

Missing data is an important problem in machine learning practice. Starting from the premise that imputation methods should preserve the causal structure of the data, we develop a regularization scheme that encourages any baseline imputation method to be causally consistent with the underlying data generating mechanism. Our proposal is a causally-aware imputation algorithm (MIRACLE). MIRACLE iteratively refines the imputation of a baseline by simultaneously modeling the missingness generating mechanism, encouraging imputation to be consistent with the causal structure of the data. We conduct extensive experiments on synthetic and a variety of publicly available datasets to show that MIRACLE is able to consistently improve imputation over a variety of benchmark methods across all three missingness scenarios: at random, completely at random, and not at random.

READ FULL TEXT

page 9

page 20

research
10/05/2022

Dimensional Data KNN-Based Imputation

Data Warehouses (DWs) are core components of Business Intelligence (BI)....
research
02/28/2022

Missing Value Estimation using Clustering and Deep Learning within Multiple Imputation Framework

Missing values in tabular data restrict the use and performance of machi...
research
06/07/2021

Proper Scoring Rules for Missing Value Imputation

Given the prevalence of missing data in modern statistical research, a b...
research
04/10/2023

Missing Data Imputation with Graph Laplacian Pyramid Network

Data imputation is a prevalent and important task due to the ubiquitousn...
research
08/13/2022

GEDI: A Graph-based End-to-end Data Imputation Framework

Data imputation is an effective way to handle missing data, which is com...
research
11/05/2022

Towards a methodology for addressing missingness in datasets, with an application to demographic health datasets

Missing data is a common concern in health datasets, and its impact on g...
research
01/12/2018

Multiple Imputation: A Review of Practical and Theoretical Findings

Multiple imputation is a straightforward method for handling missing dat...

Please sign up or login with your details

Forgot password? Click here to reset