Imputation procedures in surveys using nonparametric and machine learning methods: an empirical comparison

07/13/2020
by   Mehdi Dagdoug, et al.
0

Nonparametric and machine learning methods are flexible methods for obtaining accurate predictions. Nowadays, data sets with a large number of predictors and complex structures are fairly common. In the presence of item nonresponse, nonparametric and machine learning procedures may thus provide a useful alternative to traditional imputation procedures for deriving a set of imputed values. In this paper, we conduct an extensive empirical investigation that compares a number of imputation procedures in terms of bias and efficiency in a wide variety of settings, including high-dimensional data sets. The results suggest that a number of machine learning procedures perform very well in terms of bias and efficiency.

READ FULL TEXT

page 30

page 31

research
10/04/2020

Efficient multiply robust imputation in the presence of influential units in surveys

Item nonresponse is a common issue in surveys. Because unadjusted estima...
research
09/24/2018

Preserving the distribution function in surveys in case of imputation for zero inflated data

Item non-response in surveys is usually handled by single imputation, wh...
research
03/14/2021

Are deep learning models superior for missing data imputation in large surveys? Evidence from an empirical comparison

Multiple imputation (MI) is the state-of-the-art approach for dealing wi...
research
08/29/2022

High-dimensional imputation for the social sciences: a comparison of state-of-the-art methods

Including a large number of predictors in the imputation model underlyin...
research
10/29/2021

Quality control, data cleaning, imputation

This chapter addresses important steps during the quality assurance and ...
research
02/04/2021

Asymptotically Exact and Fast Gaussian Copula Models for Imputation of Mixed Data Types

Missing values with mixed data types is a common problem in a large numb...
research
11/08/2018

Labeling Bias in Galaxy Morphologies

We present a metric to quantify systematic labeling bias in galaxy morph...

Please sign up or login with your details

Forgot password? Click here to reset