JointAI: Joint Analysis and Imputation of Incomplete Data in R

by   Nicole S. Erler, et al.

Missing data occur in many types of studies and typically complicate the analysis. Multiple imputation, either using joint modelling or the more flexible fully conditional specification approach, are popular and work well in standard settings. In settings involving non-linear associations or interactions, however, incompatibility of the imputation model with the analysis model is an issue often resulting in bias. Similarly, complex outcomes such as longitudinal or survival outcomes cannot be adequately handled by standard implementations. In this paper, we introduce the R package JointAI, which utilizes the Bayesian framework to perform simultaneous analysis and imputation in regression models with incomplete covariates. Using a fully Bayesian joint modelling approach it overcomes the issue of uncongeniality while retaining the attractive flexibility of fully conditional specification multiple imputation by specifying the joint distribution of analysis and imputation models as a sequence of univariate models that can be adapted to the type of variable. JointAI provides functions for Bayesian inference with generalized linear and generalized linear mixed models as well as survival models, that take arguments analogous to their corresponding and well known complete data versions from base R and other packages. Usage and features of JointAI are described and illustrated using various examples and the theoretical background is outlined.


page 10

page 12

page 13

page 37

page 38


Remiod: Reference-based Controlled Multiple Imputation of Longitudinal Binary and Ordinal Outcomes with non-ignorable missingness

Missing data on response variables are common in clinical studies. Corre...

Joint distribution properties of Fully Conditional Specification under the normal linear model with normal inverse-gamma priors

Fully conditional specification (FCS) is a convenient and flexible multi...

gcimpute: A Package for Missing Data Imputation

This article introduces the Python package gcimpute for missing data imp...

Multiple Imputation: A Review of Practical and Theoretical Findings

Multiple imputation is a straightforward method for handling missing dat...

Comparison of different Methods for Univariate Time Series Imputation in R

Missing values in datasets are a well-known problem and there are quite ...

Joint data imputation and mechanistic modelling for simulating heart-brain interactions in incomplete datasets

The use of mechanistic models in clinical studies is limited by the lack...

merlin - a unified modelling framework for data analysis and methods development in Stata

merlin can do a lot of things. From simple stuff, like fitting a linear ...

Please sign up or login with your details

Forgot password? Click here to reset