UCL Discovery Stage
UCL home » Library Services » Electronic resources » UCL Discovery Stage

Multiple imputation methods for handling incomplete longitudinal and clustered data where the target analysis is a linear mixed effects model

Huque, MH; Moreno-Betancur, M; Quartagno, M; Simpson, JA; Carlin, JB; Lee, KJ; (2020) Multiple imputation methods for handling incomplete longitudinal and clustered data where the target analysis is a linear mixed effects model. Biometrical Journal , 62 (2) , Article ISCB ASC 2018. 10.1002/bimj.201900051. Green open access

[thumbnail of Quartagno_multiple imputation for random effect model.pdf]
Preview
Text
Quartagno_multiple imputation for random effect model.pdf - Accepted Version

Download (309kB) | Preview

Abstract

Multiple imputation (MI) is increasingly popular for handling multivariate missing data. Two general approaches are available in standard computer packages: MI based on the posterior distribution of incomplete variables under a multivariate (joint) model, and fully conditional specification (FCS), which imputes missing values using univariate conditional distributions for each incomplete variable given all the others, cycling iteratively through the univariate imputation models. In the context of longitudinal or clustered data, it is not clear whether these approaches result in consistent estimates of regression coefficient and variance component parameters when the analysis model of interest is a linear mixed effects model (LMM) that includes both random intercepts and slopes with either covariates or both covariates and outcome contain missing information. In the current paper, we compared the performance of seven different MI methods for handling missing values in longitudinal and clustered data in the context of fitting LMMs with both random intercepts and slopes. We study the theoretical compatibility between specific imputation models fitted under each of these approaches and the LMM, and also conduct simulation studies in both the longitudinal and clustered data settings. Simulations were motivated by analyses of the association between body mass index (BMI) and quality of life (QoL) in the Longitudinal Study of Australian Children (LSAC). Our findings showed that the relative performance of MI methods vary according to whether the incomplete covariate has fixed or random effects and whether there is missingnesss in the outcome variable. We showed that compatible imputation and analysis models resulted in consistent estimation of both regression parameters and variance components via simulation. We illustrate our findings with the analysis of LSAC data.

Type: Article
Title: Multiple imputation methods for handling incomplete longitudinal and clustered data where the target analysis is a linear mixed effects model
Open access status: An open access version is available from UCL Discovery
DOI: 10.1002/bimj.201900051
Publisher version: http://doi.org/10.1002/bimj.201900051
Language: English
Additional information: This version is the author accepted manuscript. For information on re-use, please refer to the publisher’s terms and conditions.
Keywords: clustered data, fully conditional specification, joint modeling, missing data, multiple imputation, repeated measurement.
UCL classification: UCL
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Population Health Sciences > Inst of Clinical Trials and Methodology
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Population Health Sciences > Inst of Clinical Trials and Methodology > MRC Clinical Trials Unit at UCL
URI: https://discovery-pp.ucl.ac.uk/id/eprint/10093966
Downloads since deposit
41,648Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item