Impact of missing data due to drop-outs on estimators for rates of change in longitudinal studies: a simulation study

Touloumi, G; Babiker, AG; Pocock, SJ; Darbyshire, JH; (2001) Impact of missing data due to drop-outs on estimators for rates of change in longitudinal studies: a simulation study. Statistics in medicine, 20 (24). pp. 3715-28. ISSN 0277-6715

Full text not available from this repository.


Many cohort studies and clinical trials are designed to compare rates of change over time in one or more disease markers in several groups. One major problem in such longitudinal studies is missing data due to patient drop-out. The bias and efficiency of six different methods to estimate rates of changes in longitudinal studies with incomplete observations were compared: generalized estimating equation estimates (GEE) proposed by Liang and Zeger (1986); unweighted average of ordinary least squares (OLSE) of individual rates of change (UWLS); weighted average of OLSE (WLS); conditional linear model estimates (CLE), a covariate type estimates proposed by Wu and Bailey (1989); random effect (RE), and joint multivariate RE (JMRE) estimates. The latter method combines a linear RE model for the underlying pattern of the marker with a log-normal survival model for informative drop-out process. The performance of these methods in the presence of missing data completely at random (MCAR), at random (MAR) and non-ignorable (NIM) were compared in simulation studies. Data for the disease marker were generated under the linear random effects model with parameter values derived from realistic examples in HIV infection. Rates of drop-out, assumed to increase over time, were allowed to be independent of marker values or to depend either only on previous marker values or on both previous and current marker values. Under MACR all six methods yielded unbiased estimates of both group mean rates and between-group difference. However, the cross-sectional view of the data in the GEE method resulted in seriously biased estimates under MAR and NIM drop-out process. The bias in the estimates ranged from 30 per cent to 50 per cent. The degree of bias in the GEE estimates increases with the severity of non-randomness and with the proportion of MAR data. Under MCAR and MAR all the other five methods performed relatively well. RE and JMRE estimates were more efficient(that is, had smaller variance) than UWLS, WLS and CL estimates. Under NIM, WLS and particularly RE estimates tended to underestimate the average rate of marker change (bias approximately 10 per cent). Under NIM, UWLS, CL and JMRE performed better in terms of bias (3-5 per cent) with the JMRE giving the most efficient estimates. Given that markers are key variables related to disease progression, missing marker data are likely to be at least MAR. Thus, the GEE method may not be appropriate for analysing such longitudinal marker data. The potential biases due to incomplete data require greater recognition in reports of longitudinal studies. Sensitivity analyses to assess the effect of drop-outs on inferences about the target parameters are important.

Item Type: Article
Keywords: Clinical-trials, regression-models, categorical-data, linear-, models, progression, statistics, mechanism, inference, outcomes, time, Adolescence, Adult, Bias (Epidemiology), CD4 Lymphocyte Count, Child, Child, Preschool, Cohort Studies, Comparative Study, Computer Simulation, Data Interpretation, Statistical, Disease Progression, Greece, HIV Infections, pathology, HIV-1, growth & development, Human, Longitudinal Studies, Male, Middle Age, Patient Dropouts, Statistics, methods
Faculty and Department: Faculty of Epidemiology and Population Health > Dept of Medical Statistics
PubMed ID: 11782028
Web of Science ID: 173013900006


Download activity - last 12 months
Downloads since deposit
Accesses by country - last 12 months
Accesses by referrer - last 12 months
Additional statistics for this record are available via IRStats2

Actions (login required)

Edit Item Edit Item