G-formula with multiple imputation for causal inference with incomplete data.

Jonathan W Bartlett ORCID logo ; Camila Olarte Parra ORCID logo ; Emily Granger ORCID logo ; Ruth H Keogh ORCID logo ; Erik W van Zwet ORCID logo ; Rhian M Daniel ORCID logo ; (2025) G-formula with multiple imputation for causal inference with incomplete data. Statistical methods in medical research. 9622802251316971-. ISSN 0962-2802 DOI: 10.1177/09622802251316971
Copy

G-formula is a popular approach for estimating the effects of time-varying treatments or exposures from longitudinal data. G-formula is typically implemented using Monte-Carlo simulation, with non-parametric bootstrapping used for inference. In longitudinal data settings missing data are a common issue, which are often handled using multiple imputation, but it is unclear how G-formula and multiple imputation should be combined. We show how G-formula can be implemented using Bayesian multiple imputation methods for synthetic data, and that by doing so, we can impute missing data and simulate the counterfactuals of interest within a single coherent approach. We describe how this can be achieved using standard multiple imputation software and explore its performance using a simulation study and an application from cystic fibrosis.


picture_as_pdf
Bartlett-etal-2025-G-formula-with-multiple-imputation-for-causal-inference-with-incomplete-data.pdf
subject
Published Version
Available under Creative Commons: Attribution 4.0

View Download

Atom BibTeX OpenURL ContextObject in Span Multiline CSV OpenURL ContextObject Dublin Core Dublin Core MPEG-21 DIDL Data Cite XML EndNote HTML Citation JSON MARC (ASCII) MARC (ISO 2709) METS MODS RDF+N3 RDF+N-Triples RDF+XML RIOXX2 XML Reference Manager Refer Simple Metadata ASCII Citation EP3 XML
Export

Downloads