Comparison of causal inference methods for observational data with a hierarchical structure

Kostouraki, A

(2024) Comparison of causal inference methods for observational data with a hierarchical structure. PhD thesis, London School of Hygiene & Tropical Medicine. DOI: 10.17037/PUBS.04674980

Copy

Randomised controlled trials (RCTs) are the gold standard for estimating causal treatment effects. When RCTs are not feasible or ethical, causal questions can be answered through observational data. However, analyses of observational data are prone to confounding bias and adequate statistical methods (under explicit assumptions) are needed for addressing it. Causal inference methods have been proposed to estimate treatment effects when covariates, treatment and outcome are defined at the same unit level (single-level setting). These can be broadly classified into those focusing on the outcome mechanism (e.g., g-computation), those modelling the treatment assignment (e.g., propensity scores) and those combining both (doubly robust methods). In practice, research questions could be about a system-level characteristic (e.g., general versus specialized hospital) or a patient-level characteristic (e.g., type of surgery), while patients are nested within hospital, geographic region, etc. Only few studies have extended causal techniques to hierarchical data, but they were not empirically evaluated for binary outcomes. In this thesis, we began with a review of three different causal methods, namely, g-computation, inverse probability-of-treatment weighting (IPTW) and augmented inverse probability-of-treatment weighting. We next extended them to hierarchical settings and evaluated their performance under different scenarios of unmeasured cluster-level confounding via Monte Carlo simulations, for the estimation of the population averaged treatment effect with treatment assigned at the individual level. We then derived unified formulas for IPTW variance estimators when targeting different populations like the treated or the overlap population, first starting within the single-level setting. These approximate closed formulas will form the basis of analytical extensions to the hierarchical setting, as an alternative to the bootstrap. To conclude, although common practice, ignoring clustering in causal analysis of observational studies can lead to incorrect inferences, so we extended tools to this setting and provided practical recommendations to applied researchers.

Item Type	Thesis (Doctoral)
Thesis Type	Doctoral
Thesis Name	PhD
Contributors	Belot, A; Leyrat, C; Rachet, B
Research Group	Inequalities in Cancer Outcomes research group
Grant number	C7923/A30945
Copyright Holders	Andriana Kostouraki
Date Deposited	11 Feb 2025 14:34

Explore Further

Kostouraki, A

Cancer Research UK

Dept of Non-Communicable Disease Epidemiology

picture_as_pdf: 2024_EPH_PhD_Kostouraki_A.pdf
subject: Accepted Version
: Available under Creative Commons: Attribution-NonCommercial-No Derivative Works 4.0

View

Download

Atom

BibTeX

OpenURL ContextObject in Span

Multiline CSV

OpenURL ContextObject

Dublin Core

MPEG-21 DIDL

Data Cite XML

EndNote

HTML Citation

JSON

MARC (ASCII)

MARC (ISO 2709)

METS

MODS

RDF+N3

RDF+N-Triples

RDF+XML

RIOXX2 XML

Reference Manager

Refer

Simple Metadata

ASCII Citation

EP3 XML

Export

Downloads