Comparison of Bayesian model averaging and stepwise methods for model selection in logistic regression.
Wang, Duolao;
Zhang, Wenyang;
Bakhai, Ameet;
(2004)
Comparison of Bayesian model averaging and stepwise methods for model selection in logistic regression.
Statistics in medicine, 23 (22).
pp. 3451-3467.
ISSN 0277-6715
DOI: https://doi.org/10.1002/sim.1930
Permanent Identifier
Use this Digital Object Identifier when citing or linking to this resource.
Logistic regression is the standard method for assessing predictors of diseases. In logistic regression analyses, a stepwise strategy is often adopted to choose a subset of variables. Inference about the predictors is then made based on the chosen model constructed of only those variables retained in that model. This method subsequently ignores both the variables not selected by the procedure, and the uncertainty due to the variable selection procedure. This limitation may be addressed by adopting a Bayesian model averaging approach, which selects a number of all possible such models, and uses the posterior probabilities of these models to perform all inferences and predictions. This study compares the Bayesian model averaging approach with the stepwise procedures for selection of predictor variables in logistic regression using simulated data sets and the Framingham Heart Study data. The results show that in most cases Bayesian model averaging selects the correct model and out-performs stepwise approaches at predicting an event of interest.