American Journal of Epidemiology Advance Access originally published online on April 10, 2009
American Journal of Epidemiology 2009 169(10):1182-1190; doi:10.1093/aje/kwp035
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
PRACTICE OF EPIDEMIOLOGY |
Methods of Covariate Selection: Directed Acyclic Graphs and the Change-in-Estimate Procedure
Correspondence to Dr. Hsin-Yi Weng, 2430 Veterinary Medicine Basic Sciences Building, 2001 South Lincoln Avenue, Urbana, IL 61802 (e-mail: hweng{at}illinois.edu).
Received for publication August 10, 2007. Accepted for publication January 26, 2009.
Four covariate selection approaches were compared: a directed acyclic graph (DAG) full model and 3 DAG and change-in-estimate combined procedures. Twenty-five scenarios with case-control samples were generated from 10 simulated populations in order to address the performance of these covariate selection procedures in the presence of confounders of various strengths and under DAG misspecification with omission of confounders or inclusion of nonconfounders. Performance was evaluated by standard error, bias, square root of the mean-squared error, and 95% confidence interval coverage. In most scenarios, the DAG full model without further covariate selection performed as well as or better than the other procedures when the DAGs were correctly specified, as well as when confounders were omitted. Model reduction by using change-in-estimate procedures showed potential gains in precision when the DAGs included nonconfounders, but underestimation of regression-based standard error might cause reduction in 95% confidence interval coverage. For modeling binary outcomes in a case-control study, the authors recommend construction of a "conservative" DAG, determination of all potential confounders, and then change-in-estimate procedures to simplify this full model. The authors advocate that, under the conditions investigated, the selection of final model should be based on changes in precision: Adopt the reduced model if its standard error (derived from logistic regression) is substantially smaller; otherwise, the full DAG-based model is appropriate.
bias (epidemiology); computer simulation; confounding factors (epidemiology); epidemiologic methods; logistic models; models, statistical; models, theoretical
Abbreviations: DAG, directed acyclic graph; lnOR, natural logarithm-transformed odds ratio; OR, odds ratio
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
L. Nie Re: "Methods of Covariate Selection: Directed Acyclic Graphs and the Change-in-Estimate Procedure" Am. J. Epidemiol., November 15, 2009; 170(10): 1320 - 1320. [Full Text] [PDF] |
||||
![]() |
H.-Y. Weng, L. L. McV. Messam, and I. Hertz-Picciotto Three of the Authors Reply Am. J. Epidemiol., November 15, 2009; 170(10): 1320 - 1321. [Full Text] [PDF] |
||||
![]() |
A. J Gaskins, S. L Mumford, C. Zhang, J. Wactawski-Wende, K. M Hovey, B. W Whitcomb, P. P Howards, N. J Perkins, E. Yeung, and E. F Schisterman Effect of daily fiber intake on reproductive function: the BioCycle Study Am. J. Clinical Nutrition, October 1, 2009; 90(4): 1061 - 1069. [Abstract] [Full Text] [PDF] |
||||

