American Journal of Epidemiology Advance Access originally published online on July 10, 2008
American Journal of Epidemiology 2008 168(4):384-388; doi:10.1093/aje/kwn148
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Invited Commentary: Evidence-based Evaluation of p Values and Bayes Factors
From the Division of Cancer Epidemiology and Genetics, National Cancer Institute, Rockville, MD
Correspondence to Dr. Hormuzd A. Katki, Division of Cancer Epidemiology and Genetics, National Cancer Institute, 6120 Executive Blvd., Room 8014, Executive Plaza South, MSC 7244, Rockville, MD 20852-4910 (e-mail: katkih{at}mail.nih.gov).
Received for publication February 1, 2008. Accepted for publication March 13, 2008.
Despite clear deficiencies of the p value as a summary of statistical evidence, compelling alternatives with strong theoretical justification, such as the Bayes factor and the related likelihood ratio, are rarely presented in epidemiologic publications. Comparison of the historical performance of the p value with that of its competitors in the epidemiologic literature may help epidemiologists evaluate whether Bayes factors or likelihood ratios lead to conclusions more quickly and reliably than a p value, given the same data. Empirical evidence presented by Ioannidis (Am J Epidemiol 2008;168:374–83) demonstrates that findings with p values near 0.05 tend not to be confirmed in future studies. Similarly, Bayes factors interpret p values near 0.05 as having, at best, promising evidence against the null hypothesis. However, the different types of Bayes factors require empirical evaluation of their performance in practice. P values remain popular because miniscule p values are unlikely to mislead and p values do not require alternative hypotheses. Publishing p values near 0.05 could be considered a low-threshold screen to allow many (possibly null) results to be published for follow-up consideration. Meta-analyses and studies meant to decisively convince skeptics require a stronger standard (p values much below 0.05) and a Bayes factor to interpret the p value and to facilitate incorporation of background expertise necessary for drawing comprehensive conclusions.
Bayes theorem; empirical research; epidemiologic methods; meta-analysis; observation; statistics
Abbreviations: FDR, false discovery rate; SNP, single nucleotide polymorphism
![]()
CiteULike
Connotea
Del.icio.us What's this?
Related articles in Am. J. Epidemiol.:
- Effect of Formal Statistical Significance on the Credibility of Observational Associations
- John P. A. Ioannidis
Am. J. Epidemiol. 2008 168: 374-383.[Abstract] [FREE Full Text]
This article has been cited by other articles:
![]() |
U. Stromberg, J. Bjork, P. Vineis, K. Broberg, and E. Zeggini Ranking of genome-wide association scan signals by different measures Int. J. Epidemiol., October 1, 2009; 38(5): 1364 - 1373. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. P. A. Ioannidis The Author Responds to "Evaluating p Values and Bayes Factors" Am. J. Epidemiol., August 15, 2008; 168(4): 389 - 390. [Full Text] [PDF] |
||||

