Skip Navigation


American Journal of Epidemiology Advance Access originally published online on September 8, 2005
American Journal of Epidemiology 2005 162(9):919-920; doi:10.1093/aje/kwi288
This Article
Right arrow Extract Freely available
Right arrow FREE Full Text (PDF) Freely available
Right arrow All Versions of this Article:
162/9/919-a    most recent
kwi288v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (1)
Right arrowRequest Permissions
Right arrow Disclaimer
Google Scholar
Right arrow Articles by Yawn, B. P.
Right arrow Articles by Wollan, P. M.
Right arrow Search for Related Content
PubMed
Right arrow Articles by Yawn, B. P.
Right arrow Articles by Wollan, P. M.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

American Journal of Epidemiology Copyright © 2005 by the Johns Hopkins Bloomberg School of Public Health All rights reserved; printed in U.S.A.

Letter to the Editor

THE AUTHORS REPLY

Barbara P. Yawn and Peter M. Wollan

Department of Research, Olmsted Medical Center, Rochester, MN 55904

The letter from Dr. Sheikh (1Go) about our article (2Go) raises some interesting issues that are important to address. The kappa statistic is most clearly useful when a single item is being evaluated, the number of observers is small, the number of possible responses is small, the number of cases observed is large, and agreement due to chance is an important concern. If the question being addressed is whether agreement is significantly better than chance, where "chance" is defined as multinomial independence, then the hypothesis test based on kappa is clear. Otherwise, interpretation is difficult and rather arbitrary (3Go–5Go).

In our context, we needed to assess agreement across several observers, on items of widely varying response types. It is naïve to believe that misclassification does not occur when copying items from the medical record or applying standard algorithms or indices, and it is important to assess reliability for these items. Although kappa can be computed for agreement on a subject's gender, stochastic independence is really not an issue. It is not at all clear what probability model should be used for chance agreement on date of birth, and combining agreement on gender and date of birth into a single weighted kappa would seem to muddy things considerably. Then, we consider it important to distinguish between "demographic" items such as gender, date of birth, or date of a physician visit and "judgment" items, in particular whether cardiovascular disease was considered at the visit. Although kappa is certainly relevant to this last item, the issues of interpretation remain, and we did not think that presenting a separate weighted kappa value would have furthered the discussion.

The point of view we have taken is more common in the quality assurance literature, where relatively straightforward, easily interpreted statistics are used to evaluate how well one is doing, and the critical issues are whether that is good enough and how to do better (6Go, 7Go). It may be useful to use separate terms for agreement—where failure to agree is frequent and agreement by chance is of concern—and reliability, where agreement is high and each disagreement may represent a system or process failure that may be correctable. Although kappa statistics and similar measures are often of value, we continue to believe that, in our context, they are not helpful.

ACKNOWLEDGMENTS

Conflict of interest: none declared.

References

  1. Sheikh K. Re: "Interrater reliability: completing the methods description in medical records review studies." (Letter). Am J Epidemiol 2005;162:919.[Free Full Text]
  2. Yawn BP, Wollan P. Interrater reliability: completing the methods description in medical records review studies. Am J Epidemiol 2005;161:974–7.[Abstract/Free Full Text]
  3. Bishop YMM, Feinberg SE, Holland PW. Discrete multivariate analysis: theory and practice. Cambridge, MA: MIT Press, 1975.
  4. Fleiss JL, ed. Statistical methods for rates and proportions: the measurement of interrater agreement. New York, NY: John Wiley & Sons, 1981.
  5. Landis JR, Koch GG. A review of statistical methods in the analysis of data arising from observer reliability studies, part 1. Statistica Neerlandica 1975;29:101–25.
  6. Rosander AC. The quest for quality in services. Milwaukee, WI: American Society for Quality Control, 1989.
  7. Juran JM. The quality trilogy: a universal approach to managing for quality. Quality Progress 1986 Aug;19:19–24.

Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
Am J EpidemiolHome page
T. L. Lash, M. P. Fox, S. S. Thwin, A. M. Geiger, D. S. M. Buist, F. Wei, T. S. Field, M. U. Yood, F. J. Frost, V. P. Quinn, et al.
Using Probabilistic Corrections to Account for Abstractor Agreement in Medical Record Reviews
Am. J. Epidemiol., June 15, 2007; 165(12): 1454 - 1461.
[Abstract] [Full Text] [PDF]


Home page
Am J EpidemiolHome page
B. P. Yawn and P. M. Wollan
THE AUTHORS REPLY
Am. J. Epidemiol., November 1, 2005; 162(9): 919 - 920.
[Full Text] [PDF]


This Article
Right arrow Extract Freely available
Right arrow FREE Full Text (PDF) Freely available
Right arrow All Versions of this Article:
162/9/919-a    most recent
kwi288v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (1)
Right arrowRequest Permissions
Right arrow Disclaimer
Google Scholar
Right arrow Articles by Yawn, B. P.
Right arrow Articles by Wollan, P. M.
Right arrow Search for Related Content
PubMed
Right arrow Articles by Yawn, B. P.
Right arrow Articles by Wollan, P. M.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?