Skip Navigation

This Article
Right arrow Extract Freely available
Right arrow FREE Full Text (PDF) Freely available
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Right arrow Disclaimer
Google Scholar
Right arrow Articles by Levy, P. S.
Right arrow Search for Related Content
PubMed
Right arrow Articles by Levy, P. S.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

Am J Epidemiol 2004; 159:911-912.
Copyright © 2004 by the Johns Hopkins Bloomberg School of Public Health


BOOK REVIEWS

Measurement Error and Misclassification in Statistics and Epidemiology: Impacts and Bayesian Adjustments

Paul S. Levy

RTI International, Research Triangle Park, NC 27709

By Paul Gustafson

ISBN 1-58488-335-9, Chapman and Hall/CRC Press, Boca Raton, Florida (Telephone: 800-272-7737, Fax: 800-374-3401, Website: http://www.crcpress.com), 2003, 200 pp., $79.95 (hardcover)

Epidemiologic studies that attempt to characterize associations between exposures and disease occurrence often rely on exposure assessments that are subject to mismeasurement (a term used in this book for either measurement error in a quantitative variable or misclassification in a categorical variable). The large body of existing literature on this subject deals mostly with the impact of mismeasurement on indicators of estimated exposure-disease associations. The literature is relatively scanty and unwieldy with regard to methods of adjustment for such mismeasurement. In most of the existing literature, measurement error in quantitative variables is considered separately from misclassification in categorical variables, with relatively few attempts having been made to consider these problems from a unified approach.

In this relatively small (approximately 200 pages) but ambitious volume (1), Professor Paul Gustafson takes a unified approach to 1) characterizing the consequences of ignoring mismeasurement on resulting indicators of exposure-disease association and 2) demonstrating the use of Bayesian statistical methods, enhanced by recently developed computational tools such as Markov chain Monte Carlo simulation, to adjust for mismeasurement. His formulation models the relation between a dependent variable, Y, that is not subject to mismeasurement and an explanatory variable, X, that is not directly observable. However, an observable variable X*, distributed with mean equal to X and variance {sigma}2, can be used as a surrogate for the unobservable variable X. Thus, instead of a relation between Y and X, the resulting study measures the relation between Y and X*. Using an example in epidemiology, the variable Y may indicate the presence or absence of disease; X may be an unobservable exposure variable, such as the total body burden of a pollutant; and X* may represent the measured blood level of some metabolite of the pollutant. This scenario is that of a typical analytical epidemiologic investigation, where the target is the relation between the exposure, X, and the disease, Y, but the study substitutes the surrogate X* for the immeasurable exposure X.

The book is organized into six chapters and an appendix. This includes an introduction (chapter 1), followed by two chapters on the impact of mismeasurement (chapter 2 for the scenario in which the mismeasured variable X* is quantitative and subject to measurement error and chapter 3 for the scenario in which X* is a categorical variable subject to misclassification error). These three chapters take up approximately 25 percent of the book, with the bulk of the remainder being devoted to methods of adjusting for mismeasurement (chapter 4 to developing methods of adjustment for measurement error in quantitative explanatory variables and chapter 5 to methods of adjustment for misclassification in categorical variables). Chapter 6 consists of more specialized topics, such as the situation where the mismeasured variable has been constructed by dichotomizing a quantitative variable. It also discusses additional effects caused by misspecification in the original model relating the disease variable Y to the unobservable exposure variable X. Finally, in the Appendix, the author develops the Bayes-Markov chain Monte Carlo adjustment methodology.

Most analytical epidemiologists are aware that mismeasurement in an exposure variable generally results in attenuation of the estimated regression coefficients or odds ratios characterizing the exposure-disease relation, or, as it is usually stated in the Discussion section of a paper, a "conservative bias toward the null." However, in chapter 2, the author presents the much less well known result that if another variable, Z, having no mismeasurement but correlated with the mismeasured exposure variable X*, is in the regression equation, the estimated exposure-disease regression coefficients or odds ratios can undergo further attenuation (much further if the correlation is large). Even less well known is the fact that error in the exposure variable can distort the estimated regression coefficients for all variables in the regression equation, even those that are not subject to mismeasurement. If there is more than one mismeasured exposure variable in the regression equation, the estimates of the exposure-disease relation are biased but not necessarily attenuated toward the null hypothesis. In other words, they can be biased in either direction.

The discussion of possible consequences of mismeasurement in chapter 2 for quantitative exposure variables subject to measurement error is repeated in chapter 3 for categorical variables subject to misclassification, and findings are analogous to those for quantitative variables. These two chapters, taken together, present a strong case for the use of estimation techniques that adjust for mismeasurement. This topic is addressed in chapter 4 for quantitative variables and in chapter 5 for categorical variables, with these two chapters comprising approximately 50 percent of the book. To briefly summarize the complex material in these two chapters: The adjustment for mismeasurement is made by assuming a prior distribution for the surrogate variable X* and setting up estimating equations based on the Bayesian paradigm. The parameters of interest (e.g., exposure-disease relations) are then estimated by use of the recently developed Markov chain Monte Carlo techniques; a "primer" detailing these techniques is included in the Appendix.

In his preface, Professor Gustafson states that he intended this book to be of interest to the "(bio)statistics community, and at least the more quantitative subset of the epidemiology community" (1, p. x). The book is certainly accessible to statisticians/biostatisticians, especially those trained within the last decade, during which the type of modeling developed in this book entered the mainstream of statistical analysis. Despite the author’s obvious interest and involvement in epidemiologic issues and his efforts to include examples from analytical epidemiology (e.g., discussion of the relation between smoking and bladder cancer and the discussion of the Framingham Study in chapter 4), the Bayesian concepts and computational adjuncts, especially in chapter 4 and later chapters, would probably be foreign to all but the most quantitatively trained and inclined epidemiologists. This may not be true in the near future as use of these methods becomes more widespread in epidemiologic studies and as user-friendly, off-the-shelf software enters the market. This happened in the mid- to late 1990s for generalized estimating equations and mixed-effects modeling, and we will doubtless see it for the methods described in this book.

In summary, the author has written a careful, well-organized book on an important topic that has widespread consequences for interpretation of findings from epidemiologic studies. While it was not written as a polemic, the book presents a strong case that the effects of mismeasurement can be overwhelming and that methods of adjustment for mismeasurement should be given strong consideration.

REFERENCES

  1. Gustafson P. Measurement error and misclassification in statistics and epidemiology: impacts and Bayesian adjustments. Boca Ration, FL: CRC Press, 2003.

Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?



This Article
Right arrow Extract Freely available
Right arrow FREE Full Text (PDF) Freely available
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Right arrow Disclaimer
Google Scholar
Right arrow Articles by Levy, P. S.
Right arrow Search for Related Content
PubMed
Right arrow Articles by Levy, P. S.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?