Skip Navigation

This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (PDF) Freely available
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (199)
Right arrowRequest Permissions
Right arrow Disclaimer
Google Scholar
Right arrow Articles by Pepe, M. S.
Right arrow Articles by Newcomb, P.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Pepe, M. S.
Right arrow Articles by Newcomb, P.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

Am J Epidemiol 2004; 159:882-890.
Copyright © 2004 by the Johns Hopkins Bloomberg School of Public Health


ORIGINAL CONTRIBUTIONS

Limitations of the Odds Ratio in Gauging the Performance of a Diagnostic, Prognostic, or Screening Marker

Margaret Sullivan Pepe1,2 , Holly Janes2, Gary Longton1, Wendy Leisenring1,2,3 and Polly Newcomb1

1 Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, WA.
2 Department of Biostatistics, University of Washington, Seattle, WA.
3 Division of Clinical Research, Fred Hutchinson Cancer Research Center, Seattle, WA.

Received for publication June 24, 2003; accepted for publication October 28, 2003.


    ABSTRACT
 TOP
 ABSTRACT
 INTRODUCTION
 ASSOCIATION VERSUS...
 HOW TO QUANTIFY DISCRIMINATION...
 MODIFIERS OF PREDICTOR...
 THE INCREMENTAL VALUE OF...
 DISCUSSION
 REFERENCES
 
A marker strongly associated with outcome (or disease) is often assumed to be effective for classifying persons according to their current or future outcome. However, for this assumption to be true, the associated odds ratio must be of a magnitude rarely seen in epidemiologic studies. In this paper, an illustration of the relation between odds ratios and receiver operating characteristic curves shows, for example, that a marker with an odds ratio of as high as 3 is in fact a very poor classification tool. If a marker identifies 10% of controls as positive (false positives) and has an odds ratio of 3, then it will correctly identify only 25% of cases as positive (true positives). The authors illustrate that a single measure of association such as an odds ratio does not meaningfully describe a marker’s ability to classify subjects. Appropriate statistical methods for assessing and reporting the classification power of a marker are described. In addition, the serious pitfalls of using more traditional methods based on parameters in logistic regression models are illustrated.

biological markers; diagnostic test; logistic regression; odds ratio; ROC curve; screening test

Abbreviations: Abbreviations: FPF, false-positive fraction; ROC, receiver operating characteristic; TPF, true-positive fraction.


    INTRODUCTION
 TOP
 ABSTRACT
 INTRODUCTION
 ASSOCIATION VERSUS...
 HOW TO QUANTIFY DISCRIMINATION...
 MODIFIERS OF PREDICTOR...
 THE INCREMENTAL VALUE OF...
 DISCUSSION
 REFERENCES
 
The idea of using information about a subject to detect subclinical disease states and to predict future health events has great appeal. The notion is currently motivating much biotechnological medical research. We hope to use biomarkers derived from new proteomic and genomic technologies to identify subjects who have or are very likely to develop cancer or other diseases (1). In addition, we hope to use these modern technologies and others to make precise diagnoses and more accurate prognoses of patients with disease, to help with decisions about treatment, and to monitor response to treatment. Use of biomarkers and risk factors in this way is not a new notion in medical practice. Prediction risk scores are commonly used. Examples include the Framingham risk score for cardiovascular events (2) and the Gail model risk score for breast cancer (3). Even more commonly, epidemiologists have identified a myriad of disease-specific risk factors that have been used alone or in combination in public health practice. Clinical epidemiologists have analogously identified a multitude of factors associated with the clinical course of patients diagnosed with disease.

Statistical evaluation of factors, scores, and biomarkers for assessing a person’s current status or future health outcome is the topic of this paper. We use the generic term "marker" for the factor, score, or biomarker and "outcome" for that which is predicted or detected. We show that strong statistical associations between outcome and marker do not necessarily imply that the marker can discriminate between persons likely to have the outcome and those who do not. Traditional statistical methods used by epidemiologists to assess etiologic associations are not adequate to determine the potential performance of a marker for classifying or predicting risk for persons (47). This important point is not widely appreciated and may explain to some extent the disappointing performance of many identified "markers" when they are used to predict outcome for persons. As we proceed to develop technologically sophisticated tools for individual-level prediction and classification, for so-called personalized medicine, we must be careful to use appropriate statistical techniques when evaluating research studies aimed at assessing their performance. We describe some appropriate techniques in this paper.

The performance of a marker may change with the circumstance in which it is applied. Characteristics of the population, or the assay technique (if the marker is a biomarker), for example, may lead to better or worse performance. It is important to understand and quantify variations in the performance of a marker. We show how such questions can be addressed statistically and the pitfalls of using some common epidemiologic methods for this purpose. In addition, we discuss evaluation of a marker in the presence of other information predictive of outcome, which may include risk factor data or other markers. The question is how to evaluate the incremental value of a marker for distinguishing cases from controls. Again, we show that traditional epidemiologic methods can lead to false conclusions, and we describe more appropriate statistical methods.


    ASSOCIATION VERSUS CLASSIFICATION
 TOP
 ABSTRACT
 INTRODUCTION
 ASSOCIATION VERSUS...
 HOW TO QUANTIFY DISCRIMINATION...
 MODIFIERS OF PREDICTOR...
 THE INCREMENTAL VALUE OF...
 DISCUSSION
 REFERENCES
 
Consider a binary risk factor as a marker. For example, unopposed estrogen replacement therapy is considered a strong risk factor for endometrial cancer (8). A relative risk of about 3.0 is associated with it; that is, in a case-control study, the odds ratio comparing cases with controls regarding "ever having used estrogen replacement therapy" is about 3.0. Reporting the odds ratio or relative risk as a measure of association is typical in epidemiologic studies of etiologic risk factors and is now unfortunately common in studies of predictive markers as well. Refer, for example, to studies by Cui et al. (9) in a recent issue of Science and by Rhodes et al. (10) in a recent issue of Journal of the National Cancer Institute. Examples from other popular journals are Ridker et al. (11), Zhang et al. (12), Liou et al. (13), and Hogue et al. (14). Note that the goals of etiologic risk factor studies are quite different from those in the sorts of studies we consider here, where markers are to be used for classifying persons. Therefore, the statistical considerations also differ between such studies.

The accuracy or validity of a binary marker for classifying persons is better summarized in a case-control study by reporting its true-positive fraction (TPF, also known as sensitivity) and its false-positive fraction (FPF, also known as 1-specificity). These are defined as follows: TPF = Prob[marker positive | outcome positive] and FPF = Prob[marker positive | outcome negative]. Because there are two types of errors (misclassifying positives and negatives), the study results should reflect both of these errors. A perfect marker will have TPF = 1 and FPF = 0. Obviously, to have confidence in the prediction that a marker makes, TPF and FPF should be close to these ideal values. The general public often expects that a marker offer reasonably accurate classification and confident prediction.

However, a marker can be useful even if FPF and TPF are less than ideal. The criteria by which the marker is judged useful depend entirely on the context in which it is to be used. For example, a marker for screening a healthy population for cancer needs to have an extremely low FPF because workup procedures such as biopsy that follow a positive screening test are generally invasive and expensive. Given that cancer is a rare disease in the population tested, even a low FPF will result in huge numbers of people undergoing unnecessary, costly procedures. Using a utility function, Baker (15) argues that FPF needs to be below 2 percent when screening for prostate cancer and recommends that TPF exceed 50 percent. The considerations are different for a prognostic marker, that is, a marker measured in people with disease used to predict an aspect of their prognosis. For example, van de Vijver et al. (16) evaluated a gene-expression profile of tumor tissue in stage I or II breast cancer patients as a prognostic marker for distant metastases within 5 years. A prognostic marker of poor outcome should have high sensitivity, particularly if additional therapy will be instituted in only those patients who test positive. van de Vijver et al. estimated TPF to be equal to 92 percent for the gene-expression signature. Unfortunately, the FPF estimate was rather high at 42 percent. Whether it would be acceptable for 42 percent of good-prognosis patients to undergo unnecessary additional therapy would be a key factor in deciding on the usefulness of the marker.

The odds ratio (OR) can be written as a simple function of (FPF, TPF) (5, 17): OR = {TPF/(1 – TPF)} x {(1 – FPF)/FPF}. Figure 1 shows this relation. Accuracy points (FPF, TPF) that yield the same value of the odds ratio are shown. Observe that an odds ratio of 3.0 is not consistent with an "accurate" marker. Suppose, for example, that a marker labels 10 percent of controls (outcome negatives) as positive and that the associated odds ratio is 3.0. We see from figure 1 that it identifies only about 25 percent of the cases (outcome positives); that is, 75 percent of the cases are not detected by the marker. As another example, suppose that a marker with an odds ratio of 3.0 detects 80 percent of cases. The plot shows that it must mislabel almost 60 percent of the controls. Clearly, this marker is not useful for individual-level classification or prediction. The figure shows that even weakly accurate markers are associated with odds ratios (or relative risks) far larger than those traditionally considered strong in epidemiologic studies of association. For reasonable classification accuracy of, say, FPF = 0.10 and TPF = 0.80, the odds ratio is huge: 36.0. Note, however, that even if an odds ratio as large as 36.0 is reported, one cannot conclude that the marker has good accuracy since a variety of (FPF, TPF) values are consistent with it. For example, FPF may be unacceptably large; note that (FPF = 0.50, TPF = 0.973) also yields an odds ratio of 36.0.



View larger version (27K):
[in this window]
[in a new window]
 
FIGURE 1. Correspondence between the true-positive fraction (TPF) and the false-positive fraction (FPF) of a binary marker and the odds ratio. Values of (TPF, FPF) that yield the same odds ratio are connected.

 
When the marker, denoted now by X, is continuous, its association with outcome status, D = 1 for case and D = 0 for control, is also often summarized with an odds ratio. Consider the following logistic regression model: Prob(D = 1|X) = exp({alpha} + ßX)/{1 + exp({alpha} + ßX)}. The odds ratio per unit increase in X is given by exp(ß). The size of the odds ratio depends on the units in which X is measured. In figure 2, we have scaled X so that a unit increase represents the difference between the 16th and 84th percentiles of X in the controls (i.e., two standard deviations = one unit). The distribution of X in controls is represented as normal with mean 0, which is general in the sense that data can always be transformed to this scale. Assuming that, for cases, X is normally distributed with the same standard deviation, figure 2 shows their separation from controls for various values of the odds ratio. We see again that values of the odds ratio considered large in traditional epidemiologic studies are derived from marker distributions that largely overlap (figure 2).



View larger version (13K):
[in this window]
[in a new window]
 
FIGURE 2. Probability distributions of a marker, X, in cases (solid curves) and controls (dashed curves) consistent with the logistic model logitP(D = 1|X) = {alpha} + ßX. It has been assumed that X has a mean of 0 and a standard deviation of 0.5 in controls so that a unit increase represents the difference between the 84th and 16th percentiles of X in controls. The marker is normally distributed, with the same variance in cases. The odds ratio (OR) per unit increase in X is shown.

 
Receiver operating characteristic (ROC) curves corresponding to each of the pairs of marker distributions in figure 2 are shown in figure 3. Each point on an ROC curve represents the decision criterion that is positive if X exceeds a threshold c. The FPF and TPF values associated with that criterion are one point on the curve, and, by varying c from {infty} to –{infty}, the (FPF, TPF) points corresponding to all possible thresholds are shown. Although they appear similar, figure 1 differs from figure 3. Figure 1 concerns binary markers only, with many different markers represented on the same curve if their odds ratios are the same. The odds ratios shown here in figure 3 relate to a unit increase in a continuous marker, and the ROC curve concerns different decision criteria resulting from considering all possible thresholds for a single continuous marker.



View larger version (24K):
[in this window]
[in a new window]
 
FIGURE 3. True- vs. false-positive fractions associated with dichotomous categorization of the continuous marker according to the decision criteria X > c for the six scenarios shown in figure 2. Each curve corresponds to one scenario. Points on the curve correspond to different choices of threshold c {epsilon} (–{infty}, {infty}). Solid circles represent points associated with using each quartile as the threshold criterion.

 
We see from figure 3 that, when the odds ratio associated with a unit increase in X is 3.0, regardless of the threshold chosen, the (FPF, TPF) values associated with the corresponding decision criterion generally would not be adequate for individual-level classification. In fact, unless the odds ratio per unit increase in X is at least 16.0, marker-based decision criteria seem very inaccurate. Even with an odds ratio of 16.0, a marker-based criterion that yields a 10 percent FPF at a threshold fails to detect over 40 percent of cases when that threshold is used. As another example, it will falsely detect as many as 30 percent of controls if a threshold that yields 80 percent of the cases is used.

Frequently, continuous markers are grouped to form categorical covariates. For each of the marker distributions shown in figure 2, we also categorized the marker on the basis of the quartile cutpoints from the controls. The odds ratios for each quartile relative to the lowest quartile are shown in table 1. The solid-circle points on the ROC curves in figure 3 are those associated with using each of the three quartile cutpoints to classify subjects as positive or negative for disease. Similar to our observations for the binary and continuous marker settings, even if the magnitude of the odds ratio for an upper quartile versus the lowest quartile is large, the corresponding points on the ROC curves show a poor ability to classify cases and controls. For example, when the odds ratio for the upper quartile versus the lowest quartile is 4.1, and if we use the upper quartile to define a positive test result, then only 45 percent of cases are correctly classified whereas 25 percent of controls are incorrectly identified.


View this table:
[in this window]
[in a new window]
 
TABLE 1. Odds ratios for each quartile* relative to the first quartile corresponding to the pairs of continuous marker distributions shown in figure 2
 
In summary, there are two key points to make based on the information in figures 1, 2, and 3. The first, as stated previously, is that markers for which the odds ratios are considered strong in traditional epidemiologic research are not adequate for discriminating between those persons who do and do not have an outcome of interest. Extremely strong associations are needed. The second is that odds ratios in and of themselves do not characterize the discriminatory capacity of a marker. The odds ratio is a simple scalar measure of association between marker and outcome. It does not characterize the discrimination between cases and controls that can be achieved by a marker since many different pairs of TPFs and FPFs are consistent with a particular odds ratio value. Neither does it relate to the notion of utility (15). In the next section, we discuss alternatives to the odds ratio that can be used to evaluate markers in case-control studies.


    HOW TO QUANTIFY DISCRIMINATION IN A CASE-CONTROL STUDY
 TOP
 ABSTRACT
 INTRODUCTION
 ASSOCIATION VERSUS...
 HOW TO QUANTIFY DISCRIMINATION...
 MODIFIERS OF PREDICTOR...
 THE INCREMENTAL VALUE OF...
 DISCUSSION
 REFERENCES
 
Classification error rates
Characterization of the discriminatory capacity of a binary marker has already been addressed in this paper. The TPFs and FPFs provide a description. Although predictive values are also of interest, where positive predictive value = P[D = 1 | marker positive] and negative predictive value = P[D = 0 | marker negative], these entities essentially require either a cohort study design in which the sample prevalence reflects the population prevalence or some external estimates of prevalence that pertain to the population from which cases and controls are drawn. On the other hand, TPF and FPF are defined conditional on outcome status and so can be estimated from a case-control study in which sampling depends on outcome. Diagnostic likelihood ratios have also been promoted as measures to characterize the accuracy of a binary marker (18) but are not very popular in practice.

It is interesting that characterization of accuracy requires two parameters, for example, TPF and FPF, whereas association measures such as the odds ratio or correlation coefficient (for continuous markers) are generally one dimensional. Characterization with (FPF, TPF) acknowledges that false-positive and false-negative errors are not equivalent and must be reported separately (19).

ROC curves
The ROC curve is the natural generalization of (FPF, TPF) to accommodate settings in which the marker is continuous. It describes the whole set of potential (FPF, TPF) combinations possible with positivity criteria based on the marker. The raw distributions and ROC curves for two pancreatic cancer biomarkers are shown in figures 4 and 5, respectively (20). Observe that the ROC curve does not depend on how the marker is coded. Changing the units in which the marker is measured has no impact on its ROC curve in contrast to logistic regression models in which, as noted above, the odds ratio must be interpreted according to a unit increase in the value of X. Moreover, ROC curves provide a natural common scale for comparing different markers even when they are measured in completely different units. For example, a marker that measures a serum concentration can be compared with one that measures spectral height at a given mass/charge ratio derived from protein mass spectrometry. In contrast, because odds ratios are interpreted per unit increase in the marker, odds ratios for two markers may not be comparable. This is a key advantage of ROC curves.



View larger version (14K):
[in this window]
[in a new window]
 
FIGURE 4. Frequency distributions of two markers for pancreatic cancer. Refer to Wieand et al. (20) for a description of source data.

 


View larger version (17K):
[in this window]
[in a new window]
 
FIGURE 5. Receiver operating characteristic (ROC) curves for the pancreatic cancer markers shown in figure 4.

 

    MODIFIERS OF PREDICTOR PERFORMANCE
 TOP
 ABSTRACT
 INTRODUCTION
 ASSOCIATION VERSUS...
 HOW TO QUANTIFY DISCRIMINATION...
 MODIFIERS OF PREDICTOR...
 THE INCREMENTAL VALUE OF...
 DISCUSSION
 REFERENCES
 
A variety of factors (or covariates) may affect how well a marker performs. For example, higher breast density makes mammographic readings less accurate (21). Factors other than those that are subject related are often important, too. The assay technique or the expertise of the lab technician can affect how well a biomarker performs. In audiology, the location in which the hearing test is conducted can affect the capacity of the test to detect hearing loss. If one can establish which covariates influence the performance of a marker, this information may be used to optimize the marker measurement. On the other hand, it can suggest settings or populations for which the marker is less useful and where alternative markers should be sought.

Not traditional effect modification
We continue to denote the marker by X and the covariates that may affect the performance of the marker by Z. In epidemiology, one typically uses a logistic regression model with statistical interaction between covariates and the marker of interest to evaluate whether "effect modification" occurs. Mathematically, we write logitP[D = 1|X, Z] = {alpha} + ß1X + ß2Z + ß3XZ. Evaluating the size and significance of ß3, the interaction term, asks whether the odds ratio associated with X varies with Z. However, since we have already established that the odds ratio does not properly characterize marker performance, it follows that this approach does not address questions about Z affecting the performance of X as a marker.

As an example, consider the data shown in table 2, where X and Z are binary. The marker X is being considered as a screening device, and, to make the discussion concrete, suppose that the covariate Z is gender (Z = 1 for females). The odds ratio associated with X is exactly the same for males and females. That is, Z is not an effect modifier in the sense that it alters the association between X and disease D when association is parameterized by the odds ratio. However, it seems that, at least for the purposes of disease screening, X performs better for females than for males. Almost all cases are detected in both circumstances (TPF = 0.97 and TPF = 0.94), but twice as many controls screen positive in the male population, where FPF = 0.06, versus in the female population, where FPF = 0.03. For widespread screening of healthy populations, it is critical to keep the number of false-positive results extremely low, and the lower FPF observed in females makes it a better screening marker in that population.


View this table:
[in this window]
[in a new window]
 
TABLE 2. Data* showing that a covariate (Z) can affect the performance of X as a marker (for disease D) but is not necessarily an effect modifier in the usual sense of having odds ratios that vary with Z
 
Statistical assessment
How then should one assess whether a covariate affects the performance of a marker? When the marker is binary, one can simply determine to what extent the TPFs vary with Z and to what extent the FPFs vary with Z. We already made this assessment informally for the data shown in table 2. Formal statistical techniques can be applied to test a hypothesis such as H0 : FPF(Z = 1) = FPF(Z = 0) or to quantify the difference between FPF(Z = 1) and FPF(Z = 0). The FPFs and TPFs are binomial proportions, and the usual techniques of Pearson chi-square statistics and so forth can be applied. Inference about FPFs uses data for controls (D = 0) only; inference about TPFs uses data for cases (D = 1) only. For example, in table 2, comparison of FPFs yields p = 0.002 from a chi-square test. Writing FPF(Z) = P(X = 1|D = 0, Z), we see that logistic regression techniques can be applied to data for controls with the marker as the dependent variable and covariates Z as the independent variables to establish how FPF varies with Z. Regression techniques may be preferable when there are multiple components to Z or if Z is continuous. Similarly, logistic regression can be applied to data for the cases to establish how TPF(Z) = P(X = 1|D = 1, Z) varies with Z. Refer to Leisenring et al. (22), Smith and Hadgu (23), and Pepe (24, section 3.5) for illustrations.

For a continuous marker, one needs to determine whether the ROC curves for X vary with Z. If Z is dichotomous, one can plot separate ROC curves for X by using data for the two groups or circumstances defined by Z. Statistical techniques to compare ROC curves have been developed and are included in some software packages such as Stata (25). Data for prostate-specific antigen reported by Etzioni et al. (26) are shown in figure 6 for men less than 65 years of age and men 65 years of age or older. Although the study measured prostate-specific antigen repeatedly over time, we use only those data for the last time point (prior to diagnosis for cases). The classic statistic for comparing two ROC curves is the difference in the areas under the empirical ROC curves. The difference is not statistically significant (p = 0.44). Thus, there is no evidence in this sample that age affects the capacity of prostate-specific antigen to distinguish cases with prostate cancer from age-matched controls.



View larger version (17K):
[in this window]
[in a new window]
 
FIGURE 6. Total prostate-specific antigen for 71 prostate cancer cases and 70 age-matched controls in the Beta-Carotene and Retin-A (CARET) study, Fred Hutchinson Cancer Research Center, Seattle, Washington (coordinating center), 1985–1996. Receiver operating characteristic (ROC) curves for subjects aged <65 years and >=65 years are shown.

 
Similar techniques can be used to compare the performances of two different markers in the same population. The ROC curves in figure 4 for CA-125 and CA-19-9 (20) are statistically significantly different (p < 0.01). This p value is based on the difference in empirical areas under the ROC curves applied to paired data (27). Methods based on comparing ROC curves over a relevant subinterval of FPFs are described by Pepe (24, p. 110) and are probably more appropriate for comparing screening markers (19).

As mentioned earlier in the discussion about evaluating covariate effects on binary markers, regression techniques are appropriate when Z is multidimensional or continuous. The same is true for continuous markers, but now regression models for ROC curves must be used. Some regression modeling methods for ROC curves have been described. A variety of illustrations are provided by Pepe (24, chapter 6). This area of statistical methodology is relatively new, and methods are evolving rapidly. Refer to Cai and Pepe (28) and Dodd and Pepe (29) for more recent work.


    THE INCREMENTAL VALUE OF A MARKER
 TOP
 ABSTRACT
 INTRODUCTION
 ASSOCIATION VERSUS...
 HOW TO QUANTIFY DISCRIMINATION...
 MODIFIERS OF PREDICTOR...
 THE INCREMENTAL VALUE OF...
 DISCUSSION
 REFERENCES
 
Now we discuss another way in which covariates are often considered. Suppose that there are some established markers (or predictors) for the outcome that we denote by X1. In considering a new candidate marker, X2, we want to assess how much classification is improved by using X2 in addition to X1 (4). Alternatively, we can ask whether there is predictive information in X2 that cannot be explained by associations with X1. For the purposes of illustration, suppose that CA-19-9 is an established biomarker for pancreatic cancer and we want to determine the additional contribution of CA-125 to classification accuracy. Here, X1 is CA-19-9 and X2 is CA-125. One common approach is to treat X1 and X2 as covariates in a logistic regression model,

logitP(D = 1|X1, X2) = {alpha} + ß1X1 + ß2X2,

and interpret exp(ß2) as the odds ratio for the strength of the association between X2 and the outcome, D, after "accounting for" the associations with X1. We do not dispute this approach. However, as mentioned earlier, a measure of association is not a characterization of the accuracy of prediction. Using the data displayed in figure 4, we estimate that exp(ß2) = 2.54 (p = 0.002), a statistically significant association between CA-125 and pancreatic cancer after controlling for CA-19-9.

Figure 7 shows the ROC curves for classifying subjects as having or not having pancreatic cancer by using CA-19-9 alone and by using the combination of predictors CA-19-9 and CA-125. If we assume that equation 1 fits the data reasonably well, it is known that the linear combination ß1X1 + ß2X2 is the best way to combine the markers for discriminating cases from controls (30). We see that CA-125 adds little to the capacity of CA-19-9 to discriminate between pancreatic cancer cases and controls. For example, if we are content to accept a 5 percent FPF, we can detect 68 percent of cases by using CA-19-9 alone and 71 percent by using the combination. The tangible benefit of adding the new marker CA-125 to the existing marker CA-19-9 appears to be minimal for the purposes of classification. That is, the independent contribution of CA-125 to classification is negligible despite its strong association with disease status that is independent of its association with CA-19-9.



View larger version (17K):
[in this window]
[in a new window]
 
FIGURE 7. Receiver operating characteristic (ROC) curves for classifying subjects as having or not having pancreatic cancer by using the pancreatic cancer marker CA-19-9 alone and by using the combination of predictors CA-19-9 and CA-125. The combination score is ß1X1 + ß2X2 = 1.03log(CA-19-9) + 0.93log(CA-125).

 
In our experience, this phenomenon—that a marker displaying an independent association considered strong by traditional epidemiologic standards does not contribute meaningfully to improved classification—is rather common. Another illustration is provided by Kattan (4). This finding is quite consistent with the observations made earlier. Extremely strong associations are required for meaningful classification accuracy. Again, the important message is that, for statistical evaluation of markers for classification, techniques should be used that directly address classification accuracy (e.g., ROC curves) rather than traditional logistic regression techniques for assessing associations.


    DISCUSSION
 TOP
 ABSTRACT
 INTRODUCTION
 ASSOCIATION VERSUS...
 HOW TO QUANTIFY DISCRIMINATION...
 MODIFIERS OF PREDICTOR...
 THE INCREMENTAL VALUE OF...
 DISCUSSION
 REFERENCES
 
The work for this paper was stimulated by the observation that many studies of predictive/diagnostic markers continue to use statistical methods based on the odds ratio or relative risk, despite the fact that such methods are not suited to the task of evaluating classification accuracy. Others have mentioned that the odds ratio does not quantify the classification accuracy of a marker (47). We have presented a more detailed discussion, demonstrating the pitfalls of using the odds ratio to evaluate markers, to evaluate covariate effects on marker performance, and to evaluate the incremental value of a marker over existing predictors. In addition, we have suggested more appropriate techniques that can be used to address these questions statistically. References to the literature hopefully will facilitate more widespread adoption of proper methods in practice.

Although the odds ratio does not characterize a marker’s accuracy for classifying persons, its association with the relative risk has long made it valuable for characterizing population variations in risk. A binary marker with a relative risk of 3, say, can be used to identify a population with the risk factor that has three times the risk as the population without the risk factor. This method may be used to target prevention or screening strategies. Moreover, clinical trials can often be conducted more efficiently in such populations. However, as we have noted, such a marker will be a very inaccurate tool for classifying or predicting risk for individual subjects. Markers proposed for classifying or predicting risk in individual subjects must be held to a much higher standard than merely being associated with outcome. Their sensitivities and specificities must be shown to be adequate through appropriate statistical evaluations.


    ACKNOWLEDGMENTS
 
This research was supported by US National Institutes of Health grants U01 CA86368, R01 GM54438, and P01 CA18029.


    NOTES
 
Correspondence to Dr. Margaret Sullivan Pepe, Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, 1100 Fairview Avenue, North, MZ-B500, Seattle, WA 98109 (e-mail: mspepe{at}u.washington.edu). Back


    REFERENCES
 TOP
 ABSTRACT
 INTRODUCTION
 ASSOCIATION VERSUS...
 HOW TO QUANTIFY DISCRIMINATION...
 MODIFIERS OF PREDICTOR...
 THE INCREMENTAL VALUE OF...
 DISCUSSION
 REFERENCES
 

  1. Srivastava S, Kramer BS. Early detection cancer research network. Lab Invest 2000;80:1147–8.[Web of Science][Medline]
  2. Wilson P, D’Agostino R, Levy D, et al. Prediction of coronary heart disease using risk factor categories. Circulation 1998;97:1837–47.[Abstract/Free Full Text]
  3. Gail MH, Brinton LA, Byar DP, et al. Projecting individualized probabilities of developing breast cancer for white females who are being examined annually. J Natl Cancer Inst 1989;81:1879–86.[Abstract/Free Full Text]
  4. Kattan MW. Judging new markers by their ability to improve predictive accuracy. J Natl Cancer Inst 2003;95:634–5.[Free Full Text]
  5. Boyko EJ, Alderman BW. The use of risk factors in medical diagnosis: opportunities and cautions. J Clin Epidemiol 1990;43:851–8.[CrossRef][Web of Science][Medline]
  6. Emir B, Wieand S, Su JQ, et al. Analysis of repeated markers used to predict progression of cancer. Stat Med 1998;17:2563–78.[CrossRef][Web of Science][Medline]
  7. Baker SG, Kramer BS, Srivastava S. Markers for early detection of cancer: statistical guidelines for nested case-control studies. BMC Med Res Methodol 2002;2:4.[CrossRef][Medline]
  8. Newcomb P, Trentham-Dietz A. Patterns of postmenopausal progestin use with estrogen in relation to endometrial cancer (United States). Cancer Causes Controls 2003;14:195–201.[CrossRef][Web of Science][Medline]
  9. Cui H, Cruz-Correa M, Giardiello FM, et al. Loss of IGF2 imprinting: a potential marker of colorectal cancer risk. Science 2003;299:1753–5.[Abstract/Free Full Text]
  10. Rhodes DR, Sanda MG, Otte AP, et al. Multiplex biomarker approach for determining risk of prostate-specific antigen-defined recurrence of prostate cancer. J Natl Cancer Inst 2003;95:661–9.[Abstract/Free Full Text]
  11. Ridker PM, Hennekens CH, Buring JE, et al. C-reactive protein and other markers of inflammation in the prediction of cardiovascular disease in women. N Engl J Med 2000;342:836–43.[Abstract/Free Full Text]
  12. Zhang R, Brennan ML, Fu X, et al. Association between myeloperoxidase levels and risk of coronary artery disease. JAMA 2001;286:2136–42.[Abstract/Free Full Text]
  13. Liou SH, Lung JC, Chen YH, et al. Increased chromosome-type chromosome aberration frequencies as biomarkers of cancer risk in a blackfoot endemic area. Cancer Res 1999;59:1481–4.[Abstract/Free Full Text]
  14. Hogue A, Lippman SM, Boiko IV, et al. Quantitative nuclear morphometry by image analysis for prediction of recurrence of ductal carcinoma in situ of the breast. Cancer Epidemiol Biomarkers Prev 2001;10:249–59.[Abstract/Free Full Text]
  15. Baker SG. Identifying combinations of cancer markers for further study as triggers of early intervention. Biometrics 2000;56:1082–7.[CrossRef][Web of Science][Medline]
  16. van de Vijver MJ, He YD, van’t Veer LJ, et al. A gene-expression signature as a predictor of survival in breast cancer. N Engl J Med 2002;347:1999–2009.[Abstract/Free Full Text]
  17. Lachenbruch PA. The odds ratio. Control Clin Trials 1997;8:381–2.
  18. Boyko EJ. Ruling out or ruling in disease with the most sensitive or specific diagnostic test: short cut or wrong turn? Med Decis Making 1994;14:175–9.[Abstract/Free Full Text]
  19. Baker SG. The central role of receiver operating characteristic (ROC) curves in evaluating tests for the early detection of cancer. J Natl Cancer Inst 2003;95:511–15.[Free Full Text]
  20. Wieand S, Gail MH, James BR, et al. A family of nonparametric statistics for comparing diagnostic markers with paired or unpaired data. Biometrika 1989;76:585–92.[Abstract/Free Full Text]
  21. Carney PA, Miglioretti DL, Yankaskas BC, et al. Individual and combined effects of age, breast density, and hormone replacement therapy use on the accuracy of screening mammography. Ann Intern Med 2003;138:168–75.[Abstract/Free Full Text]
  22. Leisenring W, Pepe MS, Longton G. Regression modeling of diagnostic likelihood ratios for the evaluation of medical diagnostic tests. Biometrics 1998;16:1263–81.
  23. Smith PJ, Hadgu A. Sensitivity and specificity for correlated observations. Stat Med 1992;11:1503–9.[Web of Science][Medline]
  24. Pepe MS. The statistical evaluation of medical tests for classification and prediction. Oxford, United Kingdom: Oxford University Press, 2003.
  25. Stata Corporation. Stata 8.0 software. College Station, TX: Stata Corporation, 2003.
  26. Etzioni R, Pepe, MS, Longton G, et al. Incorporating the time dimension in receiver operating characteristic curves: a case study of prostate cancer. Med Decis Making 1999;19:242–51.[Abstract/Free Full Text]
  27. DeLong ER, DeLong DM, Clarke-Pearson DL. Comparing the areas under two or more correlated receiver operating characteristic curves: a nonparametric approach. Biometrics 1988;44:837–45.[CrossRef][Web of Science][Medline]
  28. Cai T, Pepe MS. Semiparametric ROC analysis to evaluate biomarkers for disease. J Am Stat Assoc 2002;97:1099–107.[CrossRef][Web of Science]
  29. Dodd LE, Pepe MS. Semiparametric regression for the area under the receiver operating characteristic curve. J Am Stat Assoc 2003;98:409–17.[CrossRef][Web of Science]
  30. McIntosh M, Pepe MS. Combining several screening tests: optimality of the risk score. Biometrics 2002;58:657–64.[CrossRef][Web of Science][Medline]

Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
JCOHome page
J. V. Lacey Jr, M. E. Sherman, B. B. Rush, B. M. Ronnett, O. B. Ioffe, M. A. Duggan, A. G. Glass, D. A. Richesson, N. Chatterjee, and B. Langholz
Absolute Risk of Endometrial Carcinoma During 20-Year Follow-Up Among Women With Endometrial Hyperplasia
J. Clin. Oncol., February 10, 2010; 28(5): 788 - 792.
[Abstract] [Full Text] [PDF]


Home page
Int J EpidemiolHome page
S. Galea, M. Riddle, and G. A Kaplan
Causal thinking and complex system approaches in epidemiology
Int. J. Epidemiol., February 1, 2010; 39(1): 97 - 106.
[Abstract] [Full Text] [PDF]


Home page
NeurologyHome page
T. Kurth and C. Stapf
Here comes the sun?: A new online weather forecast for early stroke recurrence
Neurology, January 12, 2010; 74(2): 102 - 103.
[Full Text] [PDF]


Home page
ANN INTERN MEDHome page
B. J. Wilson, N. Qureshi, P. Santaguida, J. Little, J. C. Carroll, J. Allanson, and P. Raina
Systematic Review: Family History in Risk Assessment for Common Diseases
Ann Intern Med, December 15, 2009; 151(12): 878 - 885.
[Abstract] [Full Text] [PDF]


Home page
ANN INTERN MEDHome page
B. J. Wilson, N. Qureshi, P. Santaguida, J. Little, J. C. Carroll, J. Allanson, and P. Raina
Systematic Review: Family History in Risk Assessment for Common Diseases
Ann Intern Med, November 2, 2009; (2009) 0000605-200912150-00177v1.
[Abstract] [Full Text]


Home page
PediatricsHome page
P. H. Wise
Confronting Social Disparities in Child Health: A Critical Appraisal of Life-Course Science and Research
Pediatrics, November 1, 2009; 124(Supplement_3): S203 - S211.
[Abstract] [Full Text] [PDF]


Home page
ANN INTERN MEDHome page
D. I. Buckley, R. Fu, M. Freeman, K. Rogers, and M. Helfand
C-Reactive Protein as a Risk Factor for Coronary Heart Disease: A Systematic Review and Meta-analyses for the U.S. Preventive Services Task Force
Ann Intern Med, October 6, 2009; 151(7): 483 - 495.
[Abstract] [Full Text] [PDF]


Home page
ANN INTERN MEDHome page
M. Helfand, D. I. Buckley, M. Freeman, R. Fu, K. Rogers, C. Fleming, and L. L. Humphrey
Emerging Risk Factors for Coronary Heart Disease: A Summary of Systematic Reviews Conducted for the U.S. Preventive Services Task Force
Ann Intern Med, October 6, 2009; 151(7): 496 - 507.
[Abstract] [Full Text] [PDF]


Home page
BMJHome page
B. Holmstrom, M. Johansson, A. Bergh, U.-H. Stenman, G. Hallmans, and P. Stattin
Prostate specific antigen for early detection of prostate cancer: longitudinal study
BMJ, September 24, 2009; 339(sep24_1): b3537 - b3537.
[Abstract] [Full Text] [PDF]


Home page
BMJHome page
I. A Scott
Thigh circumference and risk of heart disease and premature death
BMJ, September 3, 2009; 339(sep03_2): b3302 - b3302.
[Full Text]


Home page
Circ Cardiovasc Qual OutcomesHome page
J. F. Steiner, P. M. Ho, B. L. Beaty, L. M. Dickinson, R. Hanratty, C. Zeng, H. M. Tavel, E. P. Havranek, A. J. Davidson, D. J. Magid, et al.
Sociodemographic and Clinical Characteristics Are Not Clinically Useful Predictors of Refill Adherence in Patients With Hypertension
Circ Cardiovasc Qual Outcomes, September 1, 2009; 2(5): 451 - 457.
[Abstract] [Full Text] [PDF]


Home page
JAMAHome page
O. Melander, C. Newton-Cheh, P. Almgren, B. Hedblad, G. Berglund, G. Engstrom, M. Persson, J. G. Smith, M. Magnusson, A. Christensson, et al.
Novel and Conventional Biomarkers for Prediction of Incident Cardiovascular Events in the Community
JAMA, July 1, 2009; 302(1): 49 - 57.
[Abstract] [Full Text] [PDF]


Home page
CirculationHome page
E. Hagstrom, P. Hellman, T. E. Larsson, E. Ingelsson, L. Berglund, J. Sundstrom, H. Melhus, C. Held, L. Lind, K. Michaelsson, et al.
Plasma Parathyroid Hormone and the Risk of Cardiovascular Mortality in the Community
Circulation, June 2, 2009; 119(21): 2765 - 2771.
[Abstract] [Full Text] [PDF]


Home page
CirculationHome page
M. A. Hlatky, P. Greenland, D. K. Arnett, C. M. Ballantyne, M. H. Criqui, M. S.V. Elkind, A. S. Go, F. E. Harrell Jr, Y. Hong, B. V. Howard, et al.
Criteria for Evaluation of Novel Markers of Cardiovascular Risk: A Scientific Statement From the American Heart Association
Circulation, May 5, 2009; 119(17): 2408 - 2416.
[Abstract] [Full Text] [PDF]


Home page
RadiologyHome page
F. Pugliese, M. G. M. Hunink, K. Gruszczynska, F. Alberghina, R. Malago, N. van Pelt, N. R. Mollet, F. Cademartiri, A. C. Weustink, W. B. Meijboom, et al.
Learning Curve for Coronary CT Angiography: What Constitutes Sufficient Training?
Radiology, May 1, 2009; 251(2): 359 - 368.
[Abstract] [Full Text] [PDF]


Home page
Diabetes CareHome page
M. F. MacKay, S. M. Haffner, L. E. Wagenknecht, R. B. D'Agostino Jr., and A. J.G. Hanley
Prediction of Type 2 Diabetes Using Alternate Anthropometric Measures in a Multi-Ethnic Cohort: The Insulin Resistance Atherosclerosis Study
Diabetes Care, May 1, 2009; 32(5): 956 - 958.
[Abstract] [Full Text] [PDF]


Home page
J. Am. Soc. Nephrol.Home page
S. I. Hallan, E. Ritz, S. Lydersen, S. Romundstad, K. Kvenild, and S. R. Orth
Combining GFR and Albuminuria to Classify CKD Improves Prediction of ESRD
J. Am. Soc. Nephrol., May 1, 2009; 20(5): 1069 - 1077.
[Abstract] [Full Text] [PDF]


Home page
Cancer Epidemiol. Biomarkers Prev.Home page
A. Manduca, M. J. Carston, J. J. Heine, C. G. Scott, V. S. Pankratz, K. R. Brandt, T. A. Sellers, C. M. Vachon, and J. R. Cerhan
Texture Features from Mammographic Images and Risk of Breast Cancer
Cancer Epidemiol. Biomarkers Prev., March 1, 2009; 18(3): 837 - 845.
[Abstract] [Full Text] [PDF]


Home page
Int J EpidemiolHome page
T. Shah, J. P Casas, J. A Cooper, I. Tzoulaki, R. Sofat, V. McCormack, L. Smeeth, J. E Deanfield, G. D Lowe, A. Rumley, et al.
Critical appraisal of CRP measurement for the prediction of coronary heart disease events: new data and systematic review of 31 prospective cohorts
Int. J. Epidemiol., February 1, 2009; 38(1): 217 - 231.
[Abstract] [Full Text] [PDF]


Home page
Clin. Chem.Home page
A. D. Hingorani, T. Shah, J. P. Casas, S. E. Humphries, and P. J. Talmud
C-Reactive Protein and Coronary Heart Disease: Predictive Test or Therapeutic Target?
Clin. Chem., February 1, 2009; 55(2): 239 - 255.
[Abstract] [Full Text] [PDF]


Home page
StrokeHome page
V. Nambi, R. C. Hoogeveen, L. Chambless, Y. Hu, H. Bang, J. Coresh, H. Ni, E. Boerwinkle, T. Mosley, R. Sharrett, et al.
Lipoprotein-Associated Phospholipase A2 and High-Sensitivity C-Reactive Protein Improve the Stratification of Ischemic Stroke Risk in the Atherosclerosis Risk in Communities (ARIC) Study
Stroke, February 1, 2009; 40(2): 376 - 381.
[Abstract] [Full Text] [PDF]


Home page
ANN INTERN MEDHome page
N. P. Paynter, D. I. Chasman, J. E. Buring, D. Shiffman, N. R. Cook, and P. M. Ridker
Cardiovascular Disease Risk Prediction With and Without Knowledge of Genetic Variation at Chromosome 9p21.3
Ann Intern Med, January 20, 2009; 150(2): 65 - 72.
[Abstract] [Full Text] [PDF]


Home page
BMJHome page
I. A Scott
Evaluating cardiovascular risk assessment for asymptomatic people
BMJ, January 5, 2009; 338(jan05_1): a2844 - a2844.
[Full Text]


Home page
BiostatisticsHome page
W. Gu and M. S. Pepe
Estimating the capacity for improvement in risk prediction with a marker
Biostat., January 1, 2009; 10(1): 172 - 186.
[Abstract] [Full Text] [PDF]


Home page
Am. J. Clin. Nutr.Home page
M. Rambod, C. P Kovesdy, R. Bross, J. D Kopple, and K. Kalantar-Zadeh
Association of serum prealbumin and its changes over time with clinical outcomes and survival in patients receiving hemodialysis
Am. J. Clinical Nutrition, December 1, 2008; 88(6): 1485 - 1494.
[Abstract] [Full Text] [PDF]


Home page
Stat Methods Med ResHome page
S. Lee
Mistakes in validating the accuracy of a prediction classifier in high-dimensional but small-sample microarray data
Statistical Methods in Medical Research, December 1, 2008; 17(6): 635 - 642.
[Abstract] [PDF]


Home page
Arch Gen PsychiatryHome page
M. King, C. Walker, G. Levy, C. Bottomley, P. Royston, S. Weich, J. A. Bellon-Saameno, B. Moreno, I. Svab, D. Rotar, et al.
Development and Validation of an International Risk Prediction Algorithm for Episodes of Major Depression in General Practice Attendees: The PredictD Study
Arch Gen Psychiatry, December 1, 2008; 65(12): 1368 - 1376.
[Abstract] [Full Text] [PDF]


Home page
Ann Fam MedHome page
R. J. Koopman, A. G. Mainous III, C. J. Everett, and R. E. Carter
Tool to Assess Likelihood of Fasting Glucose ImpairmenT (TAG-IT)
Ann. Fam. Med, November 1, 2008; 6(6): 555 - 561.
[Abstract] [Full Text] [PDF]


Home page
Circ Cardiovasc Qual OutcomesHome page
P. W.F. Wilson, M. Pencina, P. Jacques, J. Selhub, R. D'Agostino Sr, and C. J. O'Donnell
C-Reactive Protein and Reclassification of Cardiovascular Risk in the Framingham Heart Study
Circ Cardiovasc Qual Outcomes, November 1, 2008; 1(2): 92 - 97.
[Abstract] [Full Text] [PDF]


Home page
CJASNHome page
M. Rambod, C. P. Kovesdy, and K. Kalantar-Zadeh
Combined High Serum Ferritin and Low Iron Saturation in Hemodialysis Patients: The Role of Inflammation
Clin. J. Am. Soc. Nephrol., November 1, 2008; 3(6): 1691 - 1701.
[Abstract] [Full Text] [PDF]


Home page
Diabetes CareHome page
Z. T. Bloomgarden
American College of Endocrinology Pre-Diabetes Consensus Conference: Part One
Diabetes Care, October 1, 2008; 31(10): 2062 - 2069.
[Full Text] [PDF]


Home page
Clin. Cancer Res.Home page
J. M.G. Taylor, D. P. Ankerst, and R. R. Andridge
Validation of Biomarker-Based Risk Prediction Models
Clin. Cancer Res., October 1, 2008; 14(19): 5977 - 5983.
[Abstract] [Full Text] [PDF]


Home page
Ann Rheum DisHome page
E Thomas, G Peat, C Mallen, L Wood, R Lacey, R Duncan, and P Croft
Predicting the course of functional limitation among older adults with knee pain: do local signs, symptoms and radiographs add anything to general indicators?
Ann Rheum Dis, October 1, 2008; 67(10): 1390 - 1398.
[Abstract] [Full Text] [PDF]


Home page
Cancer Epidemiol. Biomarkers Prev.Home page
K. A. Lowe, C. Shah, E. Wallace, G. Anderson, P. Paley, M. McIntosh, M. R. Andersen, N. Scholler, L. Bergan, J. Thorpe, et al.
Effects of Personal Characteristics on Serum CA125, Mesothelin, and HE4 Levels in Healthy Postmenopausal Women at High-Risk for Ovarian Cancer
Cancer Epidemiol. Biomarkers Prev., September 1, 2008; 17(9): 2480 - 2487.
[Abstract] [Full Text] [PDF]


Home page
JNCI J Natl Cancer InstHome page
M. H. Gail
Discriminatory Accuracy From Single-Nucleotide Polymorphisms in Models to Predict Breast Cancer Risk
J Natl Cancer Inst, July 16, 2008; 100(14): 1037 - 1041.
[Abstract] [Full Text] [PDF]


Home page
Am J EpidemiolHome page
P. E. Castle
Invited Commentary: Is Monitoring of Human Papillomavirus Infection for Viral Persistence Ready for Use in Cervical Cancer Screening?
Am. J. Epidemiol., July 15, 2008; 168(2): 138 - 144.
[Abstract] [Full Text] [PDF]


Home page
Am J EpidemiolHome page
H. Janes and M. S. Pepe
Adjusting for Covariates in Studies of Diagnostic, Screening, or Prognostic Markers: An Old Concept in a New Setting
Am. J. Epidemiol., July 1, 2008; 168(1): 89 - 97.
[Abstract] [Full Text] [PDF]


Home page
Ann OncolHome page
S. Collette, F. Bonnetain, X. Paoletti, M. Doffoel, O. Bouche, J. L. Raoul, P. Rougier, F. Masskouri, L. Bedenne, and J. C. Barbare
Prognosis of advanced hepatocellular carcinoma: comparison of three staging systems in two French clinical trials
Ann. Onc., June 1, 2008; 19(6): 1117 - 1126.
[Abstract] [Full Text] [PDF]


Home page
NEJMHome page
B. Zethelius, L. Berglund, J. Sundstrom, E. Ingelsson, S. Basu, A. Larsson, P. Venge, and J. Arnlov
Use of Multiple Biomarkers to Improve the Prediction of Death from Cardiovascular Causes
N. Engl. J. Med., May 15, 2008; 358(20): 2107 - 2116.
[Abstract] [Full Text] [PDF]


Home page
BiostatisticsHome page
T. Cai and S. Cheng
Robust combination of multiple diagnostic tests for classifying censored event times
Biostat., April 1, 2008; 9(2): 216 - 233.
[Abstract] [Full Text] [PDF]


Home page
NEJMHome page
W. S. Weintraub and G. A. Diamond
Predicting Cardiovascular Events with Coronary Calcium Scoring
N. Engl. J. Med., March 27, 2008; 358(13): 1394 - 1396.
[Full Text] [PDF]


Home page
ANN INTERN MEDHome page
J. A. Tice, S. R. Cummings, R. Smith-Bindman, L. Ichikawa, W. E. Barlow, and K. Kerlikowske
Using Clinical Factors and Mammographic Breast Density to Estimate Breast Cancer Risk: Development and Validation of a New Predictive Model
Ann Intern Med, March 4, 2008; 148(5): 337 - 347.
[Abstract] [Full Text] [PDF]


Home page
CJASNHome page
S. G. Coca and C. R. Parikh
Urinary Biomarkers for Acute Kidney Injury: Perspectives on Translation
Clin. J. Am. Soc. Nephrol., March 1, 2008; 3(2): 481 - 490.
[Abstract] [Full Text] [PDF]


Home page
Eur Heart JHome page
T. J. Wang
New cardiovascular risk factors exist, but are they clinically useful?
Eur. Heart J., February 2, 2008; 29(4): 441 - 444.
[Abstract] [Full Text] [PDF]


Home page
Am J EpidemiolHome page
M. S. Pepe, Z. Feng, Y. Huang, G. Longton, R. Prentice, I. M. Thompson, and Y. Zheng
Integrating the Predictiveness of a Marker with Its Performance as a Classifier
Am. J. Epidemiol., February 1, 2008; 167(3): 362 - 368.
[Abstract] [Full Text] [PDF]


Home page
Clin. Chem.Home page
N. R. Cook
Statistical Evaluation of Prognostic versus Diagnostic Models: Beyond the ROC Curve
Clin. Chem., January 1, 2008; 54(1): 17 - 23.
[Abstract] [Full Text] [PDF]


Home page
Am J EpidemiolHome page
X. Gao, H. Chen, M. A. Schwarzschild, D. B. Glasser, G. Logroscino, E. B. Rimm, and A. Ascherio
Erectile Function and Risk of Parkinson's Disease
Am. J. Epidemiol., December 15, 2007; 166(12): 1446 - 1450.
[Abstract] [Full Text] [PDF]


Home page
JNCI J Natl Cancer InstHome page
R. D. Etzioni, D. P. Ankerst, N. S. Weiss, L. Y. T. Inoue, and I. M. Thompson
Is Prostate-Specific Antigen Velocity Useful in Early Detection of Prostate Cancer? A Critical Appraisal of the Evidence
J Natl Cancer Inst, October 17, 2007; 99(20): 1510 - 1515.
[Abstract] [Full Text] [PDF]


Home page
Arch Intern MedHome page
H. y. Cheng
High-Sensitivity C-Reactive Protein and Prediction of Nontraumatic Fractures
Arch Intern Med, October 8, 2007; 167(18): 2007 - 2007.
[Full Text] [PDF]


Home page
J Am Coll CardiolHome page
M. C. Meuwese, E. S.G. Stroes, S. L. Hazen, J. N. van Miert, J. A. Kuivenhoven, R. G. Schaub, N. J. Wareham, R. Luben, J. J.P. Kastelein, K.-T. Khaw, et al.
Serum Myeloperoxidase Levels Are Associated With the Future Risk of Coronary Artery Disease in Apparently Healthy Individuals: The EPIC-Norfolk Prospective Population Study
J. Am. Coll. Cardiol., July 10, 2007; 50(2): 159 - 165.
[Abstract] [Full Text] [PDF]


Home page
J Am Coll CardiolHome page
P. M. Ridker
C-Reactive Protein and the Prediction of Cardiovascular Events Among Those at Intermediate Risk: Moving an Inflammatory Hypothesis Toward Consensus
J. Am. Coll. Cardiol., May 29, 2007; 49(21): 2129 - 2138.
[Abstract] [Full Text] [PDF]


Home page
ANN INTERN MEDHome page
W. A. van der Steeg, S. M. Boekholdt, E. A. Stein, K. El-Harchaoui, E. S.G. Stroes, M. S. Sandhu, N. J. Wareham, J. W. Jukema, R. Luben, A. H. Zwinderman, et al.
Role of the Apolipoprotein B-Apolipoprotein A-I Ratio in Cardiovascular Risk Assessment: A Case-Control Analysis in EPIC-Norfolk
Ann Intern Med, May 1, 2007; 146(9): 640 - 648.
[Abstract] [Full Text] [PDF]


Home page
ANN INTERN MEDHome page
M. Berkwits and E. Guallar
Risk Factors, Risk Prediction, and the Apolipoprotein B-Apolipoprotein A-I Ratio
Ann Intern Med, May 1, 2007; 146(9): 677 - 679.
[Full Text] [PDF]


Home page
Int J EpidemiolHome page
M. J Khoury, J. Little, M. Gwinn, and J. P. Ioannidis
On the synthesis and interpretation of consistent but weak gene-disease associations in the era of genome-wide association studies
Int. J. Epidemiol., April 1, 2007; 36(2): 439 - 445.
[Abstract] [Full Text] [PDF]


Home page
CirculationHome page
N. R. Cook
Use and Misuse of the Receiver Operating Characteristic Curve in Risk Prediction
Circulation, February 20, 2007; 115(7): 928 - 935.
[Abstract] [Full Text] [PDF]


Home page
Arch Intern MedHome page
T. A. Hillier, K. L. Stone, D. C. Bauer, J. H. Rizzo, K. L. Pedula, J. A. Cauley, K. E. Ensrud, M. C. Hochberg, and S. R. Cummings
Evaluating the Value of Repeat Bone Mineral Density Measurement and Prediction of Fractures in Older Women: The Study of Osteoporotic Fractures
Arch Intern Med, January 22, 2007; 167(2): 155 - 160.
[Abstract] [Full Text] [PDF]


Home page
JNCI J Natl Cancer InstHome page
A. Dupuy and R. M. Simon
Critical Review of Published Microarray Studies for Cancer Outcome and Guidelines on Statistical Analysis and Reporting
J Natl Cancer Inst, January 17, 2007; 99(2): 147 - 157.
[Abstract] [Full Text] [PDF]


Home page
Diabetes CareHome page
C. Lorenzo, K. Williams, K. J. Hunt, and S. M. Haffner
The National Cholesterol Education Program-Adult Treatment Panel III, International Diabetes Federation, and World Health Organization Definitions of the Metabolic Syndrome as Predictors of Incident Cardiovascular Disease and Diabetes
Diabetes Care, January 1, 2007; 30(1): 8 - 13.
[Abstract] [Full Text] [PDF]


Home page
NEJMHome page
J. H. Ware
The Limitations of Risk Factors as Prognostic Tools
N. Engl. J. Med., December 21, 2006; 355(25): 2615 - 2617.
[Full Text] [PDF]


Home page
NEJMHome page
T. J. Wang, P. Gona, M. G. Larson, G. H. Tofler, D. Levy, C. Newton-Cheh, P. F. Jacques, N. Rifai, J. Selhub, S. J. Robins, et al.
Multiple Biomarkers for the Prediction of First Major Cardiovascular Events and Death
N. Engl. J. Med., December 21, 2006; 355(25): 2631 - 2639.
[Abstract] [Full Text] [PDF]


Home page
Arterioscler. Thromb. Vasc. Bio.Home page
Y. Gerber, J. P. McConnell, A. S. Jaffe, S. A. Weston, J. M. Killian, and V. L. Roger
Lipoprotein-Associated Phospholipase A2 and Prognosis After Myocardial Infarction in the Community
Arterioscler Thromb Vasc Biol, November 1, 2006; 26(11): 2517 - 2522.
[Abstract] [Full Text] [PDF]


Home page
CirculationHome page
M. J. Bos, C. M. A. Schipper, P. J. Koudstaal, J. C.M. Witteman, A. Hofman, and M. M.B. Breteler
High Serum C-Reactive Protein Level Is Not an Independent Predictor for Stroke: The Rotterdam Study
Circulation, October 10, 2006; 114(15): 1591 - 1598.
[Abstract] [Full Text] [PDF]


Home page
JNCI J Natl Cancer InstHome page
R. Simon
Development and evaluation of therapeutically relevant predictive classifiers using gene expression profiling.
J Natl Cancer Inst, September 6, 2006; 98(17): 1169 - 1171.
[Full Text] [PDF]


Home page
HypertensionHome page
A. Simon, G. Chironi, and J. Levenson
Performance of Subclinical Arterial Disease Detection as a Screening Test for Coronary Heart Disease
Hypertension, September 1, 2006; 48(3): 392 - 396.
[Abstract] [Full Text] [PDF]


Home page
JNCI J Natl Cancer InstHome page
I. M. Thompson, C. Chi, D. P. Ankerst, P. J. Goodman, C. M. Tangen, S. M. Lippman, M. S. Lucia, H. L. Parnes, and C. A. Coltman Jr.
Effect of Finasteride on the Sensitivity of PSA for Detecting Prostate Cancer.
J Natl Cancer Inst, August 16, 2006; 98(16): 1128 - 1133.
[Abstract] [Full Text] [PDF]


Home page
CirculationHome page
C. Kistorp
Risk Stratification in Secondary Prevention: Advances in Multimarker Profiles, or Back to Basics?
Circulation, July 18, 2006; 114(3): 184 - 186.
[Full Text] [PDF]


Home page
ANN INTERN MEDHome page
D. M. Lloyd-Jones, K. Liu, L. Tian, and P. Greenland
Narrative Review: Assessment of C-Reactive Protein in Risk Prediction for Cardiovascular Disease
Ann Intern Med, July 4, 2006; 145(1): 35 - 42.
[Abstract] [Full Text] [PDF]


Home page
Arch Intern MedHome page
P. W. F. Wilson, M. Pencina, R. B. D'Agostino Sr, and C. J. O'Donnell
C-Reactive Protein and Cardiovascular Risk in the Framingham Study--Reply
Arch Intern Med, June 26, 2006; 166(12): 1328 - 1328.
[Full Text] [PDF]


Home page
CirculationHome page
R. S. Vasan
Biomarkers of Cardiovascular Disease: Molecular Basis and Practical Considerations
Circulation, May 16, 2006; 113(19): 2335 - 2362.
[Full Text] [PDF]


Home page
JNCI J Natl Cancer InstHome page
I. M. Thompson, D. P. Ankerst, C. Chi, P. J. Goodman, C. M. Tangen, M. S. Lucia, Z. Feng, H. L. Parnes, and C. A. Coltman Jr.
Assessing prostate cancer risk: results from the Prostate Cancer Prevention Trial.
J Natl Cancer Inst, April 19, 2006; 98(8): 529 - 534.
[Abstract] [Full Text] [PDF]


Home page
JCOHome page
T. A. Alonzo
Standards for Reporting Prognostic Tumor Marker Studies
J. Clin. Oncol., December 20, 2005; 23(36): 9053 - 9054.
[Full Text] [PDF]


Home page
BioinformaticsHome page
S. Ma and J. Huang
Regularized ROC method for disease classification and biomarker selection with microarray data
Bioinformatics, December 15, 2005; 21(24): 4356 - 4362.
[Abstract] [Full Text] [PDF]


Home page
CirculationHome page
A. J.G. Hanley, A. J. Karter, K. Williams, A. Festa, R. B. D'Agostino Jr, L. E. Wagenknecht, and S. M. Haffner
Prediction of Type 2 Diabetes Mellitus With Alternative Definitions of the Metabolic Syndrome: The Insulin Resistance Atherosclerosis Study
Circulation, December 13, 2005; 112(24): 3713 - 3721.
[Abstract] [Full Text] [PDF]


Home page
CirculationHome page
T. Pischon, C. J. Girman, F. M. Sacks, N. Rifai, M. J. Stampfer, and E. B. Rimm
Non-High-Density Lipoprotein Cholesterol and Apolipoprotein B in the Prediction of Coronary Heart Disease in Men
Circulation, November 29, 2005; 112(22): 3375 - 3383.
[Abstract] [Full Text] [PDF]


Home page
Arch Intern MedHome page
P. Greenland and P. G. O'Malley
When Is a New Prediction Marker Useful?: A Consideration of Lipoprotein-Associated Phospholipase A2 and C-Reactive Protein for Stroke Risk
Arch Intern Med, November 28, 2005; 165(21): 2454 - 2456.
[Full Text] [PDF]


Home page
DiabetesHome page
M. K. Rutter, J. B. Meigs, L. M. Sullivan, R. B. D'Agostino Sr, and P. W. Wilson
Insulin Resistance, the Metabolic Syndrome, and Incident Cardiovascular Events in the Framingham Offspring Study
Diabetes, November 1, 2005; 54(11): 3252 - 3257.
[Abstract] [Full Text] [PDF]


Home page
Diabetes CareHome page
J. Liu, C. Sempos, R. P. Donahue, J. Dorn, M. Trevisan, and S. M. Grundy
Joint Distribution of Non-HDL and LDL Cholesterol and Coronary Heart Disease Risk Prediction Among Individuals With and Without Diabetes
Diabetes Care, August 1, 2005; 28(8): 1916 - 1921.
[Abstract] [Full Text] [PDF]


Home page
GutHome page
W-Q Wei, C C Abnet, N Lu, M J Roth, G-Q Wang, B A Dye, Z-W Dong, P R Taylor, P Albert, Y-L Qiao, et al.
Risk factors for oesophageal squamous dysplasia in adult inhabitants of a high risk region of China
Gut, June 1, 2005; 54(6): 759 - 763.
[Abstract] [Full Text] [PDF]


Home page
JAMAHome page
M. G. Shlipak, L. F. Fried, M. Cushman, T. A. Manolio, D. Peterson, C. Stehman-Breen, A. Bleyer, A. Newman, D. Siscovick, and B. Psaty
Cardiovascular Mortality Risk in Chronic Kidney Disease: Comparison of Traditional and Novel Risk Factors
JAMA, April 13, 2005; 293(14): 1737 - 1745.
[Abstract] [Full Text] [PDF]


Home page
CirculationHome page
D. M. Lloyd-Jones and P. Greenland
Letter Regarding Article by Ridker et al, "Should C-Reactive Protein Be Added to Metabolic Syndrome and to Assessment of Global Cardiovascular Risk?"
Circulation, December 14, 2004; 110(24): e532 - e532.
[Full Text] [PDF]


Home page
Diabetes CareHome page
M. P. Stern, K. Williams, C. Gonzalez-Villalpando, K. J. Hunt, and S. M. Haffner
Does the Metabolic Syndrome Improve Identification of Individuals at Risk of Type 2 Diabetes and/or Cardiovascular Disease?
Diabetes Care, November 1, 2004; 27(11): 2676 - 2681.
[Abstract] [Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (PDF) Freely available
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (199)
Right arrowRequest Permissions
Right arrow Disclaimer
Google Scholar
Right arrow Articles by Pepe, M. S.
Right arrow Articles by Newcomb, P.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Pepe, M. S.
Right arrow Articles by Newcomb, P.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?