American Journal of Epidemiology Vol. 155, No. 2 : 176-184
Copyright © 2002 by The Johns Hopkins University School of Hygiene and Public Health
PRACTICE OF EPIDEMIOLOGY |
Causal Knowledge as a Prerequisite for Confounding Evaluation: An Application to Birth Defects Epidemiology
1 Department of Epidemiology, Harvard School of Public Health, Boston, MA.
2 Slone Epidemiology Unit, Boston University School of Public Health, Brookline, MA.
| ABSTRACT |
|---|
|
|
|---|
Common strategies to decide whether a variable is a confounder that should be adjusted for in the analysis rely mostly on statistical criteria. The authors present findings from the Slone Epidemiology Unit Birth Defects Study, 19921997, a case-control study on folic acid supplementation and risk of neural tube defects. When statistical strategies for confounding evaluation are used, the adjusted odds ratio is 0.80 (95% confidence interval: 0.62, 1.21). However, the consideration of a priori causal knowledge suggests that the crude odds ratio of 0.65 (95% confidence interval: 0.46, 0.94) should be used because the adjusted odds ratio is invalid. Causal diagrams are used to encode qualitative a priori subject matter knowledge.
abnormalities; causality; confounding factors (epidemiology); inference; selection bias
Abbreviations: CI, confidence interval; DAG, directed acyclic graph; OR, odds ratio; RR, risk ratio
| INTRODUCTION |
|---|
|
|
|---|
In epidemiologic studies, statistical analyses are typically organized around three different sets of variables: the exposure, the outcome, and the confounder(s). The exposure and outcome are usually determined by the causal question under investigation. The confounders, on the other hand, are not so clearly defined; they must first be identified and then appropriately adjusted for in the analysis.
A number of authors have emphasized that confounder identification must be grounded on an understanding of the causal network linking the variables under study (i.e., a priori subject-matter or expert knowledge) (1![]()
![]()
![]()
![]()
![]()
![]()
8
). Yet, some widely used approaches to confounder identification are centered on statistical associations. One common approach, which we will call strategy 1, has been the application of automatic variable selection procedures, such as stepwise selection (9
). The implicit assumption underlying this approach is that, although not all variables selected will be confounders, all important confounders will be selected. A second common approach, strategy 2, compares adjusted and unadjusted effect estimates. If the relative change after adjustment for certain variable(s) is greater than 10 percent, for example, then the variable(s) is selected (10
). Implicit in this approach is that any variable substantially associated with an estimate change is worth adjusting for. Most epidemiology textbooks recommend a third approach, strategy 3, that consists of checking whether some necessary criteria for confounding are met. Generally, it is stated that a confounder is a variable associated with the exposure in the population, associated with the outcome conditional on the exposure (e.g., among the unexposed), and not in the causal pathway between the exposure and the outcome. A further refinement is to replace the second condition by the condition that the potential confounder is a causal risk factor or a marker for a causal risk factor (11
). Strategies 1 and 2 rest only on statistical associations that can easily be identified from the data. Strategy 3 combines statistical associations from the data with some background knowledge about the causal network that links exposure, outcome, and potential confounders.
All three strategies may lead to bias from the omission of important confounders or inappropriate adjustment for nonconfounders (3![]()
, 7
, 8
, 12
). Here, we will describe a real example from research on birth defects in which all three strategies prefer the adjusted effect estimate over the crude effect estimate. However, we will use our a priori subject-matter knowledge to argue that the crude estimate should probably be preferred. We will utilize causal diagrams (4
, 5
, 13
, 14
) to represent our qualitative a priori assumptions about the underlying biologic mechanisms. First, we briefly review confounding and causal diagrams.
| CONFOUNDING, CONFOUNDERS, AND CAUSAL DIAGRAMS |
|---|
|
|
|---|
Intuitively, two variables E and D will be statistically associated if one is a cause of the other (e.g., smoking and lung cancer), if they share a common cause (e.g., yellow fingers and lung cancer share smoking as a common cause), or both. If E precedes D, the overall association between E and D will have two components: a spurious one that is due to the sharing of common causes and another due to the causal effect of E on D. The goal of etiologic research from observational data is to estimate the latter. The former component produces confounding (4
One way to eliminate a spurious association is to adjust, stratify, or condition on the common cause; for example, we would find no association between yellow fingers and lung cancer among nonsmokers. Confounders are variables that when stratified on or adjusted for will eliminate (or diminish) the spurious component of the association between exposure and disease.
The presence of common causes, and therefore of confounding, can be represented by causal diagrams known as directed acyclic graphs (DAGs) (12![]()
14
). Briefly, these diagrams link variables by arrows that represent direct causal effects (protective or causative) of one variable on another. Figures 1![]()
![]()
![]()
![]()
![]()
8 are selected examples of causal diagrams that link the variables E, D, and C. We use U to depict unmeasured variables. Because causes precede their effects, these graphs are acyclic: One can never start from one variable and, following the direction of the arrows, end up at the same variable. In figure 1, E causes D, and both D and E are causes of C; in figure 2, E >does not cause C but both share an unmeasured common cause U1.
|
|
|
|
|
|
|
|
In figures 1
Let us first concentrate on figures 5![]()
8. Figure 5 depicts C as a common cause of E and D, whereas in figures 6
8 U is the common cause. We say that C is a confounder in figure 5 and that U is a confounder in figures 6
8. To eliminate the spurious component of the association between exposure and outcome, we can condition on the confounder and calculate the ORED|C that is, we adjust for the common cause. Thus, in figure 5 the ORED|C adjusted for C is a valid estimator of the causal effect on the odds ratio scale within levels of C. Furthermore, if (as we shall assume for simplicity) the stratified odds ratio ORED|C is constant over levels of C and the disease is rare (at each joint level of E and C), then the stratified odds ratio closely approximates the stratified risk ratio. Then ORED|C, unlike the crude ORED, also quantifies the causal effect of E on D in the whole population. But what about figures figures 6
8? Here the common cause is unmeasured, and therefore we cannot adjust for it. In figure 6, the causal pathway from U to D is mediated through C. Intuitively, if we condition on a specific value of C, then U cannot affect D because U only affects D by changing the value of C. In other words, within levels of C, U is no longer a cause of D, and therefore the spurious association (confounding) disappears. Therefore, ORED|C adjusted for C is a valid estimator of the causal effect of the exposure E on the outcome D. (This would still be true even if C occurred temporally after E.) Similar reasoning can be used to deduce that C should be adjusted for in figure 7. In both cases, we say that C is a confounder, although it is not a causal confounder in the sense that C itself is not a common cause of exposure and disease. Once we adjust for the confounder C, U ceases to be a confounder because it no longer induces a spurious association between exposure and disease.
The situation is different in figure 8, where C is not in the causal pathway between the unmeasured confounder U and either E or D. As a result, adjusting for C will not remove the spurious association between E and D due to its common cause U. However, if C is strongly associated with U, adjusting for C will remove a large part of the confounding. In the limit, if C were perfectly correlated with U, then all confounding would be removed when adjusting for either C or U. We say that C is a surrogate confounder in figure 8. Often, when a confounder cannot be adequately measured, it is better to adjust for a surrogate confounder than to use the crude odds ratio (1
). For example, if C were a misclassified version of U, the stronger the association between them (i.e., the smaller the measurement error), the better is confounding taken into account.
Let us now turn our attention to figures 1![]()
4. Even though C is not a confounder, is the adjusted ORED|C a valid estimator of the causal effect? No. Adjustment for C is not only unnecessary but harmful. To explain why, let us focus on figure 1. Suppose E represents being on a diet and D represents a recent diagnosis of a non-diet-related cancer. Let C = 1 if the person had recent weight loss greater than 5 kg, and C = 0 otherwise. Assume that dieting does not cause cancer and therefore erase the arrow from E to D. We have seen that two variables may be associated when one is the cause of the other or when they have common causes. Neither case is true in this example, so dieting and cancer are statistically independent (OR = risk ratio (RR) = 1). In other words, knowing that someone was dieting does not change the probability that she develops cancer. Now let us condition on the common effect C (common effects are known as colliders in causal graph theory) and check if E and D remain independent within levels of C. Among those who lost weight (C = 1), does the probability of someone's having cancer change if we know that she was not dieting? Yes, it does. Given that a person lost weight, it is more likely that she had cancer if she was not dieting. Thus, within those who lost weight, dieting and cancer are inversely associated. See tables 1 and 2 for a numerical example.
|
|
In general, conditioning on a common effect or collider C creates a spurious association between E and D (15
Thus, C is a confounder and one needs to adjust for it in figures 5 ![]()
8, but it is a nonconfounder and one should not adjust for it in figures 1![]()
4. This definition of confounding and confounders is not based on the statistical associations found in our data but rather on qualitative background knowledge about the causal structure of the problem under study, which we encoded in causal diagrams. This approach contrasts with the causally blind strategies 1 and 2, which use only statistical associations to decide whether C should be adjusted for, and with strategy 3, which uses statistical conditions supplemented with partial but insufficient a priori causal information.
In fact, the conditions implied by strategies 13 hold true for all figures 1![]()
![]()
![]()
![]()
![]()
8. However, in figure 9, C would be excluded as a confounder by the causal restriction of strategy 3 (that C is not in the causal pathway). The additional causal restriction that C must be a causal risk factor or a marker for a causal risk factor (8
) further restricts the set of possible causal structures in which C may be a confounder to those in figures 3![]()
![]()
![]()
8. Another widely recognized restriction is that the potential confounder cannot be affected by either exposure or outcome (1
, 6
, 16
, 17
), which excludes figures 1
3. As more causal restrictions are applied, fewer causal diagrams are consistent with C's being a confounder. Whether C is or is not truly a confounder depends on the causal structure of the problem under study. No generally applicable statistical approaches will substitute for using a priori causal knowledge to characterize such structure.
|
| AN EXAMPLE FROM BIRTH DEFECTS EPIDEMIOLOGY |
|---|
|
|
|---|
Supplementation with 0.4 mg of folic acid per day around the time of conception has been shown to decrease the risk of neural tube defects in randomized experiments (18
We found that 18 percent of cases and 25 percent of controls used folic acid daily during the exposure period (table 3). The crude (unadjusted) ORED was 0.65 (95 percent confidence interval (CI): 0.45, 0.94), which approximates the crude risk ratio. ORED can be obtained directly from table 3 or as exp(ß1) from the logistic model logit Pr(D = 1|E) = ß0 + ß1E. Table 4 displays the data by levels of C. To estimate the adjusted odds ratio, we stratify (i.e., condition) on all levels of the third variable C, compute the stratum-specific odds ratio, and then calculate a pooled summary measure across strata (e.g., using the Mantel-Haenszel method). We did not detect heterogeneity of the odds ratio between the two strata defined by C (p = 0.43 from the Breslow-Day test for homogeneity), so for our purposes we assume that the no interaction logistic model logit Pr(D = 1|E, C) = ß0 + ß1E + ß2C is correct. The ORED|C adjusted by C (exp(ß1)) was 0.80 (95 percent CI: 0.53, 1.20).
|
|
Which analysis is more appropriate, the crude or the adjusted? We first consider the three common strategies described above:
- Automatic variable selection. We force exposure E as a covariate in the logistic model with D as the outcome. We consider an automatic forward selection procedure, available in most standard statistical software packages in which the variable C is added if the p value associated with its parameter estimate is less than 0.10. As the p value in our data set is less than 0.001, variable C is selected.
- Relative change in estimate greater than 10 percent. The adjusted ORED|C was 0.80, a 23 percent relative change with respect to the crude ORED = 0.65, so the adjusted estimate will be selected.
- Standard rules for confounding. First, we check that C is associated with E in the population; in our data ORCE|D = 0 is 0.50 (95 percent CI: 0.23, 1.07). Second, we check that C is associated with D within the unexposed; in our data ORCD|E = 0 is 15.22 (95 percent CI: 10.09, 22.95). Third, we need to exclude the possibility that C may be in the causal pathway between E and D. The data by themselves are never sufficient to rule out the possibility; however, in our case it was known that C was not plausibly on the causal pathway. Because all three conditions are met, the adjusted estimate will be selected.
We have not as yet unveiled the variable encoded by C in table 4 in order to emphasize that no additional information about C beyond that contained in the data is required by stategies 1 and 2, and only limited external background information is required by strategy 3. However, we have seen in the previous section that knowledge of the causal structure is crucial if we are to decide whether C is a confounder and needs to be adjusted for. In fact, the adjusted ORED|C is biased in four of our diagrams.
In our example, the variable C stands for the event that pregnancy ends either in stillbirth or therapeutic abortion. Should we regard C as a confounder? To answer this question, we would need the true, but possibly unknown, underlying causal structure. Most investigators would agree that figures 1![]()
4 are more likely to represent the true causal structure than figures 5![]()
8. In fact, figures 5
7 are rapidly eliminated because they assume that C occurs before the exposure E or the outcome D.
Yet it is not uncommon to find epidemiologic analyses that adjust for stillbirth/induced abortion, either by stratification or by restricting the analysis to livebirths. This practice is often the unintended consequence of the difficulty of identifying stillbirths and/or ascertaining their maternal exposures. Similarly, analyses of the effects of prenatal exposures frequently adjust for variables, such as maternal weight gain during pregnancy, gestational age, or birth weight, that are likely to be affected by either the exposure or the outcome. The decision to adjust is usually based on statistical criteria only. (Here we assume that the goal is to estimate the total effect. The section "Adjusting for Variables Affected by Exposure and Causal Diagrams" below discusses direct effects.) How much bias is introduced by this decision depends on the strength of the statistical associations between the potential confounder and exposure and outcome (22
, 23
). In our study, the apparent bias was moderate despite the fact that the association between the potential confounder and the outcome was very strong.
For expositional purposes, we have assumed throughout that there were no other confounders of the causal effect of E on D other than possibly the covariate C. This is not a realistic assumption, but it was useful to simplify the problem. In a more realistic analysis in which we adjusted for region, maternal age, whether the pregnancy was planned, and maternal education, the odds ratio was 0.72 (95 percent CI: 0.49, 1.05).
In general, crude and adjusted odds ratios can differ not because of confounding but because of the noncollapsibility property of the odds ratio; that is, the crude odds ratio does not necessarily equal a common stratified odds ratio even if the exposure and stratifying factor are unassociated in the population (24
, 25
). This is an additional reason to avoid the change-in-estimate method for the odds ratio. We did not consider this issue because we assumed that, in our study, the disease was rare so the odds ratio was approximately equal to the risk ratio, which is a collapsible measure.
| SELECTION BIAS, RECALL BIAS, AND CAUSAL DIAGRAMS |
|---|
|
|
|---|
Causal diagrams are useful to represent biases other than confounding, such as selection and recall bias (5
Selection bias induces noncomparability or, equivalently, lack of exchangeability of the exposed and the unexposed, even if they were comparable before the selection. Many authors use noncomparability as a synonym for confounding (7
). We are being careful to separate confounding due to unmeasured common causes from noncomparability induced by selection.
We have so far made the simplifying assumption that exposure is perfectly measured before the outcome occurs. However, case-control studies often ascertain the exposure E' after the outcome is known, as represented in figures 10 and 11. In figure 10, E' is determined by the actual exposure E but not by the outcome D, so misclassification is nondifferential. Any association between E' and D (ORE'D) is therefore due solely to the causal effect of E on D. In figure 11, E' is determined by the actual exposure E and the outcome D, representing, for example, a setting in which there is recall bias because mothers of infants with birth defects have more complete recall of the exposure than mothers of healthy infants. In figure 11, ORE'D can differ from one even under the causal null hypothesis, because part of the association between E' and D is spurious as a result of the causal effect of D on E'. Our study used mothers of infants with other birth defects in an attempt to eliminate the arrow between D and E'. Another example of the situation depicted in figure 11 occurs when studying the effect of an exposure through a biomarker. For example, women with disease-associated weight loss could have altered blood levels of a pesticide residue due to mobilization of residues stored in fat.
|
|
| ADJUSTING FOR VARIABLES AFFECTED BY EXPOSURE AND CAUSAL DIAGRAMS |
|---|
|
|
|---|
An inspection of the causal diagram in figure 9 reveals the two main reasons why adjustment for a variable on the causal pathway is discouraged in the epidemiologic literature (1
The second argument makes clear that, even to estimate the direct effect of E (not mediated through C) on D, 1) it is not valid to adjust for C when there is an unmeasured common cause of C and D, and 2) C's being on a causal pathway from E to D is not a necessary condition for this spurious association to appear. The source of the problem is that C is a marker for an unmeasured causal risk factor U for the outcome, and C is either causally affected by exposure (figure 9) or shares common causes with the exposure (similar to figure 4).
| CONCLUSION |
|---|
|
|
|---|
We have argued that knowledge of the causal structure is a prerequisite to accurately label a variable as a confounder. Taken literally, this statement may impose such an unrealistically high standard on the epidemiologist that many studies simply could not be done at all. Instead, we wish to emphasize that causal inference from observational data requires prior causal assumptions or beliefs, which must be derived from subject-matter knowledge, not from statistical associations detected in the data.
Our goal was to highlight potential inconsistencies between beliefs and actions in data analysis. In general, investigators should not adjust for a variable C unless they believe it may be a confounder. At the very least, researchers should generally avoid stratifying on variables affected by either the exposure or the outcome. Of course, thoughtful and knowledgeable epidemiologists could believe that two or more causal structures, possibly leading to different conclusions regarding confounding, are equally plausible. In that case they should perform multiple analyses and explicitly state the assumptions about causal structure required for the validity of each. One can never be certain that the set of causal structures under consideration includes the true one; this uncertainty and the attendant model uncertainty are unavoidable with observational data.
Causal diagrams are a useful way to summarize, clarify, and communicate one's qualitative beliefs about the causal structure. The use of causal diagrams in epidemiology has been proposed by Greenland et al. (4
). The main advantage of this graphical method is that, while being a natural and simple way to approach causal inference from observational data, it is also rigorous, being mathematically identical to Robins' "g-computation theory" (29![]()
31
).
We have used causal diagrams to describe three possible sources of statistical association between two variables: cause and effect, sharing of common causes, and calculation of the association within levels of a common effect. There is confounding when the association between exposure and disease includes a noncausal component attributable to their having an uncontrolled common cause. There is selection bias when the association between exposure and disease includes a noncausal component attributable to restricting the analysis to certain level(s) of a common effect of exposure and disease or, more generally, to conditioning on a common effect of variables correlated with exposure and disease. In either case, the exposed and the unexposed in the study are not comparable, or exchangeable, which is the ultimate source of the bias. Statistical criteria are insufficient to characterize either confounding or selection bias.
| APPENDIX |
|---|
|
|
|---|
We have used causal DAGs to encode our qualitative a priori assumptions about the underlying biologic mechanisms. These diagrams consist of nodes (variables) and directed edges (arrows). The absence of an arrow between two variables indicates that the investigator believes there is no direct effect of one variable on the other (i.e., a causal effect not mediated through other variables in the DAG). The presence of an arrow indicates that the investigator is unable to assume the absence of a direct effect of one variable on the other. Further, conditioning on its direct causes, each variable is statistically independent of all the variables it does not cause. DAGs are acyclic because the arrows never point from a given variable to any other variable in its past. If, for example, one is interested in representing the causal interplay between nutritional status and infection in children, the DAG could contain an arrow from a variable that represents nutritional status at time t to another one that represents infection at time t + 1 and a second arrow from infection at time t + 1 to nutritional status at time t + 2. We say that a DAG is causal if the common causes of any pair of variables in the graph are also in the DAG.
Our neural tube defects example may be partially represented by the DAG in figure 1. Subject-matter knowledge suggests that neural tube defects (D) are a direct cause of stillbirths/therapeutic abortions (C), and that folic acid supplementation (E) may prevent stillbirths/abortions (C) through its effects on birth defects other than neural tube defects. (In our study, stillborn infants and fetuses therapeutically aborted because of a malformation are identified through review of admissions and discharges at major referral hospitals and clinics and through regular contact with newborn nurseries in community hospitals. Medical records and autopsies, if available, are then reviewed to ascertain birth defects.)
To draw a more realistic DAG according to our causal assumptions, we added the following variables: pregnancy planning (because women who are trying to conceive a pregnancy often take prenatal vitamins in preparation, and they may take better care of themselves in general, which in turn may affect the outcome), maternal education (for similar reasons to pregnancy planning), region (because both exposure and outcome may display geographic variations mediated through socioeconomic, behavioral, dietary, ethnic, cultural, and other factors), and maternal age (a risk factor for some birth defects, maternal age may also affect behavior regarding folic acid supplementation). In our hypothesized DAG, these preexposure variables would be the origin of arrows pointing to E and D. Note that the inclusion of these variables in the DAG does not imply that we are certain about the existence of their causal connections with E and D (e.g., maternal age may not affect the risk of neural tube defects relative to that of other birth defects), but that we are not willing to assume a priori that those connections are absent.
The arguments we used to support the statistical statements derived from causal DAGs were heuristic and relied on our causal intuitions. These arguments, however, have been formalized and mathematically proven (12![]()
14
). Here we present a brief overview of a graphical method called "d-separation" ("d-" stands for directional) (12
, 13
) that allows us to determine whether two given variables are (marginally or conditionally on other variables) independent.
The method of d-separation consists of a set of graphical rules to decide whether two variables are d-separated, which implies that they are independent, or are d-connected, which generally implies that they are not independent. If two variables are d-separated without conditioning on any other variables in the DAG, then they are marginally independent. If two variables are d-separated after conditioning on a set of third variables, then they are conditionally independent (i.e., independent within every joint stratum of the third variables). To explain the method we first need to define the terms "path" and "blocked path." A path is any arrow-based route between two variables in the graph. We define each path to be either blocked or open according to the following graphical rules.
Rule 1. If there are no variables being conditioned on, a path is blocked if and only if two arrowheads on the path collide at some variable on the path. For example, in appendix figure 1, the path L
A
D is open, whereas the path A
D
L is blocked because two arrowheads on the path collide in D. We call D a collider on the path A
D
L.
|
Rule 2. Any path that contains a noncollider that has been conditioned on is blocked. For example, in appendix figure 2, the path between L and D is blocked after conditioning on A. We use a square box around a variable to indicate that we are conditioning on it.
|
Rule 3. A collider that has been conditioned on does not block a path. For example, in appendix figure 3, the path between L and A is open after conditioning on D.
|
Rule 4. A collider that has a descendant that has been conditioned on does not block a path. For example, in appendix figure 4, the path between L and A is open after conditioning on S, a descendant of the collider D.
|
Rules 14 can be summarized as follows. A path is blocked if and only if it contains a noncollider that has been conditioned, or it contains a collider that has not been conditioned on and has no descendants that have been conditioned on.
Two variables are d-separated if all paths between them are blocked (otherwise they are d-connected). Thus, A and L are not marginally independent (d-connected) in appendix figure 1 because there is one open path between them (L
A), despite the other path (A
D
L)'s being blocked by the collider D. In appendix figure 3, however, A and L are marginally independent (d-separated) because the only path between them is blocked by the collider D. In appendix figure 2, we conclude that D is conditionally independent of L, given A. From appendix figure 3 we infer that L is not conditionally independent of A, given D. Appendix figure 4 includes the variable S, representing the symptoms caused by the disease. If conditioning on D opens the path and therefore creates an association between L and A, then conditioning on an effect of the disease (S) also creates an association between L and A. In general, the farther the descendant of the collider is in the chain of causation, the weaker this association will be.
Some conclusions that follow from the method of d-separation are that causes (ancestors) are not independent of their effects (descendants) and vice versa, and that generally two variables are associated if they share a common cause. Another important conclusion is that sharing a common effect does not imply that two causes are associated. Intuitively, whether two variables (the common causes) are correlated cannot be influenced by an event in the future (their effect) (4
), but two causes of a given effect generally become associated once we stratify on the common effect.
Finally, we explain why two variables that are not d-separated may actually be statistically independent. The reason is that it is logically possible that causal effects in opposite directions may exactly cancel out. For example, in appendix figure 5, if the arrow L
D is causative for half the population and preventive for the other half, and if the magnitude of the causative and protective effects is exactly the same, then L and D will be marginally independent despite the fact that they are not d-separated. Because exact cancellation of causal effects is probably a very rare event in epidenidologic applications, d-separation and independence may be treated in practice as equivalent concepts with little risk. In the probably rare occasions in which two variables are simultaneously d-connected and statistically independent, we say that the joint distribution of the variables in the DAG is not faithful to the DAG (14
).
|
| ACKNOWLEDGMENTS |
|---|
This work was supported in part by National Institutes of Health grant R01-AI32475. The Slone Epidemiology Unit Birth Defects Study was supported in part by National Institute of Child Health and Human Development grant HD27697 and National Heart, Lung, and Blood Institute grant HL50763. Additional support for the Slone Epidemiology Unit Birth Defects Study was provided by Hoechst Marion Roussel, Inc. (Kansas City, Missouri), Pfizer, Inc. (New York, New York), the Glaxo-Wellcome Company (Research Triangle Park, North Carolina), and Rhone Pouleuc Rorer (College Park, Pennsylvania).
The authors thank James Robins for inspiring them to write this article and for his many comments, which led to a substantial improvement in the manuscript. The authors also thank Sander Greenland for his detailed suggestions.
| NOTES |
|---|
Reprint requests to Dr. Miguel Hernán, Department of Epidemiology, Harvard School of Public Health, 677 Huntington Avenue, Boston, MA 02115 (e-mail: miguel_hernan{at}post.harvard.edu).
| REFERENCES |
|---|
|
|
|---|
- Greenland S, Neutra R. Control of confounding in the assessment of medical technology. Int J Epidemiol 1980;9:3617.
-
Robins JM, Greenland S. The role of model selection in causal inference from nonexperimental data. Am J Epidemiol 1986;123:392402.
[Free Full Text] -
Weinberg CR. Towards a clearer definition of confounding. Am J Epidemiol 1993;137:18.
[Abstract/Free Full Text] - Greenland S, Pearl J, Robins JM. Causal diagrams for epidemiologic research. Epidemiology 1999;10:3748.[Web of Science][Medline]
- Robins JM. Data, design, and background knowledge in etiologic inference. Epidemiology 2001;12:31320.[Web of Science][Medline]
- Rothman KJ, Greenland S. Modern epidemiology. 2nd ed. Philadelphia, PA: Lippincott-Raven, 1998.
-
Greenland S, Robins JM. Identifiability, exchangeability, and epidemiological confounding. Int J Epidemiol 1986;15:41319.
[Abstract/Free Full Text] -
Greenland S. Modeling and variable selection in epidemiologic analysis. Am J Public Health 1989;79:3409.
[Abstract/Free Full Text] - Kleinbaum DG, Kupper LL, Muller KE, et al. Applied regression analysis and other multivariable methods. 3rd ed. Pacific Grove, CA: Duxbury Press, 1998.
-
Grayson DA. Confounding confounding. Am J Epidemiol 1987;126:54653.
[Abstract/Free Full Text] - Szklo M, Nieto FJ. Epidemiology: beyond the basics. Gaithersburg, MD: Aspen Publishers, Inc, 1999.
- Pearl J. Causality. Cambridge, United Kingdom: Cambridge University Press, 2000.
-
Pearl J. Causal diagrams for empirical research. Biometrika 1995;82:669710.
[Abstract/Free Full Text] - Spirtes P, Glymour C, Scheines R. Causation, prediction, and search. Lecture notes in statistics 81. New York, NY: Springer-Verlag, 1993.
- Hernán MA, Robins JM. Method for conducting sensitivity analysis. (Letter). Biometrics 1999;55:131618.[Medline]
- Cox DR. Planning of experiments. New York, NY: John Wiley & Sons, 1958.
-
Miettinen O. Confounding and effect-modification. Am J Epidemiol 1974;100:3503.
[Free Full Text] - MRC Vitamin Study Research Group. Prevention of neural tube defects: results of the Medical Research Council Vitamin Study. Lancet 1991;338:1317.[Web of Science][Medline]
-
Werler AM, Shapiro S, Mitchell AA. Periconceptional folic acid exposure and risk of occurrent neural tube defects. JAMA 1993;269:125761.
[Abstract/Free Full Text] -
Berry RJ, Li Z, Erickson JD, et al. Prevention of neural-tube defects with folic acid in China. N Engl J Med 1999;341:148590.
[Abstract/Free Full Text] - Use of folic acid-containing supplements among women of childbearing ageUnited States, 1997. MMWR Morb Mortal Wkly Rep 1998;47:1314.[Medline]
- Bross IDJ. Spurious effects from an extraneous variable. J Chronic Dis 1966;19:63747.[Web of Science][Medline]
- Walker AM. Observation and inference: an introduction to the methods of epidemiology. Newton Lower Falls, MA: Epidemiology Resources, Inc, 1991.
-
Miettinen OS, Cook EF. Confounding: essence and detection. Am J Epidemiol 1981;114:593603.
[Abstract/Free Full Text] - Greenland S, Robins JM, Pearl J. Confounding and collapsibility in causal inference. Stat Sci 1999;14:2946.
-
Khoury MJ, Flanders WD, James LM, et al. Human teratogens, prenatal mortality, and selection bias. Am J Epidemiol 1989;130:36170.
[Abstract/Free Full Text] - Hook EB, Regal RR. Conceptus viability, malformation, and suspect mutagens or teratogens in humans. The Yule-Simpson paradox and implications for inferences of causality in studies of mutagenicity or teratogenicity limited to human livebirths. Teratology 1991;43:539.[Web of Science][Medline]
- Robins JM, Greenland S. Identifiability and exchangeability for direct and indirect effects. Epidemiology 1992;3:14355.[Web of Science][Medline]
- Robins JM. A graphical approach to the identification and estimation of causal parameters in mortality studies with sustained exposure periods. J Chronic Dis 1987;40(suppl 2):139s61s.
-
Robins JM. Comment. Biometrika 1995;82:6958.
[Free Full Text] - Robins JM. Causal inference from complex longitudinal data. In: Berkane M, ed. Latent variable modeling with applications to causality. New York, NY: Springer-Verlag, 1997:69117.
- Scholl TO, Johnson WG. Folic acid: influence on the outcome of pregnancy. Am J Clin Nutr 2000;71(suppl):1295s303s.
-
Mili F, Edmonds LD, Khoury MJ, et al. Prevalence of birth defects among low-birth-weight infants. A population study. Am J Dis Child 1991;145:131318.
[Abstract/Free Full Text] -
Mathews F, Yudkin P, Smith R, et al. Nutrient intakes during pregnancy: the influence of smoking status and age. J Epidemiol Community Health 2000;54:1723.
[Abstract/Free Full Text] - Roth J, Hendrickson J, Schilling M, et al. The risk of teen mothers having low birth weight babies: implications of recent medical research for school health personnel. J Sch Health 1998;68:2715.[Web of Science][Medline]
-
Werler MM, Louik C, Shapiro S, et al. Prepregnant weight in relation to risk of neural tube defects. JAMA 1996;275:108992.
[Abstract/Free Full Text] - Pastuszak A, Bhatia D, Okotore B, et al. Preconception counseling and women's compliance with folic acid supplementation. Can Fam Physician 1999;45:20537.[Web of Science][Medline]
-
Lie RT, Wilcox AJ, Skjaerven R. A population-based study of the risk of recurrence of birth defects. N Engl J Med 1994;331:14.
[Abstract/Free Full Text] - Van der Put NMJ, Gabreels F, Stevens EMB, et al. A second common mutation in the methylenetetrahydrofolate reductase gene: an additional risk factor for neural-tube defects. Am J Hum Genet 1998;62:104451.[Web of Science][Medline]
-
Botto LD, Yang Q. 5,10-Methylenetetrahydrofolate reductase gene variants and congenital anomalies: a HuGE review. Am J Epidemiol 2000;151:86277.
[Abstract/Free Full Text]
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
A. G. Barnett, R. Batra, N. Graves, J. Edgeworth, J. Robotham, and B. Cooper Using a Longitudinal Model to Estimate the Effect of Methicillin-resistant Staphylococcus aureus Infection on Length of Stay in an Intensive Care Unit Am. J. Epidemiol., November 1, 2009; 170(9): 1186 - 1194. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. A. Hernan and S. R. Cole Invited Commentary: Causal Diagrams and Measurement Bias Am. J. Epidemiol., October 15, 2009; 170(8): 959 - 962. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. H. Pedersen, T. B. Henriksen, M. Vestergaard, J. Olsen, and B. H. Bech Selective serotonin reuptake inhibitors in pregnancy and congenital malformations: population based cohort study BMJ, September 23, 2009; 339(sep23_1): b3569 - b3569. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. W. Platt, E. F. Schisterman, and S. R. Cole Time-modified Confounding Am. J. Epidemiol., September 15, 2009; 170(6): 687 - 694. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. L. Monda, C. M. Ballantyne, and K. E. North Longitudinal impact of physical activity on lipid profiles in middle-aged adults: the Atherosclerosis Risk in Communities Study J. Lipid Res., August 1, 2009; 50(8): 1685 - 1691. [Abstract] [Full Text] [PDF] |
||||
![]() |
H.-Y. Weng, Y.-H. Hsueh, L. L. McV. Messam, and I. Hertz-Picciotto Methods of Covariate Selection: Directed Acyclic Graphs and the Change-in-Estimate Procedure Am. J. Epidemiol., May 15, 2009; 169(10): 1182 - 1190. [Abstract] [Full Text] [PDF] |
||||
![]() |
S O Shaheen, K Northstone, R B Newson, P M Emmett, A Sherriff, and A J Henderson Dietary patterns in pregnancy and respiratory and atopic outcomes in childhood Thorax, May 1, 2009; 64(5): 411 - 417. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. N Baker, S. J Wheeler, T. A Sanders, J. E Thomas, C. J Hutchinson, K. Clarke, J. L Berry, R. L Jones, P. T Seed, and L. Poston A prospective study of micronutrient status in adolescent pregnancy Am. J. Clinical Nutrition, April 1, 2009; 89(4): 1114 - 1124. [Abstract] [Full Text] [PDF] |
||||
![]() |
M Ruiz-Canela, J I Valle-Mansilla, and D P Sulmasy Researchers' preferences and attitudes on ethical aspects of genomics research: a comparative study between the USA and Spain J. Med. Ethics, April 1, 2009; 35(4): 251 - 257. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Sonestedt, S. Borgquist, U. Ericson, B. Gullberg, H. Olsson, H. Adlercreutz, G. Landberg, and E. Wirfalt Enterolactone Is Differently Associated with Estrogen Receptor {beta}-Negative and -Positive Breast Cancer in a Swedish Nested Case-Control Study Cancer Epidemiol. Biomarkers Prev., November 1, 2008; 17(11): 3241 - 3251. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. Xu, J.-P. Collet, S. Shapiro, Y. Lin, T. Yang, R. W. Platt, C. Wang, and J. Bourbeau Independent Effect of Depression and Anxiety on Chronic Obstructive Pulmonary Disease Exacerbations and Hospitalizations Am. J. Respir. Crit. Care Med., November 1, 2008; 178(9): 913 - 920. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. M. Grice, P. M. McGovern, and B. H. Alexander Flexible work arrangements and work-family conflict after childbirth Occup. Med., October 1, 2008; 58(7): 468 - 474. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. R. Cole and M. A. Hernan Constructing Inverse Probability Weights for Marginal Structural Models Am. J. Epidemiol., September 15, 2008; 168(6): 656 - 664. [Abstract] [Full Text] [PDF] |
||||
![]() |
L Richiardi, F Barone-Adesi, F Merletti, and N Pearce Using directed acyclic graphs to consider adjustment for socioeconomic status in occupational cancer studies J Epidemiol Community Health, July 1, 2008; 62(7): e14 - e14. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. O. Asvold, T. Bjoro, T. I. Nilsen, and L. J. Vatten Tobacco Smoking and Thyroid Function: Is Weight Gain a Confounder? Reply Arch Intern Med, January 14, 2008; 168(1): 114 - 114. [Full Text] [PDF] |
||||
![]() |
S. Kim, T. O. Keku, C. Martin, J. Galanko, J. T. Woosley, J. C. Schroeder, J. A. Satia, S. Halabi, and R. S. Sandler Circulating Levels of Inflammatory Cytokines and Risk of Colorectal Adenomas Cancer Res., January 1, 2008; 68(1): 323 - 328. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. Shrier, J.-F. Boivin, R. J. Steele, R. W. Platt, A. Furlan, R. Kakuma, J. Brophy, and M. Rossignol Should Meta-Analyses of Interventions Include Observational Studies in Addition to Randomized Controlled Trials? A Critical Examination of Underlying Principles Am. J. Epidemiol., November 15, 2007; 166(10): 1203 - 1209. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. E. Gilman Invited Commentary: The Life Course Epidemiology of Depression Am. J. Epidemiol., November 15, 2007; 166(10): 1134 - 1137. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. J. VanderWeele and J. M. Robins Directed Acyclic Graphs, Sufficient Causes, and the Properties of Conditioning on a Common Effect Am. J. Epidemiol., November 1, 2007; 166(9): 1096 - 1104. [Abstract] [Full Text] [PDF] |
||||
![]() |
Z. Fewell, G. Davey Smith, and J. A. C. Sterne The Impact of Residual and Unmeasured Confounding in Epidemiologic Studies: A Simulation Study Am. J. Epidemiol., September 15, 2007; 166(6): 646 - 655. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. K. Callaway, H. D. McIntyre, M. O'Callaghan, G. M. Williams, J. M. Najman, and D. A. Lawlor The Association of Hypertensive Disorders of Pregnancy with Weight Gain over the Subsequent 21 Years: Findings from a Prospective Cohort Study Am. J. Epidemiol., August 15, 2007; 166(4): 421 - 428. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. W. Whitcomb, E. F. Schisterman, M. A. Klebanoff, M. Baumgarten, A. Rhoton-Vlasak, X. Luo, and N. Chegini Circulating Chemokine Levels and Miscarriage Am. J. Epidemiol., August 1, 2007; 166(3): 323 - 331. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Hernandez-Diaz, L. J. Van Marter, M. M. Werler, C. Louik, and A. A. Mitchell Risk Factors for Persistent Pulmonary Hypertension of the Newborn Pediatrics, August 1, 2007; 120(2): e272 - e282. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. A. Lawlor, D. A. Leon, and F. Rasmussen Growth Trajectory Matters: Interpreting the Associations among Birth Weight, Concurrent Body Size, and Systolic Blood Pressure in a Cohort Study of 378,707 Swedish Men Am. J. Epidemiol., June 15, 2007; 165(12): 1405 - 1412. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. A Beydoun, J. S Kaufman, J. A Satia, W. Rosamond, and A. R Folsom Plasma n-3 fatty acids and the risk of cognitive decline in older adults: the Atherosclerosis Risk in Communities Study Am. J. Clinical Nutrition, April 1, 2007; 85(4): 1103 - 1111. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Rasmussen and L. M. Irgens Pregnancy-Induced Hypertension in Women Who Were Born Small Hypertension, April 1, 2007; 49(4): 806 - 812. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Einarsdottir, K. Humphreys, C. Bonnard, Y. Li, Y. Li, K. S. Chia, E. T. Liu, P. Hall, J. Liu, and S. Wedren Effect of ATM, CHEK2 and ERBB2 TAGSNPs and haplotypes on endometrial cancer risk Hum. Mol. Genet., January 15, 2007; 16(2): 154 - 164. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. U. Rosenberg, K. Einarsdottir, E. I. Friman, S. Wedren, P. W. Dickman, P. Hall, and C. Magnusson Risk Factors for Hormone Receptor-Defined Breast Cancer in Postmenopausal Women Cancer Epidemiol. Biomarkers Prev., December 1, 2006; 15(12): 2482 - 2488. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Hernandez-Diaz, E. F. Schisterman, and M. A. Hernan Hernandez-Diaz et al. Respond to "The Perils of Birth Weight" Am. J. Epidemiol., December 1, 2006; 164(11): 1124 - 1125. [Full Text] [PDF] |
||||
![]() |
S. Hernandez-Diaz, E. F. Schisterman, and M. A. Hernan The Birth Weight "Paradox" Uncovered? Am. J. Epidemiol., December 1, 2006; 164(11): 1115 - 1120. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. A. Shih, T. A. Glass, K. Bandeen-Roche, M. C. Carlson, K. I. Bolla, A. C. Todd, and B. S. Schwartz Environmental lead exposure and cognitive function in community-dwelling older adults Neurology, November 14, 2006; 67(9): 1556 - 1562. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. F. Carlson, D. Langner, B. H. Alexander, J. G. Gurney, S. G. Gerberich, A. D. Ryan, C. M. Renier, and S. J. Mongin The Association Between Parents' Past Agricultural Injuries and Their Children's Risk of Injury: Analyses From the Regional Rural Injury Study-II. Arch Pediatr Adolesc Med, November 1, 2006; 160(11): 1137 - 1142. [Abstract] [Full Text] [PDF] |
||||
![]() |
L S Kurszewski, S G Gerberich, R C Serfass, A D Ryan, C M Renier, B H Alexander, K F. Carlson, and A S Masten Sports and recreational injuries: regional rural injury study-II: impact on agricultural households and operations Br. J. Sports Med., June 1, 2006; 40(6): 527 - 535. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Blignaut, L.L. Patton, W. Nittayananta, V. Ramirez-Amador, K. Ranganathan, and A. Chattopadhyay (A3) HIV Phenotypes, Oral Lesions, and Management of HIV-related Disease Advances in Dental Research, April 1, 2006; 19(1): 122 - 129. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. B. Louis, V. Dukic, P. J Heagerty, T. A Louis, C. D Lynch, L. M Ryan, E. F Schisterman, A. Trumble, and Pregnancy Modeling Working Group Analysis of repeated pregnancy outcomes Statistical Methods in Medical Research, April 1, 2006; 15(2): 103 - 126. [Abstract] [PDF] |
||||
![]() |
M. Joffe and J. Mindell Complex Causal Process Diagrams for Analyzing the Health Impacts of Policy Interventions Am J Public Health, March 1, 2006; 96(3): 473 - 479. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. D. Chambers, S. Hernandez-Diaz, L. J. Van Marter, M. M. Werler, C. Louik, K. L. Jones, and A. A. Mitchell Selective Serotonin-Reuptake Inhibitors and Risk of Persistent Pulmonary Hypertension of the Newborn N. Engl. J. Med., February 9, 2006; 354(6): 579 - 587. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. L. De Stavola, D. Nitsch, I. dos Santos Silva, V. McCormack, R. Hardy, V. Mann, T. J. Cole, S. Morton, and D. A. Leon Statistical Issues in Life Course Epidemiology Am. J. Epidemiol., January 1, 2006; 163(1): 84 - 96. [Abstract] [Full Text] [PDF] |
||||
![]() |
N M Nachreiner, S G Gerberich, P M McGovern, T R Church, H E Hansen, M S Geisser, and A D Ryan Relation between policies and work related assault: Minnesota Nurses' Study Occup. Environ. Med., October 1, 2005; 62(10): 675 - 681. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. M. Glymour, J. Weuve, L. F. Berkman, I. Kawachi, and J. M. Robins When Is Baseline Adjustment Useful in Analyses of Change? An Example with Education and Cognitive Change Am. J. Epidemiol., August 1, 2005; 162(3): 267 - 278. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. J. Newschaffer and S. R. Cole Invited Commentary: Risk Factors for Autism--Perinatal Factors, Parental Psychiatric History, and Socioeconomic Status Am. J. Epidemiol., May 15, 2005; 161(10): 926 - 928. [Full Text] [PDF] |
||||
![]() |
G. D. Smith, D. A. Lawlor, R. Harbord, N. Timpson, A. Rumley, G. D.O. Lowe, I. N.M. Day, and S. Ebrahim Association of C-Reactive Protein With Blood Pressure and Hypertension: Life Course Confounding and Mendelian Randomization Tests of Causality Arterioscler Thromb Vasc Biol, May 1, 2005; 25(5): 1051 - 1056. [Abstract] [Full Text] [PDF] |
||||
![]() |
L Richiardi, F Forastiere, P Boffetta, L Simonato, and F Merletti Effect of different approaches to treatment of smoking as a potential confounder in a case-control study on occupational exposures Occup. Environ. Med., February 1, 2005; 62(2): 101 - 104. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y.-K. Tu, R. West, G. T. H. Ellison, and M. S. Gilthorpe Why Evidence for the Fetal Origins of Adult Disease Might Be a Statistical Artifact: The "Reversal Paradox" for the Relation between Birth Weight and Blood Pressure in Later Life Am. J. Epidemiol., January 1, 2005; 161(1): 27 - 32. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. A Lawlor, G. Davey Smith, and S. Ebrahim Socioeconomic Position and Hormone Replacement Therapy Use: Explaining the Discrepancy in Evidence From Observational and Randomized Controlled Trials Am J Public Health, December 1, 2004; 94(12): 2149 - 2154. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. J. S. Christenfeld, R. P. Sloan, D. Carroll, and S. Greenland Risk Factors, Confounding, and the Illusion of Statistical Control Psychosom Med, November 1, 2004; 66(6): 868 - 875. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Hoffmann and M. M. Bergmann RE: "MIDLIFE DIETARY INTAKE OF ANTIOXIDANTS AND RISK OF LATE-LIFE INCIDENT DEMENTIA: THE HONOLULU-ASIA AGING STUDY" Am. J. Epidemiol., October 1, 2004; 160(7): 717 - 717. [Full Text] [PDF] |
||||
![]() |
T. Harder and A. Plagemann RE: "ASSOCIATION OF A WOMAN'S OWN BIRTH WEIGHT WITH HER SUBSEQUENT RISK FOR PREGNANCY-INDUCED HYPERTENSION" Am. J. Epidemiol., September 1, 2004; 160(5): 507 - 508. [Full Text] [PDF] |
||||
![]() |
S G Gerberich, T R Church, P M McGovern, H E Hansen, N M Nachreiner, M S Geisser, A D Ryan, S J Mongin, and G D Watt An epidemiological study of the magnitude and consequences of work related violence: the Minnesota Nurses' Study Occup. Environ. Med., June 1, 2004; 61(6): 495 - 503. [Abstract] [Full Text] |
||||
![]() |
S. Hernandez-Diaz, M. A. Hernan, K. Meyer, M. M. Werler, and A. A. Mitchell Case-Crossover and Case-Time-Control Designs in Birth Defects Epidemiology Am. J. Epidemiol., August 15, 2003; 158(4): 385 - 391. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Greenland and B. Brumback An overview of relations among causal modelling methods Int. J. Epidemiol., October 1, 2002; 31(5): 1030 - 1037. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
































