American Journal of Epidemiology Advance Access originally published online on January 12, 2006
American Journal of Epidemiology 2006 163(4):384-391; doi:10.1093/aje/kwj052
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Practice of Epidemiology |
Agreement between Responses to a Standardized Asthma Questionnaire and a Questionnaire following a Demonstration of Asthma Symptoms in Adults
1 Department of Public Health Sciences, King's College London, London, United Kingdom
2 Institute of Bio-Medical Science, Faculty of Medicine, University of Chile, Santiago, Chile
Correspondence to Nigel C. Smeeton, Department of Public Health Sciences, King's College London, Capital House, Fifth Floor, 42 Weston Street, London SE1 3QD, United Kingdom (e-mail: nigel.smeeton{at}kcl.ac.uk).
Received for publication August 1, 2005. Accepted for publication October 7, 2005.
| ABSTRACT |
|---|
|
|
|---|
Asthma epidemiology relies heavily on standardized questionnaires, but little is known about the understanding of asthma symptoms among adults in the community. In 2004, the authors assessed the level of agreement between responses to a standardized questionnaire and responses to a questionnaire completed by participants after viewing a demonstration of asthma symptoms. The study involved 601 young adults from Chile. The field-workers were trained to explain and demonstrate the asthma symptoms to the participants. The symptoms were wheeze, waking at night with breathlessness, breathlessness following exercise, and waking with cough. The kappa statistic did not exceed 0.4, and the recorded prevalence of asthma symptoms following the demonstration was 3060% lower than that for the standardized questionnaire. Using bronchial responsiveness as the proxy gold standard, the positive likelihood ratios for wheeze and waking short of breath were higher following symptom demonstration. The low agreement between the standardized questionnaire and the postdemonstration questionnaire and the likelihood ratios' closeness to 1 for the standardized questionnaire decreases the authors' confidence in the appropriateness of this tool for estimating the prevalence of asthma in the community. For etiologic studies of asthma, it may contribute to the lack of consistency between different studies analyzing the same etiologic exposures.
asthma; data collection; epidemiologic methods; questionnaires; statistics
Abbreviations: ECRHS, European Community Respiratory Health Survey; FEV1, forced expiratory volume in 1 second; ISAAC, International Study of Asthma and Allergies in Childhood
| INTRODUCTION |
|---|
|
|
|---|
Although the identification of asthma by questionnaire remains contentious (1
The value of standardized asthma questionnaires could be supported on the basis of the strong association between asthma symptoms and objective measurements such as lung function, bronchial hyperresponsiveness to challenge, and sensitization to allergens (7
9
). There is also evidence that the ISAAC questionnaire and the ISAAC video questionnaire are of similar value for identifying adolescents with bronchial hyperresponsiveness (10
). However, in community studies, only a small proportion of persons with asthma symptoms are also sensitized to allergens, are positive for bronchial hyperresponsiveness, or have a low forced expiratory volume in 1 second (FEV1) (9
, 11
, 12
), and none of these objective measurements are specific to asthma.
Although standardized asthma questionnaires have fulfilled an essential role in epidemiologic studies, it is important to understand their limitations. Peat et al. (13
) advised caution in the interpretation of responses to asthma questionnaires. In ISAAC, the agreement between responses to a questionnaire and responses following a video has been reported in several papers (10
, 14
, 15
). Equivalent information is not available for adults, who may differ in their interpretation of symptoms in comparison with adolescents or mothers reporting on behalf of their children. In a semirural area of Chile, we studied the level of agreement between responses to the European Community Respiratory Health Survey (ECRHS) questionnaire and responses to a demonstration of asthma symptoms based on the video developed in ISAAC (4
).
| MATERIALS AND METHODS |
|---|
|
|
|---|
Population and design
The current analysis was based on information obtained on two occasions. On the first occasion, 1,232 subjects randomly selected from persons born at the hospital in Limache, Chile, between 1974 and 1978 were studied at the ages of 2228 years (9
Limache is an agricultural town in central Chile with a population of approximately 53,000 Spanish-speaking inhabitants (17
). Many of its agricultural products are exported, and its level of poverty broadly corresponds to the median for Chile (18
, 19
). Most young adults in Limache have received at least 10 years of full-time education.
Data collection
In the first stage of the study, participants completed the Spanish version of the ECRHS questionnaire at home (16
, 19
). The occurrence of the following asthma symptoms in the previous 12 months was considered for this analysis: wheezing, waking with shortness of breath, breathlessness following exercise, and waking with a cough. We asked about having ever reported asthma, this being related mostly to physician-diagnosed asthma. Skin tests for sensitization to eight common allergens were carried out (16
, 19
). We assessed bronchial hyperresponsiveness to a methacholine challenge using the tidal breathing method (20
). Increasing concentrations of methacholine (0.5, 1, 4, 8, and 16 mg/ml) were used with a Hudson nebulizer (Hudson Inc., Temecula, California). FEV1 was measured using a Vitalograph 2120 (Vitalograph Ltd., Buckingham, United Kingdom) and Spirotrac IV software (Vitalograph Ltd.) following American Thoracic Society norms (21
). A participant with an FEV1 decrease of 20 percent in comparison with baseline FEV1 at any concentration up to 16 mg/ml was considered positive for bronchial hyperresponsiveness. These measurements were carried out in a hospital environment. For the purpose of this analysis, we used as a measure of socioeconomic status the presence of material belongings in the household (gas-fueled water-heating devices, personal computers, refrigerators, washing machines, and microwave ovens). Smoking behavior was assessed using the items in the ECRHS questionnaire. In addition, we analyzed the level of agreement in relation to length of full-time education.
Prior to the symptom demonstration, four field-workers, three of whom were university students and the other a psychologist, were given a tutorial about asthma symptoms. They then observed the ISAAC video several times and listened to a recording of the sounds of wheeze and cough. The field-workers had to mimic wheeze and an asthmatic cough separately in front of the rest of the team. The procedure was pilot-tested with several subjects who were not participating in the study in Limache. The field-workers did not commence data collection until they were considered proficient and were confident in the use of procedures. Their supervisor assessed the quality of the entire process at the beginning of the data collection period. To ensure that procedures were carried out as described in the instructions to field-workers, the supervisor visited households unannounced while the field-workers were administering the questionnaire. We did not assess repeatability between observers, since we would have needed another sample and this would have been time-consuming.
At the second study visit, the field-workers administered a list of 50 physical symptoms to assess somatization (22
). We excluded from the analysis four respiratory symptoms on the list. Somatization score was calculated as the number of symptoms indicated by the participant (23
), grouped as low (010), medium (11
20
), or high (>20).
During the second visit, the field-workers demonstrated respiratory symptoms as shown in the ISAAC video in relation to wheeze and cough. Following the demonstration, the participants were asked the same questions as those in the ISAAC video questionnaire. The sequence shown was related to wheezing at rest, exercise-induced wheezing, waking at night with wheeze, and waking at night with cough. The participants were asked whether they had had respiratory symptoms similar to those shown by the field-worker; if yes, whether the symptoms had occurred in the past year; and, if so, whether the symptoms had occurred more than once per month in relation to each item in the sequence. The field-worker demonstrated the symptom until he or she was convinced that the participant indicated an understanding of the characteristics of the symptom shown. We could not use the ISAAC video in our study, because few homes in Limache have video equipment, and in any case it is inappropriate to use a video portraying only adolescents in a study of adults.
Statistical analysis
For each of the four symptoms, the overall proportion of agreement between the questionnaire item and the demonstration by the field-worker was calculated. In addition, the proportions of positive and negative agreement (24
) and the chance-corrected kappa statistic (25
) were determined. Kappa has the value 0 when agreement is the same as that expected by chance, 1 if agreement is perfect, and 1 for complete disagreement.
For each symptom, we investigated the effect of certain characteristics on the level of agreement by comparing kappa statistics by category. Variables considered were gender (male, female), atopy (yes, no), bronchial hyperresponsiveness (positive, negative), current smoking (yes, no), somatization score (number of symptoms (010, 1120, or >20)), number of material belongings (from the five items specified above (01, 23, or 45)), and length of full-time education (010, 1112, or
13 years). The equality of subgroup kappa values was tested using the method of Donner et al. (26
). Analyses were performed using Stata, version 8.0 (27
). Mean kappa values were compared by category across symptoms using the paired t test (two categories) or repeated-measures analysis of variance (more than two categories).
The validity of the questionnaire and symptom demonstration as screening tools against a proxy gold standard for asthma was investigated for each of the four symptoms in turn. Although bronchial hyperresponsiveness cannot be considered specific to asthma, it is the best proxy measure available for assessing the validity of reported asthma in population studies. Thus, the bronchial hyperresponsiveness result (positive or negative) was taken as the indicator of "true" asthma. We describe it as "relative validity" to stress the lack of a true gold standard in this analysis. The positive likelihood ratio (sensitivity/(1 specificity)) and negative likelihood ratio ((1 sensitivity)/specificity) and their 95 percent confidence intervals (28
) were used to compare the relative validity of each symptom in screening for asthma with the questionnaire and symptom demonstration.
| RESULTS |
|---|
|
|
|---|
The characteristics of the subjects are shown in table 1. Physician-diagnosed asthma was uncommon in this population. Tables 2 and 3 show the prevalences of wheeze in the past 12 months and shortness of breath upon exercising in the past 12 months. The results for awakening short of breath in the past 12 months and being awakened by an attack of coughing in the past 12 months (not shown) displayed a similar pattern. Because of the oversampling of subjects with asthma symptoms from the initial survey, prevalences according to the standardized questionnaire were high. Regardless of the symptom explored, the prevalence values based on the questionnaire were higher than those obtained from symptom demonstration (tables 2 and 3). The proportion of agreement between the questionnaire and the demonstration was typically between 0.6 and 0.8, with positive agreement being considerably smaller than negative agreement. All kappa values were small but positive.
|
|
|
For wheeze in the previous 12 months (table 2), there was an indication of a higher level of agreement for persons with bronchial hyperresponsiveness than for those with a negative response (p = 0.057). Agreement was greater for persons with a medium level of somatization (p = 0.019), and there was a negative association between agreement and number of possessions (p = 0.027). Shortness of breath following exercise showed no agreement between questionnaire and demonstration for subjects with the most possessions, unlike the other possession groups, which had low but positive agreement (p = 0.040) (table 3). Kappa values did not show a pattern according to the length of full-time education for any of the symptoms (not shown).
Differences between subgroups were assessed by mean kappa values across the four symptoms (wheeze, awakening short of breath, shortness of breath on exercise, and cough attack) (table 4). Males had consistently higher agreement than did females (p = 0.0153), as did persons positive for bronchial hyperresponsiveness as compared with the other participants (p = 0.0466). There was a similar trend of higher agreement in subjects with atopy (p = 0.0846) and some differences for number of possessions (p = 0.0882). For the smoking and somatization groups, there were no consistent trends in level of agreement across the groups.
|
For wheeze and awakening short of breath, the positive and negative likelihood ratio 95 percent confidence intervals obtained using the standardized questionnaire contained the value 1, indicating no discriminatory value (table 5). In contrast, the positive and negative likelihood ratio 95 percent confidence intervals following symptom demonstration did not include the value 1 for any of the four symptoms, indicating discriminatory value, albeit only of a moderate level. Awakening short of breath had a relatively high positive likelihood ratio (2.62) when symptom demonstration was used. Apart from cough attack, for which the difference was small, positive likelihood ratios were greater for symptom demonstration than for the questionnaire. Negative likelihood ratios were all close to 1 for both the questionnaire and symptom demonstration.
|
| DISCUSSION |
|---|
|
|
|---|
In this study, levels of agreement between asthma symptoms reported on a questionnaire and symptoms reported following a demonstration of asthma symptoms were poor, with kappa values always less than 0.4, which is considered the lowest cutoff point for a fair degree of agreement (29
Few studies have assessed level of agreement using two administrations of the same questionnaire. A study conducted in Malaysia showed that most questions from the ISAAC questionnaire and 60 percent of the questions from the American Thoracic Society questionnaire had kappa values greater than 0.4 when the two questionnaires were administered on two occasions 1 month apart (30
). However, the time gap in this study was much longer, being approximately 18 months between the completion of the ECRHS questionnaire and the postdemonstration questionnaire. Undeniably, some of the persons with asthma symptoms would have changed their status over this period of time. Estimation of within-person change in status is difficult with the approaches currently available, as investigators from a large follow-up study argued (31
). These changes could not be associated with the management of the condition, since appropriate treatment in this population was not available (32
). The time gap cannot explain the marked decrease in prevalence after the demonstration of symptoms, since net changes per 10 years of follow-up have been shown to be small (31
).
Variations in symptoms due to seasonal effects could have lowered the level of agreement between the two questionnaires, but this lack of consistency would not explain the generally better performance of the questionnaire following a demonstration.
Our study extends to adults the current knowledge regarding the perception of asthma symptoms by adolescents. Crane et al. (4
) hypothesized that adults and children may perceive asthma symptoms differently. We are not aware of any comparative studies of asthma symptoms in adolescents and adults. However, the adult participants in our study had the same lack of agreement between responses upon questionnaire assessment and after symptom demonstration as that observed with adolescents. As is well documented, large differences in prevalence between the methods resulted in low kappa values (4
, 24
); negative agreement tended to be high, but positive agreement was usually low (4
). These results extend our knowledge by demonstrating that a higher degree of agreement can be associated with the subject's clinical, social, and gender traits, but these increased agreement levels were still disappointing. In addition, we have shown an increase in relative validity following the demonstration of asthma symptoms compared with the use of a standardized questionnaire alone.
One consideration from this study is whether results from a semirural geographic area in a middle-income country can be extrapolated to developed countries (16
, 19
). Our sample was based on young adults with a reasonable level of education. The ECRHS questionnaire was administered to the participants, and the participants had ample opportunity for clarifying the questions. In our study, persons with a higher socioeconomic level had a lower degree of agreement than persons from poorer sectors, although there was no association between agreement and length of full-time education. It is not easy to explain why the level of agreement would be slightly higher for persons with fewer possessions with respect to wheeze and shortness of breath following exercise. Respiratory symptoms are more frequent in less-affluent groups, and severe asthma is more common in poor sectors of the community (19
). It is possible that persons in lower socioeconomic groups might have more experience with the characteristics of asthma symptoms, enabling them to recognize the symptoms more easily from the demonstration.
We believe that our findings can be extrapolated with caution to older age groups or other ethnic groups, but replication of our findings in other populations would reinforce our beliefs. It is important to conduct further studies, because the exact sound of wheeze is subjective and difficult to replicate.
We expected that persons with atopy would be more consistent in recognizing asthma symptoms than persons without this trait, which is highly associated with asthma. The reason for such a conjecture was that persons with atopy are more likely to have experienced definite asthma symptoms. In our study, there was some evidence for such a trend, but the kappa statistics for differences were inconsistent across the four asthma symptoms explored.
We included somatization in the analysis because we hypothesized that subjects with many physical symptoms indicative of somatization would be more inconsistent in their responses than subjects with fewer physical symptoms. This relation was not seen; persons with an intermediate number of physical symptoms had a higher level of agreement than persons with few symptoms or a high number of symptoms.
The consequences, for community studies, of low agreement between the standardized questionnaire and the postdemonstration questionnaire and the lower relative validity of the standardized questionnaire when it was not preceded by a demonstration are twofold. Firstly, the low agreement provides a degree of uncertainty as to whether estimates of asthma prevalence in the community are appropriate. The two estimates of prevalence in this study were markedly different. If the postdemonstration questionnaire were considered appropriate, it would greatly decrease the prevalence of asthma usually reported in studies using the ISAAC or ECRHS questionnaire.
In terms of etiologic studies of asthma, the low agreement between methods of symptom assessment and the lower relative validity of not using a symptom demonstration provide a possible explanation for the lack of consistency between studies regarding the impact of diet, poverty, and infections on asthma. It is possible that by focusing on obvious symptoms of asthma, the demonstration could have caused participants to overlook minor symptoms that they would have recorded on a questionnaire completed without any prompting. However, the problem may be more related to the chronic nature of asthma symptoms. Proposals for analyzing asthma using a continuous asthma score have started to emerge (33
). These new techniques may help to resolve the current unsatisfactory situation.
| ACKNOWLEDGMENTS |
|---|
This study was funded by the Wellcome Trust.
The authors are indebted to Drs. Patricia Bustos and Hugo Amigo for their contribution to the supervision of the fieldwork and to Dr. Javier Mallol for facilitating the use of the video of asthma symptoms.
Conflict of interest: none declared.
| References |
|---|
|
|
|---|
- The International Study of Asthma and Allergies in Childhood (ISAAC) Steering Committee. World variation in the prevalence of asthma symptoms. Eur Respir J 1998;12:31535.[Abstract]
- Burney P, Chinn S, Jarvis D, et al. Variations in the prevalence of respiratory symptoms, self reported asthma attacks, and use of asthma medication in the European Community Respiratory Health Survey (ECRHS). Eur Respir J 1996;9:68795.[Abstract]
- Cane RS, Ranganathan SC, McKenzie SA. What do parents of wheezy children understand by "wheeze"? Arch Dis Child 2000;82:32732.
[Abstract/Free Full Text] - Crane M, Mallol J, Beasley R, et al. Agreement between written and video questions for comparing asthma symptoms in ISAAC. International Study of Asthma and Allergies in Childhood (ISAAC) Phase I Study Group. Eur Respir J 2003;21:45561.
[Abstract/Free Full Text] - Barnes KC, Freidhoff KR, Horowitz EM, et al. Physician-derived asthma diagnoses made on the basis of questionnaire data are in good agreement with interview-based diagnoses and are not affected by objective tests. J Allergy Clin Immunol 1999;104:7916.[CrossRef][ISI][Medline]
- Osborne ML, Vollmer WM, Pedula KL, et al. Lack of correlation of symptoms with specialist-assessed long-term asthma severity. Chest 1999;115:8591.
[Abstract/Free Full Text] - Jakeways N, McKeever T, Lewis SA, et al. Relationship between FEV1 reduction and respiratory symptoms in general population. Eur Respir J 2003;21:65863.
[Abstract/Free Full Text] - Jenkins MA, Clarke JR, Carlin JB, et al. Validation of questionnaire and bronchial hyperresponsiveness against respiratory physician assessment in the diagnosis of asthma. Int J Epidemiol 1996;25:60916.
[Abstract/Free Full Text] - Mackenney J, Amigo H, Bustos P, et al. The prevalence of asthma, atopy and bronchial hyper responsiveness and their interrelation in a semi-rural area of Chile. Int J Tuberculosis Lung Dis 2005;9:128893.
- Lai C, Chan J, Wong J, et al. Comparison of the ISAAC video questionnaire (AVQ3.0) with the ISAAC written questionnaire for estimating asthma associated with bronchial hyperreactivity. Clin Exp Allergy 1997;27:5405.[CrossRef][ISI][Medline]
- Pattemore PK, Asher MI, Harrison AC, et al. The interrelationship among bronchial hyperresponsiveness, the diagnosis of asthma, and asthma symptoms. Am Rev Respir Dis 1990;142:54954.[ISI][Medline]
- Burney PGJ, Chinn S, Britton JR, et al. What symptoms predict the bronchial response to histamine? Evaluation in a community survey of the bronchial symptoms questionnaire (1984) of the International Union against Tuberculosis and Lung Disease. Int J Epidemiol 1989;18:16573.
[Abstract/Free Full Text] - Peat JK, Salome CM, Toelle BG, et al. Reliability of a respiratory history questionnaire and effect of mode of administration on classification of asthma in children. Chest 1992;102:1537.
[Abstract/Free Full Text] - Gibson PG, Henry R, Shah S, et al. Validation of the ISAAC video questionnaire (AVQ3.0) in adolescents from a mixed ethnic background. Clin Exp Allergy 2000:30:11817.[CrossRef][ISI][Medline]
- Pizzichini MM, Rennie D, Senthilselvan A, et al. Limited agreement between written and video asthma symptom questionnaires. Pediatr Pulmonol 2000;30:30712.[CrossRef][ISI][Medline]
- Rona RJ, Smeeton NC, Bustos P, et al. The early origins hypothesis with emphasis on growth rate in the first year of life and asthma: a prospective study in Chile. Thorax 2005;60:54954.
[Abstract/Free Full Text] - Instituto Nacional de Estadística, Chile. Cuantos somos los Chilenos? Comunas, provincias y regiones: V Región: Limache y Olmue. Santiago, Chile: Instituto Nacional de Estadística, 2002. (http://www.censo2002.cl/menu_superior/cuantos_somos/poblacion_comunas.htm). (Accessed May 8, 2005).
- Ministerio Nacional de Desarrollo y Planificación (MIDEPLAN). Encuesta CASEN/estadísticas. Indicadores de pobreza. Santiago, Chile: MIDEPLAN, 2005. (http://www.mideplan.cl/publico/casen.php?ini=&apo=2000&subcatid=20&tema=9&secid=4&catid=20). (Accessed March 28, 2005).
- Corvalan C, Amigo H, Bustos P, et al. Socioeconomic risk factors for asthma in Chilean young adults. Am J Public Health 2005;95:137581.
[Abstract/Free Full Text] - American Thoracic Society. Guidelines for methacholine and exercise challenge testing1999. (Official statement). Am J Respir Crit Care Med 2000;161:30929.
[Free Full Text] - American Thoracic Society. Standardization of spirometry, 1994 update. Am Rev Respir Dis 1995;152:110736.
- Unwin C, Blatchley N, Coker W, et al. Health of UK servicemen who served in Persian Gulf War. Lancet 1999;353:16978.[CrossRef][ISI][Medline]
- Simon G, Von Korff M. Somatization and psychiatric disorder in the NIMH Epidemiologic Catchment Area Study. Am J Psychiatry 1991;148:1494500.
[Abstract/Free Full Text] - Cicchetti DV, Feinstein AR. High agreement but low kappa: II. Resolving the paradoxes. J Clin Epidemiol 1990;43:5518.[CrossRef][ISI][Medline]
- Cohen J. A coefficient of agreement for nominal scales. Educ Psychol Meas 1960;70:3746.
- Donner A, Eliasziw M, Klar N. Testing the homogeneity of kappa statistics. Biometrics 1996;52:17683.[CrossRef][ISI][Medline]
- Stata Corporation. Stata statistical software, release 8. College Station, TX: Stata Corporation, 2003.
- Simal DL, Samsa GP, Matchar DB. Likelihood ratios with confidence: sample size estimation for diagnostic test studies. J Clin Epidemiol 1991;44:76370.[CrossRef][ISI][Medline]
- Fleiss JL, Levin B, Paik MC. Statistical methods for rates and proportions. 3rd ed. Hoboken, NJ: Wiley InterScience, 2003:598626.
- Norzila MZ, Haifa AL, Deng CT, et al. Prevalence of childhood asthma and allergy in an inner city Malaysian community: intra-observer reliability of two translated international questionnaires. Med J Malaysia 2000;55:339.[Medline]
- Chinn S, Jarvis D, Burney P, et al. Increase in diagnosed asthma but not in symptoms in the European Community Respiratory Health Survey. Thorax 2004;59:64651.
[Abstract/Free Full Text] - Rona RJ, Smeeton NC, Vargas C, et al. Untreated asthma, final height and sitting height/leg length ratio in Chile. Respir Med 2005;Oct 10:17 (electronic publication ahead of print). (DOI:10.1016/j.rmed.2005.07.018).
- Pekkanen J, Sunyer J, Anto JM, et al. Operational definitions of asthma in studies on its aetiology. Eur Respir J 2005;26:2835.
[Abstract/Free Full Text]
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||