AJCN Tufts Nutrition Symposium, Boston Sept 24-26
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Purchase Article
Right arrow View Shopping Cart
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Kroke, A.
Right arrow Articles by Boeing, H.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Kroke, A.
Right arrow Articles by Boeing, H.
Agricola
Right arrow Articles by Kroke, A.
Right arrow Articles by Boeing, H.
American Journal of Clinical Nutrition, Vol. 70, No. 4, 439-447, October 1999
© 1999 American Society for Clinical Nutrition


Original Research Communications

Validation of a self-administered food-frequency questionnaire administered in the European Prospective Investigation into Cancer and Nutrition (EPIC) Study: comparison of energy, protein, and macronutrient intakes estimated with the doubly labeled water, urinary nitrogen, and repeated 24-h dietary recall methods1,2,3

Anja Kroke, Kerstin Klipstein-Grobusch, Susanne Voss, Jutta Möseneder, Frank Thielecke, Rudolf Noack and Heiner Boeing

1 From the Departments of Epidemiology, Intervention Studies, and Nutritional Biochemistry and Physiology, German Institute of Human Nutrition, Potsdam-Rehbruecke, Germany.

2 Supported by the Federal Ministry of Technology, Germany, and by grant SOC 95 201408 05F02 from the Europe Against Cancer Program of the European Community.

3 Address reprint requests to A Kroke, German Institute of Human Nutrition, Postsdam-Rehbruecke, Arthur-Scheunert-Allee 114-116, 14558 Bergholz-Rehbruecke, Germany. E-mail: Kroke{at}www.dife.de.


    ABSTRACT
 TOP
 ABSTRACT
 INTRODUCTION
 SUBJECTS AND METHODS
 RESULTS
 DISCUSSION
 REFERENCES
 
Background: The validation of dietary assessment instruments is critical in the evaluation of diet as a chronic disease risk factor.

Objective: The objective was to assess the validity of a self-administered food-frequency questionnaire by comparison with dietary recall, urinary nitrogen excretion, and total energy expenditure data.

Design: Over a 1-y period, data from twelve 24-h dietary recalls, a food-frequency questionnaire, and four 24-h urine samples were obtained from 134 study participants of the European Prospective Investigation into Cancer and Nutrition (EPIC) Study in Potsdam, Germany. In a substudy of 28 participants, total energy expenditure from doubly labeled water measurements was assessed.

Results: Energy-adjusted, deattenuated correlation coefficients between the questionnaire and the recalls ranged from 0.54 for dietary fiber to 0.86 for alcohol. Cross-classification of quintiles of nutrient intakes from the questionnaire and recalls indicated severe misclassification to be <4%. Reported protein intake correlated with estimated protein excretion (r = 0.46). Energy intake and total energy expenditure were also significantly correlated (r = 0.48); however, all but one subject underreported their energy intake. The magnitude of underreporting varied considerably, by 22% on average, and increased slightly with increasing energy intake. A similar pattern of underreporting was observed when energy intakes from the 24-h dietary recalls were compared with total energy expenditure.

Conclusions: These data indicate an acceptable relative validity of the food-frequency questionnaire in this study population. Compared with measurements of total energy expenditure and protein excretion, however, only moderate agreement with both the food-frequency questionnaire and the 24-h dietary recalls was observed.

Key Words: Validity • biomarkers • energy expenditure • epidemiology • dietary recalls • food-frequency questionnaire • doubly labeled water • FFQ • nutrition • EPIC Study • Germany • humans


    INTRODUCTION
 TOP
 ABSTRACT
 INTRODUCTION
 SUBJECTS AND METHODS
 RESULTS
 DISCUSSION
 REFERENCES
 
In epidemiologic studies, dietary assessments of the relation between diet and disease most frequently rely on self-administered, quantitative food-frequency questionnaires (FFQs) aiming at the assessment of usual long-term dietary intake and designed to rank subjects into quantiles of dietary intake. The relative validity of FFQs is usually assessed by comparing their data with those of a reference method, eg, repeated 24-h dietary recalls or weighed food records, because a gold standard as a "truth reference" is not available. Most validation studies have reported a strong correlation between estimates from the 2 methods and a generally accepted amount of misclassification when subjects were cross-classified into quantiles of the distribution of each nutrient under investigation (14). The underlying assumption of this validation approach requires that the measurement error of the test method be independent of the measurement error of the reference method (5). Several studies, however, have reported that both methods are affected by underreporting, a frequently observed phenomenon in dietary assessment methods that refers to the discrepancy between reported energy intakes and energy requirements (6). Therefore, biomarkers of dietary intake have become increasingly important for an unbiased ascertainment of nutrient intake in nutritional validation studies (7, 8) because their measurement error is uncorrelated with the errors of the test method (5, 9, 10).

A biomarker frequently applied in nutritional validation studies is the measurement of urinary nitrogen excretion, which allows the determination of protein intake (11, 12). The doubly labeled water (DLW) method, developed by Lifson et al (13) in the early 1950s and first used in humans by Schoeller and van Santen (14) in 1982, is commonly accepted as the only method that provides valid estimates of energy expenditure in free-living subjects.

The present validation study was conducted during the baseline recruitment phase of the European Prospective Investigation into Cancer and Nutrition Study at the Potsdam, Germany, study center (EPIC-Potsdam Study). The FFQ used in the baseline examination of the EPIC-Potsdam Study was validated previously for reproducibility and relative validity in a West German study population (1517). Because the performance of a dietary assessment instrument depends on characteristics of the study population, additional validation studies are recommended when a previously validated instrument is used in another population (5, 10). The 2 aims of this validation study were therefore to investigate the validity of the FFQ against biomarkers as reference methods and to obtain data on relative validity by comparing FFQ data with data from 24-h dietary recalls, thereby determining the validity of the FFQ in this East German study population.


    SUBJECTS AND METHODS
 TOP
 ABSTRACT
 INTRODUCTION
 SUBJECTS AND METHODS
 RESULTS
 DISCUSSION
 REFERENCES
 
Background
EPIC is a multicenter cohort study conducted in 9 European countries (18, 19). The cohort from the EPIC-Potsdam Study, 1 of 2 German cohorts, will contribute {approx}27000 subjects to the overall cohort size of {approx}475000. Study participants were randomly selected from population registries in Potsdam and adjacent communities. Approval for all study procedures, including those of the validation study, was given by the Ethical Committee of the State of Brandenburg, Germany, and informed consent was obtained from all study participants.

Subjects
The 160 subjects who volunteered to participate in the validation study were recruited from participants of the baseline examination of the EPIC-Potsdam Study. Women between 35 and 66 y of age and men between 40 and 67 y of age were divided into 3 age- and sex-specific strata according to the EPIC protocol (18). Only those 134 subjects who completed at least ten 24-h dietary recalls and at least three 24-h urine collections [>=85% p-aminobenzoic acid (PABA) recovery] were included in the present analysis. In a subset (15 men and 15 women) of the validation study, total energy expenditure (TEE) was determined with the DLW method. Two persons did not complete the required measurements and were therefore excluded from this analysis.

Dietary assessment
Food-frequency questionnaire
A self-administered, scanner-readable FFQ was the basic instrument used to assess habitual food intake in this cohort study. Development, reproducibility, and relative validity of the FFQ were described in detail previously (1517). The 146-item FFQ included questions about specific food items, such as the frequency of sauce consumption, the fat content of several food items, and seasonal consumption of fruit and vegetables. For certain food items, portion sizes were estimated on the basis of colored photographs of portion sizes; otherwise, standard portion sizes such as a cup (150 mL), jars, and pieces were used. Questions about general patterns of consumption of the main food groups (bread, luncheon meat and cheese on bread, meat, fruit, and vegetables) at the end of the FFQ were used to adjust the consumption patterns that were reported when several single food items from these food groups were requested. After the FFQ was optically read, a software program was used to check the questionnaire data for completeness and plausibility. Missing or implausible information was then corrected with the particpant, resulting in complete and consistent dietary data. The FFQ was administered at the end of the validation study period, thereby covering the 1-y study phase.

Twenty-four–hour dietary recall
Three computer-assisted 24-h dietary recall interviews (EPIC-SOFT, German version; developed in collaboration with the International Agency of Research on Cancer, Lyon, France) were conducted per season during the 1-y study period by one trained interviewer. This software program was specifically developed for the standardized assessment of foods and recipes consumed on the preceding day and was adapted to the specific dietary habits of the EPIC-Potsdam Study population (20). The interviews were meal-sequence based and involved a detailed assessment and description of the food consumed. Colored photographs of different portion sizes of foods were provided to help estimate the quantity of food consumed. Interviews were conducted on all weekdays except Fridays because of organizational restrictions. Recalls of food consumed on Saturdays and Sundays were obtained on Mondays.

Nutrient database
Energy and nutrient intakes from both dietary assessments were calculated by using data from the German Food Code (21).

Lifestyle factors and anthropometry
Information on other lifestyle factors, such as smoking habits and physical activity, were obtained by conducting computer-assisted person-to-person interviews. Anthropometric measures were obtained by trained and quality-monitored personnel (22) while subjects wore no shoes and only light underwear. Body weight was measured on a digital scale to the nearest 100 g; body height was measured with a flexible anthropometer to the nearest 0.1 cm.

Relative energy intake
For all subjects in the validation study, basal metabolic rate (BMR) was calculated based on the formulas of Schofield et al (23) from weight and age. Energy intake reported on the FFQ was then divided by the estimated BMR, thereby obtaining a measure of relative energy intake that accounted for age and body weight (24).

Doubly labeled water method
The DLW method was used to assess the carbon dioxide production values needed to calculate TEE with the classic equations of Coward and Cole (25). This method is based on the differential disappearance rates of the stable isotopes 2H and 18O. We applied the multipoint approach (26) over a time period of 14 d after the oral administration of an individually calculated amount of doubly labeled water per kilogram body weight (27). The isotopes disperse throughout the body water and gradually leave the body over subsequent days. Urine samples from each day of the sampling period were stored by the participants at -20°C. When the sampling was completed, the urine samples were stored at -80°C until analyzed by mass spectrometry with an automated equilibration device. The CV for the applied analytic method was 3%; further details of the isotope analysis were described previously (28).

Twenty-four–hour urinary nitrogen
Four 24-h urine samples were collected from each participant while PABA was administered simultaneously to assess the completeness of urine collection (29). Recovery of <85% PABA indicated incomplete urine collection (30, 31) and resulted in exclusion of 93 of 640 urine samples. Urea nitrogen excretion was measured by spectrophotometry and converted to urea nitrogen excretion in g/d. Assuming that urea nitrogen excretion is a constant proportion (85%) of total urinary nitrogen (12), protein intake was derived from the formula 6.25 x (urinary nitrogen + 2), as suggested by Isaksson (11).

Statistical analysis
Nutrients not normally distributed were logarithmically transformed to achieve normal distributions, which were obtained for all nutrients except alcohol. Means and SDs of energy, protein, and macronutrient intakes were determined for the test (FFQ) and reference (24-h dietary recall, urinary nitrogen, and DLW) methods; differences and ratios between mean values obtained with the FFQ and with the reference methods were calculated. A paired-difference t test and Wilcoxon's signed-rank test for alcohol were used to determine significant differences between means. The variance ratios of the repeated 24-h dietary recalls and the urinary nitrogen measurements were determined by dividing within-person variations by between-person variations. Pearson's and Spearman's correlation coefficients, respectively, were obtained for crude and deattenuated data as well as for energy-adjusted crude and deattenuated data. Deattenuation to correct for intraindividual variability was accomplished by using the formula suggested by Beaton et al (32). Values were adjusted for energy by using the residual method (33).

To assess agreement between the FFQ and reference methods and to detect any bias with the test method relative to the reference method, differences between the 2 respective methods were plotted against the means, as suggested by Bland and Altman (34). To compare the FFQ with the multiple 24-h dietary recall, study participants were classified into quintile categories of energy and macronutrient intakes based on the distribution of data from the test and reference methods. Proportions of subjects classified into the same, adjacent, or extreme quintiles were derived for both crude and energy-adjusted cross-classifications. To compare the test method with the urinary nitrogen and DLW methods, study participants were classified into quartiles of energy and protein intakes according to the distribution of the test and the respective reference methods. Proportions of subjects classified into the same, adjacent, or extreme quartiles were determined. Results were considered statistically significant at a two-tailed {alpha} level of 0.05. Statistical analysis was performed by using SAS software (release 6.12; SAS Institute Inc, Cary, NC).


    RESULTS
 TOP
 ABSTRACT
 INTRODUCTION
 SUBJECTS AND METHODS
 RESULTS
 DISCUSSION
 REFERENCES
 
General characteristics of the study population are presented in Table 1Go. Mean ages, body weights, and energy intakes relative to estimated BMRs were significantly different between men and women. Seventeen percent of the men and 3% of the women were current smokers. Regular sporting activities were reported by 39% of the men and by 44% of the women. Mean energy and macronutrient intakes estimated with the FFQ and 24-h dietary recalls, mean differences between these 2 methods, mean ratios of the 2 methods, and variance ratios for the 24-h dietary recall data are presented in Table 2Go. Intakes of energy, total protein, and total carbohydrate were significantly higher with the FFQ than with the 24-h dietary recalls, whereas intakes of total fat, saturated fat, monounsaturated fat, cholesterol, and monosaccharides were lower with the FFQ than with the 24-h dietary recall. For several nutrients, there was considerably more within-person than between-person variation as indicated by the variance ratios for 24-h dietary recall data. Therefore, crude data as well as correlation coefficients deattenuated for within-person variation and energy-adjusted crude and deattenuated correlation coefficients are presented in Table 2Go. Pearson's correlation coefficients for crude data ranged between 0.47 (polyunsaturated fat) and 0.83 (alcohol); deattenuation improved the correlation coefficients for energy intake and all nutrients, especially for those with the highest variance ratio (polyunsaturated fat and cholesterol). Energy adjustment generally improved the correlation coefficients, except for alcohol. Further deattenuation improved the energy-adjusted correlation coefficients, which ranged from 0.54 for dietary fiber to 0.86 for alcohol.


View this table:
[in this window]
[in a new window]
 
TABLE 1. General characteristics of the study population: EPIC-Potsdam Study1
 

View this table:
[in this window]
[in a new window]
 
TABLE 2. Macronutrient intakes estimated from the food-frequency questionnaire (FFQ) and twelve 24-h dietary recalls, differences between the methods, the ratio of the 2 methods, and correlation coefficients between the methods: EPIC-Potsdam Study1
 
Cross-classification of energy-adjusted nutrient intakes derived from the FFQ and 24-h dietary recalls into quintiles of intake did not show extreme misclassification for total fat, saturated fat, monounsaturated fat, and alcohol (Table 3Go); for the other nutrients, severe misclassification ranged from 0.8% to 3.7%. Classification into the same or adjacent quintiles ranged from 70% for monosaccharides to 87% for total protein and alcohol intakes.


View this table:
[in this window]
[in a new window]
 
TABLE 3. Cross-classification of mean daily macronutrient intakes derived from the food-frequency questionnaire and from twelve 24-h dietary recalls into quintiles: EPIC-Potsdam Study1
 
Comparisons of energy and protein intakes estimated with the FFQ and the biomarkers as the reference methods (urinary nitrogen and DLW) are presented in Table 4Go. Protein intake reported on the FFQ was {approx}23% lower than estimates derived from urinary nitrogen measurements. The correlation was slightly improved by deattenuation for the within-person variation observed in urinary nitrogen excretion. Severe misclassification was observed in 2% of the subjects; 46% of the subjects were classified into the adjacent quartiles and 35% were correctly classified. The Bland and Altman plot (Figure 1Go), showing differences between methods for each subject, did not indicate that differences tended to increase as absolute energy intake increased. The mean difference was negative because of low reported protein intakes. The spread around the mean reflected the variation in differences, reflecting the observation that the 2 methods were only moderately correlated.


View this table:
[in this window]
[in a new window]
 
TABLE 4. Energy and protein intakes estimated from the food-frequency questionnaire (FFQ) and reference methods (urinary nitrogen and doubly labeled water), differences between the methods, the ratio of the methods, and correlation coefficients between the methods: EPIC-Potsdam Study1
 


View larger version (17K):
[in this window]
[in a new window]
 
FIGURE 1. Differences between protein intake estimated from the food-frequency questionnaire and by the urinary nitrogen method plotted against the mean from the 2 methods in the European Prospective Investigation into Cancer and Nutrition (EPIC) Potsdam Study (n = 134).

 
Reported energy intake on the FFQ was 22% lower than the measured energy requirement (r = 0.48). Thirty-six percent of the subjects were cross-classified into the same quartile, 39% were cross-classified into the adjacent quartile, and 3.6% of the subjects were cross-classified into the opposite quartiles of the respective distributions of energy intakes and energy expenditure. Differences in energy intakes between the FFQ and the DLW method plotted against the mean of the 2 methods are shown in Figure 2Go. The mean difference between the 2 methods reflected a considerable amount of underreporting of energy intake for all but one subject, and the width of the limits of agreement from 2000 to -7000 kJ (±2 SD) indicated wide discrepancies between the 2 methods for individual subjects. The differences tended to increase as the absolute energy values increased. Further analysis of this phenomenon indicated that the difference between the FFQ and the DLW method, as well as the difference between the FFQ and the urinary nitrogen method, were significantly correlated with BMI: r = 0.50 (P = 0.007) and r = 0.30 (P = 0.0004), respectively.



View larger version (17K):
[in this window]
[in a new window]
 
FIGURE 2. Differences between energy intake estimated from the food-frequency questionnaire and total energy expenditure estimated with the doubly labeled water method plotted against the mean from the 2 methods in the European Prospective Investigation into Cancer and Nutrition (EPIC) Potsdam Study (n = 28).

 
To evaluate the phenomenon of underreporting, we plotted the mean of the energy intake from the 24-h dietary recall and DLW data against the difference between these methods (Figure 3Go). Basically, we observed the same pattern as in the comparison with the FFQ: energy intake from the 24-h dietary recalls was clearly underreported compared with the reference method, and the limits of agreement, though slightly smaller, indicated discrepancies between the 24-h dietary recall and reference methods as well. The difference between the 24-h dietary recall and DLW methods was strongly and highly significantly correlated with the corresponding difference between the FFQ and DLW method (r = 0.74, P = 0.0001). Weaker, but also highly significant correlations for these differences were observed for urinary nitrogen (r = 0.58, P = 0.0001). These 2 correlation coefficients clearly indicated correlation of measurement error, which was supported by the plot of energy intakes from the FFQ and 24-h dietary recalls (Figure 4Go). This comparison did not indicate underreporting; however, a slight tendency for increasing differences with increasing energy intakes was also detectable.



View larger version (18K):
[in this window]
[in a new window]
 
FIGURE 3. Differences between energy intake estimated from the 24-h dietary recalls and total energy expenditure estimated with the doubly labeled water method plotted against the mean from the 2 methods in the European Prospective Investigation into Cancer and Nutrition (EPIC) Potsdam Study (n = 28).

 


View larger version (20K):
[in this window]
[in a new window]
 
FIGURE 4. Differences between energy intake estimated from the food-frequency questionnaire and from the 24-h dietary recalls plotted against the mean from the 2 methods in the European Prospective Investigation into Cancer and Nutrition (EPIC) Potsdam Study (n = 134).

 

    DISCUSSION
 TOP
 ABSTRACT
 INTRODUCTION
 SUBJECTS AND METHODS
 RESULTS
 DISCUSSION
 REFERENCES
 
This validation study is among the first to use the DLW method to evaluate the validity of a self-administered FFQ with respect to energy intake. Compared with the quasi gold standard of TEE measurement by the DLW method, energy intake was underreported by all but one subject, by 22% on average. The observed differences between reported energy intake and TEE may have been due to measurement error with both methods. The conclusion that energy intake was underreported is based on the assumption that energy requirements are not overestimated by the DLW method, a method that has been successfully validated against gas-exchange measurements in lean and obese volunteers (27, 3537). The discrepancy between these methods was found to vary from 2.5% to 3% and, thus, could not be the reason for the observed differences between reported energy intake and TEE. Attribution of the error mainly to the FFQ method is supported by results of other studies investigating the relation of reported energy intake to TEE measured by the DLW method. The only published validation study that compared data from 2 different FFQs with DLW measurements described underreporting of energy intake as well (38). Several validation studies of other dietary assessment methods observed underreporting of energy intake in most but not all studies (3941). However, conclusions about the validity of the FFQ are difficult to make unless 2 underlying assumptions hold true: that the subjects are in energy balance and that the short-term (14 d) TEE measurement represents habitual usual energy expenditure. The DLW method permits validation of reported energy intake only if subjects are in energy balance (41). Subjects in this study were instructed not to change their dietary and lifestyle habits during the 14 d of sampling. Mean weight changes during the sampling period were small (<1 kg); therefore, the subjects likely were in energy balance. With respect to the second assumption, there was only one 14-d TEE measurement period; however, the FFQ was designed to assess usual dietary intake over the preceding year. The observed discrepancies between TEE and reported energy intake might therefore partly be explained by the incomplete coverage of the FFQ data by the DLW measurement and the lack of data on within-person variability.

A second biomarker, urinary nitrogen, was used to validate protein intake reported on the FFQ. Previous studies comparing urinary nitrogen with FFQ data observed correlation coefficients between 0.07 and 0.54 (10, 30, 42, 43), indicating that our data were within the range observed previously. Similar to reported energy intakes, protein intakes were underreported when compared with urinary nitrogen. The use of urinary nitrogen as a reference method for dietary intake requires stable nitrogen balance, eg, no gain or loss of tissue. For >90% of the study population, the estimated weight change during the 1-y study period was <=2 kg. However, larger short-term changes in weight might have occurred during the study period. Furthermore, it was assumed that extrarenal nitrogen losses are a constant proportion of overall nitrogen excretion (12). This might not be the case for all subjects. These limitations might therefore partly explain the observed discrepancies between estimated protein excretion and intake.

Relative validity of the FFQ was assessed in comparison with twelve 24-h dietary recalls administered over 1 y to obtain information on performance of the FFQ in our East German study population. Procedures and data analysis were adapted to the previous validation study of the FFQ in the other German EPIC study population in Heidelberg, where twelve 24-h dietary recalls were obtained in face-to-face interviews (15). Limitations of the applied reference instrument and study procedures were described and discussed previously (15, 16); therefore, only aspects specific to the Potsdam validation study are discussed here.

For the Potsdam validation study, a computer-supported, structured, face-to-face 24-h dietary recall interview was administered by one trained interviewer. In contrast with the Heidelberg Study, for which 24-h dietary recall data were not available for Fridays and Saturdays, we obtained data for Saturdays using a 24-h dietary recall applied on Monday. However, no data were obtained for Fridays because of logistic reasons. Because dietary habits on Fridays typically differ from those on other weekdays, our 24-h dietary recall might have underestimated some nutrients as well as energy intake. Crude correlation coefficients were similar in the 2 studies. Weaker correlations were observed in Potsdam than in Heidelberg only for intakes of alcohol and dietary fiber. Comparison of cross-classifications indicated similar results with the FFQ in our study population; there were no significant differences between the results of the Heidelberg and Potsdam studies. Note, however, that methodologic differences in the 2 validation studies did not allow an unrestricted comparison. Our correlation coefficients were in a range comparable with those of other validation studies (10) and thus we concluded that the relative validity of the FFQ was acceptable.

The validation approach in which a second dietary assessment instrument is used as a reference method assumes uncorrelated errors between the test and reference methods. However, when the dietary assessment instruments used in the present validation study were compared with biomarkers, measurement error between the test methods (FFQ, 24-h recalls) and the reference methods (DLW, urinary nitrogen) were correlated. The validation of dietary assessment instruments has been discussed not only in terms of the appropriateness of the reference instrument but also with respect to the statistics used to compare the validity of the methods (44, 45). As pointed out by Bland and Altman (34), it is inappropriate to use correlation coefficients to analyze agreement between methods. Therefore, the mean difference is calculated to obtain information on bias in the group estimate, and limits of agreement (±2 SD of mean difference) indicate the scatter of individual results. Applying this analytic concept to the energy data, we showed a strongly biased mean of differences and a wide scatter of differences between reported energy intakes and total energy expenditure, despite significant correlations between reported energy intake and TEE. The wide scattering of the differences showed clearly that some subjects underreported their energy intakes more so than did others. This would not have been detected if only 24-h dietary recall data were used as a reference, as shown in Figure 4Go.

A limitation in the interpretation of our results with respect to the validity of the FFQ in the entire study population is the fact that participants in the validation study were a selected, highly motivated sample of the study population. In addition, 16 subjects dropped out of the validation study. For these reasons, there may have been smaller within-person variation and, therefore, stronger correlations in the validation study than in the entire study population.

Even though some uncertainties remain about the absolute amount of underreporting of energy intake on the FFQ, we observed a systematic increase in measurement error with increasing absolute reported energy intakes. This observation could have serious implications on risk estimates of diet-disease relations. Differential underreporting depending on subject characteristics and selective underreporting of certain macronutrients (46) are issues to be considered in this regard. Our data indicated, for example, that underreporting was higher with higher energy intakes. Because subjects with a high BMI have higher energy requirements, underreporting could be related to obesity, as described previously (4649). These assumptions are supported by the strong correlations observed in our data between measurement error and BMI. However, the sample size was too small to detect significant determinants of underreporting. Several investigators have noted the importance of obtaining data on random and systematic measurement error of the exposure assessment for correcting linear regression (50) or logistic regression estimates (51). A detailed characterization of measurement error is also considered to be important in the use and interpretation of energy-adjustment models because measurement error may unpredictably distort the estimated effects of energy-adjustment models (52).

In summary, energy intake was underreported on the FFQ when compared with TEE, and protein intakes were underreported on the FFQ when compared with urinary nitrogen. These observations were similar to those described for other dietary assessment instruments. Correlation coefficients between macronutrient intakes estimated with the FFQ and the 24-h dietary recalls were within the range observed in a previous validation study (16), indicating comparable relative validity for the FFQ used in the present study population.


    ACKNOWLEDGMENTS
 
We thank the interviewers of the EPIC-Potsdam Study.


    REFERENCES
 TOP
 ABSTRACT
 INTRODUCTION
 SUBJECTS AND METHODS
 RESULTS
 DISCUSSION
 REFERENCES
 

  1. Fidanza F, Gentile MG, Porrini M. A self-administered semi-quantitative food-frequency questionnaire with optical reading and its concurrent validation. Eur J Epidemiol 1995;11:163–70.[Medline]
  2. Gnardellis C, Trichopulou A, Katsouyanni K, et al. Reproducibility and validity of an extensive semiquantitative food frequency questionnaire among Greek school teachers. Epidemiology 1995;6:74–7.[Medline]
  3. Goldbohm RA, van den Brandt PA, Brants HAM, et al. Validation of a dietary questionnaire used in a large-scale prospective cohort study on diet and cancer. Eur J Clin Nutr 1994;48:253–65.[Medline]
  4. Larkin FA, Metzner HL, Thompson FE, et al. Comparison of estimated nutrient intakes by food frequency and dietary records in adults. J Am Diet Assoc 1989;89:215–23.[Medline]
  5. Willett W. Reproducibility and validity of food-frequency questionnaires. In: Willett W, ed. Nutritional epidemiology. Oxford, United Kingdom: Oxford University Press, 1990:92–126.
  6. Black AE, Goldberg GR, Jebb SA, et al. Critical evaluation of energy intake data using fundamental principles of energy physiology. 2. Evaluating the results of published surveys. Eur J Clin Nutr 1991;45:583–99.[Medline]
  7. Bingham SA. Limitations of the various methods for collecting dietary intake data. Ann Nutr Metab 1991;35:117–27.[Medline]
  8. van't Veer P, Kardinaal AFM, Bausch-Goldbohm RA, Kok FJ. Biomarkers for validation. Eur J Clin Nutr 1993;47:S58–63.
  9. Mahalko JR, Johnson LK, Gallagher SK, Milne DB. Comparison of dietary histories and seven-day food records in a nutritional assessment of older adults. Am J Clin Nutr 1985;42:542–53.[Abstract/Free Full Text]
  10. Willett W, Lenart E. Reproducibility and validity of food-frequency questionnaires. In: Willett W, ed. Nutritional epidemiology. Oxford, United Kingdom: Oxford University Press, 1998:101–47.
  11. Isaksson B. Urinary nitrogen output as a validity test in dietary surveys. Am J Clin Nutr 1980;33:4–5 (letter).[Free Full Text]
  12. Bingham SA, Williams R, Cole TJ, et al. Reference values for analytes of 24-h urine collections known to be complete. Ann Clin Biochem 1988;25:610–9.
  13. Lifson N, Gordon GB, McClintock R. Measurement of total carbon dioxide production by means of D218O. J Appl Physiol 1955;7:704–10.[Free Full Text]
  14. Schoeller DA, van Santen E. Measurement of energy expenditure in humans by doubly labeled water. J Appl Physiol 1982;53:955–9.[Abstract/Free Full Text]
  15. Bohlscheid-Thomas S, Hoting I, Boeing H, Wahrendorf J. Reproducibility and relative validity of food group intake in a food frequency questionnaire developed for the German part of the EPIC project. Int J Epidemiol 1997;26(suppl):S59–70.[Abstract/Free Full Text]
  16. Bohlscheid-Thomas S, Hoting I, Boeing H, Wahrendorf J. Reproducibility and relative validity of energy and macronutrient intake of a food frequency questionnaire developed for the German part of the EPIC project. Int J Epidemiol 1997;26(suppl):S71–81.[Abstract/Free Full Text]
  17. Boeing H, Bohlscheidt-Thomas S, Voss S, et al. The relative validity of vitamin intakes derived from a food frequency questionnaire compared to 24-hr-recalls and biological measurements. Results from the EPIC pilot study in Germany. Int J Epidemiol 1997;26(suppl):S82–90.[Abstract/Free Full Text]
  18. Riboli E. Nutrition and cancer: background and rationale of the European Prospective Investigation into Cancer and Nutrition (EPIC). Ann Oncol 1992;3:783–91.[Abstract/Free Full Text]
  19. Riboli E, Kaaks R. The EPIC project: rationale and study design. Int J Epidemiol 1997;26(suppl):S6–14.[Abstract/Free Full Text]
  20. Voss S, Charrondiere UR, Slimani N, et al. EPIC-SOFT—ein Europaweites Erfassungs- und Kodierungssystem fuer 24-Stunden-Erinnerungsprotokolle. (EPIC-SOFT—a European wide dietary assessment instrument for 24-hour recalls.) Z Ernahrungswiss 1998; 37:227–33 (in German).[Medline]
  21. Federal Institute for Health Protection of Consumers and Veterinary Medicine. The German Food Code and nutrient data base, BLS II.1. Berlin: BgVV-Journal 2, 1998 (in German).
  22. Klipstein-Grobusch K, Georg T, Boeing H. Interviewer variability in anthropometric measurements and estimates of body composition. Int J Epidemiol 1997;26(suppl):S174–80.[Abstract/Free Full Text]
  23. Schofield WN, Schofield C, James WPT. Basal metabolic rate. Hum Nutr Clin Nutr 1985;39C(suppl):1–96.
  24. Cunningham JJ. A reanalysis of factors influencing basal metabolic rate in normal adults. Am J Clin Nutr 1980;33:2372–4.[Abstract/Free Full Text]
  25. Coward WA, Cole TJ. The doubly labeled water method for the measurement of energy expenditure in humans: risks and benefits. In: Whitehead RG, ed. New techniques in nutritional research. San Diego: Academic Press, 1990:139–76.
  26. Coward WA. Measurement of energy expenditure in clinical practice. Proc Nutr Soc 1991;50:227–37.[Medline]
  27. Coward WA, Roberts SB, Prentice AM, et al. The 2H218O method for energy expenditure measurements—clinical possibilities, necessary assumptions and limitations. In: Dietze JG, Grünert A, Kleinberger G, Wolfram G, eds. Clinical nutrition and metabolic research. Basel, Switzerland: Karger, 1986.
  28. Thielecke F, Noack R. Evaluation of an automated equilibration technique for deuterium/hydrogen isotope ratio measurements with respect to assessing total energy expenditure by the doubly labelled water method. J Mass Spectrom 1997;32:323–7.
  29. Bingham SA, Cummings JH. The use of 4-aminobenzoic acid as a marker to validate the completeness of 24 h urine collections in man. Clin Sci 1983;64:629–35.[Medline]
  30. Bingham SA, Cassidy A, Cole TJ, et al. Validation of weighed records and other methods of dietary assessment using the 24 h urine nitrogen technique and other biological markers. Br J Nutr 1995;73:531–50.[Medline]
  31. Black AE, Jebb SA, Bingham SA, et al. The validation of energy and protein intakes by doubly labelled water and 24-hour urinary nitrogen excretion in post-obese subjects. J Hum Nutr Diet 1995;8:51–64.
  32. Beaton GH, Milner J, Corey P, et al. Sources of variance in 24-hour dietary recall data: implications for nutrition study design and interpretation. Am J Clin Nutr 1979;32:2546–9.[Free Full Text]
  33. Willett W, Stampfer MJ. Total energy intake: implications for epidemiologic analyses. Am J Epidemiol 1986;124:17–27.[Free Full Text]
  34. Bland JM, Altman DG. Statistical methods for assessing agreement between two methods of clinical measurement. Lancet 1986;1:307–10.[Medline]
  35. Klein PD, James WPT, Wong WW, et al. Calorimetric validation of the doubly-labelled water method for determination of energy expenditure in man. Hum Nutr Clin Nutr 1984;38C:95–106.[Medline]
  36. Schoeller DA, Ravussin E, Schutz Y, et al. Energy expenditure by doubly labeled water: validation in humans and proposed calculation. Am J Physiol 1986;240:R823–30.
  37. Ravussin E, Ingborg TH, Rising R, et al. Energy expenditure by doubly-labeled water: validation in lean and obese subjects. Am J Physiol 1991;261:E402–9.[Abstract/Free Full Text]
  38. Sawaya AL, Tucker K, Tsay R, et al. Evaluation of four methods for determining energy intake in young and older women: comparison with doubly labeled water measurements of total energy expenditure. Am J Clin Nutr 1996;63:491–9.[Abstract/Free Full Text]
  39. Livingstone MBE, Prentice AM, Strain JJ, et al. Accuracy of weighed dietary records in studies of diet and health. Br Med J 1990;300:708–12.
  40. Black AE, Prentice AM, Goldberg GR, et al. Measurements of total energy expenditure provide insights into the validity of dietary measurements of energy intake. J Am Diet Assoc 1993;93:572–9.[Medline]
  41. Westerterp KR, Verboeket-Van de Venne WPHG, Meijer GAL, ten Hoor F. Self-reported intake as a measure for energy intake. A validation against doubly labelled water. In: Aihaud G, ed. Obesity in Europe 91. London: Libbey, 1991:17–22.
  42. O'Donnell MG, Nelson M, Wise PH, Walker DM. A computerized diet questionnaire for use in health education. 1. Development and validation. Br J Nutr 1991;66:3–15.[Medline]
  43. Porrini M, Gentile MG, Fidanza F. Biochemical validation of a self-administered semi-quantitative food-frequency questionnaire. Br J Nutr 1995;74:323–33.[Medline]
  44. Burema J, van Staveren WA, Feunekes GIJ. Guidelines for reports on validation studies. Eur J Clin Nutr 1995;49:932–3 (letter).[Medline]
  45. Garrow JS. Validation of methods for estimating habitual diet: proposed guidelines. Eur J Clin Nutr 1995;49:231–2.[Medline]
  46. Voss S, Kroke A, Klipstein-Grobusch K, Boeing H. Is macronutrient composition of dietary intake affected by underreporting? Results from the EPIC-Potsdam study. Eur J Clin Nutr 1998;52:119–26.[Medline]
  47. Pryer JA, Vrijheid M, Nichols R, et al. Who are the `low energy reporters' in the Dietary and Nutritional Survey of British Adults? Int J Epidemiol 1997;26:146–54.[Abstract/Free Full Text]
  48. Rothenberg E, Bosaeus I, Steen B. Evaluation of energy intake estimated by a diet history in three free-living 70 year old populations in Gothenburg, Sweden. Eur J Clin Nutr 1997;51:60–6.[Medline]
  49. Voss S, Kroke A, Klipstein-Grobusch K, Boeing H. Obesity as a major determinant of underreporting in a self-administered food frequency questionnaire: results from the EPIC-Potsdam study. Z Ernahrungswiss 1997;36:229–36.[Medline]
  50. Spiegelman D, McDermott A, Rosner B. Regression calibration method for correcting measurement-error bias in nutritional epidemiology. Am J Clin Nutr 1997;65(suppl):1179S–86S.[Abstract/Free Full Text]
  51. Rosner B, Willett WC, Spiegelman D. Correction of logistic regression relative risk estimates and confidence intervals for systematic within-person measurement error. Stat Med 1989;8:1051–69.[Medline]
  52. Kipnis V, Freedman LS, Brown CC, et al. Effect of measurement error on energy-adjustment models in nutritional epidemiology. Am J Epidemiol 1997;146:842–55.[Abstract/Free Full Text]
Received for publication November 12, 1998. Accepted for publication March 9, 1999.




This article has been cited by other articles:


Home page
AMERICAN JOURNAL OF LIFESTYLE MEDICINEHome page
C. E. O'Neil and T. A. Nicklas
A Review of the Relationship Between 100% Fruit Juice Consumption and Weight in Children and Adolescents
American Journal of Lifestyle Medicine, July 1, 2008; 2(4): 315 - 354.
[Abstract] [PDF]


Home page
Am J EpidemiolHome page
K. Hoffmann, T. Pischon, M. Schulz, M. B. Schulze, J. Ray, and H. Boeing
A Statistical Test for the Equality of Differently Adjusted Incidence Rate Ratios
Am. J. Epidemiol., March 1, 2008; 167(5): 517 - 522.
[Abstract] [Full Text] [PDF]


Home page
J. Nutr.Home page
S. Bo, M. Durazzo, R. Gambino, C. Berutti, N. Milanesio, A. Caropreso, L. Gentile, M. Cassader, P. Cavallo-Perin, and G. Pagano
Associations of Dietary and Serum Copper with Inflammation, Oxidative Stress, and Metabolic Variables in Adults
J. Nutr., February 1, 2008; 138(2): 305 - 310.
[Abstract] [Full Text] [PDF]


Home page
J. Nutr.Home page
F. E. Thompson, D. Midthune, G. C. Williams, A. L. Yaroch, T. G. Hurley, K. Resnicow, J. R. Hebert, D. J. Toobert, G. W. Greene, K. Peterson, et al.
Evaluation of a Short Dietary Assessment Instrument for Percentage Energy from Fat in an Intervention Study
J. Nutr., January 1, 2008; 138(1): 193S - 199S.
[Abstract] [Full Text] [PDF]


Home page
Am J EpidemiolHome page
M. L. Molag, J. H. M. de Vries, M. C. Ocke, P. C. Dagnelie, P. A. van den Brandt, M. C. J. F. Jansen, W. A. van Staveren, and P. van't Veer
Design Characteristics of Food Frequency Questionnaires in Relation to Their Validity
Am. J. Epidemiol., December 15, 2007; 166(12): 1468 - 1478.
[Abstract] [Full Text] [PDF]


Home page
J. Nutr.Home page
U. Nothlings, K. Hoffmann, M. M. Bergmann, and H. Boeing
Fitting Portion Sizes in a Self-Administered Food Frequency Questionnaire
J. Nutr., December 1, 2007; 137(12): 2781 - 2786.
[Abstract] [Full Text] [PDF]


Home page
J. Nutr.Home page
D. Drogan, K. Hoffmann, M. Schulz, M. M. Bergmann, H. Boeing, and C. Weikert
A Food Pattern Predicting Prospective Weight Change Is Associated with Risk of Fatal but Not with Nonfatal Cardiovascular Disease
J. Nutr., August 1, 2007; 137(8): 1961 - 1967.
[Abstract] [Full Text] [PDF]


Home page
Int J EpidemiolHome page
G. B. Gori
Metaphorical measurements and theories
Int. J. Epidemiol., August 1, 2007; 36(4): 931 - 932.
[Full Text] [PDF]


Home page
Arch Intern MedHome page
M. B. Schulze, M. Schulz, C. Heidemann, A. Schienkiewitz, K. Hoffmann, and H. Boeing
Fiber and Magnesium Intake and Incidence of Type 2 Diabetes: A Prospective Study and Meta-analysis
Arch Intern Med, May 14, 2007; 167(9): 956 - 965.
[Abstract] [Full Text] [PDF]


Home page
Diabetes CareHome page
M. B. Schulze, K. Hoffmann, H. Boeing, J. Linseisen, S. Rohrmann, M. Mohlig, A. F.H. Pfeiffer, J. Spranger, C. Thamer, H.-U. Haring, et al.
An Accurate Risk Score Based on Anthropometric, Dietary, and Lifestyle Factors to Predict the Development of Type 2 Diabetes
Diabetes Care, March 1, 2007; 30(3): 510 - 515.
[Abstract] [Full Text] [PDF]


Home page
Am. J. Clin. Nutr.Home page
E. B Levitan, C. W Westgren, S. Liu, and A. Wolk
Reproducibility and validity of dietary glycemic index, dietary glycemic load, and total carbohydrate intake in 141 Swedish men
Am. J. Clinical Nutrition, February 1, 2007; 85(2): 548 - 553.
[Abstract] [Full Text] [PDF]


Home page
Am. J. Clin. Nutr.Home page
S. Bo, M. Durazzo, S. Guidi, M. Carello, C. Sacerdote, B. Silli, R. Rosato, M. Cassader, L. Gentile, and G. Pagano
Dietary magnesium and fiber intakes and inflammatory and metabolic indicators in middle-aged subjects from a population-based cohort.
Am. J. Clinical Nutrition, November 1, 2006; 84(5): 1062 - 1069.
[Abstract] [Full Text] [PDF]


Home page
J. Am. Coll. Nutr.Home page
T. S. Hudson, M. R. Forman, M. M. Cantwell, A. Schatzkin, P. S. Albert, and E. Lanza
Dietary Fiber Intake: Assessing the Degree of Agreement between Food Frequency Questionnaires and 4-Day Food Records.
J. Am. Coll. Nutr., October 1, 2006; 25(5): 370 - 381.
[Abstract] [Full Text] [PDF]


Home page
Int J EpidemiolHome page
C. Sacerdote, L. Fiorini, R. Rosato, M. Audenino, M. Valpreda, and P. Vineis
Randomized controlled trial: effect of nutritional counselling in general practice
Int. J. Epidemiol., April 1, 2006; 35(2): 409 - 415.
[Abstract] [Full Text] [PDF]


Home page
JCOHome page
J. M. Chan, P. H. Gann, and E. L. Giovannucci
Role of Diet in Prostate Cancer Development and Progression
J. Clin. Oncol., November 10, 2005; 23(32): 8152 - 8160.
[Abstract] [Full Text] [PDF]


Home page
Am J EpidemiolHome page
D. R. Paul, D. G. Rhodes, M. Kramer, D. J. Baer, and W. V. Rumpler
Validation of a Food Frequency Questionnaire by Direct Measurement of Habitual ad Libitum Food Intake
Am. J. Epidemiol., October 15, 2005; 162(8): 806 - 814.
[Abstract] [Full Text] [PDF]


Home page
Crop Sci.Home page
V. S. Primomo, V. Poysa, G. R. Ablett, C.-J. Jackson, and I. Rajcan
Agronomic Performance of Recombinant Inbred Line Populations Segregating for Isoflavone Content in Soybean Seeds
Crop Sci., September 23, 2005; 45(6): 2203 - 2211.
[Abstract] [Full Text] [PDF]


Home page
J. Nutr.Home page
C. Weikert, K. Hoffmann, J. Dierkes, B.-C. Zyriax, K. Klipstein-Grobusch, M. B. Schulze, R. Jung, E. Windler, and H. Boeing
A Homocysteine Metabolism-Related Dietary Pattern and the Risk of Coronary Heart Disease in Two Independent German Study Populations
J. Nutr., August 1, 2005; 135(8): 1981 - 1988.
[Abstract] [Full Text] [PDF]


Home page
J. Nutr.Home page
K. J. Petzke, H. Boeing, S. Klaus, and C. C. Metges
Carbon and Nitrogen Stable Isotopic Composition of Hair Protein and Amino Acids Can Be Used as Biomarkers for Animal-Derived Dietary Protein Intake in Humans
J. Nutr.,