|
|
||||||||
Original Research Communications |
1 From the Departments of Epidemiology, Intervention Studies, and Nutritional Biochemistry and Physiology, German Institute of Human Nutrition, Potsdam-Rehbruecke, Germany.
2 Supported by the Federal Ministry of Technology, Germany, and by grant SOC 95 201408 05F02 from the Europe Against Cancer Program of the European Community.
3 Address reprint requests to A Kroke, German Institute of Human Nutrition, Postsdam-Rehbruecke, Arthur-Scheunert-Allee 114-116, 14558 Bergholz-Rehbruecke, Germany. E-mail: Kroke{at}www.dife.de.
| ABSTRACT |
|---|
|
|
|---|
Objective: The objective was to assess the validity of a self-administered food-frequency questionnaire by comparison with dietary recall, urinary nitrogen excretion, and total energy expenditure data.
Design: Over a 1-y period, data from twelve 24-h dietary recalls, a food-frequency questionnaire, and four 24-h urine samples were obtained from 134 study participants of the European Prospective Investigation into Cancer and Nutrition (EPIC) Study in Potsdam, Germany. In a substudy of 28 participants, total energy expenditure from doubly labeled water measurements was assessed.
Results: Energy-adjusted, deattenuated correlation coefficients between the questionnaire and the recalls ranged from 0.54 for dietary fiber to 0.86 for alcohol. Cross-classification of quintiles of nutrient intakes from the questionnaire and recalls indicated severe misclassification to be <4%. Reported protein intake correlated with estimated protein excretion (r = 0.46). Energy intake and total energy expenditure were also significantly correlated (r = 0.48); however, all but one subject underreported their energy intake. The magnitude of underreporting varied considerably, by 22% on average, and increased slightly with increasing energy intake. A similar pattern of underreporting was observed when energy intakes from the 24-h dietary recalls were compared with total energy expenditure.
Conclusions: These data indicate an acceptable relative validity of the food-frequency questionnaire in this study population. Compared with measurements of total energy expenditure and protein excretion, however, only moderate agreement with both the food-frequency questionnaire and the 24-h dietary recalls was observed.
Key Words: Validity biomarkers energy expenditure epidemiology dietary recalls food-frequency questionnaire doubly labeled water FFQ nutrition EPIC Study Germany humans
| INTRODUCTION |
|---|
|
|
|---|
A biomarker frequently applied in nutritional validation studies is the measurement of urinary nitrogen excretion, which allows the determination of protein intake (11, 12). The doubly labeled water (DLW) method, developed by Lifson et al (13) in the early 1950s and first used in humans by Schoeller and van Santen (14) in 1982, is commonly accepted as the only method that provides valid estimates of energy expenditure in free-living subjects.
The present validation study was conducted during the baseline recruitment phase of the European Prospective Investigation into Cancer and Nutrition Study at the Potsdam, Germany, study center (EPIC-Potsdam Study). The FFQ used in the baseline examination of the EPIC-Potsdam Study was validated previously for reproducibility and relative validity in a West German study population (1517). Because the performance of a dietary assessment instrument depends on characteristics of the study population, additional validation studies are recommended when a previously validated instrument is used in another population (5, 10). The 2 aims of this validation study were therefore to investigate the validity of the FFQ against biomarkers as reference methods and to obtain data on relative validity by comparing FFQ data with data from 24-h dietary recalls, thereby determining the validity of the FFQ in this East German study population.
| SUBJECTS AND METHODS |
|---|
|
|
|---|
27000 subjects to the overall cohort size of
475000. Study participants were randomly selected from population registries in Potsdam and adjacent communities. Approval for all study procedures, including those of the validation study, was given by the Ethical Committee of the State of Brandenburg, Germany, and informed consent was obtained from all study participants.
Subjects
The 160 subjects who volunteered to participate in the validation study were recruited from participants of the baseline examination of the EPIC-Potsdam Study. Women between 35 and 66 y of age and men between 40 and 67 y of age were divided into 3 age- and sex-specific strata according to the EPIC protocol (18). Only those 134 subjects who completed at least ten 24-h dietary recalls and at least three 24-h urine collections [
85% p-aminobenzoic acid (PABA) recovery] were included in the present analysis. In a subset (15 men and 15 women) of the validation study, total energy expenditure (TEE) was determined with the DLW method. Two persons did not complete the required measurements and were therefore excluded from this analysis.
Dietary assessment
Food-frequency questionnaire
A self-administered, scanner-readable FFQ was the basic instrument used to assess habitual food intake in this cohort study. Development, reproducibility, and relative validity of the FFQ were described in detail previously (1517). The 146-item FFQ included questions about specific food items, such as the frequency of sauce consumption, the fat content of several food items, and seasonal consumption of fruit and vegetables. For certain food items, portion sizes were estimated on the basis of colored photographs of portion sizes; otherwise, standard portion sizes such as a cup (150 mL), jars, and pieces were used. Questions about general patterns of consumption of the main food groups (bread, luncheon meat and cheese on bread, meat, fruit, and vegetables) at the end of the FFQ were used to adjust the consumption patterns that were reported when several single food items from these food groups were requested. After the FFQ was optically read, a software program was used to check the questionnaire data for completeness and plausibility. Missing or implausible information was then corrected with the particpant, resulting in complete and consistent dietary data. The FFQ was administered at the end of the validation study period, thereby covering the 1-y study phase.
Twenty-fourhour dietary recall
Three computer-assisted 24-h dietary recall interviews (EPIC-SOFT, German version; developed in collaboration with the International Agency of Research on Cancer, Lyon, France) were conducted per season during the 1-y study period by one trained interviewer. This software program was specifically developed for the standardized assessment of foods and recipes consumed on the preceding day and was adapted to the specific dietary habits of the EPIC-Potsdam Study population (20). The interviews were meal-sequence based and involved a detailed assessment and description of the food consumed. Colored photographs of different portion sizes of foods were provided to help estimate the quantity of food consumed. Interviews were conducted on all weekdays except Fridays because of organizational restrictions. Recalls of food consumed on Saturdays and Sundays were obtained on Mondays.
Nutrient database
Energy and nutrient intakes from both dietary assessments were calculated by using data from the German Food Code (21).
Lifestyle factors and anthropometry
Information on other lifestyle factors, such as smoking habits and physical activity, were obtained by conducting computer-assisted person-to-person interviews. Anthropometric measures were obtained by trained and quality-monitored personnel (22) while subjects wore no shoes and only light underwear. Body weight was measured on a digital scale to the nearest 100 g; body height was measured with a flexible anthropometer to the nearest 0.1 cm.
Relative energy intake
For all subjects in the validation study, basal metabolic rate (BMR) was calculated based on the formulas of Schofield et al (23) from weight and age. Energy intake reported on the FFQ was then divided by the estimated BMR, thereby obtaining a measure of relative energy intake that accounted for age and body weight (24).
Doubly labeled water method
The DLW method was used to assess the carbon dioxide production values needed to calculate TEE with the classic equations of Coward and Cole (25). This method is based on the differential disappearance rates of the stable isotopes 2H and 18O. We applied the multipoint approach (26) over a time period of 14 d after the oral administration of an individually calculated amount of doubly labeled water per kilogram body weight (27). The isotopes disperse throughout the body water and gradually leave the body over subsequent days. Urine samples from each day of the sampling period were stored by the participants at -20°C. When the sampling was completed, the urine samples were stored at -80°C until analyzed by mass spectrometry with an automated equilibration device. The CV for the applied analytic method was 3%; further details of the isotope analysis were described previously (28).
Twenty-fourhour urinary nitrogen
Four 24-h urine samples were collected from each participant while PABA was administered simultaneously to assess the completeness of urine collection (29). Recovery of <85% PABA indicated incomplete urine collection (30, 31) and resulted in exclusion of 93 of 640 urine samples. Urea nitrogen excretion was measured by spectrophotometry and converted to urea nitrogen excretion in g/d. Assuming that urea nitrogen excretion is a constant proportion (85%) of total urinary nitrogen (12), protein intake was derived from the formula 6.25 x (urinary nitrogen + 2), as suggested by Isaksson (11).
Statistical analysis
Nutrients not normally distributed were logarithmically transformed to achieve normal distributions, which were obtained for all nutrients except alcohol. Means and SDs of energy, protein, and macronutrient intakes were determined for the test (FFQ) and reference (24-h dietary recall, urinary nitrogen, and DLW) methods; differences and ratios between mean values obtained with the FFQ and with the reference methods were calculated. A paired-difference t test and Wilcoxon's signed-rank test for alcohol were used to determine significant differences between means. The variance ratios of the repeated 24-h dietary recalls and the urinary nitrogen measurements were determined by dividing within-person variations by between-person variations. Pearson's and Spearman's correlation coefficients, respectively, were obtained for crude and deattenuated data as well as for energy-adjusted crude and deattenuated data. Deattenuation to correct for intraindividual variability was accomplished by using the formula suggested by Beaton et al (32). Values were adjusted for energy by using the residual method (33).
To assess agreement between the FFQ and reference methods and to detect any bias with the test method relative to the reference method, differences between the 2 respective methods were plotted against the means, as suggested by Bland and Altman (34). To compare the FFQ with the multiple 24-h dietary recall, study participants were classified into quintile categories of energy and macronutrient intakes based on the distribution of data from the test and reference methods. Proportions of subjects classified into the same, adjacent, or extreme quintiles were derived for both crude and energy-adjusted cross-classifications. To compare the test method with the urinary nitrogen and DLW methods, study participants were classified into quartiles of energy and protein intakes according to the distribution of the test and the respective reference methods. Proportions of subjects classified into the same, adjacent, or extreme quartiles were determined. Results were considered statistically significant at a two-tailed
level of 0.05. Statistical analysis was performed by using SAS software (release 6.12; SAS Institute Inc, Cary, NC).
| RESULTS |
|---|
|
|
|---|
|
|
|
23% lower than estimates derived from urinary nitrogen measurements. The correlation was slightly improved by deattenuation for the within-person variation observed in urinary nitrogen excretion. Severe misclassification was observed in 2% of the subjects; 46% of the subjects were classified into the adjacent quartiles and 35% were correctly classified. The Bland and Altman plot (Figure 1
|
|
|
|
|
| DISCUSSION |
|---|
|
|
|---|
A second biomarker, urinary nitrogen, was used to validate protein intake reported on the FFQ. Previous studies comparing urinary nitrogen with FFQ data observed correlation coefficients between 0.07 and 0.54 (10, 30, 42, 43), indicating that our data were within the range observed previously. Similar to reported energy intakes, protein intakes were underreported when compared with urinary nitrogen. The use of urinary nitrogen as a reference method for dietary intake requires stable nitrogen balance, eg, no gain or loss of tissue. For >90% of the study population, the estimated weight change during the 1-y study period was
2 kg. However, larger short-term changes in weight might have occurred during the study period. Furthermore, it was assumed that extrarenal nitrogen losses are a constant proportion of overall nitrogen excretion (12). This might not be the case for all subjects. These limitations might therefore partly explain the observed discrepancies between estimated protein excretion and intake.
Relative validity of the FFQ was assessed in comparison with twelve 24-h dietary recalls administered over 1 y to obtain information on performance of the FFQ in our East German study population. Procedures and data analysis were adapted to the previous validation study of the FFQ in the other German EPIC study population in Heidelberg, where twelve 24-h dietary recalls were obtained in face-to-face interviews (15). Limitations of the applied reference instrument and study procedures were described and discussed previously (15, 16); therefore, only aspects specific to the Potsdam validation study are discussed here.
For the Potsdam validation study, a computer-supported, structured, face-to-face 24-h dietary recall interview was administered by one trained interviewer. In contrast with the Heidelberg Study, for which 24-h dietary recall data were not available for Fridays and Saturdays, we obtained data for Saturdays using a 24-h dietary recall applied on Monday. However, no data were obtained for Fridays because of logistic reasons. Because dietary habits on Fridays typically differ from those on other weekdays, our 24-h dietary recall might have underestimated some nutrients as well as energy intake. Crude correlation coefficients were similar in the 2 studies. Weaker correlations were observed in Potsdam than in Heidelberg only for intakes of alcohol and dietary fiber. Comparison of cross-classifications indicated similar results with the FFQ in our study population; there were no significant differences between the results of the Heidelberg and Potsdam studies. Note, however, that methodologic differences in the 2 validation studies did not allow an unrestricted comparison. Our correlation coefficients were in a range comparable with those of other validation studies (10) and thus we concluded that the relative validity of the FFQ was acceptable.
The validation approach in which a second dietary assessment instrument is used as a reference method assumes uncorrelated errors between the test and reference methods. However, when the dietary assessment instruments used in the present validation study were compared with biomarkers, measurement error between the test methods (FFQ, 24-h recalls) and the reference methods (DLW, urinary nitrogen) were correlated. The validation of dietary assessment instruments has been discussed not only in terms of the appropriateness of the reference instrument but also with respect to the statistics used to compare the validity of the methods (44, 45). As pointed out by Bland and Altman (34), it is inappropriate to use correlation coefficients to analyze agreement between methods. Therefore, the mean difference is calculated to obtain information on bias in the group estimate, and limits of agreement (±2 SD of mean difference) indicate the scatter of individual results. Applying this analytic concept to the energy data, we showed a strongly biased mean of differences and a wide scatter of differences between reported energy intakes and total energy expenditure, despite significant correlations between reported energy intake and TEE. The wide scattering of the differences showed clearly that some subjects underreported their energy intakes more so than did others. This would not have been detected if only 24-h dietary recall data were used as a reference, as shown in Figure 4
.
A limitation in the interpretation of our results with respect to the validity of the FFQ in the entire study population is the fact that participants in the validation study were a selected, highly motivated sample of the study population. In addition, 16 subjects dropped out of the validation study. For these reasons, there may have been smaller within-person variation and, therefore, stronger correlations in the validation study than in the entire study population.
Even though some uncertainties remain about the absolute amount of underreporting of energy intake on the FFQ, we observed a systematic increase in measurement error with increasing absolute reported energy intakes. This observation could have serious implications on risk estimates of diet-disease relations. Differential underreporting depending on subject characteristics and selective underreporting of certain macronutrients (46) are issues to be considered in this regard. Our data indicated, for example, that underreporting was higher with higher energy intakes. Because subjects with a high BMI have higher energy requirements, underreporting could be related to obesity, as described previously (4649). These assumptions are supported by the strong correlations observed in our data between measurement error and BMI. However, the sample size was too small to detect significant determinants of underreporting. Several investigators have noted the importance of obtaining data on random and systematic measurement error of the exposure assessment for correcting linear regression (50) or logistic regression estimates (51). A detailed characterization of measurement error is also considered to be important in the use and interpretation of energy-adjustment models because measurement error may unpredictably distort the estimated effects of energy-adjustment models (52).
In summary, energy intake was underreported on the FFQ when compared with TEE, and protein intakes were underreported on the FFQ when compared with urinary nitrogen. These observations were similar to those described for other dietary assessment instruments. Correlation coefficients between macronutrient intakes estimated with the FFQ and the 24-h dietary recalls were within the range observed in a previous validation study (16), indicating comparable relative validity for the FFQ used in the present study population.
| ACKNOWLEDGMENTS |
|---|
| REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
C. E. O'Neil and T. A. Nicklas A Review of the Relationship Between 100% Fruit Juice Consumption and Weight in Children and Adolescents American Journal of Lifestyle Medicine, July 1, 2008; 2(4): 315 - 354. [Abstract] [PDF] |
||||
![]() |
K. Hoffmann, T. Pischon, M. Schulz, M. B. Schulze, J. Ray, and H. Boeing A Statistical Test for the Equality of Differently Adjusted Incidence Rate Ratios Am. J. Epidemiol., March 1, 2008; 167(5): 517 - 522. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Bo, M. Durazzo, R. Gambino, C. Berutti, N. Milanesio, A. Caropreso, L. Gentile, M. Cassader, P. Cavallo-Perin, and G. Pagano Associations of Dietary and Serum Copper with Inflammation, Oxidative Stress, and Metabolic Variables in Adults J. Nutr., February 1, 2008; 138(2): 305 - 310. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. E. Thompson, D. Midthune, G. C. Williams, A. L. Yaroch, T. G. Hurley, K. Resnicow, J. R. Hebert, D. J. Toobert, G. W. Greene, K. Peterson, et al. Evaluation of a Short Dietary Assessment Instrument for Percentage Energy from Fat in an Intervention Study J. Nutr., January 1, 2008; 138(1): 193S - 199S. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. L. Molag, J. H. M. de Vries, M. C. Ocke, P. C. Dagnelie, P. A. van den Brandt, M. C. J. F. Jansen, W. A. van Staveren, and P. van't Veer Design Characteristics of Food Frequency Questionnaires in Relation to Their Validity Am. J. Epidemiol., December 15, 2007; 166(12): 1468 - 1478. [Abstract] [Full Text] [PDF] |
||||
![]() |
U. Nothlings, K. Hoffmann, M. M. Bergmann, and H. Boeing Fitting Portion Sizes in a Self-Administered Food Frequency Questionnaire J. Nutr., December 1, 2007; 137(12): 2781 - 2786. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Drogan, K. Hoffmann, M. Schulz, M. M. Bergmann, H. Boeing, and C. Weikert A Food Pattern Predicting Prospective Weight Change Is Associated with Risk of Fatal but Not with Nonfatal Cardiovascular Disease J. Nutr., August 1, 2007; 137(8): 1961 - 1967. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. B. Gori Metaphorical measurements and theories Int. J. Epidemiol., August 1, 2007; 36(4): 931 - 932. [Full Text] [PDF] |
||||
![]() |
M. B. Schulze, M. Schulz, C. Heidemann, A. Schienkiewitz, K. Hoffmann, and H. Boeing Fiber and Magnesium Intake and Incidence of Type 2 Diabetes: A Prospective Study and Meta-analysis Arch Intern Med, May 14, 2007; 167(9): 956 - 965. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. B. Schulze, K. Hoffmann, H. Boeing, J. Linseisen, S. Rohrmann, M. Mohlig, A. F.H. Pfeiffer, J. Spranger, C. Thamer, H.-U. Haring, et al. An Accurate Risk Score Based on Anthropometric, Dietary, and Lifestyle Factors to Predict the Development of Type 2 Diabetes Diabetes Care, March 1, 2007; 30(3): 510 - 515. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. B Levitan, C. W Westgren, S. Liu, and A. Wolk Reproducibility and validity of dietary glycemic index, dietary glycemic load, and total carbohydrate intake in 141 Swedish men Am. J. Clinical Nutrition, February 1, 2007; 85(2): 548 - 553. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Bo, M. Durazzo, S. Guidi, M. Carello, C. Sacerdote, B. Silli, R. Rosato, M. Cassader, L. Gentile, and G. Pagano Dietary magnesium and fiber intakes and inflammatory and metabolic indicators in middle-aged subjects from a population-based cohort. Am. J. Clinical Nutrition, November 1, 2006; 84(5): 1062 - 1069. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. S. Hudson, M. R. Forman, M. M. Cantwell, A. Schatzkin, P. S. Albert, and E. Lanza Dietary Fiber Intake: Assessing the Degree of Agreement between Food Frequency Questionnaires and 4-Day Food Records. J. Am. Coll. Nutr., October 1, 2006; 25(5): 370 - 381. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Sacerdote, L. Fiorini, R. Rosato, M. Audenino, M. Valpreda, and P. Vineis Randomized controlled trial: effect of nutritional counselling in general practice Int. J. Epidemiol., April 1, 2006; 35(2): 409 - 415. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. M. Chan, P. H. Gann, and E. L. Giovannucci Role of Diet in Prostate Cancer Development and Progression J. Clin. Oncol., November 10, 2005; 23(32): 8152 - 8160. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. R. Paul, D. G. Rhodes, M. Kramer, D. J. Baer, and W. V. Rumpler Validation of a Food Frequency Questionnaire by Direct Measurement of Habitual ad Libitum Food Intake Am. J. Epidemiol., October 15, 2005; 162(8): 806 - 814. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. S. Primomo, V. Poysa, G. R. Ablett, C.-J. Jackson, and I. Rajcan Agronomic Performance of Recombinant Inbred Line Populations Segregating for Isoflavone Content in Soybean Seeds Crop Sci., September 23, 2005; 45(6): 2203 - 2211. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Weikert, K. Hoffmann, J. Dierkes, B.-C. Zyriax, K. Klipstein-Grobusch, M. B. Schulze, R. Jung, E. Windler, and H. Boeing A Homocysteine Metabolism-Related Dietary Pattern and the Risk of Coronary Heart Disease in Two Independent German Study Populations J. Nutr., August 1, 2005; 135(8): 1981 - 1988. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. J. Petzke, H. Boeing, S. Klaus, and C. C. Metges Carbon and Nitrogen Stable Isotopic Composition of Hair Protein and Amino Acids Can Be Used as Biomarkers for Animal-Derived Dietary Protein Intake in Humans J. Nutr., |