|
|
||||||||
Original Research Communication |
1 From the Cancer Prevention Research Program, Fred Hutchinson Cancer Research Center (NKH, REP, MLN, JWL, SAB, and RLP), and the Department of Epidemiology, University of Washington (REP, JWL, and SAB), Seattle.
2 Supported by the Fannie E Rippel Foundation, the Fred Hutchinson Cancer Research Center, the National Institutes of Health (T32 CA09661), and the National Cancer Institute (R03 CA80648 and R01 CA53996).
3 Address reprint requests to RE Patterson, Cancer Prevention Research Program, Fred Hutchinson Cancer Research Center, 1100 Fairview Avenue North, MP-702, Seattle, WA 98109-1024. E-mail: rpatters{at}fhcrc.org.
| ABSTRACT |
|---|
|
|
|---|
Objective: The objective was to quantify the magnitude, direction, and predictors of errors associated with energy intakes estimated from the Womens Health Initiative FFQ.
Design: Postmenopausal women (n = 102) provided data on sociodemographic and psychosocial characteristics that relate to errors in self-reported energy intake. Energy intake was objectively estimated as total energy expenditure, physical activity expenditure, and the thermic effect of food (10% addition to other components of total energy expenditure).
Results: Participants underreported energy intake on the FFQ by 20.8%; this error trended upward with younger age (P = 0.07) and social desirability (P = 0.09) but was not associated with body mass index (P = 0.95). The correlation coefficient between reported energy intake and total energy expenditure was 0.24; correlations were higher among women with less education, higher body mass index, and greater fat-free mass, social desirability, and dissatisfaction with perceived body size (all P < 0.10).
Conclusions: Energy intake is generally underreported, and both the magnitude of the error and the association of the self-reporting with objectively estimated intake appear to vary by participant characteristics. Studies relying on self-reported intake should include objective measures of energy expenditure in a subset of participants to identify person-specific bias within the study population for the dietary self-reporting tool; these data should be used to calibrate the self-reported data as an integral aspect of diet and disease association studies.
Key Words: Dietary records systematic bias dietary measurement error energy expenditure postmenopausal women Womens Health Initiative
| INTRODUCTION |
|---|
|
|
|---|
To identify factors associated with dietary energy underreporting, it is necessary to have an objective measure of energy intake. In weight-stable participants, total energy expenditure (TEE) is used as a proxy for energy intake (17, 18), and the doubly labeled water method is generally considered the gold standard for assessment of TEE (19). However, that method requires the costly oxygen-18 isotope and isotope ratio mass spectrometry, which limits its use in large-scale epidemiologic studies. Predictive equations for basal metabolic rate (based on sex, age, weight, or height or on all of these factors) can be used to identify persons whose self-reported energy intakes fall below some physiologically plausible cutoff (2, 6, 10, 17). This method is easy and inexpensive, but predictive equations may not represent some population subgroups (20), include no measure of activity-related energy expenditure (AREE), and identify only extremes of reporting error (18, 21).
Here we use the factorial approach to measure TEE for comparison to self-reported energy intake, as estimated with the use of an FFQ. Factorial determination of objective energy intake involves measuring or estimating the components of TEE: resting metabolic rate (RMR), thermic effect of food, and AREE. Specifically, our objective was to examine the magnitude, direction, and predictors of energy intake underreporting on the FFQ used in the Womens Health Initiative (WHI) Clinical Trial and Observational Study, a nationwide study of health and disease risk in > 160 000 postmenopausal women (21). An understanding of the participant characteristics associated with errors in self-reported dietary intake is critical for the interpretation of diet-disease relations in analyses that rely on self-reported dietary intake, and the development of a comprehensive measurement-error model has the potential to increase the validity and reliability of diet-disease association analysis.
| SUBJECTS AND METHODS |
|---|
|
|
|---|
Participants completed questionnaires and collected their urine for 24 h before each of 2 clinic visits scheduled 1 wk apart. At the visits, we received the 24-h urine collections, reviewed completed questionnaires, took body measurements, conducted indirect calorimetry, and drew blood. To measure activity, participants wore an accelerometer for 3 consecutive days during the week between clinic visits. The Institutional Review Board at the Fred Hutchinson Cancer Research Center approved all procedures.
Energy intake reporting on the FFQ
Before the clinic visits, participants received the WHI FFQ in the mail (visit 1) and in person (visit 2). Both FFQs were completed at home by the participants without assistance. A registered dietitian checked the FFQs for completeness at each visit. The WHI FFQ asks about usual dietary intake during the previous 3 mo and contains 3 sections: 1) 19 adjustment questions that are used in the analysis software to calculate the nutrient content of specific food items, 2) a listing of 122 foods or food groups with questions about the usual frequency of intake and portion size for each entry, and 3) 4 summary questions about the usual intake of fruit, vegetables, and fat added to foods and used in cooking. We analyzed the FFQ dietary data by using the University of Minnesota Nutrition Coordinating Center Nutrient Database (22), and the algorithms for analysis are described elsewhere (23). Compared with food records and recalls in a study of 113 WHI study participants, the completed WHI FFQs underestimated energy intakes by
8% and correlated with energy estimates from records or recalls at 0.4 (24). However, the WHI FFQ has not been validated against the doubly labeled water method.
Total energy expenditure
We objectively estimated energy intake by summing the components of TEE (RMR and AREE) with a 10% addition to this summation for the thermic effect of food (25). Below we provide details of our protocol and measures.
Resting metabolic rate
We measured RMR with a VMAX 2900 indirect calorimeter (SensorMedics, Loma Linda, CA). After 30 min for machine warm-up, volume calibration was conducted daily to
3% of 3 L according to the manufacturers recommendations. Gas calibrations were done before each subject was measured, with the use of 2 mixtures: 26% O2 and 0% CO2 and 20% O2 and 0.75% CO2.
We instructed participants to abstain from food and beverages, except water, for
8 h and to avoid strenuous activity for 48 h before each indirect calorimetry measurement. To begin the visit, participants rested quietly on a recliner for 30 min in the thermally neutral testing room. We explained the procedure and oriented participants to the equipment. The mixing chamber pump was turned on, and the plastic canopy was placed over the head and neck of the recumbent participant, with the vinyl skirt covering the torso. For the participants acclimation to the apparatus and our adjustment of pump speed, we allowed 2 min of data to expire before initiating formal data collection. We collected data points every 30 s and defined steady state as 10 min during which the volume of oxygen consumed, the minute ventilation, and the respiratory quotient did not vary by > 10%. If 10 min of steady state was achieved by 30 min of data collection, the test was concluded. If not, the test was continued until 10 min of steady state was achieved or to 45 min of data collection, whichever occurred first. Analyses indicated that RMR estimates from the first 5 min of calorimetry were significantly higher than the remainder of the measures. However, RMR estimates from indirect calorimetry segments meeting steady state criteria or from calorimetry extended beyond 30 min did not significantly differ from those at 30 min (26). Therefore, we used the 530-min segment of calorimetry measurements for our calculation of RMR and the mean of the 2 RMR measures in our estimation of TEE.
Activity assessment
We measured AREE with a uniaxial accelerometer (Caltrac; Muscle Dynamics Fitness Network, Torrance, CA), a 7-cm2 unit that is worn at the waist and that measures vertical accelerations of the bodys center of gravity. When movement occurs, a cantilevered beam in the monitor bends and emits a current proportional to the force acting on it. A computer in the monitor plots an acceleration curve and uses the area under the curve for the estimation of activity (27).
We instructed participants to maintain their usual activities while wearing the monitor during waking hours for 3 consecutive days, including one weekend day. We input female sex and each womans height, weight, and age into her unit at visit 1. The participants cleared the screen to zero and attached the unit to their waistbands or belts under the right arm on arising each day. As needed, they used weightlifting and pedal modes for stationary, isotonic activities and for bicycling or rowing, which switched the accelerometer settings to prediction equations that more closely estimated the energy expended for these activities. Before retiring, participants recorded the readings in the "CALS USED ACTM" screen reflecting AREE only. Water activities precluded the wearing of the unit. The energy expenditure for these activities, though rare (n = 9 of 278 d), was added to the days monitor total (28). We included data if the accelerometer was worn for
22 waking h/d collectively and if the participant followed protocol instructions. We used the mean of the accelerometer measurements for 23 d for our estimate of TEE. Each participant completed the Physical Activity Scale for the Elderly (PASE) questionnaire before visit 1 (29), and scores were calculated for comparison with accelerometer data. The PASE questionnaire assesses the frequency and duration of low-, moderate-, and high-intensity activities over the previous week, and it has been used in studies of physical activity levels in older adults (30). The scale gives a total score that can range from 0 to > 400, with higher scores representing higher activity levels.
Subject characteristics
Anthropometry
We measured height, weight, and waist and hip circumferences at each clinic visit. We estimated body composition by using urinary creatinine concentrations from two 24-h urine collections 1 wk apart. Participants documented collection times and completeness, stored their urine at
4.4 °C in a refrigerator or cooler at home, and submitted collections at each clinic visit. Urinary creatinine concentrations were determined with a kinetic modification of the Jaffé alkaline picrate-reaction procedure using a Cobas Mira Plus Analyzer (Roche Diagnostics, Brandburg, NJ) according to the manufacturers instructions. The interassay CVs for low, medium, and high urine quality-control pool levels were 1.2%, 1.6%, and 1.6%, respectively. Daily creatinine excretions were excluded if they were < 780 mg/d or if urine was collected for < 23 h or > 25 h (31). We were unable to estimate body composition for 7 women because of urine collections that were incomplete, as calculated by Welless formula for older adults (32). The percentage of body fat (%BF) was derived from the fat-free mass (FFM) (weight FFM = kg fat; kg fat/weight x 100% = %BF).
Psychosocial measures
We administered the Crowne-Marlowe Social Desirability Scale during the second visit (33). This is a 33-item, true-false questionnaire that measures a persons tendency to provide the most socially desirable answer regardless of the truth. The tool discriminates between high total scores, representing a strong tendency to choose the socially desirable answer, and low total scores, representing a weaker tendency to do so, at a level of P = 0.05 or better. The scale has an internal consistency coefficient of 0.88 (Cronbachs alpha) (14, 33).
We used the Stunkard-Sorensen silhouettes to assess the participants perceptions of body size (34). This instrument shows 9 silhouettes from thin to large, which are given a score of 1 to 9, respectively. Participants were asked to identify the silhouettes that best represented their perceived body size, the desired body size, and the healthiest body size. Dissatisfaction with body size was calculated as perceived body size minus desired body size. We also calculated the difference between current body size and the body size perceived to be the healthiest to determine its influence on energy reporting.
Other measures
We collected data on age, race, education, marital status, and household income from a self-administered questionnaire.
Statistical analyses
We present descriptive data (
± SD) for our measures of energy expenditure, the energy intake reported on the FFQ (FFQ energy), and energy underreporting (FFQ energy - TEE), as well as Pearson correlation coefficients for multiple measures. To provide a general assessment of our factorial approach to estimating TEE, we present associations with participant characteristics that should influence TEE [age, body mass index (BMI; in kg/m2), FFM, %BF] and with characteristics that should not influence TEE [waist-to-hip ratio (WHR) and social desirability].
We used multiple linear regression to model the effects of participant characteristics on energy underreporting. The dependent variables were ratio measures of energy reporting. Specifically, we calculated a ratio of FFQ energy divided by TEE for our primary measure of energy reporting error. For analyses, these ratio variables were naturally log transformed to yield approximate normality, and the back-transformed means for FFQ energy divided by TEE are given in tables in the book by Fleiss (35). Independent variables were sociodemographic characteristics (age, income, and education), adiposity-related measures (BMI, FFM, %BF, and WHR), and psychosocial factors (social desirability, perceived body size, dissatisfaction with body size, and disparity from healthy size). To ensure the stability of our estimates, we divided our independent variables into tertiles, except in cases where meaningful categories existed, as follows. Age was divided into decades (5059, 6069, and 7079 y), and education was grouped by level of attainment (high school or some college, bachelors degree, and advanced degree). BMI was categorized by using cutoffs from a 1998 consensus conference of the Expert Panel on the Identification, Evaluation, and Treatment of Overweight in Adults: < 25, 2529.9, and
30 (36). A WHR > 0.80 is associated with increased health risks in women, and 0.80 was used as an upper cutoff (37).
For purposes of comparison, we also examined the ratio of FFQ energy to RMR. Linear regression models using demographic characteristics and adiposity-related measures were controlled for age, education, and household income. Models using psychosocial factors were controlled for age, education, household income, and BMI. We used a linear contrast to test for trends across ordered categories. Pearson correlation coefficients were calculated for FFQ energy reported and TEE measured, and partial correlations were calculated as for the regression models. The z scores were used to test for differences between coefficients and for trends (38). PASE score tertiles were compared with average accelerometer measures. Statistical analyses used SAS version 6.12 (SAS Institute, Inc, Cary, NC). Trends were assigned significance if P < 0.05.
| RESULTS |
|---|
|
|
|---|
$35 000. Participants had a mean (± SD) BMI of 26.4 ± 4.8, %BF estimates of 30 ± 10%, FFM of 47.8 ± 4.7, and WHR of 0.78 ± 0.07.
The mean values and within-subject Pearson correlation coefficients for our measures of energy expenditure, FFQ energy, and energy underreporting are shown in Table 1
. Duplicate indirect calorimetry measures of RMR were highly correlated (r = 0.92), as were the first and last days of AREE estimates measured on the accelerometer (r = 0.70). Analyses of the accelerometer data using paired t tests found no significant differences in AREE estimates between weekends and weekdays or between consecutive and nonconsecutive days of data collection (data not shown). FFQ energy at visit 1 and at visit 2 was also highly correlated (r = 0.79), and a two-tailed t test found no significant differences between the 2 measures.
|
2400 kJ (574 kcal) according to a comparison of FFQ and TEE estimates. The median for energy underreporting estimates in this sample was 20.8%, ranging from 80% underreporting to 140% overreporting (data not shown).
|
|
|
|
Because the accelerometer measure of AREE is novel and vulnerable to bias (participants were aware that we were monitoring their activity levels), we also examined the ratio of FFQ energy to RMR for purposes of comparison (data not shown). Overall, the associations of participant characteristics with ratios of FFQ energy to RMR were similar to those observed for ratios of FFQ energy to TEE, with P values differing only slightly. Specifically, for this outcome, the negative association of educational attainment and underreporting was suggestive (P for trend = 0.06), whereas trends in age (P for trend = 0.17) and FFM (P for trend = 0.20) were not. PASE scores were positively correlated with average accelerometer measures (r = 0.25, P < 0.01). PASE score tertiles (ie, low, medium, and high activity levels) corresponded to accelerometer means of 1723 kJ (412 kcal), 2029 kJ (485 kcal), and 2142 kJ (512 kcal), respectively, with a significant test for trend (P < 0.01).
Data on energy underreporting by psychosocial factors are given in Table 5
. Women with high social desirability scores tended to be more likely to underreport energy intakes (ie, have lower FFQ energy-to-TEE ratios) than women with lower scores (P for trend = 0.09) (33). Women perceiving they were thin according to the Stunkard-Sorensen silhouettes (34) tended to underreport their energy intake more than women perceiving themselves as heavy (P for trend = 0.11). Again, similar trends resulted when ratios of FFQ energy to RMR were used, with a significant association between underreporting energy and social desirability (P for trend = 0.04) (data not shown). A suggestive trend in energy underreporting was seen for dissatisfaction with perceived body size (perceived size - desired size) by use of the FFQ energy:RMR outcome variable (P for trend = 0.09); that is, women closer to their desired size underreported energy more.
|
| DISCUSSION |
|---|
|
|
|---|
21% on the WHI FFQ. This degree of underreporting is consistent with that seen in doubly labeled water method studies (1, 4043). Overall, the most important finding is that energy underreporting appeared to vary with participant characteristics, and evidence suggested more underreporting among women who were younger or had high social desirability scores. Johansson et al (2) reported an increase in energy underreporting with increasing age in a younger group (age 44 ± 9 y). However, studies of reporting errors in older women such as those in our sample found no significant age differences (1, 11, 15, 4446). Our results may vary from those of others because of cultural and age differences among populations studied (47, 48), limited sample sizes, or differences in the measures of energy expenditure and energy intake (ie, the FFQ) (49). We found no association between BMI and energy underreporting such as was found in numerous other studies (3, 4, 612), An interesting point regarding the association of adiposity with underreporting is that there is no plausible biologic reason that excess body fat would, in and of itself, cause women to underreport energy intake. Therefore, the various measures of body size and adiposity must be serving as surrogates for psychosocial characteristics that result in underreporting energy, such as poor awareness of intake or portion sizes, deliberate underreporting, and subconscious biasing toward intake that is perceived to be appropriate. We did observe several suggestive associations of psychosocial factors with underreporting. Postmenopausal women with high social desirability scores, which theoretically reflect a desire to answer questions in a socially desirable manner, underreported energy intake more than women with lower scores (P for trend = 0.09); this finding is consistent with findings in other studies (14, 16). We found that postmenopausal women perceiving themselves to be thin underreported their energy intake more than those perceiving themselves to be heavy, even when we controlled for actual BMI (P for trend = 0.11). These findings support the idea that body image, independent of measured BMI, is an important predictor of energy-reporting errors.
This study has many limitations. In particular, the modest sample size limited our ability to detect many associations at conventional significance levels. The generalizability of our results is limited because we included only postmenopausal women from the United States. Although the correlation coefficient for reported FFQ energy and measured TEE (r = 0.24) was similar to the findings of others who used different tools in different populations (all: r = 0.22, P = 0.20) (13, 50), our measure of TEE was likely limited by
3 features of the AREE measurement. First, although we specifically instructed women to refrain from strenuous activity for 48 h before the indirect calorimetry measurements, 20 women (19.6%) recorded accelerometer data between 36 and 48 h beforehand. Second, the direction of the measurement error inherent in the accelerometer is unclear. AREE may be underestimated because the accelerometer measures movement in only one plane, does not account for fidgeting or anxiety-related energy expenditures, and is unable to discern variations in grade (51). Third, these participants did not wear the monitor during sleeping hours. One study that compared physical activity estimates determined with this accelerometer to those determined with indirect calorimetry and the doubly labeled water method found that the accelerometer underestimated by 55% in 4584-y-old women (n = 35) (52). Conversely, another study found that this accelerometer overestimated activity > 2 mph in women aged 71.2 ± 3.5 y (51). Others described this accelerometer as unable to adequately discriminate between running speeds of 58 mph (53). Finally, although physical activity is theoretically an objective measure, participants may have altered their usual physical activity levels in response to wearing an accelerometer (54). RMR alone accounted for 6070% of TEE (quartiles 1 and 3), and, when we examined the associations of participant characteristics with the ratio of FFQ energy to RMR (which excludes the accelerometer component), trends were almost identical to those seen for the ratio of FFQ energy to TEE, which indicates that errors in the accelerometer measures cannot entirely explain our findings. Nonetheless, one could postulate that errors in this measure of physical activity were associated with participant characteristics (eg, age and BMI) and could influence our results.
The ability to establish relative ranking of dietary energy intake from low to high was affected by participant characteristics, as evidenced by the correlation coefficients between FFQ energy and TEE. For example, dietary energy intake was ranked with more precision in relation to the objective TEE marker among women with lower levels of education, higher income, and greater BMI, social desirability, or dissatisfaction with perceived body size. These observations are limited by small sample sizes, and we know of no similar data in the literature with which to compare our findings.
This study supports the hypothesis that types of errors in dietary energy self-reporting vary by participant characteristics. These errors, when associated with dietary exposures or confounding factors of interest, could result in incorrect inferences regarding diet-disease relations. Studies relying on self-reported dietary intake to assess exposure to dietary components should include objective measures of energy intake (energy expenditure) in at least a subset of participants to identify sources of systematic error within the particular population studied and in the specific self-reporting tool used. The objective measure of energy expenditure in a subset of the study cohort should follow the self-reported intake. These data will allow an assessment of systematic bias for the self-reporting tool in the context of the study population and can be used to calibrate the self-reported data as an integral aspect of diet and disease association analyses. Future objective-measures substudies are needed that have the following characteristics: 1) larger sample sizes, 2) more-diverse populations, 3) additional biomarkers that could assist in determining whether underreporting varies by macronutrient source, and 4) a wider array of psychosocial measures to elucidate the observation that adiposity is associated with underreporting. Identification of characteristics associated with energy-reporting errors can lay the groundwork for the development of statistical methods equipped to adjust for person-specific systematic error in dietary self-reporting.
| ACKNOWLEDGMENTS |
|---|
| REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
M. L. Neuhouser, L. Tinker, P. A. Shaw, D. Schoeller, S. A. Bingham, L. V. Horn, S. A. A. Beresford, B. Caan, C. Thomson, S. Satterfield, et al. Use of Recovery Biomarkers to Calibrate Nutrient Consumption Self-Reports in the Women's Health Initiative Am. J. Epidemiol., May 15, 2008; 167(10): 1247 - 1259. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Watson, T. Baranowski, D. Thompson, R. Jago, J. Baranowski, and L. M. Klesges Innovative application of a multidimensional item response model in assessing the influence of social desirability on the pseudo-relationship between self-efficacy and behavior Health Educ. Res., December 1, 2006; 21(suppl_1): i85 - i97. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Halkjaer, A. Tjonneland, B. L Thomsen, K. Overvad, and T. I. Sorensen Intake of macronutrients as predictors of 5-y changes in waist circumference. Am. J. Clinical Nutrition, October 1, 2006; 84(4): 789 - 797. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. A. Blanton, A. J. Moshfegh, D. J. Baer, and M. J. Kretsch The USDA Automated Multiple-Pass Method Accurately Estimates Group Total Energy and Nutrient Intake J. Nutr., October 1, 2006; 136(10): 2594 - 2599. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. E. Foster-Schubert, A. McTiernan, R. S. Frayo, R. S. Schwartz, K. B. Rajan, Y. Yasui, S. S. Tworoger, and D. E. Cummings Human Plasma Ghrelin Levels Increase during a One-Year Exercise Program J. Clin. Endocrinol. Metab., February 1, 2005; 90(2): 820 - 825. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. A Tooze, A. F Subar, F. E Thompson, R. Troiano, A. Schatzkin, and V. Kipnis Psychosocial predictors of energy underreporting in a large doubly labeled water study Am. J. Clinical Nutrition, May 1, 2004; 79(5): 795 - 804. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |