|
|
||||||||
ORIGINAL RESEARCH COMMUNICATION |
1 From the Nutritional Sciences, University of WisconsinMadison (DAS); Preventive Medicine, Health Science Center, University of Tennessee, Memphis, TN (FAT); the Beltsville Human Nutrition Research Center, US Department of Agriculture, Beltsville, MD (DJB); the National Center for Health Statistics, Centers for Disease Control and Prevention, Hyattsville, MD (LGB); the Lifespan Health Research Center, Department of Community Health, Wright State University School of Medicine, Kettering, OH (WCC and RMS); Virginia Polytechnic Institute and State University, Blacksburg, VA (CPE); the University of California, San Francisco (TF); the National Institute on Aging, National Institutes of Health, Bethesda, MD (TBH); the Obesity Research Center, St Luke's Roosevelt Hospital, New York, NY (SBH and HM); the University of Arizona, Tucson, AZ (TGL); the Grand Forks Human Nutrition Research Center, US Department of Agriculture, Grand Forks, ND (HCL); and Radiology, University of California, San Francisco (JS)
2 Mention of a trademark or proprietary product does not constitute a guarantee of warranty of the product and does not imply its approval to the exclusion of other products that may also be suitable by any of the following agencies: USDA, CDC/DHHS, or NIH/DHHS. 3 Supported in part by grants from the National Institute on Aging (N01-AG-2106, N01-AG-6-2102, and N01-AG-6-2103) and by Virginia Polytechnic Biological Initiative Funds. The initial workshop was supported by the National Center for Health Statistics, Centers for Disease Control and Prevention, and the National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health. 4 Reprints not available. Address correspondence to DA Schoeller, Nutritional Sciences, University of WisconsinMadison, 1415 Linden Drive, Madison, WI 53706. E-mail: dschoell{at}nutrisci.wisc.edu.
| ABSTRACT |
|---|
|
|
|---|
Objective: The objective was to compare the calibration of the QDR 4500A against criterion methods in a large heterogeneous population.
Design: DXA-derived body-composition data were obtained from 7 studies: 6 data sets were provided by the investigators, one of which was published. The data included fat mass and fat-free mass measured with a QDR 4500A and criteria measurements of body composition from total body water by dilution at 4 centers, densitometry from 1 center, and four-compartment analysis at 2 centers.
Results: In the cohort of 1195 subjects, 602 men and 593 women aged 1982 y with a body mass index (in kg/m2) of 1644, the fan-beam DXA overestimated fat-free mass (P < 0.05). A significant difference was observed in all 7 data sets, and the mean (±SE) was 5 ± 1%.
Conclusions: It is recommended that the lean soft tissue mass estimate with the fan-beam QDR 4500A be reduced by 5% and that for fat mass be increased by that same mass. This finding is particularly important because the National Health and Nutrition Examination Survey is using the QDR 4500A to assess body composition in a nationally representative sample of persons in the United States.
Key Words: Body composition hydration total body water
| INTRODUCTION |
|---|
|
|
|---|
A direct measure of fat-free mass (FFM) and fat mass (FM) is, therefore, often preferred for assessing obesity. Dual-energy X-ray absorptiometry (DXA) estimates of FFM and FM have been validated and generally are reported to correlate highly with values determined with criterion methods (814). In addition, DXA can provide estimates of fat distribution by body region, yet is rapid and simple to perform in most subjects. The DXA information on body composition from large multiethnic studies may help to identify factors that might explain differences seen in cardiovascular disease risk factors and other markers of chronic disease, including bone mineral density (BMD) (1518). For these reasons, the National Center for Health Statistics, Centers for Disease Control and Prevention, included DXA in the current National Health and Nutrition Examination Survey (NHANES) (19).
DXA, however, is not without limitations. Although highly correlated with criterion methods, modest systematic variation in the absolute estimates of body composition by DXA can arise from different hardware and software accommodations to several factors, including interpolations for soft tissues located over bone (8) and treatment of pixels for which a small portion is bone (20). For newer and faster fan-beam instruments, parallax error (beam magnification) due to the variation in heights between the source and detector of tissues is an additional concern (11). Because of these inaccuracies in a given DXA instrument (11), it is important to either cross-validate new DXA models or substantive software revisions against an established DXA instrument or an established criterion method.
The accuracy of the QDR 4500A (Hologic, Bedford, MA) is of particular interest because it is being used in the continuous NHANES. The results from NHANES will provide national estimates for body composition, which should be a valuable baseline for future surveys or smaller independent samples. Future studies are likely to use other instruments or techniques; therefore, establishing the accuracy of the fan-beam QDR 4500A is critical. Recent studies, however, have indicated that the Hologic QDR 4500A overestimates FFM and underestimates FM compared with criterion methods (12, 21, 22) and the pencil-beam QDR 2000 (11, 13).
The aim of this study was to compare the QDR 4500A measurement of FM and FFM against that of measurements made with criterion methods at multiple laboratory sites. The use of multiple laboratories ensured that individual bias from any one instrument, criterion method, or protocol would be minimized and, thus, the NHANES results should be the most reliable estimates of body composition in the US population that are possible.
| SUBJECTS AND METHODS |
|---|
|
|
|---|
|
|
Whole-body DXA
A Hologic model QDR 4500A fan-beam X-ray absorptiometer was used to measure body composition at each site. Lean soft tissue mass (LSTM), FM, and bone mineral content (BMC) were assessed by using software versions 8.21, 8.25A, or 8.26A (Whole Body Analysis software), which all use pixel-specific adjustment for beam magnification and the same routine for estimating body composition. Participants were positioned for whole-body scans according to the protocol recommended by the manufacturer. Participants lay supine on the DXA table with limbs close to their bodies. Only those with all body parts in the scan field were included in this study. FFMDXA was the sum of LSTM and BMC.
DXA phantom analysis
To test whether the instruments were comparable, the Hologic whole-body phantom was circulated among laboratories 1 to 5 as well as the 3 NHANES mobile examination centers. At each site, the phantom was scanned 10 times with repositioning of the phantom between each scan; thus, the results included positioning error. The phantom measurements were analyzed and reviewed for accuracy errors by trained staff at the University of California, San Francisco. The mean and SD for the most commonly used variables from each phantom were calculated by using standard methods, including body mass, LSTM, FM, percentage fat, BMD, and BMC. The ratio of LSTM obtained with each instrument to the mean value averaged across all instruments was calculated and compared with the ratio of FFMDXA to FFMcriterion for each individual DXA instrument by linear regression.
Criterion methods
Data set 1 was from the Virginia Polytechnic Institute and State University. The criterion method was TBW measured by deuterium dilution. After a baseline venous blood sample was collected, subjects were given a weighed dose of deuterium oxide (99.8 atom%; Isotec Inc, Miamisburg, OH), equivalent to 0.30 g/kg body weight in 100 mL distilled water, followed by a 100-mL distilled water rinse. After a 3-h equilibration period, during which subjects did not ingest anything, a second postdose blood specimen was drawn. Plasma samples were purified by diffusion after the method described by Davis et al (27). Plasma samples were purified by incubating equal volumes of plasma and distilled deionized water at 37 °C for 48 h in incubation dishes (Bel-Air Products, Pequannock, NJ). Purified plasma samples were subsequently analyzed for deuterium enrichment by isotope ratio mass spectrometry (MAT 251; Finnigan, Bremen, Germany). TBW was corrected for urinary deuterium losses and reduced by 4% to correct for exchange with protein and carbohydrate during the 3-h equilibration period (28).
Data set 2 was from the energy expenditure subset of the Health, Aging, and Body Composition (Health ABC) Study conducted at the University of Pittsburgh and the University of Tennessee Health Science Center as described elsewhere (29, 30). The criterion method was TBW measured by isotope dilution;
4 g deuterium and 8 g 18O-labeled water were given to fasted subjects after collection of a baseline urine sample. Three additional urine samples were collected in the next 6 h. Enrichments of tracer were measured in the final 2 urine samples relative to baseline by isotope ratio mass spectrometry (Finnigan Delta-S and Delta Plus). Corrections were made for water intake during urine collections, and dilution spaces were reduced by 4% and 0.7% for deuterium and 18O dilution, respectively, and averaged (28).
Data set 3 was from the US Department of Agriculture, Beltsville, MD. The criterion method was TBW measured by isotope dilution. After an overnight fast, subjects provided a urine sample that was used to measure background isotope enrichments and were then dosed orally with 0.1 g/kg body weight each of deuterium and 18O. The bottle containing the dose was rinsed with 100 mL of deionized water and consumed. After that, the subjects were offered a muffin and juice and were allowed coffee, tea, or water for up to 1 h postdose. Food and beverages were not allowed after the first hour. Saliva samples were collected 4 and 4.5 h postdose. Isotopic enrichment was measured by continuous-flow, isotope ratio mass spectroscopy (Europa Scientific Hydra, Cheshire, United Kingdom). TBW was calculated as the average of the deuterium and 18O dilution spaces calculated from the enrichment of the last saliva sample and corrected for 4% and 0.7% in vivo exchange, respectively (28).
Data set 4 was from the University of Tennessee Health Science Center. The criterion method was TBW measured by deuterium dilution as described elsewhere (29, 30). An oral dose of 4 g deuterium oxide was administered to each participant after a 612 h fast. Plasma samples were collected into a dry EDTA-coated tube before and 4 h after the isotope administration. Plasma protein was removed by ultrafiltration, and deuterium enrichment above baseline was measured by isotope ratio mass spectrometry (Finnigan Delta Plus). Subjects were allowed fluid at 1 h after the dose, and corrections were made by subtracting this water intake from TBW, but only in those in whom intake exceeded 0.2 kg. A 1% correction was made for isotope exchange during plasma filtration (30) and 4% for in vivo exchange (28).
Data set 5 was from the University of California, San Francisco, as described elsewhere (31). A four-compartment model was used for the criterion method. TBW was measured by using the same methods as described for data set 4. Body density was measured by underwater weighing (UWW) while the subjects wore a bathing suit. Water temperature was set at 3235 °C. Five of the most consistent trials (underwater weights within a range of 0.02 kg) from 10 replicates were averaged. Before submersion, residual lung volume was measured in triplicate by using a respirometer (model SVR/PLUS; Collins, Braintree, MA). Total-body mineral mass was calculated from measured total-body BMC in the skeleton by the QDR 4500A. To account for the mineral in nonosseous tissue, total mineral from DXA was multiplied by 1.23. The four-compartment model equation of Lohman (32) was used except as noted above, where total-body mineral was used rather than bone mineral (33).
Data set 6 was from the Life Span Health Research Center at the Wright State University School of Medicine. The criterion method was UWW. Underwater weight was measured in a tank of water (4 ft wide, 6 ft long, and 5 ft deep) at 34 °C. The chair was suspended by 4 load cells whose weights were summed and the weight printed. Residual lung volume was measured on land with a SensorMedics model 2450 (Yorba Linda, CA) Pulmonary Function Laboratory. Weight in air was measured on a Seca scale (Hamburg, Germany) to 0.1 kg. The body density data for each adult participant were converted into FFM by using the Siri equation as described in detail by Guo et al (26).
Data set 7 was extracted from published data (22) and included to show external validity. It was the only other study that met our criteria for comparing the QDR 4500A against TBW, UWW, or a four-compartment model as the criterion method. Briefly, these investigators used the four-compartment model of Baumgartner et al (25). Body density was measured by air plethysmography. TBW was measured by deuterium dilution 3 h after a 10-g oral dose of deuterium oxide using plasma. Deuterium was measured by infrared spectroscopy, and a 5% correction for in vivo isotope exchange was applied. Bone mineral was determined by using the QDR 4500A and multiplied by 1.167 to adjust it to the Lunar DPXL equivalent and again by 1.235 to calculate total-body mineral to match the assumptions used in the development of the four-compartment model (25).
Calculation of FFM
When the criterion method was TBW, FFM was calculated as TBW/0.73. FM was calculated as body mass minus FFM. The selection of 0.73 as the hydration of FFM was based on an extensive review of the literature (Table 3
). When the criterion method was the four-compartment model, percentage fat was calculated by using the equations referenced above. FM was calculated as percentage fat/100 multiplied by body mass, and FFM was calculated as the difference.
|
0.05. Statistical calculations were performed by using JMP version 4 (SAS Institute Inc, Cary, NC). | RESULTS |
|---|
|
|
|---|
Data were combined for men and women, and the DXA estimate of FFM was compared with that from the criterion method individually for data sets 16 (Table 2
). FFM was 1.84.7 kg larger by DXA than by the criterion method, and FM was 1.35.1 kg smaller by DXA in laboratories 1- 6 (Table 3
). DXA-determined body mass was significantly different from scale mass in all laboratories, but the differences included both negative and positive offsets, the largest of which was 1.1 kg. Data on FFM and FM were not directly available from sample 7, but it was reported that the DXA-derived percentage fat was 2 and 4 percentage points smaller in women and men, respectively, than that from the four-compartment criterion method. From this we calculated that FFM averaged 1.9 kg greater by DXA and FM 1.9 kg less by DXA.
FFMDXA was found to have a strong linear relation with FFMcriterion in each of the data sets. The correlation coefficients ranged from 0.972 in data set 4 to 0.991 in data set 5. When FFMDXA was regressed on FFMcriterion, the slopes ranged from 0.909 to 0.967 (Table 4
). The average of the individual slopes was 0.946, which indicates that DXA overestimated FFM by 5.4% with an SE of 0.7% compared with the criterion methods. The SE of the slope for data set indicated that the precision of the linear fit was significantly greater for data sets 1 and 3 than for the other data sets (P < 0.05; F test). Of note, however, the variance about the slope within a laboratory (VWL) for any given data set is less than the variance in the slopes (Vtot) when averaged between data sets (P < 0.01; F test). Calculation of the betweendata set (laboratory) variance (VBL = Vtotal
VWL) indicates that the betweendata set SD for the slope was 0.015. This value is more than twice the largest withindata set SE and, therefore, the betweendata set (laboratory) variance has a greater influence on the final average across data sets than does the individual variance within a data set.
|
|
Because the criterion method might also influence the comparison of FFMDXA with FFMcriterion, we compared the slopes of the regression of FFMDXA on FFMcriterion using multiple criterion methods for those data sets in which comparisons between several criterion methods could be made (Table 6
). This was only possible in 2 data sets, however. The slopes and SDs (0.952 ± 0.019) for the various criterion methods from these 2 data sets were not different from those for the between-laboratory variance (0.945 ± 0.016; Table 4
).
|
Given that between-laboratory variance in the relation between FFM from DXA and the criterion methods was larger than the within-laboratory variance and that the variance could not be explained by variance in the analysis of the phantom data, we concluded that the variance resulted from small between-laboratory biases in the criterion methods. Because of this, the average of the mean proportional error in FFM for the 7 data sets was used to estimate the bias in the body-composition estimates of the QDR 4500A. This average was 0.946 and it was significantly different from zero (P < 0.001). Thus, the calibration of the QDR 4500A was in error, which resulted in an overestimate of FFM and an underestimate of FM.
A correction factor for QDR 4500A estimates of FFM and FM was determined by using the assumption that BMC obtained from the QDR 4500A is correct and should not be adjusted. Corrected FFMDXA was obtained by multiplying LSTM by the determined correction factor. Corrected FM was then calculated by subtracting the sum of LSTM and BMC from total body weight determined by DXA as follows:
![]() | (1) |
![]() | (2) |
![]() | (3) |
BMC was not corrected.
| DISCUSSION |
|---|
|
|
|---|
One potential criticism of our findings was that 4 of the 7 data sets used TBW as the criterion method. The use of TBW requires an assumption of a constant hydration of FFM. Wang et al (49) recently provided a theoretical basis for variation in the hydration of FFM in healthy adults and, in so doing, predicted a maximal range for the individual hydration factors of 0.66 to 0.77. Our review of the literature indicated that group mean values are less variable than are those predicted by Wang et al (49), ranging from 0.70 to 0.75 in healthy adults (Table 1
). Our assumed value of 0.73 is at the center of these ranges. Individual studies within this compilation indicate that hydration of FFM is a few percentage points larger during pregnancy, in severe protein malnutrition, and in muscle builders (34, 40, 41); these observations are consistent with the theoretical basis of hydration reviewed by Wang et al (49). After exclusion of these 3 subsets, we performed a meta-analysis to determine whether there were effects of age in adulthood, sex, or ethnicity on hydration of FFM. The average hydration of FFM, excluding the 3 groups listed above with significant increases in hydration, was 72.6 ± 1.4% (between-study SD) with no significant effect of age in adulthood, sex, or ethnicity. These lines of evidence support our use of a hydration factor of 0.73 for our populations.
We also questioned the use of different criterion methods in our analysis. To address this concern, we performed further data analyses in 2 of our data sets that had measurements that permitted the calculation of FFM by several different criterion methods (Table 6
). The mean FFM was similar among the various criterion methods, and the slopes between DXA and the criterion method were all <1.0. Moreover, the within-criterion method variances for the slopes were comparable with those for TBW. This provides further support for the use of multiple criterion methods and substantiates the use of TBW as a criterion method for assessing the accuracy of the calibration of the QDR 4500A.
Our use of an unweighted mean to calculate the correction factor for FFM rather than a weighted mean for sample size could also be questioned. Our choice of the unweighted mean was based on the between-laboratory variance being larger than the within-laboratory variance. Both the literature review of the FFM hydration data and the data in Tables 4
and 6
indicate a larger between-laboratory variation than would be predicted from the SE for the within-laboratory mean. This indicates that there was a systematic difference in the criterion methods between laboratories and that this difference became limiting for the interlaboratory comparison. Thus, we used the between-laboratory average for the slope of FFMDXA on FFMcriterion without weighting, despite greatly different numbers of participants in the data sets.
It is also possible that the between-laboratory variance may have resulted from between-instrument variance of the DXA instruments. A common whole-body phantom was circulated among the laboratories to determine interlaboratory differences for the assessment of FFM. The results of phantom data analysis showed no difference between the DXA instruments used in this calibration study. Because the circulation of the phantom was done 15 y after the human data were collected, it may not represent the actual accuracy of the particular instrument during the collection of the human data. We speculated, however, that the differences that existed between these instruments during the period of data collection were not any greater than those measured during the phantom analysis for the following reasons. Although calibration of a DXA instrument can be altered as a result of a major repair to the DXA hardware or the upgrade of software, none of the laboratories reported any major repair to their DXA instrument nor were there any differences in the software versions used to collect the human or phantom data. It should also be noted that the 3 NHANES DXA instruments being used to collect the national data were found to provide estimates of FFM similar to those obtained by the laboratories used to determine our correction factor (Table 5
). This finding supports the use of the above-suggested corrections for the NHANES data.
The error for the FFMDXA value for the QDR 4500A may appear surprising in light of the large number of publications that have found DXA to be an accurate means of assessing FFM and FM in adults. Many of these validations, however, were performed with the use of pencil-beam instruments. The QDR 4500A is a fan-beam instrument and thus involves a correction for beam magnification and a new software routine for converting the X-ray absorption data to body composition. Beam magnification has been shown to influence the measurement of FFM and FM by DXA (13). This and the use of the new software appear to have introduced a modest systematic error in the FFM calibration that requires correction. This error is specific to the QDR 4500A and these software versions. Other fan-bean instruments require independent validation to determine whether they too are subject to systematic bias.
The calibration of the QDR 4500A is of particular importance because it is currently being used to acquire body-composition data for a nationally representative sample of individuals in the United States as part of the NHANES. Because these data may be used to make national policy decisions, comparisons over time within the United States, comparisons between countries, and comparisons of study cohorts with the NHANES sample, an assessment of the accuracy of the QDR 4500A was critical. Based on our findings of a biologically significant bias in the QDR 4500A, this correction was implemented in NHANES 19992004 data before its release. We suggest that others who use the QDR 4500A should also use this correction for FFM, FM, and percentage fat.
| ACKNOWLEDGMENTS |
|---|
LGB conceived and organized the combined effort of the authors. DAS, FAT, and TGL organized the data and data analysis and prepared the first draft of the manuscript. DJB, WCC, CPE, TF, TBH, SBH, HCL, HM, JS, and RMS provided the data sets and edited the manuscript. None of the authors reported having a conflict of interest.
| REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
J. S. Lee, M. Visser, F. A. Tylavsky, S. B. Kritchevsky, A. V. Schwartz, N. Sahyoun, T. B. Harris, A. B. Newman, and for the Health ABC study Weight Loss and Regain and Effects on Body Composition: The Health, Aging, and Body Composition Study J Gerontol A Biol Sci Med Sci, April 14, 2009; (2009) glp042v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Fantin, V. D. Francesco, G. Fontana, A. Zivelonghi, L. Bissoli, E. Zoico, A. Rossi, R. Micciolo, O. Bosello, and M. Zamboni Longitudinal Body Composition Changes in Old Men and Women: Interrelationships With Worsening Disability J. Gerontol. A Biol. Sci. Med. Sci., December 1, 2007; 62(12): 1375 - 1381. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |