|
|
||||||||
Review Article |
1 From the Department of Internal Medicine, the Center for Human Nutrition, Washington University, St Louis (DAF); the Department of Preventive Medicine, the Institute for Preventive Research, the Keck School of Medicine, the University of Southern California, Los Angeles (MIG); and the Energy Metabolism Laboratory, the Jean Mayer US Department of Agriculture Human Nutrition Research Center on Aging, Tufts University, Boston (MAM).
2 The contents of this publication do not necessarily reflect the views or policies of the US Department of Agriculture.
3 Address reprint requests to MA McCrory, Energy Metabolism Laboratory, Jean Mayer USDA Human Nutrition Research Center on Aging, Tufts University, 711 Washington Street, Boston, MA 02111-1524. E-mail: mmccrory{at}hnrc.tufts.edu.
| ABSTRACT |
|---|
|
|
|---|
23% BF by both the BOD POD and HW. Individual variations between 2compartment models compared with DXA and 4compartment models are partly attributable to deviations from the assumed chemical composition of the body. Wide variations among study means, 4.0% to 1.9% BF for BOD POD HW and 3.0% to 1.7% BF for BOD POD DXA, are likely due in part to differences in laboratory equipment, study design, and subject characteristics and in some cases to failure to follow the manufacturer's recommended protocol. Wide intersubject variations between methods are partly attributed to technical precision and biological error but to a large extent remain unexplained. On the basis of this review, future research goals are suggested.
Key Words: Body-composition methods air-displacement plethysmography hydrostatic weighing dual-energy X-ray absorptiometry isotopic dilution total body water multicompartment body-composition models thoracic gas volume residual lung volume review
| INTRODUCTION |
|---|
|
|
|---|
In this review, we summarize the principal findings from studies published between December 1995 (the time at which the BOD POD was initially validated) and August 2001 that compared the BOD POD with reference methods. Specifically, we compared in both adults and children the reliability and validity of the BOD POD with the reliability and validity of established reference methods, ie, hydrostatic weighing (HW), dual-energy X-ray absorptiometry (DXA), and multicompartment [3-compartment (3C) and 4-compartment (4C)] models. To fully comprehend the significance of the viability of the BOD POD today, it is necessary to gain an understanding of the history of the development of air-displacement plethysmography. Therefore, we provided a brief description and historical overview of air-displacement plethysmography in general and of the BOD POD in particular and reviewed the operating principles of the BOD POD. Finally, we discuss the potential applicability of air-displacement plethysmography for use in a wide range of populations and summarize areas in need of further research.
| BACKGROUND AND BRIEF HISTORICAL PERSPECTIVE |
|---|
|
|
|---|
In air-displacement plethysmography, the volume of an object is measured indirectly by measuring the volume of air it displaces inside an enclosed chamber (plethysmograph). Thus, human body volume is measured when a subject sits inside the chamber and displaces a volume of air equal to his or her body volume. Body volume is calculated indirectly by subtracting the volume of air remaining inside the chamber when the subject is inside from the volume of air in the chamber when it is empty. The air inside the chamber is measured by applying relevant physical gas laws. Boyle's Law states that at a constant temperature, volume (V) and pressure (P) are inversely related:
![]() | (1) |
The principles of plethysmography were first applied to the measurement of the body volume and composition of infants in the early 1900s (11, 12), but it was not until the 1960s that relatively stable measurements were achieved (13, 14). However, these systems required that ambient conditions be maintained constant. Therefore, to deal with rapid fluctuations in temperature, humidity, and pressure generated by humans inside the enclosed chamber, the measurement process by necessity included procedures that were difficult and laborious and by modern standards would be considered impractical and unacceptable. For example, the infant plethysmograph developed by Friis-Hansen (13) needed to undergo a 12-h calibration procedure before each measurement, and the test procedure took an additional 23 h. The technique also required that a plastic catheter be inserted through the infant's nose into the stomach to achieve a direct connection between the air inside the infant and the air in the surroundings. Another example in which extreme measures were necessary is the use of Gundlach and Visscher's adult plethysmograph (14). This procedure required that the test chamber be filled with polyurethane foam to maintain isothermal conditions. In addition, the adult subject had to be wrapped in a goose-down blanket and was required to hold his or her breath for
10 s during the measurement. Because of inconveniences such as these and various technologic difficulties, none of the early air-displacement plethysmographs were ever developed for common, everyday use.
Later experimental air-displacement plethysmographs developed in the 1980s were more advanced technologically. Petty et al (9) used a motor-driven pump and oscillating piston to create pressure changes within their system designed for adults; they also used advanced electronics and material to absorb moisture build-up in the chamber during the 5-min test period. An infant plethysmograph developed by Taylor et al (10) used a 2-chambered, dynamic, pressure-differential system. Pistons between the 2 chambers moved in concert and were controlled by a sinusoidal crank. A high-pass filter (controlled leak) was also incorporated, and harmonic analysis was done to interpret the pressure signal. Despite major improvements over previous systems, the results from these newer systems were still not sufficiently accurate and repeatable for routine human body-composition measurements.
| BASIC PRINCIPLES OF THE BOD POD |
|---|
|
|
|---|
450 and 300 L, respectively. A diaphragm oscillates between the chambers, producing sinusoidal volume perturbations that are equal in magnitude but opposite in sign. The perturbations result in very small pressure changes within the chambers (±
1 cm water), which are monitored by transducers and analyzed for pressure at the frequency of oscillation (3 Hz). The ratio of the pressures is a measure of the test chamber volume. Unlike with early air-displacement plethysmographs, it is not necessary to conduct measurements under isothermal conditions in the BOD POD. Instead, the air in the chambers is allowed to compress and expand adiabatically (ie, it freely gains and loses heat during compression and expansion). In this case, the BOD POD makes use of Poisson's Law, which describes the pressure-volume relation under adiabatic conditions:
![]() | (2) |
is the ratio of the specific heat of the gas at constant pressure to that of constant volume and is equal to 1.4 for air (16).
Although body-volume measurements in the BOD POD occur under mostly adiabatic conditions, there is some volume of air maintained under isothermal conditions that must be taken into account. The reason for this is that when there are small changes in pressure, isothermal air volumes are compressed 40% more than are adiabatic air volumes. The largest sources of isothermal air are those contained in the lungs, near skin or hair, and in clothing. Isothermal air from clothing and hair on the head are minimized by having the subject wear a tight-fitting swimsuit and swim cap. (The manufacturer of the BOD POD recommends the use of swimsuits and caps made from either Lycra (DuPont, Wilmington, DE) or other spandex-type material, for reasons discussed later.) The average amount of air in the lungs during normal tidal breathing, thoracic gas volume (VTG), is measured with the procedure described below. Alternatively, VTG can be predicted. Finally, the effect of isothermal air near the skin's surface is estimated by calculating a surface area artifact (SAA). The SAA is automatically computed by the BOD POD's software as
![]() | (3) |
-1.0 L for average-sized adults; it is negative because it represents the apparent negative volume produced by the isothermal air space near the skin's surface. Note that if the wrong height is entered into the BOD POD software, the calculation of %BF will be in error because of inappropriate estimates of SAA. For example, for a 70-kg average-sized person, a 25-cm (
10-in) error in height [168 cm (66 in) instead of 193 cm (76 in)] will result in a miscalculation of SAA (via BSA) and an error in body fatness of
0.50.7% BF. Thus, all of the BOD POD results should be routinely screened to determine whether any software data entry errors were made.
The measurement of body volume involves 3 steps. The first step is a standard 2-point calibration process: first with the chamber empty to establish baseline and then with a calibration cylinder (
50 L) to establish range (duration: 50 s each). In the second step, the subject's volume in the chamber is measured (duration: 50 s). At this point, the measured body volume is "raw" (Vbraw), ie, it has not been corrected for VTG and SAA. This step is then repeated to check for agreement. If these 2 Vbraw measurements are within 0.2% or 150 mL, whichever is larger, they are averaged. If the first 2 Vbraw measurements do not meet these criteria, a third Vbraw determination is made and the 2 values that are closest and within the criteria for agreement are averaged. If ambient conditions are relatively stable and the subject is breathing quietly in a relaxed fashion, it is common for the 2 Vbraw measurements to agree within the predefined criteria. If the criteria are not met, the manufacturer suggests that the entire procedure be repeated, including the 2-point calibration step. Situations that could cause nonagreement between individual Vbraw measurements include changing environmental conditions, other environmental impositions (eg, pressure changes in the room due to opening and closing doors or air drafts), or irregular tidal breathing by the subject (eg, yawning, throat clearing, or breath-holding). In the third step, VTG is measured with the use of a procedure similar to that used in standard pulmonary plethysmography, sometimes called the panting maneuver by respiratory physiologists (18).
In contrast with traditional pulmonary plethysmography in which VTG is determined at end-tidal exhalation [ie, functional residual capacity (FRC)], the BOD POD measures VTG at midtidal exhalation. This is done because it is necessary to correct Vbraw for the average amount of air in the lungs during normal tidal breathing, which is reflected by taking the measurement at midtidal exhalation. (A key assumption is that the subject is breathing normally during both the Vbraw measurement and the VTG measurement.) Thus, VTG values derived from the BOD POD should be directly compared with VTG values derived from a pulmonary plethysmograph only after correction for this difference (eg, a difference of
50% of the tidal volume). The VTG measurement procedure begins with the subject breathing room air quietly through a disposable tube and antimicrobial filter while wearing a nose clip. After a few normal tidal breaths, a shutter valve in the airway closes, occluding it for
2 s. During occlusion, the subject makes 2 or 3 gentle quick puffs by alternately contracting and relaxing the diaphragm (ie, the panting maneuver). This leads to small changes in the gas volume of the airways, simultaneously with changes in body volume that are equal but opposite. These volume changes produce pressure changes that are monitored throughout the procedure. Comparison of the magnitudes of the changes in airway and chamber pressure allows calculation of VTG via proprietary methods (Life Measurement, Inc, personal communication, 2001).
Two indicators are used to assess good compliance with the VTG procedure: the figure of merit and airway pressure. The figure of merit is an index that estimates the degree of agreement between pressures measured inside the chamber and in the breathing airway (after scaling and translation). A smaller merit value indicates better agreement. Situations that may lead to poor agreement in these pressure values include lack of a tight lip seal around the tube, failure to wear a nose clip, significant puffing of the cheeks, or contraction of the abdominal muscles. Calculation of the figure of merit is discussed in detail by Dempster and Aitkens (1). If the airway pressure is too high, it may indicate closure of the glottis (ie, a Valsalva maneuver) or significant alveolar compression; both of these factors would result in falsely low VTG values. If the figure of merit is >1.0 or the airway pressure is
35 cm water, the manufacturer recommends that the VTG value be rejected and the procedure be repeated.
The BOD POD also allows for the prediction of VTG. This feature is useful when it is necessary to test many subjects in a short period of time. Predicted VTG was used in some studies when subjects were not able to satisfactorily perform the VTG measurement procedure (1921). The VTG prediction equations currently used by the BOD POD (software version 1.69; Life Measurement, Inc) are based on FRC predictions by Crapo et al (22) from the heights and ages of subjects aged 1791 y and include a further estimate for 50% of tidal volume. The accuracy of predicted VTG and the effect of its use instead of measured VTG on body-composition measurements are discussed below. Body volume in the BOD POD is calculated with the following formula:
![]() | (4) |
| RELIABILITY OF THE BOD POD |
|---|
|
|
|---|
2 measurements. The reliability of the BOD POD in different studies has been reflected by many statistical terms, such as SD, CV, precision (see definition below), intraclass correlation, and mean differences between tests. For the purposes of this review, we chose to limit the discussion of the BOD POD's reliability to only the most consistently reported statistics: SD, CV, and precision (defined as [(SD/n)/
d], where n is the sample size and d is the number of repeated measurements).
Inanimate objects
The reliability of the BOD POD in measuring the body volume of inanimate objects is reported to be excellent. Twenty consecutive measurements of a 50.039-L aluminum cylinder resulted in a mean (±SD) volume of 50.027 ± 0.00127 L and a corresponding CV of 0.025% (1). Results were similar when the experiment was repeated on another day. In another study, repeated measurements over 4 d of smaller volumes ranging from 4.643 to 50.0 L resulted in a mean CV of 0.67 ± 0.70% (20).
Adults
Reliability of percentage body fat
Seven studies reported the reliability of %BF measured by the BOD POD (8, 20, 2832) as CVs; these values are shown in Table 1
. Reported mean within-subject CVs for %BF ranged from 1.7% to 4.5% within a day and from 2.0% to 2.3% between days. These CVs are within the range of those measured previously by HW (8, 28, 33, 34) and DXA (3537). In the 2 studies that examined the within-day repeatability of the BOD POD and HW in the same subjects, CVs did not differ significantly between methods: 1.7% compared with 2.3% in the study by McCrory et al (28) and 3.7% compared with 4.3% in the study by Iwaoka et al (8). Miyatake et al (31) reported similar mean CVs for tests conducted on the same day and on different days (over 3 d). They also reported a mean intertester CV of 4.5% (3 different operators). Examination of the individual data showed that this unexpectedly high CV was due to one abnormal test result in 1 of 5 subjects measured by 1 of 3 operators and may have been an anomaly. [Note that Wells and Fuller (38) suggest routinely conducting 2 tests per subject, enabling detection of infrequent rogue BOD POD results such as these.] Recalculation of the mean CV without the abnormal test result gave a mean intertester CV of 2.7%. Further studies in different populations and with larger numbers of subjects are needed to determine usual values for within-day, between-day, and intertester CVs.
|
Children
Reliability of percentage body fat
The CV for repeated %BF measurements by the BOD POD in children has not been reported. Using the precision statistic described above, Wells and Fuller (38) described the precision of 2 repeat measurements of %BF to be 0.83% for 11 boys (
: 12.6%) and 0.99% for 16 girls (
: 19.7%). Precision was not related to body size because duplicate measurements in 30 men and women with 18.0% and 27.5% BF, respectively, had similar values for precision (0.99% and 0.76% BF).
Reliability of body volume
Dewit et al (39) and Wells et al (7) reported the precision of body-volume measurements in children aged 714 y. Precision of Vbcorr was 0.07 and 0.08 L in the 2 studies, respectively, which was just as good as or slightly better than the precision in adults in the same studies (0.07 and 0.11 L, respectively). Therefore, the precision of body-volume measurements in children and adults was comparable in these 2 studies, despite the smaller body volumes of the children. Similar body-volume precision was reported in another study by the same research group (38). It has been suggested that a relatively small ratio of chamber volume to subject volume would optimize the precision of body-volume measurements (5, 9). For example, Gnaedinger et al (5) calculated a mean ratio of chamber volume to subject volume of 6:1 in their plethysmograph and suggested that a smaller ratio would have improved their data. Assuming a BOD POD test chamber volume of 450 L, the mean ratio of chamber volume to subject volume can be calculated from data provided by Dewit et al (39). Despite the larger ratio for children (14:1 for children compared with 8:1 for adults), the precision of measurements in children and adults was similar. This finding indicates that within the range of body sizes studied thus far, the ratio of chamber volume to subject volume may be irrelevant in the BOD POD.
| VALIDITY OF THE BOD POD RELATIVE TO HW |
|---|
|
|
|---|
86 y of age. BMI ranged from 17 to 40 across the different studies.
|
10) (8, 39). Ethnicity did not contribute significantly to differences between the 2 methods in the 2 studies that had a wide enough range of ethnicities to examine this possibility (20, 28); however, the potential effects of ethnicity were not reported in 2 studies that included both whites and blacks (19, 42).
In the 8 studies that reported regression analysis for the prediction of %BF measured by HW from %BF measured by the BOD POD, the slope of this relation ranged from 0.76 to 0.96; the mean value was much lower than the desired value (1.00) in 4 of these studies (8, 30, 32, 42). Not all of the studies reported whether this slope differed significantly from 1.00; of those that did (19, 28, 30, 32, 40, 43), only 2 studies (19, 30) had slopes that differed significantly from 1.00, as indicated in Table 1
. %BF measured by the BOD POD explained 7894% of the variance in %BF measured by HW, whereas the SEEs reported in 4 of the 12 studies ranged from 1.8% to 2.3% BF. These SEEs are in the excellent to ideal range (
2.5 %BF) according to Lohman (47).
Bland-Altman limits of agreement (mean difference ± 2 SD ranges; 48) and results of trend analysis are also shown in Table 2
. In general, the limits of agreement indicated wide variations in agreement between the BOD POD and HW (range:
916% BF) for individuals, even when group mean differences were small.
Summary of findings in children
Relatively few studies have compared the BOD POD with HW in children (Table 2
). Of the 5 studies that have (7, 20, 21, 39, 44), the age range of the children studied was 619 y. Two of these studies (21, 44) reported that, on average, the BOD POD gave significantly different %BF measurements than did HW. As in the studies in adults, the results were in opposite directions (2.6 compared with -2.9% BF, respectively). The other 3 studies (7, 20, 39) reported that %BF measured by the BOD POD was somewhat higher than that measured by HW (0.61.2% BF), but not significantly so. The slope of the relation for the prediction of %BF by HW from %BF by the BOD POD was 0.86, which was not significantly different from 1.00 in the one study that reported the slope (44). In the 3 studies that reported R2 values, the BOD POD explained between 72% and 87% of the variation in HW (20, 21, 44). The only SEE available (3.3% BF) was from Fields and Goran (46), which was in the good (average) range (47). Finally, Bland-Altman limits of agreement calculated from the study by Fields and Goran (44) were -4.4% to 9.6% BF, indicating large individual variations in the difference between the BOD POD and HW.
Potential reasons for differences between the BOD POD and HW measurements
Theoretically, the BOD POD and HW should give identical values for Db and %BF because both methods are based on the principles of densitometry. Therefore, any differences between the 2 methods can be attributed to differences in either measured body mass (if the same scale is not used for both methods) or body volume. In turn, differences in body volume measured with the BOD POD can be attributed to variations in measurements of Vbraw, SAA, or VTG, and differences in body volume measured with HW can be attributed to variations in body mass measured in water, residual lung volume (VR), or other types of lung volume [eg, lung volume at submersion (7, 39)].
Interlaboratory variation
Interlaboratory variation may be an important factor contributing to the discrepant findings among studies in mean differences between the BOD POD and HW. The extent to which different BOD POD systems vary is not known, although it is hypothesized that BOD POD systems may vary less than do HW systems because there are several variations of HW equipment and methods (eg, different weighing scales and methods for measurement of lung volume) but only one type of BOD POD system manufactured by one company. Although it is possible that the variation in mean differences in the previously mentioned adult studies was random, note that there are 4 pairs of studies, with each of the 4 pairs being from a different laboratory but with each study within a pair being from the same laboratory [(7) and (39), (30) and (32), (40) and (43), and (19) and (42)], and the results within each of the study pairs are more similar than among the study pairs. For example, Dewit et al (39) and Wells et al (7) reported large negative mean differences between the 2 methods (-3.3% and -2.2% BF, respectively), as did Collins et al (19) and Millard-Stafford et al (42) (-2.0% and -2.8% BF, respectively). However, Biaggi et al (30) and Levenhagen et al (32) reported 2 of the smallest and slightly negative mean differences (-0.1% and -0.5% BF, respectively) and Fields et al (40, 43) reported mean differences that were slightly positive, with one value being close to 0 (1.2% and 0.2% BF, respectively). These similar findings within study pairs suggest that interlaboratory variation in protocol, test equipment, or both may contribute importantly to the variation in results observed among studies. To more fully understand the potential effect of interlaboratory variation on measurements of %BF, a multicenter study in which the same subjects are tested in different laboratories is needed.
Test conditions
Measurements with the BOD POD should be made under standard test conditions, ie, subjects should wear minimal but skintight clothing [Lycra (DuPont) or other spandex-style swimsuit and cap], be completely dry, and be in a resting state.
Effects of clothing.
In some of the studies discussed, subjects wore spandex-style shorts (rather than swimsuits, which are recommended by the manufacturer) while undergoing measurements with the BOD POD. This may have contributed to the relatively lower %BF values measured with the BOD POD than with HW in some of the studies (19, 21). In other studies it is unclear what type of clothing was worn during the test protocol. However, it is known that excess clothing causes a significant underestimation of body volume because air that comes in contact with cloth will remain isothermal as pressure fluctuates. The more cloth that is worn, the larger the layer of isothermal air. Because isothermal air is 40% more compressible than is adiabatic air, body volume (Vbraw, and hence Vbcorr) is underestimated and, in turn, Db is overestimated and %BF is underestimated. The effect of excess clothing on %BF measurements with the BOD POD was illustrated in a study by Fields et al (40). No significant difference in %BF was found between women who wore a 1-piece or 2-piece swimsuit. However, %BF was
5% lower in women who wore a hospital gown than in women who wore either type of swimsuit. Although this study illustrated that extreme deviations from the manufacturer's recommended protocol (ie, wearing of loose clothing) had significant effects on estimates of %BF with the BOD POD, it did not address whether slight deviations from the recommended protocol (ie, wearing of spandex-style shorts rather than a swimsuit) would result in acceptable %BF measurements. Until studies are conducted that confirm or deny that alternative clothing is acceptable, it is suggested that the clothing protocol recommended by the manufacturer be rigorously followed.
Effects of testing under nondry, nonresting conditions.
In 2 studies (21, 32), the order in which the 2 methods were conducted was randomized; therefore, in some cases the BOD POD measurements were made first and in others the HW measurements were made first. However, neither of these studies reported whether the subjects were still wet when the BOD POD measurements were made or how much time passed between the 2 tests. Tests with the BOD POD should be conducted only when the subjects are completely dry and in a rested state. Moisture on the body, in the hair, and in the swimsuit will artificially increase body weight. Furthermore, if subjects are recovering from situations that elevate metabolism (eg, exercise or presence in a tank of warm water for 1015 min as part of the HW procedure), breathing patterns are likely to change over time. In BOD POD testing, a key assumption is that breathing patterns are similar during the Vbraw and VTG measurements; however, this will not be the case if subjects are recovering from a physical stress. This situation is somewhat analogous to HW when VR is measured on land and it is assumed that the subject exhales to the same end point both on land and in the water. In both cases, the exact lung volume is not a concern, but the lung volume should be the same during the HW and VR measurement procedures and, likewise, during the Vbraw and VTG measurement procedures.
The effect of testing under nondry, nonresting conditions was illustrated in a preliminary study (DA Fields, GR Hunter, unpublished observations, 2000). When the BOD POD tests were conducted 1015 min after HW, BF was 2.3% lower than it was when measured before HW. In that study, subjects had dried with a towel after HW but their hair and swimsuits were still damp when the BOD POD measurements were made.
VTG prediction
In some studies, predicted VTG was used when some subjects could not adequately perform the panting maneuver to obtain measured VTG (20, 21), whereas in others (7, 39) it was used routinely simply to save time (JKC Wells, personal communication, 2001). McCrory et al (49) reported no significant difference between mean predicted and measured VTG in 50 men and women aged 1856 y (BMI: 1935) with the use of software versions 1.50 and 1.53 (Life Measurement, Inc). Further findings indicated that for 82% of the subjects, the use of predicted VTG resulted in a value within ±2% BF of that calculated with the use of measured VTG. The difference between predicted and measured VTG was not related to the magnitude of VTG (MA McCrory, PA Molé, TD Gomez, KG Dewey, EM Bernauer, unpublished observations, 1998). In contrast with the results of the above study, 2 later studies (software version not reported) showed that, on average, predicted VTG was significantly higher than measured VTG, by 344 mL in 69 collegiate football players (19) and by 190 mL in 37 children aged 1018 y (21). These findings suggest that the BOD POD's current method for prediction of VTG may not be valid for all populations and illustrate the need to report software versions used in all studies to help determine whether different versions of software may be responsible for conflicting findings among studies.
Errors in VTG prediction generally have only a small effect on %BF. As can be deduced from Equation 4
, overestimation of VTG results in overestimation of Vbcorr and, in turn, underestimation of Db and overestimation of %BF. However, because only 40% of VTG is incorporated into the equation to calculate Vbcorr, the magnitude of the overestimation of VTG reported in the above studies should only have caused a very small overestimation of %BF (<1.0%). Note that, in the studies by Collins et al (19) and Lockner et al (21), %BF measured with the BOD POD was significantly lower (rather than higher as would be caused by overprediction of VTG) than that by HW (Table 2
). This finding indicates that other factors (eg, clothing) may have contributed to the observed differences between the BOD POD and HW measurements.
The study by Lockner et al (21) indicates that some children may have more difficulty performing the VTG procedure than do adults; only 69% of their study population adequately performed the VTG measurement procedure in 3 trials. In contrast, Fields and Goran (44), who studied children of a similar age range, obtained VTG measurements in all of their subjects. Valid measurements, on the basis of the standard merit and airway criteria, were obtained in
80% of the children in
3 trials and in
20% of the children in > 3 trials. Two studies conducted in children aged 514 y (39, 45) substituted child-specific prediction equations for FRC (50) and tidal volume (51) to calculate child-specific VTG and body-composition measurements with the BOD POD. In these studies, neither measured nor predicted VTG with the BOD POD was reported; therefore, it is not possible to assess the utility of these child-specific prediction equations. However, Dewit et al (39) noted that when the child-specific equations were used, rather than the adult equations that were incorporated into the BOD POD's software, the mean difference in %BF (calculated as BOD POD - HW) changed from 0.8% to -0.9% BF. This finding suggests that the use of the adult equations overpredicts VTG in children. This is understandable because the BOD POD was originally designed for use in adults. Although usual errors in VTG have only a relatively small influence on %BF as discussed above, more work is needed to improve both the VTG measurement process and the accuracy of VTG prediction in different populations.
Subject sex
Whether the sex of the subject systematically affects the results obtained with the BOD POD or HW remains to be determined. This possibility was first raised by Biaggi et al (30), who reported a significant sex effect and found that the mean difference between the BOD POD and HW was positive for females (1.0 ± 2.5% BF) and negative for males (-1.2 ± 3.1% BF). The same research group also reported findings similar to those of Levenhagen et al (32). However, an additional 2 studies that included both males and females and that examined whether there was a significant effect of sex on the difference between %BF measured by HW and with the BOD POD found no effect of sex (20, 28). Additionally, the studies by Biaggi et al (30) and Levenhagen et al (32) were the 2 of the only 3 studies to report a significant upward trend in the Bland-Altman plot (Table 2
), indicating a negative difference between the BOD POD and HW measurements in leaner subjects and a positive difference in fatter subjects. Millard-Stafford et al (42) also reported a significant upward trend, but did not specifically test for a sex effect, possibly because of the relatively small number of females in their study (10 females and 40 males). Because males tend to be leaner than females, it is difficult to determine whether the significant effect of sex reported in the studies by Biaggi et al (30) and Levenhagen et al (32) were due to an effect of sex per se or to body fatness. Examination of the Bland-Altman plots from these studies showed little overlap in %BF between men and women, although it would be possible in future studies to recruit men and women matched for %BF in an attempt to disentangle the separate influences of %BF and sex.
To further examine the question of whether differences between the BOD POD and HW measurements are dependent on the sex of the subject or on %BF, we plotted the sex-specific means in Bland-Altman fashion for studies in which mean differences were reported separately for males and females (8, 19, 20, 28, 30, 32, 40, 41, 43) (Figure 1
). An upward trend was seen (r = 0.66, P = 0.014), with no overlap in mean %BF between males and females. Therefore, in this analysis, as in the individual studies, it is impossible to separate the confounding effects of subject sex and %BF.
|
It is possible that body hair on humans does not routinely influence the accuracy of body-volume measurements, except in subjects who have an unusually thick layer of body hair and that the men in Biaggi et al's study (30) were unusually hairy. To definitively answer the question of whether body hair significantly influences air-displacement plethysmography measurements of body volume, a study is needed in which these measurements are conducted before and after the body is shaved. This was done to a limited extent in men (52). The men in the study grew beards for 3 wk and then the BOD POD measurements were made before and after the beards were shaved. Although there were large individual variations, mean Vbraw was 157 mL lower and %BF was 0.9% lower after shaving. These findings suggest that for men who have beards, an additional factor could be built into the BOD POD software to adjust for the small effect of additional isothermal air associated with a beard. The findings further suggest that during longitudinal studies in which the BOD POD is used to measure body composition, men should either remain clean shaven or maintain the same amount of facial hair throughout the study.
Subject size
Lockner et al (21) reported that the difference between Db by HW and the BOD POD in children was significantly related to height, body mass, and body surface area, with the largest differences (calculated as BOD POD HW) seen in the smallest children. The suggestion that a smaller ratio of chamber volume to subject volume would improve measurement precision (5, 9) and the above-mentioned findings of Lockner et al suggest that body-volume measurements with the BOD POD may be less accurate in smaller children than in larger children. However, as discussed above, there were other possible confounding factors in Lockner et al's study. Furthermore, the possibility that smaller (younger) children may have had more difficulty complying with the requirements of the HW procedure should not be overlooked.
Fasting compared with postprandial conditions
It is known that gas in the stomach or intestine that is not accounted for leads to an underestimate of Db and an overestimate of %BF when measured by HW. This can be seen in the following formula used to calculate Db by HW:
![]() | (5) |
Errors in VR compared with errors in VTG
The largest contributor to HW variability is the error in measuring VR (59, 60). Depending on the measurement technique used, VR can vary by as much as 300 mL and consequently affect %BF estimates by HW up to
4% (61). When VR is measured on land, errors in HW also arise when there is a mismatch between the amount of air exhaled on land relative to that in water. Friedl et al (62) conducted HW measurements on 3 d in a single week. In one-half of the subjects, a learning effect on the maximal exhalation procedure under water was noted such that over time subjects exhaled a greater amount of air (and thus had a higher mass in water). In contrast, a concomitant change in the VR measurements on land was not observed. This learning effect under water, but not on land, resulted in an average BF measurement that was 1% lower on the third day than on the first day in these subjects. It is possible that simultaneous determination of VR and body mass in water would have alleviated the mismatch observed in Friedl et al's study (62); however, this solution is controversial because some studies suggest that the measurement of VR in water by gas dilution may be underestimated because of pulmonary gas trapping (63, 64). As can be seen by comparing Equations 4 and 5![]()
and as discussed by McCrory et al (49), an error in VTG has less of an effect on measurements made with the BOD POD than an error in VR of the same magnitude has on HW. However, the variability in VTG relative to that in VR has not yet been reported. In addition, the validity of VTG measured with the BOD POD needs to be established. One way to do this is to compare VTG measurements made with the BOD POD with those made by standard pulmonary plethysmography (considered by pulmonary physiologists as the gold standard method for measuring lung volume; 6567), after correction for differences in tidal volume as discussed above.
| VALIDITY OF THE BOD POD RELATIVE TO DXA |
|---|
|
|
|---|
|
Potential reasons for differences between the BOD POD and DXA measurements
Many of the issues discussed above that may have contributed to the differences between the BOD POD and HW measurements also pertain to the observed differences between the BOD POD and DXA measurements, particularly the clothing worn during the BOD POD test, the order in which the different body-composition tests were conducted, and the prediction of VTG. Other factors that also may be at play include limitations in DXA and errors in the assumptions inherent to the 2C models of densitometry, which are used in the BOD POD to calculate %BF. These additional factors and the potential sex effect on differences between the 2 methods are discussed below.
Limitations of the densitometric 2C model
The 2C model for converting Db to %BF divides the body into components of fat mass and fat-free mass. Among the assumptions inherent to this model are that the densities of these 2 components are 0.9 and 1.1 kg/L, respectively, and that these densities do not vary among individuals or populations (47).
Although these numbers appear to be relatively accurate for the general population, it is known that the density of the fat-free mass can differ substantially from 1.1 kg/L for particular groups of individuals, such as the elderly, children, and blacks (47). In individuals and groups in whom density deviates from these assumptions, body-composition estimates based on the 2C model are in error; 2C models can be improved, however. For example, for children and adolescents, who have not yet matured chemically and whose fat-free mass has a greater proportion of water and lesser proportion of mineral, Lohman (46) used average estimates for the water and mineral proportions of the fat-free mass to derive age- and sex-specific 2C equations. These equations, applicable for persons aged 118 y, should improve group estimates of %BF by densitometry (including both the BOD POD and HW) and are preferable to the equations of Siri (24) or Brozek et al (25) for this age group. Data from Roemmich et al (69) support the use of these equations. They showed that in young adolescents, Lohman's equations resulted in a mean %BF estimate that was much closer to that derived by the gold standard 4C model (discussed below) than was %BF calculated with Siri's equation; however, individual errors were still high, as shown by the Bland-Altman limits of agreement.
The density of fat-free mass is influenced in large part by bone mineral because the density of bone is markedly higher than that of other components of the fat-free mass. Koda et al (68), who studied men and women aged 4079 y, reported that the difference between the BOD POD and DXA %BF measurements observed in their study was inversely associated with the bone mineral content expressed as a percentage of the fat-free mass (BMC/%FFM) in both sexes. In other words, the lower the BMC/%FFM, the more positive the difference between the BOD POD and DXA %BF measurements. Of note, there are previous reports that BMC/%FFM is also inversely associated with differences between HW and DXA measurements (7072). The mean BMC/%FFM in the study by Koda et al (68) was relatively low in both sexes [4.4% compared with Brozek et al's (25) estimate of 5.6% in a reference man]; on the basis of this information, it can be predicted from the 2C model that the BOD POD would overestimate %BF in both males and females in this age group. However, mean %BF measured with the BOD POD was significantly higher than that measured with DXA only in males. Furthermore, a large proportion of both sexes had a lower %BF as measured with the BOD POD (
25% of males and 68% of females). This suggests that other factors in addition to a relatively low BMC/%FFM were responsible for the differences in %BF measured with the BOD POD and DXA. Koda et al (68) also reported that for both sexes, the differences between the BOD POD and DXA measurements of %BF (calculated as BOD POD DXA) were positively associated with age, waist circumference, and sagittal diameter; ie, older subjects and those with larger waist circumferences and sagittal diameters had a more positive difference. However, multiple regression analysis was not used to determine whether either of these factors remained significant after accounting for BMC/%FFM. The authors hypothesized that DXA errors due to tissue thickness may have been one reason behind the observed association between the differences between methods and the anthropometric measurements. Their hypothesis, however, is not supported by studies that indicate that DXA overestimates (rather than underestimates) %BF at higher tissue thicknesses (73). Nonetheless, other potential contributors to variations in DXA should be considered, as discussed below.
Limitations of DXA
Because DXA does not rely on the assumptions of a 2C model to provide estimates of body composition and because it does not depend on subject performance, DXA is sometimes regarded as a standard against which other methods can be validated. However, like most other methods for measuring body composition, DXA is also subject to errors (7476). Compared with chemical analysis, Jebb et al (75) reported that DXA underestimated the fat mass of deboned pork shoulders by 58% on average, whereas others reported that DXA overestimated %BF in small animals by an average of
30% (77, 78). Furthermore, %BF, fat mass, fat-free mass, and bone mineral estimates have been shown to vary among brands (7982), test modes [eg, pencil beam compared with fan beam (Hologic, Waltham, MA)] (83), and software versions (84), and by tissue thickness (75, 85). Although the studies comparing the BOD POD and DXA varied in each of these respects (Table 3
), no particular aspect of DXA can be singled out as a likely candidate for the lack of agreement among these studies. However, the different machines, software brands, modes, and subject thicknesses certainly contributed to the variability in the findings.
Subject sex
Of the 4 studies that compared the BOD POD and DXA measurements of %BF in men and women (20, 31, 32, 68), only the study by Koda et al (68) reported a significant effect of sex on the difference between the 2 methods. One possibility for the discrepant findings among studies is that the influence of sex on differences between these methods exists only in older subjects because Koda et al was one of only 2 studies that included older subjects. Although Nuñez et al (20) also studied older subjects, the inclusion of younger subjects as well in their study may have masked any potential effect of age in the older subjects. To better understand whether potential differences between the 2 methods are sex specific, we performed Bland-Altman analysis on group means for studies that reported mean values separately for males and females (19, 20, 29, 32, 41, 43, 68). These data are shown in Figure 2
. There was an overall negative bias of -1.0% BF (P = 0.10) and no trend for differences in %BF between the BOD POD and DXA to vary by sex or with increasing %BF. The underlying reasons for the upward trend shown in Figure 1
(BOD POD compared with HW) but not in Figure 2
(BOD POD compared with DXA) should be addressed in future studies.
|
| VALIDITY OF THE BOD POD RELATIVE TO MULTICOMPARTMENT MODELS |
|---|
|
|
|---|
Summary of findings in adults
The BOD POD compared with the 4C model
Fields et al (43) studied young to middle-aged women with the use of the 4C model of Baumgartner et al (87) as the standard against which to compare the BOD POD. In this model, Db was assessed with the BOD POD, TBW by isotopic dilution, and the bone mineral content by DXA. %BF from the BOD POD was calculated by using the 2C model of Siri (24). Although the mean difference between methods was significant (BOD POD - 4C model = -2.2% BF), the R2 value was high (0.95) and the SEE of 2.3% BF was excellent (47). Furthermore, the 95% CI around the mean difference was relatively narrow in comparison with the wider CIs found when the BOD POD was compared with either HW or DXA in other studies, as summarized in Tables 2 and 3![]()
, ranging from -6.8% to 2.2% BF. Also of interest, the BOD POD and HW performed similarly when both were evaluated against a 4C model. As in other studies that compared the 2C densitometric model obtained from HW with a 4C model (87, 9092), the study by Fields et al (43) found that the aqueous and mineral fractions of the fat-free mass were positively and negatively associated, respectively, with the difference in %BF calculated as BOD POD - 4C model.
More recently, Millard-Stafford et al (42) assessed %BF with the BOD POD and HW with Siri's (24) 2C model and a 4C model (93) in 50 young men and women of mixed ethnicity (35 white, 15 black). Calculations of %BF with the 4C model were determined with the use of Db derived from the BOD POD and from HW. %BF determined with the BOD POD differed significantly from that determined with HW when the 2C model was used, and both values differed significantly from their respective 4C models. That is, the results with the BOD POD 2C model differed significantly from those with the BOD POD 4C model, and the results from the HW 2C model differed significantly from those with the HW 4C model. The highest %BF was found with the HW 4C model (19.3% BF), followed by the BOD POD 4C and HW 2C models (each 17.8% BF) and the BOD POD 2C model (15.0% BF). Both the aqueous fraction of the body and Db were positive predictors of the difference between the BOD POD and 4C model %BF measurements (calculated as BOD POD 4C), and the mineral fraction of the body was a negative predictor. Limits of agreement in Bland-Altman analysis were -6.1% to 3.1% BF for individual differences between measurements made with the BOD POD 4C and HW 4C models; females tended to have positive differences and males tended to have negative differences. Nevertheless, whether subject sex per se was a significant predictor of this difference independent of %BF was not ascertained because of the small proportion of females studied and the minimal overlap in %BF between the sexes. Any potential influence of ethnicity also was not reported.
The BOD POD compared with the 3C model
Collins et al (19) compared %BF measured with the BOD POD [using the Siri (24) equation] with that calculated with a 3C density-mineral model in a subset (n = 20) of their original 69 subjects in whom the BOD POD was compared with HW (Table 2
). The 3C density-mineral model was originally proposed by Lohman (47) in 1992 and later modified by Modlesky et al (94) in 1996. In this case, %BF was calculated with the use of body mineral (derived from the bone mineral content measured by DXA) and Db from HW. Although the group mean difference was small (a difference of -1.8% BF between the BOD POD and the 3C model) and the SEE from the regression analysis was excellent (2.4% BF) per Lohman (47), the regression equation showed poor agreement between the BOD POD and the 3C model (slope = 0.65, R2 = 0.64). No Bland-Altman analyses were presented. One reason for these relatively poor results may be that Db derived from HW was used in the 3C model rather than Db derived from the BOD POD. Although the advantage of this is that it allows an independent assessment of the BOD POD and the 3C model, it may have confounded the comparison because BF measurements were 2.4% low