Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Aug 2;114(2):661-668.
doi: 10.1093/ajcn/nqab074.

The perils of using predicted values in place of observed covariates: an example of predicted values of body composition and mortality risk

Affiliations

The perils of using predicted values in place of observed covariates: an example of predicted values of body composition and mortality risk

Gregory Haber et al. Am J Clin Nutr. .

Abstract

Background: Several studies have assessed the relation of body composition to health outcomes by using values of fat and lean mass that were not measured but instead were predicted from anthropometric variables such as weight and height. Little research has been done on how substituting predicted values for measured covariates might affect analytic results.

Objectives: We aimed to explore statistical issues causing bias in analytical studies that use predicted rather than measured values of body composition.

Methods: We used data from 8014 adults ≥40 y old included in the 1999-2006 US NHANES. We evaluated the relations of predicted total body fat (TF) and predicted total body lean mass (TLM) with all-cause mortality. We then repeated the evaluation using measured body composition variables from DXA. Quintiles and restricted cubic splines allowed flexible modeling of the HRs in unadjusted and multivariable-adjusted Cox regression models.

Results: The patterns of associations between body composition and all-cause mortality depended on whether body composition was defined using predicted values or DXA measurements. The largest differences were observed in multivariable-adjusted models which mutually adjusted for both TF and TLM. For instance, compared with analyses based on DXA measurements, analyses using predicted values for males overestimated the HRs for TF in splines and in quintiles [HRs (95% CIs) for fourth and fifth quintiles compared with first quintile, DXA: 1.22 (0.88, 1.70) and 1.46 (0.99, 2.14); predicted: 1.86 (1.29, 2.67) and 3.24 (2.02, 5.21)].

Conclusions: It is important for researchers to be aware of the potential pitfalls and limitations inherent in the substitution of predicted values for measured covariates in order to draw proper conclusions from such studies.

Keywords: Berkson error; all-cause mortality; bias; body composition; epidemiology; prediction equations; research methods.

PubMed Disclaimer

Figures

FIGURE 1
FIGURE 1
Density plots showing the distribution of TF and TLM in 1999–2004 NHANES data. Results are stratified by gender (male: n = 3142; female: n = 3236). SR, self-reported; TF, total body fat; TLM, total body lean mass.
FIGURE 2
FIGURE 2
Association of TF, predicted TF, and predicted TF based on SR variables with all-cause mortality. (A) An unadjusted model; (B) a multiple-variable model adjusting for age, height, ethnicity, alcohol consumption, physical activity, and smoking status; (C) an additional multiple-variable model mutually adjusting for total body lean mass as a categorical variable in quintiles (in addition to the covariates in model B). TF was modeled using a restricted cubic spline fit with knots at the 5th, 35th, 50th, 65th, and 95th centiles. Results are stratified by gender (male: n = 3142; female: n = 3236). SR, self-reported; TF, total body fat.
FIGURE 3
FIGURE 3
Association of TLM, predicted TLM, and predicted TLM based on SR variables with all-cause mortality. (A) An unadjusted model; (B) a multiple-variable model adjusting for age, height, ethnicity, alcohol consumption, physical activity, and smoking status; (C) an additional multiple-variable model mutually adjusting for total body fat as a categorical variable in quintiles (in addition to the covariates in model B). TLM was modeled using a restricted cubic spline fit with knots at the 5th, 35th, 50th, 65th, and 95th centiles. Results are stratified by gender (male: n = 3142; female: n = 3236). SR, self-reported; TLM, total body lean mass.
FIGURE 4
FIGURE 4
Plot showing transportability biases for TF and TLM when predicting these values for adults aged 50–75 y. One line shows a fitted prediction function fit using adults aged 18–75 y from the 1999–2000 NHANES (Cohort 1: males: n = 1368; females: n = 1635) and the other a fitted prediction function for adults restricted to ages 50–75 y from the 2005–2006 NHANES (Cohort 2: males: n = 353; females: n = 330). TF, total body fat; TLM, total body lean mass.

Comment in

Similar articles

Cited by

References

    1. Lee DH, de Rezende LFM, Eluf-Neto J, Wu K, Tabung FK, Giovannucci EL. Association of type and intensity of physical activity with plasma biomarkers of inflammation and insulin response. Int J Cancer. 2019;145(2):360–9. - PMC - PubMed
    1. Lee DH, Keum N, Hu FB, Orav EJ, Rimm EB, Willett WC, Giovannucci EL. Comparison of the association of predicted fat mass, body mass index, and other obesity indicators with type 2 diabetes risk: two large prospective studies in US men and women. Eur J Epidemiol. 2018;33(11):1113–23. - PubMed
    1. Hanyuda A, Lee DH, Ogino S, Wu K, Giovannucci EL. Long-term status of predicted body fat percentage, body mass index and other anthropometric factors with risk of colorectal carcinoma: two large prospective cohort studies in the US. Int J Cancer. 2020;146(9):2383–93. - PMC - PubMed
    1. Jeong S-M, Lee DH, Giovannucci EL. Predicted lean body mass, fat mass and risk of lung cancer: prospective US cohort study. Eur J Epidemiol. 2019;34(12):1151–60. - PMC - PubMed
    1. Lee DH, Keum N, Hu FB, Orav EJ, Rimm EB, Willett WC, Giovannucci EL. Predicted lean body mass, fat mass, and all cause and cause specific mortality in men: prospective US cohort study. BMJ. 2018;362:k2575. - PMC - PubMed

Publication types