Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Nov 10;12(1):20230018.
doi: 10.1515/em-2023-0018. eCollection 2023 Jan.

Outliers in nutrient intake data for U.S. adults: national health and nutrition examination survey 2017-2018

Affiliations

Outliers in nutrient intake data for U.S. adults: national health and nutrition examination survey 2017-2018

Sara Burcham et al. Epidemiol Methods. .

Abstract

Objectives: An important step in preparing data for statistical analysis is outlier detection and removal, yet no gold standard exists in current literature. The objective of this study is to identify the ideal decision test using the National Health and Nutrition Examination Survey (NHANES) 2017-2018 dietary data.

Methods: We conducted a secondary analysis of NHANES 24-h dietary recalls, considering the survey's multi-stage cluster design. Six outlier detection and removal strategies were assessed by evaluating the decision tests' impact on the Pearson's correlation coefficient among macronutrients. Furthermore, we assessed changes in the effect size estimates based on pre-defined sample sizes. The data were collected as part of the 2017-2018 24-h dietary recall among adult participants (N=4,893).

Results: Effect estimate changes for macronutrients varied from 6.5 % for protein to 39.3 % for alcohol across all decision tests. The largest proportion of outliers removed was 4.0 % in the large sample size, for the decision test, >2 standard deviations from the mean. The smallest sample size, particularly for alcohol analysis, was most affected by the six decision tests when compared to no decision test.

Conclusions: This study, the first to use 2017-2018 NHANES dietary data for outlier evaluation, emphasizes the importance of selecting an appropriate decision test considering factors such as statistical power, sample size, normality assumptions, the proportion of data removed, effect estimate changes, and the consistency of estimates across sample sizes. We recommend the use of non-parametric tests for non-normally distributed variables of interest.

Keywords: CDC; NHANES; dietary intake; macronutrient; outlier.

PubMed Disclaimer

Conflict of interest statement

Competing interests: The authors state no conflict of interest.

Figures

Figure 1:
Figure 1:
Total participants in 2017–2018 NHANES dietary intake, day 1 available for analysis.
Figure 2:
Figure 2:
Variance of total energy (kcal) intake amongst the NHANES sample population.

References

    1. Lee MS, Carcone AI, Ko L, Kulik N, Ellis DA, Naar S. Managing outliers in adolescent food frequency questionnaire data. J Nutr Educ Behav. 2021;53:28–35. doi: 10.1016/j.jneb.2020.08.002. - DOI - PMC - PubMed
    1. Kwak SK, Kim JH. Statistical data preparation: management of missing values and outliers. Korean J Anesthesiol. 2017;70:407. doi: 10.4097/kjae.2017.70.4.407. - DOI - PMC - PubMed
    1. Thakwalakwa CM, Kuusipalo HM, Maleta KM, Phuka JC, Ashorn P, Cheung YB. The validity of a structured interactive 24-hour recall in estimating energy and nutrient intakes in 15-month-old rural Malawian children: the validity of 24 h recall. Matern Child Nutr. 2012;8:380–9. doi: 10.1111/j.1740-8709.2010.00283.x. - DOI - PMC - PubMed
    1. Maniruzzaman M, Rahman MJ, Al-MehediHasan M, Suri HS, Abedin MM, El-Baz A, et al. Accurate diabetes risk stratification using machine learning: role of missing value and outliers. J Med Syst. 2018;42:92. doi: 10.1007/s10916-018-0940-7. - DOI - PMC - PubMed
    1. Curran-Everett D. Explorations in statistics: the assumption of normality. Adv Physiol Educ. 2017;41:449–53. doi: 10.1152/advan.00064.2017. - DOI - PubMed

LinkOut - more resources