Use of feature importance statistics to accurately predict asthma attacks using machine learning: A cross-sectional cohort study of the US population
- PMID: 37992024
- PMCID: PMC10664888
- DOI: 10.1371/journal.pone.0288903
Use of feature importance statistics to accurately predict asthma attacks using machine learning: A cross-sectional cohort study of the US population
Abstract
Background: Asthma attacks are a major cause of morbidity and mortality in vulnerable populations, and identification of associations with asthma attacks is necessary to improve public awareness and the timely delivery of medical interventions.
Objective: The study aimed to identify feature importance of factors associated with asthma in a representative population of US adults.
Methods: A cross-sectional analysis was conducted using a modern, nationally representative cohort, the National Health and Nutrition Examination Surveys (NHANES 2017-2020). All adult patients greater than 18 years of age (total of 7,922 individuals) with information on asthma attacks were included in the study. Univariable regression was used to identify significant nutritional covariates to be included in a machine learning model and feature importance was reported. The acquisition and analysis of the data were authorized by the National Center for Health Statistics Ethics Review Board.
Results: 7,922 patients met the inclusion criteria in this study. The machine learning model had 55 out of a total of 680 features that were found to be significant on univariate analysis (P<0.0001 used). In the XGBoost model the model had an Area Under the Receiver Operator Characteristic Curve (AUROC) = 0.737, Sensitivity = 0.960, NPV = 0.967. The top five highest ranked features by gain, a measure of the percentage contribution of the covariate to the overall model prediction, were Octanoic Acid intake as a Saturated Fatty Acid (SFA) (gm) (Gain = 8.8%), Eosinophil percent (Gain = 7.9%), BMXHIP-Hip Circumference (cm) (Gain = 7.2%), BMXHT-standing height (cm) (Gain = 6.2%) and HS C-Reactive Protein (mg/L) (Gain 6.1%).
Conclusion: Machine Learning models can additionally offer feature importance and additional statistics to help identify associations with asthma attacks.
Copyright: © 2023 Huang, Huang. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Conflict of interest statement
The authors have declared that no competing interests exist.
Figures



Similar articles
-
Exploring Depression and Nutritional Covariates Amongst US Adults using Shapely Additive Explanations.Health Sci Rep. 2023 Oct 20;6(10):e1635. doi: 10.1002/hsr2.1635. eCollection 2023 Oct. Health Sci Rep. 2023. PMID: 37867784 Free PMC article.
-
Use of machine learning to identify risk factors for insomnia.PLoS One. 2023 Apr 12;18(4):e0282622. doi: 10.1371/journal.pone.0282622. eCollection 2023. PLoS One. 2023. PMID: 37043435 Free PMC article.
-
Comparison of model feature importance statistics to identify covariates that contribute most to model accuracy in prediction of insomnia.PLoS One. 2024 Jul 2;19(7):e0306359. doi: 10.1371/journal.pone.0306359. eCollection 2024. PLoS One. 2024. PMID: 38954735 Free PMC article.
-
Interpretable machine learning model to predict surgical difficulty in laparoscopic resection for rectal cancer.Front Oncol. 2024 Feb 6;14:1337219. doi: 10.3389/fonc.2024.1337219. eCollection 2024. Front Oncol. 2024. PMID: 38380369 Free PMC article. Review.
-
Machine learning for prediction of asthma exacerbations among asthmatic patients: a systematic review and meta-analysis.BMC Pulm Med. 2023 Jul 28;23(1):278. doi: 10.1186/s12890-023-02570-w. BMC Pulm Med. 2023. PMID: 37507662 Free PMC article.
Cited by
-
Association of cardiovascular health with COPD (NHANES 2007-2020): mediating potential of lean body mass.Front Endocrinol (Lausanne). 2025 Apr 4;16:1539550. doi: 10.3389/fendo.2025.1539550. eCollection 2025. Front Endocrinol (Lausanne). 2025. PMID: 40255497 Free PMC article.
-
Screening and Validation of Potential Biomarkers of Immune Cells in Childhood Asthma Patients via Mendelian Randomization and Machine Learning.J Inflamm Res. 2025 Feb 21;18:2583-2600. doi: 10.2147/JIR.S498017. eCollection 2025. J Inflamm Res. 2025. PMID: 40008080 Free PMC article.
-
The J-shaped relationship between body roundness index and adult asthma: insights from NHANES 2001-2018.Front Nutr. 2025 Mar 20;12:1516003. doi: 10.3389/fnut.2025.1516003. eCollection 2025. Front Nutr. 2025. PMID: 40181938 Free PMC article.
-
Early Recognition of Secondary Asthma Caused by Lower Respiratory Tract Infection in Children Based on Multi-Omics Signature: A Retrospective Cohort Study.Int J Gen Med. 2024 Dec 14;17:6229-6241. doi: 10.2147/IJGM.S498965. eCollection 2024. Int J Gen Med. 2024. PMID: 39703797 Free PMC article.
-
Use machine learning models to identify and assess risk factors for coronary artery disease.PLoS One. 2024 Sep 6;19(9):e0307952. doi: 10.1371/journal.pone.0307952. eCollection 2024. PLoS One. 2024. PMID: 39240939 Free PMC article.
References
-
- Price D, Wilson AM, Chisholm A, Rigazio A, Burden A, Thomas M, et al.. Predicting frequent asthma exacerbations using blood eosinophil count and other patient data routinely available in clinical practice. J Asthma Allergy. 2016;9:1–12. Epub 20160107. doi: 10.2147/JAA.S97973 ; PubMed Central PMCID: PMC4708874. - DOI - PMC - PubMed
-
- Blakey JD, Price DB, Pizzichini E, Popov TA, Dimitrov BD, Postma DS, et al.. Identifying Risk of Future Asthma Attacks Using UK Medical Record Data: A Respiratory Effectiveness Group Initiative. J Allergy Clin Immunol Pract. 2017;5(4):1015–24 e8. Epub 20161222. doi: 10.1016/j.jaip.2016.11.007 . - DOI - PubMed
-
- Mukherjee M, Stoddart A, Gupta RP, Nwaru BI, Farr A, Heaven M, et al.. The epidemiology, healthcare and societal burden and costs of asthma in the UK and its member nations: analyses of standalone and linked national databases. BMC Med. 2016;14(1):113. Epub 20160829. doi: 10.1186/s12916-016-0657-8 ; PubMed Central PMCID: PMC5002970. - DOI - PMC - PubMed
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Medical
Research Materials
Miscellaneous