Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 May 18;3(3):100282.
doi: 10.1016/j.jacig.2024.100282. eCollection 2024 Aug.

Predictive genetic panel for adult asthma using machine learning methods

Affiliations

Predictive genetic panel for adult asthma using machine learning methods

Luciano Gama da Silva Gomes et al. J Allergy Clin Immunol Glob. .

Abstract

Background: Asthma is a chronic inflammatory disease of the airways that is heterogeneous and multifactorial, making its accurate characterization a complex process. Therefore, identifying the genetic variations associated with asthma and discovering the molecular interactions between the omics that confer risk of developing this disease will help us to unravel the biological pathways involved in its pathogenesis.

Objective: We sought to develop a predictive genetic panel for asthma using machine learning methods.

Methods: We tested 3 variable selection methods: Boruta's algorithm, the top 200 genome-wide association study markers according to their respective P values, and an elastic net regression. Ten different algorithms were chosen for the classification tests. A predictive panel was built on the basis of joint scores between the classification algorithms.

Results: Two variable selection methods, Boruta and genome-wide association studies, were statistically similar in terms of the average accuracies generated, whereas elastic net had the worst overall performance. The predictive genetic panel was completed with 155 single-nucleotide variants, with 91.18% accuracy, 92.75% sensitivity, and 89.55% specificity using the support vector machine algorithm. The markers used range from known single-nucleotide variants to those not previously described in the literature. Our study shows potential in creating genetic prediction panels with tailored penalties per marker, aiding in the identification of optimal machine learning methods for intricate results.

Conclusions: This method is able to classify asthma and nonasthma effectively, proving its potential utility in clinical prediction and diagnosis.

Keywords: Asthma; genetics; machine learning; prediction; single-nucleotide variants.

PubMed Disclaimer

Conflict of interest statement

This study was supported by Brazilian funding agencies (FAPESB/CNPq 009/2014, process no. 8305/2014; FAPESB/CNPq-08/2014, process no. 8665/2014; MCTI/CNPq 14/2023 - 442337/2023-0; CAPES/PRINT, process no. 88887.911599/2023-00; and ERC/CONFAP/CNPQ–INT0002/2023, process no. 1125/2023), CNPq 306705/2022. Disclosure of potential conflict of interest: The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this article.

Figures

Fig 1
Fig 1
Cumulative sequential analysis, adding one feature at a time from highest to lowest importance score. Even for algorithms with great potential, dozens of markers are necessary for predictive accuracy to be powerful in asthma outcomes. Penalized and nonpenalized systems are similar, but in some cases, such as K-nearest neighbor and C5.0, prediction becomes satisfactory with fewer markers. The vertical lines in each graph represent the first appearance of 80% predictive accuracy (black lines = original data; red lines = penalized scores). Horizontal lines: dotted = 80%, dashed = 85%, filled = 90%.
Fig 2
Fig 2
ROC curves of the genetic panel predictions. Compared with other algorithms, the ROC curve and AUC generated by the SVM model were superior. AUC, Area under the curve; KNN, K-nearest neighbor; ROC, receiver-operating characteristic.

References

    1. Global Initiative for Asthma (GINA) Global Strategy for Asthma Management and Prevention. http://www.ginasthma.org Available at:
    1. Carr T.F., Zeki A.A., Kraft M. Eosinophilic and noneosinophilic asthma. Am J Respir Crit Care Med. 2018;197:22–37. - PMC - PubMed
    1. Schoettler N., Strek M.E. Recent advances in severe asthma: from phenotypes to personalized medicine. Chest. 2020;157:516–528. - PMC - PubMed
    1. Augustine T., Al-Aghbar M.A., Al-Kowari M., Espino-Guarch M., van Panhuys N. Asthma and the missing heritability problem: necessity for multiomics approaches in determining accurate risk profiles. Front Immunol. 2022;13 - PMC - PubMed
    1. Kuruvilla M.E., Lee F.E.H., Lee G.B. Understanding asthma phenotypes, endotypes, and mechanisms of disease. Clin Rev Allergy Immunol. 2019;56:219–233. - PMC - PubMed

LinkOut - more resources