Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Sep 28:2022:2679050.
doi: 10.1155/2022/2679050. eCollection 2022.

A Methylation Diagnostic Model Based on Random Forests and Neural Networks for Asthma Identification

Affiliations

A Methylation Diagnostic Model Based on Random Forests and Neural Networks for Asthma Identification

Dong-Dong Li et al. Comput Math Methods Med. .

Abstract

Background: Asthma significantly impacts human life and health as a chronic disease. Traditional treatments for asthma have several limitations. Artificial intelligence aids in cancer treatment and may also accelerate our understanding of asthma mechanisms. We aimed to develop a new clinical diagnosis model for asthma using artificial neural networks (ANN).

Methods: Datasets (GSE85566, GSE40576, and GSE13716) were downloaded from Gene Expression Omnibus (GEO) and identified differentially expressed CpGs (DECs) enriched by Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) analysis. Random forest (RF) and ANN algorithms further identified gene characteristics and built clinical models. In addition, two external validation datasets (GSE40576 and GSE137716) were used to validate the diagnostic ability of the model.

Results: The methylation analysis tool (ChAMP) considered DECs that were up-regulated (n =121) and down-regulated (n =20). GO results showed enrichment of actin cytoskeleton organization and cell-substrate adhesion, shigellosis, and serotonergic synapses. RF (random forest) analysis identified 10 crucial DECs (cg05075579, cg20434422, cg03907390, cg00712106, cg05696969, cg22862094, cg11733958, cg00328720, and cg13570822). ANN constructed the clinical model according to 10 DECs. In two external validation datasets (GSE40576 and GSE137716), the Area Under Curve (AUC) for GSE137716 was 1.000, and AUC for GSE40576 was 0.950, confirming the reliability of the model.

Conclusion: Our findings provide new methylation markers and clinical diagnostic models for asthma diagnosis and treatment.

PubMed Disclaimer

Conflict of interest statement

This research does not include any research conducted by any author on human participants or animals. The authors declare no competing interests.

Figures

Figure 1
Figure 1
The workflow of our study.
Figure 2
Figure 2
Methylation landscape of GSE85566. (a) Heat map of the top 1000 most divergent CpGs; the gradient from dark blue to yellow represented the change in expression level. (b) Results of differential expression analysis of volcano plots (asthma vs healthy). The X-axis was log(deltaBeta), and the ordinate was -log10(adj.P.Val) value; DOWN (red): DECs with down-regulated expression, UP (gray): DECs with up-regulated expression, NOT (dark blue): meaningless. (c) Heat map of DECs. Dark blue to light blue means high to low expression, green represents asthma samples, red represented healthy samples, and a clustering tree aggregated similar samples together.
Figure 3
Figure 3
GO and KEGG analysis results. (a) GO analysis (including molecular function, cellular component, and biological process). (b) KEGG analysis.
Figure 4
Figure 4
(a) The effect of the number of decision trees on the error rate. The X-axis was the number of decision trees, and the Y-axis was the error rate. The increase of trees did not affect the reduction of the error rate. (b) After the variables were entered into the random forest, the top 10 DECs were listed in order of importance according to MeanDecreaseAccuracy (left) and MeanDecreaseGini (right). (c) Hierarchical clustering results of 10 DECs in GSE85566 dataset; dark colors represent high expression, light colors represent low expression, the red band above the heat map represents normal samples, and green represents asthma samples.
Figure 5
Figure 5
Neural network topology. (a) Artificial neural network visualization results for the train dataset. (b) ROC results (cg05075579, cg20434422, cg03907390, cg00712106, cg05696969, cg22862094, cg11733958, cg00328720, cg13570892, and cg03325522) analysis visualization for 10-fold cross-validation method.
Figure 6
Figure 6
Two datasets determine neural network classification efficiency. (a) ROC result of GSE137716 dataset. (b) ROC result of GSE40576. The points marked on ROC curve are the optimal threshold points, and the values in parentheses indicate sensitivity and specificity. The AUC value was the Area Under ROC Curve, X-axis was the specificity, and Y-axis was the sensitivity. The optimal threshold was marked at the inflection point, and sensitivity and specificity were listed in parentheses.

Similar articles

Cited by

References

    1. Mims J. W. Asthma: definitions and pathophysiology. International Forum of Allergy & Rhinology . 2015;5(Supplement 1):S2–S6. doi: 10.1002/alr.21609. - DOI - PubMed
    1. Ntontsi P., Photiades A., Zervas E., Xanthou G., Samitas K. Genetics and epigenetics in asthma. International Journal of Molecular Sciences . 2021;22(5):p. 2412. doi: 10.3390/ijms22052412. - DOI - PMC - PubMed
    1. Miller R. L., Grayson M. H., Strothman K. Advances in asthma: new understandings of asthma’s natural history, risk factors, underlying mechanisms, and clinical management. The Journal of Allergy and Clinical Immunology . 2021;148(6):1430–1441. doi: 10.1016/j.jaci.2021.10.001. - DOI - PubMed
    1. Pelaia C., Crimi C., Vatrella A., Tinello C., Terracciano R., Pelaia G. Molecular targets for biological therapies of severe asthma. Frontiers in Immunology . 2020;11, article 603312 doi: 10.3389/fimmu.2020.603312. - DOI - PMC - PubMed
    1. Weiss S. T. Emerging mechanisms and novel targets in allergic inflammation and asthma. Genome Medicine . 2017;9(1):p. 107. doi: 10.1186/s13073-017-0501-6. - DOI - PMC - PubMed