Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2023 Mar 27;13(1):4979.
doi: 10.1038/s41598-023-30904-5.

Exploiting machine learning models to identify novel Alzheimer's disease biomarkers and potential targets

Affiliations
Review

Exploiting machine learning models to identify novel Alzheimer's disease biomarkers and potential targets

Hind Alamro et al. Sci Rep. .

Abstract

We still do not have an effective treatment for Alzheimer's disease (AD) despite it being the most common cause of dementia and impaired cognitive function. Thus, research endeavors are directed toward identifying AD biomarkers and targets. In this regard, we designed a computational method that exploits multiple hub gene ranking methods and feature selection methods with machine learning and deep learning to identify biomarkers and targets. First, we used three AD gene expression datasets to identify 1/ hub genes based on six ranking algorithms (Degree, Maximum Neighborhood Component (MNC), Maximal Clique Centrality (MCC), Betweenness Centrality (BC), Closeness Centrality, and Stress Centrality), 2/ gene subsets based on two feature selection methods (LASSO and Ridge). Then, we developed machine learning and deep learning models to determine the gene subset that best distinguishes AD samples from the healthy controls. This work shows that feature selection methods achieve better prediction performances than the hub gene sets. Beyond this, the five genes identified by both feature selection methods (LASSO and Ridge algorithms) achieved an AUC = 0.979. We further show that 70% of the upregulated hub genes (among the 28 overlapping hub genes) are AD targets based on a literature review and six miRNA (hsa-mir-16-5p, hsa-mir-34a-5p, hsa-mir-1-3p, hsa-mir-26a-5p, hsa-mir-93-5p, hsa-mir-155-5p) and one transcription factor, JUN, are associated with the upregulated hub genes. Furthermore, since 2020, four of the six microRNA were also shown to be potential AD targets. To our knowledge, this is the first work showing that such a small number of genes can distinguish AD samples from healthy controls with high accuracy and that overlapping upregulated hub genes can narrow the search space for potential novel targets.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

Figure 1
Figure 1
The study workflow consists of two key paths, via ranking algorithms and feature selection methods.
Figure 2
Figure 2
An ImaGEO generated heatmap of the top-100 DEGs (Red represents the relative upregulated gene expression; green represents the relative downregulated gene expression; black represents no significant change in gene expression).
Figure 3
Figure 3
The prediction performance of the list of DEGs selected by (a) LASSO (L) and (b) Ridge (R) regression algorithms at multiple thresholds for the importance scores.
Figure 4
Figure 4
Comparing the AUC results of different genes lists. The blue bar is for the whole DEGs list (924 genes), the red bar is the overlapping list of LASSO and Ridge (5 genes). The yellow bar is the overlapping list of the six ranking algorithms (28 genes), and the green is the overlapping list of the ranking algorithms excluding MCC (53 genes).
Figure 5
Figure 5
The top-10 enriched diseases retrieved from DisGeNET for upregulated genes in the different gene sets, including the DEGs, LASSO, Ridge, and the overlapped/common hub and feature selection genes.
Figure 6
Figure 6
Network generated by miRNet. It shows six important miRNAs, represented by blue squares, and highlights JUN as the critical transcription factor.

Similar articles

Cited by

References

    1. Alzheimer's A. Alzheimer's disease facts and figures. Alzheimers. Dement. 2013;9:208–245. doi: 10.1016/j.jalz.2013.02.003. - DOI - PubMed
    1. Alzheimer's A. Alzheimer's disease facts and figures. Alzheimers. Dement. 2016;12:459–509. doi: 10.1016/j.jalz.2016.03.001. - DOI - PubMed
    1. Long JM, Holtzman DM. Alzheimer disease: An update on pathobiology and treatment strategies. Cell. 2019;179:312–339. doi: 10.1016/j.cell.2019.09.001. - DOI - PMC - PubMed
    1. Cummings JL, Morstorf T, Zhong K. Alzheimer's disease drug-development pipeline: few candidates, frequent failures. Alzheimers. Res. Ther. 2014;6:37. doi: 10.1186/alzrt269. - DOI - PMC - PubMed
    1. Wang J, Gu BJ, Masters CL, Wang Y-J. A systemic view of Alzheimer disease - insights from amyloid-β metabolism beyond the brain. Nat. Rev. Neurol. 2017;13:612–623. doi: 10.1038/nrneurol.2017.111. - DOI - PubMed

Publication types