Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Apr 21:13:880997.
doi: 10.3389/fgene.2022.880997. eCollection 2022.

Identifying Key MicroRNA Signatures for Neurodegenerative Diseases With Machine Learning Methods

Affiliations

Identifying Key MicroRNA Signatures for Neurodegenerative Diseases With Machine Learning Methods

ZhanDong Li et al. Front Genet. .

Abstract

Neurodegenerative diseases, including Alzheimer's disease (AD), Parkinson's disease, and many other disease types, cause cognitive dysfunctions such as dementia via the progressive loss of structure or function of the body's neurons. However, the etiology of these diseases remains unknown, and diagnosing less common cognitive disorders such as vascular dementia (VaD) remains a challenge. In this work, we developed a machine-leaning-based technique to distinguish between normal control (NC), AD, VaD, dementia with Lewy bodies, and mild cognitive impairment at the microRNA (miRNA) expression level. First, unnecessary miRNA features in the miRNA expression profiles were removed using the Boruta feature selection method, and the retained feature sets were sorted using minimum redundancy maximum relevance and Monte Carlo feature selection to provide two ranking feature lists. The incremental feature selection method was used to construct a series of feature subsets from these feature lists, and the random forest and PART classifiers were trained on the sample data consisting of these feature subsets. On the basis of the model performance of these classifiers with different number of features, the best feature subsets and classifiers were identified, and the classification rules were retrieved from the optimal PART classifiers. Finally, the link between candidate miRNA features, including hsa-miR-3184-5p, has-miR-6088, and has-miR-4649, and neurodegenerative diseases was confirmed using recently published research, laying the groundwork for more research on miRNAs in neurodegenerative diseases for the diagnosis of cognitive impairment and the understanding of potential pathogenic mechanisms.

Keywords: classification algorithm; expression pattern; feature selection; microRNA; neurodegenerative disease.

PubMed Disclaimer

Conflict of interest statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Figures

FIGURE 1
FIGURE 1
Analysis flowchart for this study, which consists of three main steps: 1) miRNA dataset collection; 2) filtering and ranking of miRNA features in the dataset using Boruta, mRMR, and MCFS; 3) determining the essential miRNA features and building the best classifiers and classification rules using IFS method with RF and PART algorithms.
FIGURE 2
FIGURE 2
Venn diagram to show top ten miRNA features obtained by mRMR and MCFS methods. Four miRNA features are commonly identified.
FIGURE 3
FIGURE 3
IFS curves with different number of features in RF and PART under the mRMR and MCFS feature lists. (A). With the mRMR feature list, RF reaches the highest point (MCC = 0.683) with the top 106 features, and PART obtains the highest MCC (0.359) when using the top 72 features. The RF with top 41 features also provides high performance (MCC = 0.587). (B). With the MCFS feature list, RF and PART reach the highest points (MCC = 0.681 and 0.360, respectively) at the top 106 and 89 features. The RF with top 31 features also yields high performance (MCC = 0.575).
FIGURE 4
FIGURE 4
Performance of the key RF and PART classifiers on each class based on mRMR (A) and MCFS (B) feature lists. AD, VaD, DLB, MCI, and NC stand for Alzheimer’s disease, Vascular dementia, Dementia with Lewy bodies, Mild cognitive impairment and Normal control, respectively.
FIGURE 5
FIGURE 5
Number of rules generated by the optimal PART classifiers based on mRMR and MCFS feature lists. AD, VaD, DLB, MCI, and NC stand for Alzheimer’s disease, Vascular dementia, Dementia with Lewy bodies, Mild cognitive impairment and Normal control, respectively.
FIGURE 6
FIGURE 6
Performance of the key RF and PART classifiers without SMOTE. (A). Classifiers obtained by using mRMR feature list; (B). Classifiers obtained by using MCFS feature list. AD, VaD, DLB, MCI, and NC stand for Alzheimer’s disease, Vascular dementia, Dementia with Lewy bodies, Mild cognitive impairment and Normal control, respectively.
FIGURE 7
FIGURE 7
Venn diagram to show top 41 miRNA features obtained by mRMR method and top 31 miRNA features obtained by MCFS method. Nineteen miRNA features are commonly identified.

Similar articles

Cited by

References

    1. Asanomi Y., Shigemizu D., Akiyama S., Sakurai T., Ozaki K., Ochiya T., et al. (2021). Dementia Subtype Prediction Models Constructed by Penalized Regression Methods for Multiclass Classification Using Serum microRNA Expression Data. Sci. Rep. 11, 20947. 10.1038/s41598-021-00424-1 - DOI - PMC - PubMed
    1. Bekris L. M., Lutz F., Montine T. J., Yu C. E., Tsuang D., Peskind E. R., et al. (2013). MicroRNA in Alzheimer's Disease: an Exploratory Study in Brain, Cerebrospinal Fluid and Plasma. Biomarkers 18, 455–466. 10.3109/1354750x.2013.814073 - DOI - PMC - PubMed
    1. Breiman L. (2001). Random Forests. Machine Learn. 45, 5–32. 10.1023/a:1010933404324 - DOI
    1. Callis T. E., Pandya K., Seok H. Y., Tang R.-H., Tatsuguchi M., Huang Z.-P., et al. (2009). MicroRNA-208a Is a Regulator of Cardiac Hypertrophy and Conduction in Mice. J. Clin. Invest. 119, 2772–2786. 10.1172/jci36154 - DOI - PMC - PubMed
    1. Chawla N. V., Bowyer K. W., Hall L. O., Kegelmeyer W. P. (2002). SMOTE: Synthetic Minority Over-sampling Technique. jair 16, 321–357. 10.1613/jair.953 - DOI

LinkOut - more resources