Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Apr 2;11(4):453.
doi: 10.3390/brainsci11040453.

Diagnostic Classification and Biomarker Identification of Alzheimer's Disease with Random Forest Algorithm

Affiliations

Diagnostic Classification and Biomarker Identification of Alzheimer's Disease with Random Forest Algorithm

Minseok Song et al. Brain Sci. .

Abstract

Random Forest (RF) is a bagging ensemble model and has many important advantages, such as robustness to noise, an effective structure for complex multimodal data and parallel computing, and also provides important features that help investigate biomarkers. Despite these benefits, RF is not used actively to predict Alzheimer's disease (AD) with brain MRIs. Recent studies have reported RF's effectiveness in predicting AD, but the test sample sizes were too small to draw any solid conclusions. Thus, it is timely to compare RF with other learning model methods, including deep learning, particularly with large amounts of data. In this study, we tested RF and various machine learning models with regional volumes from 2250 brain MRIs: 687 normal controls (NC), 1094 mild cognitive impairment (MCI), and 469 AD that ADNI (Alzheimer's Disease Neuroimaging Initiative database) provided. Three types of features sets (63, 29, and 22 features) were selected, and classification accuracies were computed with RF, Support vector machine (SVM), Multi-layer perceptron (MLP), and Convolutional neural network (CNN). As a result, RF, MLP, and CNN showed high performances of 90.2%, 89.6%, and 90.5% with 63 features. Interestingly, when 22 features were used, RF showed the smallest decrease in accuracy, -3.8%, and the standard deviation did not change significantly, while MLP and CNN yielded decreases in accuracy of -6.8% and -4.5% with changes in the standard deviation from 3.3% to 4.0% for MLP and 2.1% to 7.0% for CNN, indicating that RF predicts AD more reliably with fewer features. In addition, we investigated the importance of the features that RF provides, and identified the hippocampus, amygdala, and inferior lateral ventricle as the major contributors in classifying NC, MCI, and AD. On average, AD showed smaller hippocampus and amygdala volumes and a larger volume of inferior lateral ventricle than those of MCI and NC.

Keywords: Alzheimer’s disease; Gini index; Random Forest; convolutional neural network; feature importance; machine learning; magnetic resonance imaging; mild-cognitive impairment.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.

Figures

Figure 1
Figure 1
Brain segmentation provided by FreeSurfer software.
Figure 2
Figure 2
Structure of neural network models. MLP is the structure of the multi-layer perceptron model. The features in Input Layer were either 63, 29, or 22. CNN is the structure of the convolutional neural network model. The features in FC Layer 1 are either 1824(32 × 57), 736(32 × 23), or 512(32 × 16).
Figure 3
Figure 3
Various models’ classification accuracy with standard deviation. Pairs that did not differ significantly are marked N.S. (non-significant).
Figure 4
Figure 4
Feature importance obtained from RF classifier. Vertical red line is the chance level calculated. Feature importance is presented for 63, 29, and 22 feature groups.
Figure 5
Figure 5
Regional volume of hippocampus, amygdala, and inferior lateral ventricle for NC, MCI, and AD groups. Statistical significance is marked with * (p < 0.05).
Figure 6
Figure 6
Two representative MRI images from the NC and AD groups. The three areas identified are marked as follows: Hippocampus: Yellow, Amygdala: Cyan, and Inferior Lateral Ventricle: Purple). Notably, the NC’s hippocampus and amygdala are larger and the inferior lateral ventricle is smaller than that of the AD subject.

References

    1. Alzheimer’s Association Alzheimer’s Association 2016 Alzheimer’s Disease Facts and Figures. Alzheimers Dement. 2016;12:459–509. doi: 10.1016/j.jalz.2016.03.001. - DOI - PubMed
    1. Lahiri D.K., Farlow M.R., Greig N.H., Sambamurti K. Current Drug Targets for Alzheimer’s Disease Treatment. Drug Dev. Res. 2002;56:267–281. doi: 10.1002/ddr.10081. - DOI
    1. Morris J.C. The Clinical Dementia Rating (CDR): Current Version and Scoring Rules. Neurology. 1993;43:2412–2414. doi: 10.1212/WNL.43.11.2412-a. - DOI - PubMed
    1. Teng E.L., Chui H.C., Schneider L.S., Metzger L.E. Alzheimer’s Dementia: Performance on the Mini-Mental State Examination. J. Consult. Clin. Psychol. 1987;55:96–100. doi: 10.1037/0022-006X.55.1.96. - DOI - PubMed
    1. Gauthier S., Reisberg B., Zaudig M., Petersen R., Ritchie K., Broich K., Belleville S., Brodaty H., Bebbet D., Chertkow H., et al. Mild Cognitive Impairment. Lancet. 2006;367:1262–1270. doi: 10.1016/S0140-6736(06)68542-5. - DOI - PubMed

LinkOut - more resources