Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Clinical Trial
. 2020 Sep 21;21(18):6914.
doi: 10.3390/ijms21186914.

Classifications of Neurodegenerative Disorders Using a Multiplex Blood Biomarkers-Based Machine Learning Model

Affiliations
Clinical Trial

Classifications of Neurodegenerative Disorders Using a Multiplex Blood Biomarkers-Based Machine Learning Model

Chin-Hsien Lin et al. Int J Mol Sci. .

Abstract

Easily accessible biomarkers for Alzheimer's disease (AD), Parkinson's disease (PD), frontotemporal dementia (FTD), and related neurodegenerative disorders are urgently needed in an aging society to assist early-stage diagnoses. In this study, we aimed to develop machine learning algorithms using the multiplex blood-based biomarkers to identify patients with different neurodegenerative diseases. Plasma samples (n = 377) were obtained from healthy controls, patients with AD spectrum (including mild cognitive impairment (MCI)), PD spectrum with variable cognitive severity (including PD with dementia (PDD)), and FTD. We measured plasma levels of amyloid-beta 42 (Aβ42), Aβ40, total Tau, p-Tau181, and α-synuclein using an immunomagnetic reduction-based immunoassay. We observed increased levels of all biomarkers except Aβ40 in the AD group when compared to the MCI and controls. The plasma α-synuclein levels increased in PDD when compared to PD with normal cognition. We applied machine learning-based frameworks, including a linear discriminant analysis (LDA), for feature extraction and several classifiers, using features from these blood-based biomarkers to classify these neurodegenerative disorders. We found that the random forest (RF) was the best classifier to separate different dementia syndromes. Using RF, the established LDA model had an average accuracy of 76% when classifying AD, PD spectrum, and FTD. Moreover, we found 83% and 63% accuracies when differentiating the individual disease severity of subgroups in the AD and PD spectrum, respectively. The developed LDA model with the RF classifier can assist clinicians in distinguishing variable neurodegenerative disorders.

Keywords: Alzheimer’s disease; Parkinson’s disease; biomarkers; classification; deep learning model; frontotemporal dementia; linear discriminant analysis; multivariate imputation by chained equations; neurodegenerative disorders.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.

Figures

Figure 1
Figure 1
Individual plasma biomarker levels of normal controls and in different disease groups. The plasma Aβ42 (a) and Aβ40 (b) levels significantly increased in patients with AD and FTD, especially when compared to the normal control and other disease groups (p < 0.01). The plasma total tau (c) and p-tau181 (d) level significantly increased in patients with FTD and then followed by AD, MCI, and PDD group (p < 0.01). The plasma α-synuclein (e) levels were highest in the PDD group than other disease groups and controls. The mean ± one standard deviation (SD) was illustrated as horizontal lines in each disease group. * p < 0.05; ** p < 0.01.
Figure 2
Figure 2
The correlation matrix between each biomarker from four groups, including healthy controls, as well as patients with Alzheimer’s disease (AD) spectrum, Parkinson’s Disease (PD) spectrum, and Frontotemporal Dementia (FTD). The upper triangular part of the matrix is the correlation coefficients between any two biomarkers. The lower triangular part of the matrix is the scattered-plot graphs of any two biomarkers. The main diagonal part of the matrix is the distribution graphs of each biomarker. log(α) is the log of α-synuclein. * p < 0.05; *** p < 0.001.
Figure 3
Figure 3
The 3D scatter plot demonstrates that LDA separated sample points of different dementia groups. AD: AD spectrm, PD: PD spectrum.
Figure 4
Figure 4
The correlation matrix between each biomarker from three groups, including normal controls, patients with mild cognitive impairment (MCI), and AD. The upper triangular part of the matrix is the correlation coefficients between any two biomarkers. The lower triangular part of the matrix is the scattered-plot graphs of any two biomarkers. The main diagonal part of the matrix is the distribution graphs of each biomarker. log(α) is the log of α-synuclein. * p < 0.05; *** p < 0.001.
Figure 5
Figure 5
The 2D scatter plot demonstrates that LDA separated sample points of the AD spectrum.
Figure 6
Figure 6
The correlation matrix between each biomarker from four groups, including normal controls, PD-NC, PD-MCI, and PDD. The lower triangular part of the matrix is the scattered-plot graphs of any two biomarkers. The main diagonal part of the matrix is the distribution graphs of each biomarker. log(α) is the log of α-synuclein. ** p < 0.01; *** p < 0.001.
Figure 7
Figure 7
The 3D scatter plot demonstrates that LDA separated sample points of different subgroups of the PD spectrum.
Figure 8
Figure 8
The estimated accuracy for models classifying different dementia syndrome (a), AD spectrum (b), and PD spectrum (c). LDA1: the first function of the linear combinations based on the original five biomarker features using linear discriminant analysis; LDA2: the second function of the linear combinations based on the original five biomarker features using linear discriminant analysis; LDA3: the third function of the linear combinations based on the original five biomarker features using linear discriminant analysis.
Figure 9
Figure 9
Data processing flow chart in this study; MICE: multivariate imputation by the chained equation; LOOCV: leave-one-out cross-validation; NB: naïve Bayes; kNN: k-nearest neighbor; SVM: support vector machine; C4.5: C4.5 decision tree; CART: classification and regression trees; RF: random forest; LogReg: logistic regression.

Similar articles

Cited by

References

    1. Johnson I.P. Age-related neurodegenerative disease research needs aging models. Front. Aging Neurosci. 2015;7:168. doi: 10.3389/fnagi.2015.00168. - DOI - PMC - PubMed
    1. McKhann G., Drachman D., Folstein M., Katzman R., Price D., Stadlan E.M. Clinical diagnosis of Alzheimer’s disease: Report of the NINCDS-ADRDA Work Group under the auspices of Department of Health and Human Services Task Force on Alzheimer’s Disease. Neurology. 1984;34:939–944. doi: 10.1212/WNL.34.7.939. - DOI - PubMed
    1. Braak H., Del Tredici K., Rub U., de Vos R.A., Jansen Steur E.N., Braak E. Staging of brain pathology related to sporadic Parkinson’s disease. Neurobiol. Aging. 2003;24:197–211. doi: 10.1016/S0197-4580(02)00065-9. - DOI - PubMed
    1. Emre M., Aarsland D., Brown R., Burn D.J., Duyckaerts C., Mizuno Y., Broe G.A., Cummings J., Dickson D.W., Gauthier S., et al. Clinical diagnostic criteria for dementia associated with Parkinson’s disease. Mov. Disord. 2007;22:1689–1707. doi: 10.1002/mds.21507. quiz 1837. - DOI - PubMed
    1. Tolosa E., Wenning G., Poewe W. The diagnosis of Parkinson’s disease. Lancet Neurol. 2006;5:75–86. doi: 10.1016/S1474-4422(05)70285-4. - DOI - PubMed

Publication types