. 2021 Feb 5;11(1):3254.

doi: 10.1038/s41598-020-74399-w.

Multimodal deep learning models for early detection of Alzheimer's disease stage

Janani Venugopalan¹, Li Tong¹, Hamid Reza Hassanzadeh², May D Wang^{3

4

5}

Affiliations

¹ Department of Biomedical Engineering, Georgia Institute of Technology and Emory University, Atlanta, GA, USA.
² School of Computational Science and Engineering, Georgia Institute of Technology, Atlanta, GA, USA.
³ Department of Biomedical Engineering, Georgia Institute of Technology and Emory University, Atlanta, GA, USA. maywang@gatech.edu.
⁴ School of Electrical and Computer Engineering, Georgia Institute of Technology, Atlanta, GA, USA. maywang@gatech.edu.
⁵ Winship Cancer Institute, Parker H. Petit Institute for Bioengineering and Biosciences, Institute of People and Technology, Georgia Institute of Technology and Emory University, Atlanta, GA, USA. maywang@gatech.edu.

PMID: 33547343
PMCID: PMC7864942
DOI: 10.1038/s41598-020-74399-w

Multimodal deep learning models for early detection of Alzheimer's disease stage

Janani Venugopalan et al. Sci Rep. 2021.

. 2021 Feb 5;11(1):3254.

doi: 10.1038/s41598-020-74399-w.

Authors

Janani Venugopalan¹, Li Tong¹, Hamid Reza Hassanzadeh², May D Wang^{3

4

5}

Affiliations

¹ Department of Biomedical Engineering, Georgia Institute of Technology and Emory University, Atlanta, GA, USA.
² School of Computational Science and Engineering, Georgia Institute of Technology, Atlanta, GA, USA.
³ Department of Biomedical Engineering, Georgia Institute of Technology and Emory University, Atlanta, GA, USA. maywang@gatech.edu.
⁴ School of Electrical and Computer Engineering, Georgia Institute of Technology, Atlanta, GA, USA. maywang@gatech.edu.
⁵ Winship Cancer Institute, Parker H. Petit Institute for Bioengineering and Biosciences, Institute of People and Technology, Georgia Institute of Technology and Emory University, Atlanta, GA, USA. maywang@gatech.edu.

PMID: 33547343
PMCID: PMC7864942
DOI: 10.1038/s41598-020-74399-w

Abstract

Most current Alzheimer's disease (AD) and mild cognitive disorders (MCI) studies use single data modality to make predictions such as AD stages. The fusion of multiple data modalities can provide a holistic view of AD staging analysis. Thus, we use deep learning (DL) to integrally analyze imaging (magnetic resonance imaging (MRI)), genetic (single nucleotide polymorphisms (SNPs)), and clinical test data to classify patients into AD, MCI, and controls (CN). We use stacked denoising auto-encoders to extract features from clinical and genetic data, and use 3D-convolutional neural networks (CNNs) for imaging data. We also develop a novel data interpretation method to identify top-performing features learned by the deep-models with clustering and perturbation analysis. Using Alzheimer's disease neuroimaging initiative (ADNI) dataset, we demonstrate that deep models outperform shallow models, including support vector machines, decision trees, random forests, and k-nearest neighbors. In addition, we demonstrate that integrating multi-modality data outperforms single modality models in terms of accuracy, precision, recall, and meanF1 scores. Our models have identified hippocampus, amygdala brain areas, and the Rey Auditory Verbal Learning Test (RAVLT) as top distinguished features, which are consistent with the known AD literature.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

**Figure 1**
(a) Description of ADNI data. Clinical data consists of demographics, neurological exams and assessments, medications, imaging volumes, and biomarkers. (b) Number of patients by modality and disease stage. (CN controls, *MCI* mild cognitive disorder, and AD Alzheimer’s disease). 220 patients have all the three data modalities, 588 patients have SNP and EHR, 283 patients have imaging and EHR, the remaining patients have only EHR data.

**Figure 2**
Deep model for data integration compared with shallow models of data integration. (a) Feature level integration on shallow models, where the features are concatenated before passing into shallow models. (b) Deep intermediate feature level integration where the original features are transformed separatelyusing deep models prior to integration and prediction. (c) Decision level integration where voting is performed using decisions of individual classifiers. In this study, we comparee the performance of deep intermediate level integration against shallow feature and decision levels integrations for the prediction of Alzheimer’s stages.

**Figure 3**
Intermediate-feature-level combination deep models for multimodality data integration for clinical decision support. Data from diverse sources, imaging, EHR and SNP are combined using novel deep architectures. 3D convolutional neural network architectures used on 3D MR image regions to obtain intermediate imaging features. Deep stacked denoising autoencoders are used to obtain intermediate EHR features. Deep stacked denoising autoencoders are used obtain intermediate SNP features. The 3 types of intermediate features are passed into a classification layer for classification into Alzheimer’s stages (CN, MCI and AD).

**Figure 4**
Internal cross validation results for individual data modality to predict Alzheimer’s stage (a) Imaging results: deep learning prediction performs better than shallow learning predictions (b) EHR results: deep learning outperforms shallow models kNN and SVM and is comparable to decision trees and random forests (c) SNP results: deep learning outperforms shallow models. The kNN, SVM, RF and decision trees are shallow models. (*kNN* k-Nearest Neighbors, *SVM* support vector machines, and RF random forests).

**Figure 5**
Internal cross validation results for integration of data modalities to predict Alzheimer’s stage (a) Imaging + EHR + SNP. Deep learning prediction performs better than shallow learning predictions (b) EHR + SNP Deep learning prediction performs better than shallow learning predictions (c) Imaging + EHR deep learning prediction performs better than shallow learning predictions (d) Imaging + SNP results. Shallow learning gave a better prediction than deep learning due to small sample sizes. (*kNN* k-Nearest Neighbors, *SVM* support vector machines, RF random forests, SM shallow models, and DL deep learning).

**Figure 6**
Feature extraction for deep model interpretation. Novel feature interpretation methodology where features are masked one at a time and the effect on the classification is observed. The feature which gives the highest drop in accuracy is ranked the highest. Once we ranked the features, we checked if the intermediate features picked associations different from raw data using cluster analysis. Deep models show associations which are different from shallow models, which accounts for superior performance.

**Figure 7**
Data pre-processing pipeline for three data modalities: (a) Imaging data is first skull stripped, segmented into white matter, grey matter, and cerebrospinal fluid. Then the images are registered to a standard space, prior to extracting 21 brain regions using anatomical automatic labeling atlases. (b) Clinical data is normalized between 1–2 or encoded as 1–2. Then we discard features with values missing values > 70% to obtain 1680 features for 204 patients. (c) SNP data is first filtered, error corrected, feature selection using known genes and then followed by maximum relevance (maxrel) based methods, to obtain 500 SNPS for 808 patients.

See this image and copyright information in PMC

References

1. Gulshan V, et al. Development and validation of a deep learning algorithm for detection of diabetic retinopathy in retinal fundus photographs. JAMA. 2016;316:2402–2410. doi: 10.1001/jama.2016.17216. - DOI - PubMed
1. Ting DSW, et al. Development and validation of a deep learning system for diabetic retinopathy and related eye diseases using retinal images from multiethnic populations with diabetes. JAMA. 2017;318:2211–2223. doi: 10.1001/jama.2017.18152. - DOI - PMC - PubMed
1. Esteva A, et al. Dermatologist-level classification of skin cancer with deep neural networks. Nature. 2017;542:115. doi: 10.1038/nature21056. - DOI - PMC - PubMed
1. Weng S, Xu X, Li J, Wong ST. Combining deep learning and coherent anti-Stokes Raman scattering imaging for automated differential diagnosis of lung cancer. J. Biomed. Opt. 2017;22:106017. doi: 10.1117/1.JBO.22.10.106017. - DOI - PMC - PubMed
1. Suk H-I, Shen D. Medical Image Computing and Computer-Assisted Intervention–MICCAI 2013. New York: Springer; 2013. pp. 583–590. - PMC - PubMed

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions

Grants and funding

UL1 TR000454/TR/NCATS NIH HHS/United States

LinkOut - more resources

Full Text Sources
Other Literature Sources
- The Lens - Patent Citations Database
- scite Smart Citations
Medical
- MedlinePlus Health Information

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Multimodal deep learning models for early detection of Alzheimer's disease stage

Affiliations

Multimodal deep learning models for early detection of Alzheimer's disease stage

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

References

Publication types

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources

Other Literature Sources

Medical