Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Oct 30;12(11):1754.
doi: 10.3390/genes12111754.

Molecular Classification and Interpretation of Amyotrophic Lateral Sclerosis Using Deep Convolution Neural Networks and Shapley Values

Affiliations

Molecular Classification and Interpretation of Amyotrophic Lateral Sclerosis Using Deep Convolution Neural Networks and Shapley Values

Abdul Karim et al. Genes (Basel). .

Abstract

Amyotrophic lateral sclerosis (ALS) is a prototypical neurodegenerative disease characterized by progressive degeneration of motor neurons to severely effect the functionality to control voluntary muscle movement. Most of the non-additive genetic aberrations responsible for ALS make its molecular classification very challenging along with limited sample size, curse of dimensionality, class imbalance and noise in the data. Deep learning methods have been successful in many other related areas but have low minority class accuracy and suffer from the lack of explainability when used directly with RNA expression features for ALS molecular classification. In this paper, we propose a deep-learning-based molecular ALS classification and interpretation framework. Our framework is based on training a convolution neural network (CNN) on images obtained from converting RNA expression values into pixels based on DeepInsight similarity technique. Then, we employed Shapley additive explanations (SHAP) to extract pixels with higher relevance to ALS classifications. These pixels were mapped back to the genes which made them up. This enabled us to classify ALS samples with high accuracy for a minority class along with identifying genes that might be playing an important role in ALS molecular classifications. Taken together with RNA expression images classified with CNN, our preliminary analysis of the genes identified by SHAP interpretation demonstrate the value of utilizing Machine Learning to perform molecular classification of ALS and uncover disease-associated genes.

Keywords: ALS; classification; interpretation; machine learning; target identification.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.

Figures

Figure 1
Figure 1
ALS and control sample images with 120 × 120 resolution obtained using DeepInsight for demonstration purposes.
Figure 2
Figure 2
DeepInsight pipeline. (a) An illustration of transformation from feature vector to feature matrix. (b) An illustration of the DeepInsight methodology to transform a feature vector to image pixels. Image taken from DeepInsight [23].
Figure 3
Figure 3
CNN architecture for classifying ALS and control images.
Figure 4
Figure 4
Left side is gray-scale image of an ALS sample. Right side shows highlighted pixels in the image with SHAP values.
Figure 5
Figure 5
12-fold cross-validation performance for creating images of various resolutions.

Similar articles

Cited by

References

    1. Phukan J., Pender N.P., Hardiman O. Cognitive impairment in amyotrophic lateral sclerosis. Lancet Neurol. 2007;6:994–1003. doi: 10.1016/S1474-4422(07)70265-X. - DOI - PubMed
    1. Yin B., Balvert M., van der Spek R.A., Dutilh B.E., Bohté S., Veldink J., Schönhuth A. Using the structure of genome data in the design of deep neural networks for predicting amyotrophic lateral sclerosis from genotype. Bioinformatics. 2019;35:i538–i547. doi: 10.1093/bioinformatics/btz369. - DOI - PMC - PubMed
    1. Amyotrophic Lateral Sclerosis (ALS) Fact Sheet | National Institute of Neurological Disorders and Stroke. [(accessed on 15 February 2021)]; Available online: https://www.ninds.nih.gov/Disorders/Patient-Caregiver-Education/Fact-She....
    1. Van Rheenen W., Shatunov A., Dekker A.M., McLaughlin R.L., Diekstra F.P., Pulit S.L., Van Der Spek R.A., Võsa U., De Jong S., Robinson M.R., et al. Genome-wide association analyses identify new risk variants and the genetic architecture of amyotrophic lateral sclerosis. Nat. Genet. 2016;48:1043–1048. doi: 10.1038/ng.3622. - DOI - PMC - PubMed
    1. Arloth J., Eraslan G., Andlauer T.F., Martins J., Iurato S., Kühnel B., Waldenberger M., Frank J., Gold R., Hemmer B., et al. DeepWAS: Multivariate genotype-phenotype associations by directly integrating regulatory information using deep learning. PLoS Comput. Biol. 2020;16:e1007616. doi: 10.1371/journal.pcbi.1007616. - DOI - PMC - PubMed

MeSH terms