Identifying and predicting amyotrophic lateral sclerosis clinical subgroups: a population-based machine-learning study
- PMID: 35341712
- PMCID: PMC9038712
- DOI: 10.1016/S2589-7500(21)00274-0
Identifying and predicting amyotrophic lateral sclerosis clinical subgroups: a population-based machine-learning study
Abstract
Background: Amyotrophic lateral sclerosis (ALS) is known to represent a collection of overlapping syndromes. Various classification systems based on empirical observations have been proposed, but it is unclear to what extent they reflect ALS population substructures. We aimed to use machine-learning techniques to identify the number and nature of ALS subtypes to obtain a better understanding of this heterogeneity, enhance our understanding of the disease, and improve clinical care.
Methods: In this retrospective study, we applied unsupervised Uniform Manifold Approximation and Projection [UMAP]) modelling, semi-supervised (neural network UMAP) modelling, and supervised (ensemble learning based on LightGBM) modelling to a population-based discovery cohort of patients who were diagnosed with ALS while living in the Piedmont and Valle d'Aosta regions of Italy, for whom detailed clinical data, such as age at symptom onset, were available. We excluded patients with missing Revised ALS Functional Rating Scale (ALSFRS-R) feature values from the unsupervised and semi-supervised steps. We replicated our findings in an independent population-based cohort of patients who were diagnosed with ALS while living in the Emilia Romagna region of Italy.
Findings: Between Jan 1, 1995, and Dec 31, 2015, 2858 patients were entered in the discovery cohort. After excluding 497 (17%) patients with missing ALSFRS-R feature values, data for 42 clinical features across 2361 (83%) patients were available for the unsupervised and semi-supervised analysis. We found that semi-supervised machine learning produced the optimum clustering of the patients with ALS. These clusters roughly corresponded to the six clinical subtypes defined by the Chiò classification system (ie, bulbar, respiratory, flail arm, classical, pyramidal, and flail leg ALS). Between Jan 1, 2009, and March 1, 2018, 1097 patients were entered in the replication cohort. After excluding 108 (10%) patients with missing ALSFRS-R feature values, data for 42 clinical features across 989 patients were available for the unsupervised and semi-supervised analysis. All 1097 patients were included in the supervised analysis. The same clusters were identified in the replication cohort. By contrast, other ALS classification schemes, such as the El Escorial categories, Milano-Torino clinical staging, and King's clinical stages, did not adequately label the clusters. Supervised learning identified 11 clinical parameters that predicted ALS clinical subtypes with high accuracy (area under the curve 0·982 [95% CI 0·980-0·983]).
Interpretation: Our data-driven study provides insight into the ALS population substructure and confirms that the Chiò classification system successfully identifies ALS subtypes. Additional validation is required to determine the accuracy and clinical use of these algorithms in assigning clinical subtypes. Nevertheless, our algorithms offer a broad insight into the clinical heterogeneity of ALS and help to determine the actual subtypes of disease that exist within this fatal neurodegenerative syndrome. The systematic identification of ALS subtypes will improve clinical care and clinical trial design.
Funding: US National Institute on Aging, US National Institutes of Health, Italian Ministry of Health, European Commission, University of Torino Rita Levi Montalcini Department of Neurosciences, Emilia Romagna Regional Health Authority, and Italian Ministry of Education, University, and Research.
Translations: For the Italian and German translations of the abstract see Supplementary Materials section.
Copyright © 2022 The Author(s). Published by Elsevier Ltd. This is an Open Access article under the CC BY-NC-ND 4.0 license. Published by Elsevier Ltd.. All rights reserved.
Conflict of interest statement
Declaration of interests BJT holds patents on the clinical testing and therapeutic intervention for the hexanucleotide repeat expansion of C9orf72 (patent numbers EP2751284A1, CA2846307A, and 20180187262); received research grants from the Myasthenia Gravis Foundation, ALS Association, US Center for Disease Control and Prevention, US Department of Veterans Affairs, MSD, and Cerevel Therapeutics; receives funding through the Intramural Research Program at the US National Institutes of Health (NIH), is on the scientific advisory committee of the American Neurological Association, is an associate editor of Brain, and is on the editorial boards of Journal of Neurology, Neurosurgery, and Psychiatry, Neurobiology of Aging, and eClinicalMedicine. JM received research grants from the Fondazione Italiana di Ricerca per la Sclerosi Laterale Amiotrofica, Agenzia Italiana del Farmaco, Italian Ministry of Health, Emilia Romagna Regional Health Authority, and Pfizer. ACh received research funding and honoraria for lectures from Biogen; sits on advisory boards for Mitsubishi Tanabe Pharma, Roche, Denali Therapeutics, Cytokinetics, Biogen, Amylyx Pharmaceuticals, and Sanofi; and participates in data safety monitoring boards for Lilly and AB Science. RV received research scholarship funding from the Rotary Club (global grant GG2094854). FF is employed by Data Tecnica International. MAN is employed by Data Tecnica International and is an adviser for Clover Therapeutics and Neuron23. AD is employed by Data Tecnica International. All other authors declare no competing interests.
Figures





Similar articles
-
Model-Based and Model-Free Techniques for Amyotrophic Lateral Sclerosis Diagnostic Prediction and Patient Clustering.Neuroinformatics. 2019 Jul;17(3):407-421. doi: 10.1007/s12021-018-9406-9. Neuroinformatics. 2019. PMID: 30460455 Free PMC article.
-
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217. Cochrane Database Syst Rev. 2022. PMID: 36321557 Free PMC article.
-
Amyotrophic lateral sclerosis: a comparison of two staging systems in a population-based study.Eur J Neurol. 2016 Sep;23(9):1426-32. doi: 10.1111/ene.13053. Epub 2016 May 30. Eur J Neurol. 2016. PMID: 27238551
-
Prognosis for patients with amyotrophic lateral sclerosis: development and validation of a personalised prediction model.Lancet Neurol. 2018 May;17(5):423-433. doi: 10.1016/S1474-4422(18)30089-9. Epub 2018 Mar 26. Lancet Neurol. 2018. PMID: 29598923
-
Cell-based therapies for amyotrophic lateral sclerosis/motor neuron disease.Cochrane Database Syst Rev. 2019 Dec 19;12(12):CD011742. doi: 10.1002/14651858.CD011742.pub3. Cochrane Database Syst Rev. 2019. PMID: 31853962 Free PMC article.
Cited by
-
Phenotypical Characterization of C9ALS Patients from the Emilia Romagna Registry of ALS: A Retrospective Case-Control Study.Genes (Basel). 2025 Mar 4;16(3):309. doi: 10.3390/genes16030309. Genes (Basel). 2025. PMID: 40149460 Free PMC article.
-
Safety and tolerability of bosutinib in patients with amyotrophic lateral sclerosis (iDReAM study): A multicentre, open-label, dose-escalation phase 1 trial.EClinicalMedicine. 2022 Oct 25;53:101707. doi: 10.1016/j.eclinm.2022.101707. eCollection 2022 Nov. EClinicalMedicine. 2022. PMID: 36467452 Free PMC article.
-
Deep learning methods to predict amyotrophic lateral sclerosis disease progression.Sci Rep. 2022 Aug 12;12(1):13738. doi: 10.1038/s41598-022-17805-9. Sci Rep. 2022. PMID: 35962027 Free PMC article.
-
Identification and prediction of Parkinson's disease subtypes and progression using machine learning in two cohorts.NPJ Parkinsons Dis. 2022 Dec 16;8(1):172. doi: 10.1038/s41531-022-00439-z. NPJ Parkinsons Dis. 2022. PMID: 36526647 Free PMC article.
-
Curating a knowledge base for patients with neurosyphilis: a study protocol of a DEep learning Framework for pErsonalized prediction of Adverse prognosTic events in NeuroSyphilis (DEFEAT-NS).BMJ Open. 2025 Jul 18;15(7):e092248. doi: 10.1136/bmjopen-2024-092248. BMJ Open. 2025. PMID: 40681191 Free PMC article.
References
-
- Hirtz D, Thurman DJ, Gwinn-Hardy K, Mohamed M, Chaudhuri AR, Zalutsky R. How common are the “common” neurologic disorders? Neurology 2007; 68: 326–37 - PubMed
-
- Byrne S, Bede P, Elamin M, et al. Proposed criteria for familial amyotrophic lateral sclerosis. Amyotroph Lateral Scler 2011; 12: 157–9 - PubMed
-
- de Carvalho M, Dengler R, Eisen A, et al. Electrodiagnostic criteria for diagnosis of ALS. Clin Neurophysiol 2008; 119: 497–503 - PubMed
-
- Brooks BR. El Escorial World Federation of Neurology criteria for the diagnosis of amyotrophic lateral sclerosis. Subcommittee on Motor Neuron Diseases/Amyotrophic Lateral Sclerosis of the World Federation of Neurology Research Group on Neuromuscular Diseases and the El Escorial “Clinical limits of amyotrophic lateral sclerosis” workshop contributors. J Neurol Sci 1994; 124 Suppl: 96–107 - PubMed
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Medical
Research Materials
Miscellaneous