. 2022 Jun 22;8(2):20552173221108635.

doi: 10.1177/20552173221108635. eCollection 2022 Apr-Jun.

Validation of a machine learning approach to estimate expanded disability status scale scores for multiple sclerosis

Pedro Alves¹, Eric Green¹, Michelle Leavy², Haley Friedler², Gary Curhan², Carl Marci³, Costas Boussios¹

Affiliations

¹ Data Science, OM1, Inc., Boston, MA, USA.
² Research, OM1, Inc., Boston, MA, USA.
³ Mental Health and Neuroscience, OM1, Inc., Boston, MA, USA.

PMID: 35755008
PMCID: PMC9228644
DOI: 10.1177/20552173221108635

Validation of a machine learning approach to estimate expanded disability status scale scores for multiple sclerosis

Pedro Alves et al. Mult Scler J Exp Transl Clin. 2022.

. 2022 Jun 22;8(2):20552173221108635.

doi: 10.1177/20552173221108635. eCollection 2022 Apr-Jun.

Authors

Pedro Alves¹, Eric Green¹, Michelle Leavy², Haley Friedler², Gary Curhan², Carl Marci³, Costas Boussios¹

Affiliations

¹ Data Science, OM1, Inc., Boston, MA, USA.
² Research, OM1, Inc., Boston, MA, USA.
³ Mental Health and Neuroscience, OM1, Inc., Boston, MA, USA.

PMID: 35755008
PMCID: PMC9228644
DOI: 10.1177/20552173221108635

Abstract

Background: Disability assessment using the Expanded Disability Status Scale (EDSS) is important to inform treatment decisions and monitor the progression of multiple sclerosis. Yet, EDSS scores are documented infrequently in electronic medical records.

Objective: To validate a machine learning model to estimate EDSS scores for multiple sclerosis patients using clinical notes from neurologists.

Methods: A machine learning model was developed to estimate EDSS scores on specific encounter dates using clinical notes from neurologist visits. The OM1 MS Registry data were used to create a training cohort of 2632 encounters and a separate validation cohort of 857 encounters, all with clinician-recorded EDSS scores. Model performance was assessed using the area under the receiver-operating-characteristic curve (AUC), positive predictive value (PPV), and negative predictive value (NPV), calculated using a binarized version of the outcome. The Spearman R and Pearson R values were calculated. The model was then applied to encounters without clinician-recorded EDSS scores in the MS Registry.

Results: The model had a PPV of 0.85, NPV of 0.85, and AUC of 0.91. The model had a Spearman R value of 0.75 and Pearson R value of 0.74 when evaluating performance using the continuous estimated EDSS and clinician-recorded EDSS scores. Application of the model to eligible encounters resulted in the generation of eEDSS scores for an additional 190,282 encounters from 13,249 patients.

Conclusion: EDSS scores can be estimated with very good performance using a machine learning model applied to clinical notes, thus increasing the utility of real-world data sources for research purposes.

Keywords: Multiple sclerosis; disability evaluation; health services research; machine learning; outcome assessment; registries.

PubMed Disclaimer

Conflict of interest statement

Declaration of conflicting interests: The authors declared the following potential conflicts of interest with respect to the research, authorship, and/or publication of this article: The authors indicated are employees of OM1, which is involved in issues related to the topic of this manuscript.

Figures

**Figure 1.**
The area under the receiver-operating-characteristic curve (AUC). The AUC was calculated using a binarized version of the outcome in which the positive class is defined as those notes with scores greater or equal to 6 (the threshold at which EDSS scores reflects the requirement for ambulatory aid), and the negative class is defined as those records with scores less than or equal to 5.5.

**Figure 2.**
Distribution of estimated and clinician-recorded EDSS scores in the validation cohort. The distribution of eEDSS scores was compared to the distribution of clinician-recorded EDSS scores.

**Figure 3.**
Confusion matrix showing agreement between estimated and clinician-recorded EDSS scores in the validation cohort. A confusion matrix was generated to further assess the agreement between the eEDSS scores and clinician-recorded EDSS scores.

**Figure 4.**
Distribution of estimated and clinician-recorded EDSS scores in the validation cohort. The distribution of eEDSS scores for eligible encounters in the MS Registry was compared to the distribution of clinician-recorded EDSS scores in the validation cohort, with the scores on the x-axis and the percentage of total encounters on the y-axis.

See this image and copyright information in PMC

Cited by

Natural language processing systems for extracting information from electronic health records about activities of daily living. A systematic review.
Wieland-Jorna Y, van Kooten D, Verheij RA, de Man Y, Francke AL, Oosterveld-Vlug MG. Wieland-Jorna Y, et al. JAMIA Open. 2024 May 24;7(2):ooae044. doi: 10.1093/jamiaopen/ooae044. eCollection 2024 Jul. JAMIA Open. 2024. PMID: 38798774 Free PMC article. Review.
Utilizing Aerobic Capacity Data for EDSS Score Estimation in Multiple Sclerosis: A Machine Learning Approach.
Tuncer SA, Danacı C, Bilek F, Demir CF, Tuncer T. Tuncer SA, et al. Diagnostics (Basel). 2024 Jun 13;14(12):1249. doi: 10.3390/diagnostics14121249. Diagnostics (Basel). 2024. PMID: 38928664 Free PMC article.
Detecting New Lesions Using a Large Language Model: Applications in Real-World Multiple Sclerosis Datasets.
Poole S, Sisodia N, Koshal K, Henderson K, Wijangco J, Paredes D, Chen C, Rowles W, Akula A, Wuerfel J, Sharma V; UCSF Multiple Sclerosis and Neuroinflammation Center clinicians; Rauschecker AM, Henry RG, Bove R. Poole S, et al. Ann Neurol. 2025 Aug;98(2):308-316. doi: 10.1002/ana.27251. Epub 2025 Apr 25. Ann Neurol. 2025. PMID: 40277428 Free PMC article.
Integrating large language models in care, research, and education in multiple sclerosis management.
Inojosa H, Voigt I, Wenk J, Ferber D, Wiest I, Antweiler D, Weicken E, Gilbert S, Kather JN, Akgün K, Ziemssen T. Inojosa H, et al. Mult Scler. 2024 Oct;30(11-12):1392-1401. doi: 10.1177/13524585241277376. Epub 2024 Sep 23. Mult Scler. 2024. PMID: 39308156 Free PMC article. Review.
Reply to Letter to the Editor: Machine learning to deal with missing disability status.
Alves P, Leavy M, Curhan G, Marci C, Boussios C. Alves P, et al. Mult Scler J Exp Transl Clin. 2022 Oct 20;8(4):20552173221128875. doi: 10.1177/20552173221128875. eCollection 2022 Oct-Dec. Mult Scler J Exp Transl Clin. 2022. PMID: 36311693 Free PMC article. No abstract available.

See all "Cited by" articles

References

1. Kurtzke JF. A new scale for evaluating disability in multiple sclerosis. Neurology 1955; 5: 580. - PubMed
1. Meyer-Moock S, Feng Y-S, Maeurer M, et al.. Systematic literature review and validity evaluation of the expanded disability status scale (EDSS) and the multiple sclerosis functional composite (MSFC) in patients with multiple sclerosis. BMC Neurol 2014; 14: 58. - PMC - PubMed
1. Baldassari LE, Salter AR, Longbrake EE, et al.. Streamlined EDSS for use in multiple sclerosis clinical practice: development and cross-sectional comparison to EDSS. Mult Scler 2018; 24: 1347–1355. - PubMed
1. Cohen JA, Trojano M, Mowry EM, et al.. Leveraging real-world data to investigate multiple sclerosis disease behavior, prognosis, and treatment. Mult Scler 2020; 26: 23–37. - PMC - PubMed
1. Alves P, Bandaria J, Leavy MB, et al.. Validation of a machine learning approach to estimate systemic lupus erythematosus disease activity index score categories and application in a real-world dataset. RMD Open 2021; 7: e001586. - PMC - PubMed

LinkOut - more resources

Full Text Sources

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Validation of a machine learning approach to estimate expanded disability status scale scores for multiple sclerosis

Affiliations

Validation of a machine learning approach to estimate expanded disability status scale scores for multiple sclerosis

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

LinkOut - more resources

Full Text Sources

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

Related information

LinkOut - more resources

Full Text Sources