Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Jun 22;8(2):20552173221108635.
doi: 10.1177/20552173221108635. eCollection 2022 Apr-Jun.

Validation of a machine learning approach to estimate expanded disability status scale scores for multiple sclerosis

Affiliations

Validation of a machine learning approach to estimate expanded disability status scale scores for multiple sclerosis

Pedro Alves et al. Mult Scler J Exp Transl Clin. .

Abstract

Background: Disability assessment using the Expanded Disability Status Scale (EDSS) is important to inform treatment decisions and monitor the progression of multiple sclerosis. Yet, EDSS scores are documented infrequently in electronic medical records.

Objective: To validate a machine learning model to estimate EDSS scores for multiple sclerosis patients using clinical notes from neurologists.

Methods: A machine learning model was developed to estimate EDSS scores on specific encounter dates using clinical notes from neurologist visits. The OM1 MS Registry data were used to create a training cohort of 2632 encounters and a separate validation cohort of 857 encounters, all with clinician-recorded EDSS scores. Model performance was assessed using the area under the receiver-operating-characteristic curve (AUC), positive predictive value (PPV), and negative predictive value (NPV), calculated using a binarized version of the outcome. The Spearman R and Pearson R values were calculated. The model was then applied to encounters without clinician-recorded EDSS scores in the MS Registry.

Results: The model had a PPV of 0.85, NPV of 0.85, and AUC of 0.91. The model had a Spearman R value of 0.75 and Pearson R value of 0.74 when evaluating performance using the continuous estimated EDSS and clinician-recorded EDSS scores. Application of the model to eligible encounters resulted in the generation of eEDSS scores for an additional 190,282 encounters from 13,249 patients.

Conclusion: EDSS scores can be estimated with very good performance using a machine learning model applied to clinical notes, thus increasing the utility of real-world data sources for research purposes.

Keywords: Multiple sclerosis; disability evaluation; health services research; machine learning; outcome assessment; registries.

PubMed Disclaimer

Conflict of interest statement

Declaration of conflicting interests: The authors declared the following potential conflicts of interest with respect to the research, authorship, and/or publication of this article: The authors indicated are employees of OM1, which is involved in issues related to the topic of this manuscript.

Figures

Figure 1.
Figure 1.
The area under the receiver-operating-characteristic curve (AUC). The AUC was calculated using a binarized version of the outcome in which the positive class is defined as those notes with scores greater or equal to 6 (the threshold at which EDSS scores reflects the requirement for ambulatory aid), and the negative class is defined as those records with scores less than or equal to 5.5.
Figure 2.
Figure 2.
Distribution of estimated and clinician-recorded EDSS scores in the validation cohort. The distribution of eEDSS scores was compared to the distribution of clinician-recorded EDSS scores.
Figure 3.
Figure 3.
Confusion matrix showing agreement between estimated and clinician-recorded EDSS scores in the validation cohort. A confusion matrix was generated to further assess the agreement between the eEDSS scores and clinician-recorded EDSS scores.
Figure 4.
Figure 4.
Distribution of estimated and clinician-recorded EDSS scores in the validation cohort. The distribution of eEDSS scores for eligible encounters in the MS Registry was compared to the distribution of clinician-recorded EDSS scores in the validation cohort, with the scores on the x-axis and the percentage of total encounters on the y-axis.

Similar articles

Cited by

References

    1. Kurtzke JF. A new scale for evaluating disability in multiple sclerosis. Neurology 1955; 5: 580. - PubMed
    1. Meyer-Moock S, Feng Y-S, Maeurer M, et al.. Systematic literature review and validity evaluation of the expanded disability status scale (EDSS) and the multiple sclerosis functional composite (MSFC) in patients with multiple sclerosis. BMC Neurol 2014; 14: 58. - PMC - PubMed
    1. Baldassari LE, Salter AR, Longbrake EE, et al.. Streamlined EDSS for use in multiple sclerosis clinical practice: development and cross-sectional comparison to EDSS. Mult Scler 2018; 24: 1347–1355. - PubMed
    1. Cohen JA, Trojano M, Mowry EM, et al.. Leveraging real-world data to investigate multiple sclerosis disease behavior, prognosis, and treatment. Mult Scler 2020; 26: 23–37. - PMC - PubMed
    1. Alves P, Bandaria J, Leavy MB, et al.. Validation of a machine learning approach to estimate systemic lupus erythematosus disease activity index score categories and application in a real-world dataset. RMD Open 2021; 7: e001586. - PMC - PubMed

LinkOut - more resources