Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Jan 12;10(1):e25157.
doi: 10.2196/25157.

Assessment of Natural Language Processing Methods for Ascertaining the Expanded Disability Status Scale Score From the Electronic Health Records of Patients With Multiple Sclerosis: Algorithm Development and Validation Study

Affiliations

Assessment of Natural Language Processing Methods for Ascertaining the Expanded Disability Status Scale Score From the Electronic Health Records of Patients With Multiple Sclerosis: Algorithm Development and Validation Study

Zhen Yang et al. JMIR Med Inform. .

Abstract

Background: The Expanded Disability Status Scale (EDSS) score is a widely used measure to monitor disability progression in people with multiple sclerosis (MS). However, extracting and deriving the EDSS score from unstructured electronic health records can be time-consuming.

Objective: We aimed to compare rule-based and deep learning natural language processing algorithms for detecting and predicting the total EDSS score and EDSS functional system subscores from the electronic health records of patients with MS.

Methods: We studied 17,452 electronic health records of 4906 MS patients followed at one of Canada's largest MS clinics between June 2015 and July 2019. We randomly divided the records into training (80%) and test (20%) data sets, and compared the performance characteristics of 3 natural language processing models. First, we applied a rule-based approach, extracting the EDSS score from sentences containing the keyword "EDSS." Next, we trained a convolutional neural network (CNN) model to predict the 19 half-step increments of the EDSS score. Finally, we used a combined rule-based-CNN model. For each approach, we determined the accuracy, precision, recall, and F-score compared with the reference standard, which was manually labeled EDSS scores in the clinic database.

Results: Overall, the combined keyword-CNN model demonstrated the best performance, with accuracy, precision, recall, and an F-score of 0.90, 0.83, 0.83, and 0.83 respectively. Respective figures for the rule-based and CNN models individually were 0.57, 0.91, 0.65, and 0.70, and 0.86, 0.70, 0.70, and 0.70. Because of missing data, the model performance for EDSS subscores was lower than that for the total EDSS score. Performance improved when considering notes with known values of the EDSS subscores.

Conclusions: A combined keyword-CNN natural language processing model can extract and accurately predict EDSS scores from patient records. This approach can be automated for efficient information extraction in clinical and research settings.

Keywords: machine learning; multiple sclerosis; natural language processing.

PubMed Disclaimer

Conflict of interest statement

Conflicts of Interest: JO reports grants from MS Society of Canada, The Barford and Love MS Fund of St. Michael’s Hospital Foundation, National MS Society, Brain Canada, Biogen-Idec, Roche, and EMD-Serono; and personal fees for consulting or speaking from Biogen-Idec, EMD-Serono, Roche, Sanofi-Genzyme, Novartis, and Celgene.

Figures

Figure 1
Figure 1
Convolutional neural network model structure. EDSS: Expanded Disability Status Scale.
Figure 2
Figure 2
Combined rule-based–CNN model. CNN: convolutional neural network.

Similar articles

Cited by

References

    1. Murray TJ. Diagnosis and treatment of multiple sclerosis. BMJ. 2006 Mar 04;332(7540):525–7. doi: 10.1136/bmj.332.7540.525. http://europepmc.org/abstract/MED/16513709 332/7540/525 - DOI - PMC - PubMed
    1. Compston A, Coles A. Multiple sclerosis. The Lancet. 2008 Oct;372(9648):1502–1517. doi: 10.1016/s0140-6736(08)61620-7. - DOI - PubMed
    1. Kurtzke JF. Rating neurologic impairment in multiple sclerosis: an expanded disability status scale (EDSS) Neurology. 1983 Nov 01;33(11):1444–52. doi: 10.1212/wnl.33.11.1444. - DOI - PubMed
    1. Meyer-Moock S, Feng Y, Maeurer M, Dippel F, Kohlmann T. Systematic literature review and validity evaluation of the Expanded Disability Status Scale (EDSS) and the Multiple Sclerosis Functional Composite (MSFC) in patients with multiple sclerosis. BMC Neurol. 2014 Mar 25;14(1):58. doi: 10.1186/1471-2377-14-58. https://bmcneurol.biomedcentral.com/articles/10.1186/1471-2377-14-58 1471-2377-14-58 - DOI - DOI - PMC - PubMed
    1. Uitdehaag BMJ. Disability Outcome Measures in Phase III Clinical Trials in Multiple Sclerosis. CNS Drugs. 2018 Jun 20;32(6):543–558. doi: 10.1007/s40263-018-0530-8. http://europepmc.org/abstract/MED/29926371 10.1007/s40263-018-0530-8 - DOI - PMC - PubMed

LinkOut - more resources