Machine learning and natural language processing methods to identify ischemic stroke, acuity and location from radiology reports

doi:10.1371/journal.pone.0234908

. 2020 Jun 19;15(6):e0234908.

doi: 10.1371/journal.pone.0234908. eCollection 2020.

Machine learning and natural language processing methods to identify ischemic stroke, acuity and location from radiology reports

Charlene Jennifer Ong^{1

2

3

4}, Agni Orfanoudaki⁴, Rebecca Zhang⁴, Francois Pierre M Caprasse⁴, Meghan Hutch^{1

2}, Liang Ma¹, Darian Fard¹, Oluwafemi Balogun^{1

2}, Matthew I Miller¹, Margaret Minnig¹, Hanife Saglam³, Brenton Prescott², David M Greer^{1

2}, Stelios Smirnakis³, Dimitris Bertsimas^{4

5}

Affiliations

¹ Boston University School of Medicine, Boston, Massachusetts, United States of America.
² Boston Medical Center, Boston, Massachusetts, United States of America.
³ Harvard Medical School, Boston, Massachusetts, United States of America.
⁴ Operations Research Center, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America.
⁵ Sloan School of Management, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America.

PMID: 32559211
PMCID: PMC7304623
DOI: 10.1371/journal.pone.0234908

Machine learning and natural language processing methods to identify ischemic stroke, acuity and location from radiology reports

Charlene Jennifer Ong et al. PLoS One. 2020.

. 2020 Jun 19;15(6):e0234908.

doi: 10.1371/journal.pone.0234908. eCollection 2020.

Authors

Affiliations

¹ Boston University School of Medicine, Boston, Massachusetts, United States of America.
² Boston Medical Center, Boston, Massachusetts, United States of America.
³ Harvard Medical School, Boston, Massachusetts, United States of America.
⁴ Operations Research Center, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America.
⁵ Sloan School of Management, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America.

PMID: 32559211
PMCID: PMC7304623
DOI: 10.1371/journal.pone.0234908

Abstract

Accurate, automated extraction of clinical stroke information from unstructured text has several important applications. ICD-9/10 codes can misclassify ischemic stroke events and do not distinguish acuity or location. Expeditious, accurate data extraction could provide considerable improvement in identifying stroke in large datasets, triaging critical clinical reports, and quality improvement efforts. In this study, we developed and report a comprehensive framework studying the performance of simple and complex stroke-specific Natural Language Processing (NLP) and Machine Learning (ML) methods to determine presence, location, and acuity of ischemic stroke from radiographic text. We collected 60,564 Computed Tomography and Magnetic Resonance Imaging Radiology reports from 17,864 patients from two large academic medical centers. We used standard techniques to featurize unstructured text and developed neurovascular specific word GloVe embeddings. We trained various binary classification algorithms to identify stroke presence, location, and acuity using 75% of 1,359 expert-labeled reports. We validated our methods internally on the remaining 25% of reports and externally on 500 radiology reports from an entirely separate academic institution. In our internal population, GloVe word embeddings paired with deep learning (Recurrent Neural Networks) had the best discrimination of all methods for our three tasks (AUCs of 0.96, 0.98, 0.93 respectively). Simpler NLP approaches (Bag of Words) performed best with interpretable algorithms (Logistic Regression) for identifying ischemic stroke (AUC of 0.95), MCA location (AUC 0.96), and acuity (AUC of 0.90). Similarly, GloVe and Recurrent Neural Networks (AUC 0.92, 0.89, 0.93) generalized better in our external test set than BOW and Logistic Regression for stroke presence, location and acuity, respectively (AUC 0.89, 0.86, 0.80). Our study demonstrates a comprehensive assessment of NLP techniques for unstructured radiographic text. Our findings are suggestive that NLP/ML methods can be used to discriminate stroke features from large data cohorts for both clinical and research-related investigations.

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

Figures

**Fig 1. Flow diagram of Natural Language Processing (NLP) methodology.**
Text featurization with GloVe and binary classification lead to Receiver Operator Curves (ROC) for stroke occurrence, MCA location and stroke acuity. Representative ROC curves for each of the text featurization methods are displayed. RPDR = Research Patient Data Registry; CT = Computed Tomography; CTA = Computed Tomography Angiography; MRI = Magnetic Resonance Imaging; MRA = Magnetic Resonance Angiography; BOW = Bag of Words; tf-idf = Term Frequency-Inverse Document Frequency; GloVe = Global Vectors for Word Representation; CART = Classification and Regression Trees; OCT = Optimal Classification Trees; RF = Random Forests; RNN = Recurrent Neural Networks.

**Fig 2. Receiver operating curves for NLP classification.**
A, stroke presence; B, MCA location; C, acuity. These curves represent different combinations of text featurization (BOW, tf-idf, GloVe) and binary classification algorithms (Logistic Regression, k-NN, CART, OCT, OCT-H, RF, RNN). GloVe and RNN achieved the highest AUC for all three tasks (>90%). Similar results were achieved for simple tasks by BOW or tf-idf paired with Logistic Regression. The results presented average the mean sensitivity and specificity over five random splits of the data. In a ROC curve the true positive rate (Sensitivity) is plotted as a function of the false positive rate (1-Specificity) for different cut-off points of a parameter. Each point on the ROC curve represents a sensitivity/specificity pair corresponding to a particular decision threshold. The area under the ROC curve (AUC) is a measure of how well a parameter can distinguish between the two subpopulation groups.

**Fig 3. Calibration curves for NLP classification.**
A, stroke presence; B, MCA location; C, acuity. These curves represent different combinations of text featurization (BOW, tf-idf, GloVe) and binary classification algorithms (Logistic Regression, RF, RNN). We created plots showing the relation between the true class of the samples and the predicted probabilities. We binned the samples according to their class probabilities generated by the model. We defined the following intervals: [0,10%], (10,20%], (20,30%], … (90,100%]. We subsequently identified the event rate of each bin. For example, if 4 out of 5 samples falling into the last bin are actual events, then the event rate for that bin would be 80%. The calibration plot displays the bin mid-points on the x-axis and the event rate on the y-axis. Ideally, the event rate should be reflected as a 45° line. The results presented are aggregated over five different splits of the data. We show results of the three best performing methods in each task.

See this image and copyright information in PMC

References

1. Zech J, Pain M, Titano J, Badgeley M, Schefflein J, Su A, et al. Natural Language-based Machine Learning Models for the Annotation of Clinical Radiology Reports. Radiology. 2018;287(2):570–80. 10.1148/radiol.2018171093 - DOI - PubMed
1. Manning CD, Manning CD, Schütze H. Foundations of statistical natural language processing: MIT press; 1999.
1. Pons E, Braun LM, Hunink MG, Kors JA. Natural Language Processing in Radiology: A Systematic Review. Radiology. 2016;279(2):329–43. 10.1148/radiol.16142770 - DOI - PubMed
1. Pennington J, Socher R, Manning C, editors. Glove: Global vectors for word representation. Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP); 2014.
1. Mikolov T, Chen K, Corrado G, Dean J. Efficient estimation of word representations in vector space. arXiv preprint arXiv:13013781. 2013.

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources
Medical
- MedlinePlus Health Information

[1] Zech J, Pain M, Titano J, Badgeley M, Schefflein J, Su A, et al. Natural Language-based Machine Learning Models for the Annotation of Clinical Radiology Reports. Radiology. 2018;287(2):570–80. 10.1148/radiol.2018171093 - DOI - PubMed

[2] Zech J, Pain M, Titano J, Badgeley M, Schefflein J, Su A, et al. Natural Language-based Machine Learning Models for the Annotation of Clinical Radiology Reports. Radiology. 2018;287(2):570–80. 10.1148/radiol.2018171093 - DOI - PubMed

[3] Manning CD, Manning CD, Schütze H. Foundations of statistical natural language processing: MIT press; 1999.

[4] Manning CD, Manning CD, Schütze H. Foundations of statistical natural language processing: MIT press; 1999.

[5] Pons E, Braun LM, Hunink MG, Kors JA. Natural Language Processing in Radiology: A Systematic Review. Radiology. 2016;279(2):329–43. 10.1148/radiol.16142770 - DOI - PubMed

[6] Pons E, Braun LM, Hunink MG, Kors JA. Natural Language Processing in Radiology: A Systematic Review. Radiology. 2016;279(2):329–43. 10.1148/radiol.16142770 - DOI - PubMed

[7] Pennington J, Socher R, Manning C, editors. Glove: Global vectors for word representation. Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP); 2014.

[8] Pennington J, Socher R, Manning C, editors. Glove: Global vectors for word representation. Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP); 2014.

[9] Mikolov T, Chen K, Corrado G, Dean J. Efficient estimation of word representations in vector space. arXiv preprint arXiv:13013781. 2013.

[10] Mikolov T, Chen K, Corrado G, Dean J. Efficient estimation of word representations in vector space. arXiv preprint arXiv:13013781. 2013.

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Machine learning and natural language processing methods to identify ischemic stroke, acuity and location from radiology reports

Affiliations

Machine learning and natural language processing methods to identify ischemic stroke, acuity and location from radiology reports

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

References

MeSH terms

LinkOut - more resources

Full Text Sources

Medical