Coding neuroradiology reports for the Northern Manhattan Stroke Study: a comparison of natural language processing and manual review

J S Elkins¹, C Friedman, B Boden-Albala, R L Sacco, G Hripcsak

Affiliations

PMID: 10772780
DOI: 10.1006/cbmr.1999.1535

Comparative Study

Coding neuroradiology reports for the Northern Manhattan Stroke Study: a comparison of natural language processing and manual review

J S Elkins et al. Comput Biomed Res. 2000 Feb.

. 2000 Feb;33(1):1-10.

doi: 10.1006/cbmr.1999.1535.

Authors

J S Elkins¹, C Friedman, B Boden-Albala, R L Sacco, G Hripcsak

Affiliation

¹ Department of Medicine, University of California, San Francisco, Medical Center, San Francisco, California, USA.

PMID: 10772780
DOI: 10.1006/cbmr.1999.1535

Abstract

Automated systems using natural language processing may greatly speed chart review tasks for clinical research, but their accuracy in this setting is unknown. The objective of this study was to compare the accuracy of automated and manual coding in the data acquisition tasks of an ongoing clinical research study, the Northern Manhattan Stroke Study(NOMASS). We identified 471 neuroradiology reports of brain images used in the NOMASS study. Using both automated and manual coding, we completed a standardized NOMASS imaging form with the information contained in these reports. We then generated ROC curves for both manual and automated coding by comparing our results to the original NOMASS data, where study in investigators directly coded their interpretations of brain images. The areas under the ROC curves for both manual and automated coding were the main outcome measure. The overall predictive value of the automated system (ROC area 0.85, 95% CI 0.84-0.87) was not statistically different from the predictive value of the manual coding (ROC area 0.87, 95% CI 0.83-0.91). Measured in terms of accuracy, the automated system performed slightly worse than manual coding. The overall accuracy of the automated system was 84% (CI 83-85%). The overall accuracy of manual coding was 86% (CI 84-88%). The difference in accuracy between the two methods was small but statistically significant (P = 0.026). Errors in manual coding appeared to be due to differences between neurologists' and nueroradiologists' interpretation, different use of detailed anatomic terms, and lack of clinical information. Automated systems can use natural language processing to rapidly perform complex data acquisition tasks. Although there is a small decrease in the accuracy of the data as compared to traditional methods, automated systems may greatly expand the power of chart review in clinical research design and implementation.

PubMed Disclaimer

Publication types

Actions
Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Grants and funding

etc

LinkOut - more resources

Full Text Sources
- Elsevier Science
Other Literature Sources
- The Lens - Patent Citations Database
Medical
- MedlinePlus Health Information

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Coding neuroradiology reports for the Northern Manhattan Stroke Study: a comparison of natural language processing and manual review

Affiliation

Coding neuroradiology reports for the Northern Manhattan Stroke Study: a comparison of natural language processing and manual review

Authors

Affiliation

Abstract

Publication types

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources

Other Literature Sources

Medical