A natural language processing program effectively extracts key pathologic findings from radical prostatectomy reports

Brian J Kim¹, Madhur Merchant, Chengyi Zheng, Anil A Thomas, Richard Contreras, Steven J Jacobsen, Gary W Chien

Affiliations

PMID: 25211697
DOI: 10.1089/end.2014.0221

A natural language processing program effectively extracts key pathologic findings from radical prostatectomy reports

Brian J Kim et al. J Endourol. 2014 Dec.

. 2014 Dec;28(12):1474-8.

doi: 10.1089/end.2014.0221.

Authors

Brian J Kim¹, Madhur Merchant, Chengyi Zheng, Anil A Thomas, Richard Contreras, Steven J Jacobsen, Gary W Chien

Affiliation

¹ 1 Department of Urology, Kaiser Permanente Los Angeles Medical Center , Los Angeles, California.

PMID: 25211697
DOI: 10.1089/end.2014.0221

Abstract

Introduction and objective: Natural language processing (NLP) software programs have been widely developed to transform complex free text into simplified organized data. Potential applications in the field of medicine include automated report summaries, physician alerts, patient repositories, electronic medical record (EMR) billing, and quality metric reports. Despite these prospects and the recent widespread adoption of EMR, NLP has been relatively underutilized. The objective of this study was to evaluate the performance of an internally developed NLP program in extracting select pathologic findings from radical prostatectomy specimen reports in the EMR.

Methods: An NLP program was generated by a software engineer to extract key variables from prostatectomy reports in the EMR within our healthcare system, which included the TNM stage, Gleason grade, presence of a tertiary Gleason pattern, histologic subtype, size of dominant tumor nodule, seminal vesicle invasion (SVI), perineural invasion (PNI), angiolymphatic invasion (ALI), extracapsular extension (ECE), and surgical margin status (SMS). The program was validated by comparing NLP results to a gold standard compiled by two blinded manual reviewers for 100 random pathology reports.

Results: NLP demonstrated 100% accuracy for identifying the Gleason grade, presence of a tertiary Gleason pattern, SVI, ALI, and ECE. It also demonstrated near-perfect accuracy for extracting histologic subtype (99.0%), PNI (98.9%), TNM stage (98.0%), SMS (97.0%), and dominant tumor size (95.7%). The overall accuracy of NLP was 98.7%. NLP generated a result in <1 second, whereas the manual reviewers averaged 3.2 minutes per report.

Conclusions: This novel program demonstrated high accuracy and efficiency identifying key pathologic details from the prostatectomy report within an EMR system. NLP has the potential to assist urologists by summarizing and highlighting relevant information from verbose pathology reports. It may also facilitate future urologic research through the rapid and automated creation of large databases.

PubMed Disclaimer

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources
- Atypon
Other Literature Sources
- scite Smart Citations
Medical
- MedlinePlus Health Information

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

A natural language processing program effectively extracts key pathologic findings from radical prostatectomy reports

Affiliation

A natural language processing program effectively extracts key pathologic findings from radical prostatectomy reports

Authors

Affiliation

Abstract

MeSH terms

LinkOut - more resources

Full Text Sources

Other Literature Sources

Medical