Automated identification of postoperative complications within an electronic medical record using natural language processing
- PMID: 21862746
- DOI: 10.1001/jama.2011.1204
Automated identification of postoperative complications within an electronic medical record using natural language processing
Abstract
Context: Currently most automated methods to identify patient safety occurrences rely on administrative data codes; however, free-text searches of electronic medical records could represent an additional surveillance approach.
Objective: To evaluate a natural language processing search-approach to identify postoperative surgical complications within a comprehensive electronic medical record.
Design, setting, and patients: Cross-sectional study involving 2974 patients undergoing inpatient surgical procedures at 6 Veterans Health Administration (VHA) medical centers from 1999 to 2006.
Main outcome measures: Postoperative occurrences of acute renal failure requiring dialysis, deep vein thrombosis, pulmonary embolism, sepsis, pneumonia, or myocardial infarction identified through medical record review as part of the VA Surgical Quality Improvement Program. We determined the sensitivity and specificity of the natural language processing approach to identify these complications and compared its performance with patient safety indicators that use discharge coding information.
Results: The proportion of postoperative events for each sample was 2% (39 of 1924) for acute renal failure requiring dialysis, 0.7% (18 of 2327) for pulmonary embolism, 1% (29 of 2327) for deep vein thrombosis, 7% (61 of 866) for sepsis, 16% (222 of 1405) for pneumonia, and 2% (35 of 1822) for myocardial infarction. Natural language processing correctly identified 82% (95% confidence interval [CI], 67%-91%) of acute renal failure cases compared with 38% (95% CI, 25%-54%) for patient safety indicators. Similar results were obtained for venous thromboembolism (59%, 95% CI, 44%-72% vs 46%, 95% CI, 32%-60%), pneumonia (64%, 95% CI, 58%-70% vs 5%, 95% CI, 3%-9%), sepsis (89%, 95% CI, 78%-94% vs 34%, 95% CI, 24%-47%), and postoperative myocardial infarction (91%, 95% CI, 78%-97%) vs 89%, 95% CI, 74%-96%). Both natural language processing and patient safety indicators were highly specific for these diagnoses.
Conclusion: Among patients undergoing inpatient surgical procedures at VA medical centers, natural language processing analysis of electronic medical records to identify postoperative complications had higher sensitivity and lower specificity compared with patient safety indicators based on discharge coding.
Comment in
-
The promise of electronic records: around the corner or down the road?JAMA. 2011 Aug 24;306(8):880-1. doi: 10.1001/jama.2011.1219. JAMA. 2011. PMID: 21862751 No abstract available.
-
Natural language processing and electronic medical records.JAMA. 2011 Dec 7;306(21):2325; author reply 2325-6. doi: 10.1001/jama.2011.1780. JAMA. 2011. PMID: 22147375 No abstract available.
Similar articles
-
Exploring the frontier of electronic health record surveillance: the case of postoperative complications.Med Care. 2013 Jun;51(6):509-16. doi: 10.1097/MLR.0b013e31828d1210. Med Care. 2013. PMID: 23673394 Free PMC article.
-
Validity of selected Patient Safety Indicators: opportunities and concerns.J Am Coll Surg. 2011 Jun;212(6):924-34. doi: 10.1016/j.jamcollsurg.2010.07.007. Epub 2010 Dec 14. J Am Coll Surg. 2011. PMID: 20869268
-
Retrospective Derivation and Validation of an Automated Electronic Search Algorithm to Identify Post Operative Cardiovascular and Thromboembolic Complications.Appl Clin Inform. 2015 Sep 9;6(3):565-76. doi: 10.4338/ACI-2015-03-RA-0026. eCollection 2015. Appl Clin Inform. 2015. PMID: 26448798 Free PMC article.
-
Machine learning natural language processing for identifying venous thromboembolism: systematic review and meta-analysis.Blood Adv. 2024 Jun 25;8(12):2991-3000. doi: 10.1182/bloodadvances.2023012200. Blood Adv. 2024. PMID: 38522096 Free PMC article.
-
Natural Language Processing in Surgery: A Systematic Review and Meta-analysis.Ann Surg. 2021 May 1;273(5):900-908. doi: 10.1097/SLA.0000000000004419. Ann Surg. 2021. PMID: 33074901
Cited by
-
A deep database of medical abbreviations and acronyms for natural language processing.Sci Data. 2021 Jun 2;8(1):149. doi: 10.1038/s41597-021-00929-4. Sci Data. 2021. PMID: 34078918 Free PMC article.
-
Extracting data from electronic medical records: validation of a natural language processing program to assess prostate biopsy results.World J Urol. 2014 Feb;32(1):99-103. doi: 10.1007/s00345-013-1040-4. Epub 2013 Feb 17. World J Urol. 2014. PMID: 23417341
-
Defining Phenotypes from Clinical Data to Drive Genomic Research.Annu Rev Biomed Data Sci. 2018 Jul;1:69-92. doi: 10.1146/annurev-biodatasci-080917-013335. Epub 2018 Apr 25. Annu Rev Biomed Data Sci. 2018. PMID: 34109303 Free PMC article.
-
The Language of Delirium: Keywords for Identifying Delirium from Medical Records.J Gerontol Nurs. 2015 Aug;41(8):34-42. doi: 10.3928/00989134-20150723-01. J Gerontol Nurs. 2015. PMID: 26248142 Free PMC article.
-
Initial development of tools to identify child abuse and neglect in pediatric primary care.BMC Med Inform Decis Mak. 2023 Nov 17;23(1):266. doi: 10.1186/s12911-023-02361-7. BMC Med Inform Decis Mak. 2023. PMID: 37978498 Free PMC article.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
Medical