Machine learning natural language processing for identifying venous thromboembolism: systematic review and meta-analysis
- PMID: 38522096
- PMCID: PMC11215191
- DOI: 10.1182/bloodadvances.2023012200
Machine learning natural language processing for identifying venous thromboembolism: systematic review and meta-analysis
Abstract
Venous thromboembolism (VTE) is a leading cause of preventable in-hospital mortality. Monitoring VTE cases is limited by the challenges of manual medical record review and diagnosis code interpretation. Natural language processing (NLP) can automate the process. Rule-based NLP methods are effective but time consuming. Machine learning (ML)-NLP methods present a promising solution. We conducted a systematic review and meta-analysis of studies published before May 2023 that use ML-NLP to identify VTE diagnoses in the electronic health records. Four reviewers screened all manuscripts, excluding studies that only used a rule-based method. A meta-analysis evaluated the pooled performance of each study's best performing model that evaluated for pulmonary embolism and/or deep vein thrombosis. Pooled sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV) with confidence interval (CI) were calculated by DerSimonian and Laird method using a random-effects model. Study quality was assessed using an adapted TRIPOD (Transparent Reporting of a multivariable prediction model for Individual Prognosis Or Diagnosis) tool. Thirteen studies were included in the systematic review and 8 had data available for meta-analysis. Pooled sensitivity was 0.931 (95% CI, 0.881-0.962), specificity 0.984 (95% CI, 0.967-0.992), PPV 0.910 (95% CI, 0.865-0.941) and NPV 0.985 (95% CI, 0.977-0.990). All studies met at least 13 of the 21 NLP-modified TRIPOD items, demonstrating fair quality. The highest performing models used vectorization rather than bag-of-words and deep-learning techniques such as convolutional neural networks. There was significant heterogeneity in the studies, and only 4 validated their model on an external data set. Further standardization of ML studies can help progress this novel technology toward real-world implementation.
© 2024 by The American Society of Hematology. Licensed under Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0), permitting only noncommercial, nonderivative use with attribution. All other rights reserved.
Conflict of interest statement
Conflict-of-interest disclosure: R.P. reports consultancy with Merck. Outside current work, J.I.Z. reports prior research funding from Incyte and Quercegen; consultancy for Sanofi, CSL Behring, and Calyx. Outside current work, S.M. has served as an adviser for Janssen Pharmaceuticals and is the principal owner of Daboia Consulting LLC. S.M. has a patent application pending and is developing a licensing agreement with Superbio.ai for NLP software not featured in this work. T.C. reports consultancy for Takeda, outside of current work. The remaining authors declare no competing financial interests.
Figures




Similar articles
-
The use of natural language processing on pediatric diagnostic radiology reports in the electronic health record to identify deep venous thrombosis in children.J Thromb Thrombolysis. 2017 Oct;44(3):281-290. doi: 10.1007/s11239-017-1532-y. J Thromb Thrombolysis. 2017. PMID: 28815363
-
Natural Language Processing in a Clinical Decision Support System for the Identification of Venous Thromboembolism: Algorithm Development and Validation.J Med Internet Res. 2023 Apr 24;25:e43153. doi: 10.2196/43153. J Med Internet Res. 2023. PMID: 37093636 Free PMC article.
-
Natural Language Processing Performance for the Identification of Venous Thromboembolism in an Integrated Healthcare System.Clin Appl Thromb Hemost. 2021 Jan-Dec;27:10760296211013108. doi: 10.1177/10760296211013108. Clin Appl Thromb Hemost. 2021. PMID: 33906470 Free PMC article.
-
Deep Learning for Natural Language Processing in Radiology-Fundamentals and a Systematic Review.J Am Coll Radiol. 2020 May;17(5):639-648. doi: 10.1016/j.jacr.2019.12.026. Epub 2020 Jan 28. J Am Coll Radiol. 2020. PMID: 32004480
-
Machine Learning-Based Predictive Models for Patients with Venous Thromboembolism: A Systematic Review.Thromb Haemost. 2024 Nov;124(11):1040-1052. doi: 10.1055/a-2299-4758. Epub 2024 Apr 4. Thromb Haemost. 2024. PMID: 38574756
Cited by
-
Comparing efficiency of an attention-based deep learning network with contemporary radiological workflow for pulmonary embolism detection on CTPA: A retrospective study.Eur J Radiol Open. 2025 May 9;14:100657. doi: 10.1016/j.ejro.2025.100657. eCollection 2025 Jun. Eur J Radiol Open. 2025. PMID: 40469717 Free PMC article.
-
Development and Validation of VTE-BERT Natural Language Processing Model for Venous Thromboembolism.J Thromb Haemost. 2025 Aug 1:S1538-7836(25)00484-2. doi: 10.1016/j.jtha.2025.07.021. Online ahead of print. J Thromb Haemost. 2025. PMID: 40754035 Free PMC article.
-
Using a transformer language model to curate a pulmonary embolism dataset from the Medical Information Mart for Intensive Care IV: MIMIC-IV-Ext-PE.Res Pract Thromb Haemost. 2025 May 21;9(4):102896. doi: 10.1016/j.rpth.2025.102896. eCollection 2025 May. Res Pract Thromb Haemost. 2025. PMID: 40606764 Free PMC article.
-
Large language models for chart review: how machine learning can accelerate hematology research.Blood Vessel Thromb Hemost. 2025 Jan 15;2(1):100052. doi: 10.1016/j.bvth.2025.100052. eCollection 2025 Feb. Blood Vessel Thromb Hemost. 2025. PMID: 40766872 Free PMC article. No abstract available.
-
Artificial intelligence in thrombosis: transformative potential and emerging challenges.Thromb J. 2025 Jan 16;23(1):2. doi: 10.1186/s12959-025-00690-3. Thromb J. 2025. PMID: 39825337 Free PMC article. Review.
References
-
- Cohen AT, Tapson VF, Bergmann JF, et al. Venous thromboembolism risk and prophylaxis in the acute hospital care setting (ENDORSE study): a multinational cross-sectional study. Lancet. 2008;371(9610):387–394. - PubMed
-
- Fanikos J, Piazza G, Zayaruzny M, Goldhaber SZ. Long-term complications of medical patients with hospital-acquired venous thromboembolism. Thromb Haemost. 2009;102(4):688–693. - PubMed
-
- Henke PK, Kahn SR, Pannucci CJ, et al. Call to action to prevent venous thromboembolism in hospitalized patients: a policy statement from the American Heart Association. Circulation. 2020;141(24):e914–e931. - PubMed
-
- Agency of Healthcare Research and Quality Chapter 4. Choose the Model to Assess VTE and Bleeding Risk. https://www.ahrq.gov/patient-safety/settings/hospital/vtguide/guide4.html
-
- The Joint Commission Venous Thromboembolism. https://www.jointcommission.org/measurement/measures/venous-thromboembol...
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
Miscellaneous