Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Comparative Study
. 2024 Nov;167(6):1198-1212.
doi: 10.1053/j.gastro.2024.06.030. Epub 2024 Jul 5.

Validation of an Electronic Health Record-Based Machine Learning Model Compared With Clinical Risk Scores for Gastrointestinal Bleeding

Affiliations
Comparative Study

Validation of an Electronic Health Record-Based Machine Learning Model Compared With Clinical Risk Scores for Gastrointestinal Bleeding

Dennis L Shung et al. Gastroenterology. 2024 Nov.

Abstract

Background & aims: Guidelines recommend use of risk stratification scores for patients presenting with gastrointestinal bleeding (GIB) to identify very-low-risk patients eligible for discharge from emergency departments. Machine learning models may outperform existing scores and can be integrated within the electronic health record (EHR) to provide real-time risk assessment without manual data entry. We present the first EHR-based machine learning model for GIB.

Methods: The training cohort comprised 2546 patients and internal validation of 850 patients presenting with overt GIB (ie, hematemesis, melena, and hematochezia) to emergency departments of 2 hospitals from 2014 to 2019. External validation was performed on 926 patients presenting to a different hospital with the same EHR from 2014 to 2019. The primary outcome was a composite of red blood cell transfusion, hemostatic intervention (ie, endoscopic, interventional radiologic, or surgical), and 30-day all-cause mortality. We used structured data fields in the EHR, available within 4 hours of presentation, and compared the performance of machine learning models with current guideline-recommended risk scores, Glasgow-Blatchford Score, and Oakland Score. Primary analysis was area under the receiver operating characteristic curve. Secondary analysis was specificity at 99% sensitivity to assess the proportion of patients correctly identified as very low risk.

Results: The machine learning model outperformed the Glasgow-Blatchford Score (area under the receiver operating characteristic curve, 0.92 vs 0.89; P < .001) and Oakland Score (area under the receiver operating characteristic curve, 0.92 vs 0.89; P < .001). At the very-low-risk threshold of 99% sensitivity, the machine learning model identified more very-low-risk patients: 37.9% vs 18.5% for Glasgow-Blatchford Score and 11.7% for Oakland Score (P < .001 for both comparisons).

Conclusions: An EHR-based machine learning model performs better than currently recommended clinical risk scores and identifies more very-low-risk patients eligible for discharge from the emergency department.

Keywords: Electronic Health Record; Gastrointestinal Hemorrhage; Machine Learning.

PubMed Disclaimer

Figures

Figure A4.1:
Figure A4.1:
Calibration of conditional probabilities predicted by the machine learning model.
Figure A4.2:
Figure A4.2:
Calibration of conditional probabilities predicted by the Glasgow-Blatchford Score.
Figure A4.3:
Figure A4.3:
Calibration of conditional probabilities predicted by the Oakland score.
Figure 9.1:
Figure 9.1:
Superimposed histograms displaying data distributions of 9 variables with differences that were significant by P<0.05 between the false positives and true negatives across the base variable characteristics reported above.
Figure 1:
Figure 1:
Artificial Intelligence-Clinical Decision Support System – an Interactive Interpretable Machine Learning Dashboard. The inputs on the left show demographics, vital signs, and other input categories such as laboratory values, medications, and past medical history. The Top pane displays a patient that is not very low risk (probability is above the pre-set threshold for the high sensitivity 99%) and Bottom pane a very-low-risk patient. This is defined by the model probability output less than 6.3%, which is the output that corresponds to the 99% sensitivity threshold for very low risk patients. The top three variables that affect risk are displayed in graphical form, with how the risk changes across the different thresholds of the variable (e.g. as hemoglobin increases, risk of hospital-based intervention decreases)
Figure 2:
Figure 2:
Flow diagram for the selected machine learning model (top left), deep learning model (top right), GBS (bottom left), and Oakland score (bottom right) with the respective predicted classifications, false positive, false negative, true positive, and true negative rates.

References

    1. Peery AF, Crockett SD, Murphy CC, et al. Burden and Cost of Gastrointestinal, Liver, and Pancreatic Diseases in the United States: Update 2021. Gastroenterology 2022;162:621–644. - PMC - PubMed
    1. Laine L Upper Gastrointestinal Bleeding Due to a Peptic Ulcer. N Engl J Med 2016;375:1198. - PubMed
    1. Gralnek IM, Neeman Z, Strate LL. Acute Lower Gastrointestinal Bleeding. New England Journal of Medicine 2017;376:1054–1063. - PubMed
    1. Laine L, Barkun AN, Saltzman JR, et al. ACG Clinical Guideline: Upper Gastrointestinal and Ulcer Bleeding. Am J Gastroenterol 2021;116:899–917. - PubMed
    1. Gralnek IM, Stanley AJ, Morris AJ, et al. Endoscopic diagnosis and management of nonvariceal upper gastrointestinal hemorrhage (NVUGIH): European Society of Gastrointestinal Endoscopy (ESGE) Guideline - Update 2021. Endoscopy 2021;53:300–332. - PubMed