Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Nov;6(6):e230296.
doi: 10.1148/ryai.230296.

Precise Image-level Localization of Intracranial Hemorrhage on Head CT Scans with Deep Learning Models Trained on Study-level Labels

Affiliations

Precise Image-level Localization of Intracranial Hemorrhage on Head CT Scans with Deep Learning Models Trained on Study-level Labels

Yunan Wu et al. Radiol Artif Intell. 2024 Nov.

Abstract

Purpose To develop a highly generalizable weakly supervised model to automatically detect and localize image-level intracranial hemorrhage (ICH) by using study-level labels. Materials and Methods In this retrospective study, the proposed model was pretrained on the image-level Radiological Society of North America dataset and fine-tuned on a local dataset by using attention-based bidirectional long short-term memory networks. This local training dataset included 10 699 noncontrast head CT scans in 7469 patients, with ICH study-level labels extracted from radiology reports. Model performance was compared with that of two senior neuroradiologists on 100 random test scans using the McNemar test, and its generalizability was evaluated on an external independent dataset. Results The model achieved a positive predictive value (PPV) of 85.7% (95% CI: 84.0, 87.4) and an area under the receiver operating characteristic curve of 0.96 (95% CI: 0.96, 0.97) on the held-out local test set (n = 7243, 3721 female) and 89.3% (95% CI: 87.8, 90.7) and 0.96 (95% CI: 0.96, 0.97), respectively, on the external test set (n = 491, 178 female). For 100 randomly selected samples, the model achieved performance on par with two neuroradiologists, but with a significantly faster (P < .05) diagnostic time of 5.04 seconds per scan (vs 86 seconds and 22.2 seconds for the two neuroradiologists, respectively). The model's attention weights and heatmaps visually aligned with neuroradiologists' interpretations. Conclusion The proposed model demonstrated high generalizability and high PPVs, offering a valuable tool for expedited ICH detection and prioritization while reducing false-positive interruptions in radiologists' workflows. Keywords: Computer-Aided Diagnosis (CAD), Brain/Brain Stem, Hemorrhage, Convolutional Neural Network (CNN), Transfer Learning Supplemental material is available for this article. © RSNA, 2024 See also the commentary by Akinci D'Antonoli and Rudie in this issue.

Keywords: Brain/Brain Stem; Computer-Aided Diagnosis (CAD); Convolutional Neural Network (CNN); Hemorrhage; Transfer Learning.

PubMed Disclaimer

Conflict of interest statement

Disclosures of conflicts of interest: Y.W. U.S. patent application 18/315,813, filed November 16, 2023, for “Precise slice-level localization of intracranial hemorrhage on head CTs with networks trained on scan-level labels.” M.I. No relevant relationships. S.B. No relevant relationships. J.Z. No relevant relationships. D.R.C. No relevant relationships. E.J.T. No relevant relationships. N.S. No relevant relationships. A.M.N. Support for the present article from a National Institutes of Health R01 NS119772 grant to author’s institution; royalties from Handbook of Neurocritical Care from Cambridge University Press (unrelated); payment or honoraria from Society of Critical Care Medicine for board review course; patent applied for “Detection of intracerebral hemorrhage with deep learning” by author’s institution (not issued); blind trust, stocks managed by a third party per spouse conflict of interest work policy. M.D. No relevant relationships. S.T.H. No relevant relationships. K.M.P. No relevant relationships. T.A.H. No relevant relationships. E.J.R. Payment for expert testimony for 10 cases related to medical malpractice allegations, none related to NM or NU/FSM; consultant to the editor for Radiology. S.L. No relevant relationships. A.A. No relevant relationships. T.B.P. U.S. patent application 18/315,813, filed November 16, 2023, for “Precise slice-level localization of intracranial hemorrhage on head CTs with networks trained on scan-level labels.” A.K.K. U.S. patent application 18/315,813, filed November 16, 2023, for “Precise slice-level localization of intracranial hemorrhage on head CTs with networks trained on scan-level labels.” V.B.H. Dixon Translational Research Grant, Northwestern University, in 2020 (funded the beginning of this study); Northwestern University has a preliminary patent for this work; member of the Radiological Society of North America (RSNA), the American Society of Neuroradiology (ASNR), and the American Society of Functional Neuroradiology; on the AI Committee for ASNR and the RadioGraphics Review Panel for RSNA; reviewer for multiple journals; potential research relationship with Neosoma, not related to this study; business dinner with the founder of Neosoma and a colleague who is also involved in the Neosoma project; U.S. patent application 18/315,813, filed November 16, 2023, for “Precise slice-level localization of intracranial hemorrhage on head CTs with networks trained on scan-level labels.”

Figures

None
Graphical abstract
Flowchart for all patients included in this study and the breakdown of pretraining, training, validation, and held-out and external test datasets. ICH = intracranial hemorrhage, NLP = natural language processing, RSNA = Radiological Society of North America.
Figure 1:
Flowchart for all patients included in this study and the breakdown of pretraining, training, validation, and held-out and external test datasets. ICH = intracranial hemorrhage, NLP = natural language processing, RSNA = Radiological Society of North America.
The end-to-end deep learning workflow of the model. Stage I pretrained the EfficientNet-B2 network on the RSNA dataset at the image level. Stage II utilized a bidirectional LSTM and an attention module to train the model on our institution’s dataset at the study level. Finally, in stage III, the complete model was fine-tuned into an end-to-end model on our institution’s dataset. Fully-conn represents a fully connected layer and Leakyrelu represents the Leaky Rectified Linear Unit activation function. ICH = intracranial hemorrhage, LSTM = long short-term memory, RSNA = Radiological Society of North America.
Figure 2:
The end-to-end deep learning workflow of the model. Stage I pretrained the EfficientNet-B2 network on the RSNA dataset at the image level. Stage II utilized a bidirectional LSTM and an attention module to train the model on our institution’s dataset at the study level. Finally, in stage III, the complete model was fine-tuned into an end-to-end model on our institution’s dataset. Fully-conn represents a fully connected layer and Leakyrelu represents the Leaky Rectified Linear Unit activation function. ICH = intracranial hemorrhage, LSTM = long short-term memory, RSNA = Radiological Society of North America.
Gradient-weighted class activation map (Grad-CAM)–generated heatmaps to highlight important features for predicting intracranial hemorrhage (ICH) and to visualize the possible regions of ICH. In each figure part, the raw CT image is on the left, and the heatmap generated by Grad-CAM is on the right. (A, C) Examples from our institution’s test set, not annotated by radiologists. The color map (right) indicates possible ICH regions. (B, D) Examples from our institution’s test set, manually annotated by radiologists, where the red mask (left) represents the ICH, and the color map (right) indicates possible ICH regions.
Figure 3:
Gradient-weighted class activation map (Grad-CAM)–generated heatmaps to highlight important features for predicting intracranial hemorrhage (ICH) and to visualize the possible regions of ICH. In each figure part, the raw CT image is on the left, and the heatmap generated by Grad-CAM is on the right. (A, C) Examples from our institution’s test set, not annotated by radiologists. The color map (right) indicates possible ICH regions. (B, D) Examples from our institution’s test set, manually annotated by radiologists, where the red mask (left) represents the ICH, and the color map (right) indicates possible ICH regions.
Illustration of the interpretability of our weakly supervised model for localizing ICH at the image and pixel level. The model was trained at the study level, but the attention weights indicate the probability, from 0 to 1, of each image showing ICH (column 5). The model generated heatmaps on predicted ICH images (designated by red arrows in column 5) to localize specific ICH regions (shown in column 4). The heatmaps were converted to a pixel-level mask and quantified by Dice scores (column 3), which were computed by comparison with the reference standard segments (column 2) manually annotated by radiologists (column 1). Grad-CAM = gradient-weighted class activation map, ICH = intracranial hemorrhage.
Figure 4:
Illustration of the interpretability of our weakly supervised model for localizing ICH at the image and pixel level. The model was trained at the study level, but the attention weights indicate the probability, from 0 to 1, of each image showing ICH (column 5). The model generated heatmaps on predicted ICH images (designated by red arrows in column 5) to localize specific ICH regions (shown in column 4). The heatmaps were converted to a pixel-level mask and quantified by Dice scores (column 3), which were computed by comparison with the reference standard segments (column 2) manually annotated by radiologists (column 1). Grad-CAM = gradient-weighted class activation map, ICH = intracranial hemorrhage.
(A–C) Confusion matrix plots for comparisons between the reference standard and predictions or diagnoses from (A) the proposed model, (B) neuroradiologist 1 (Neurorad-1), and (C) neuroradiologist 2 (Neurorad-2). (D) Bar chart demonstrates the percentage of agreement between the model and each neuroradiologist, as well as between the two neuroradiologists. ICH = intracranial hemorrhage.
Figure 5:
(A–C) Confusion matrix plots for comparisons between the reference standard and predictions or diagnoses from (A) the proposed model, (B) neuroradiologist 1 (Neurorad-1), and (C) neuroradiologist 2 (Neurorad-2). (D) Bar chart demonstrates the percentage of agreement between the model and each neuroradiologist, as well as between the two neuroradiologists. ICH = intracranial hemorrhage.

Comment in

References

    1. Caceres JA , Goldstein JN . Intracranial hemorrhage . Emerg Med Clin North Am 2012. ; 30 ( 3 ): 771 – 794 . - PMC - PubMed
    1. Aguilar MI , Brott TG . Update in intracerebral hemorrhage . Neurohospitalist 2011. ; 1 ( 3 ): 148 – 159 . - PMC - PubMed
    1. Brady AP . Error and discrepancy in radiology: inevitable or avoidable? Insights Imaging 2017. ; 8 ( 1 ): 171 – 182 . - PMC - PubMed
    1. RSNA Intracranial Hemorrhage Detection . https://kaggle.com/competitions/rsna-intracranial-hemorrhage-detection. Accessed October 26, 2021 .
    1. Salehinejad H , Kitamura J , Ditkofsky N , et al. . A real-world demonstration of machine learning generalizability in the detection of intracranial hemorrhage on head computerized tomography . Sci Rep 2021. ; 11 ( 1 ): 17051 . - PMC - PubMed

Publication types