. 2024 Feb;34(2):810-822.

doi: 10.1007/s00330-023-10074-8. Epub 2023 Aug 22.

Effects of a comprehensive brain computed tomography deep learning model on radiologist detection accuracy

Quinlan D Buchlak^{1

2

3}, Cyril H M Tang⁴, Jarrel C Y Seah^{4

5}, Andrew Johnson⁴, Xavier Holt⁴, Georgina M Bottrell⁴, Jeffrey B Wardman⁴, Gihan Samarasinghe⁴, Leonardo Dos Santos Pinheiro⁴, Hongze Xia⁴, Hassan K Ahmad⁴, Hung Pham^{4

6}, Jason I Chiang^{4

7

8}, Nalan Ektas⁴, Michael R Milne⁴, Christopher H Y Chiu⁴, Ben Hachey⁴, Melissa K Ryan⁴, Benjamin P Johnston⁴, Nazanin Esmaili^{9

10}, Christine Bennett⁹, Tony Goldschlager^{11

12}, Jonathan Hall^{4

13

14}, Duc Tan Vo⁶, Lauren Oakden-Rayner¹⁵, Jean-Christophe Leveque¹⁶, Farrokh Farrokhi¹⁶, Richard G Abramson⁴, Catherine M Jones^{4

17

18

19}, Simon Edelstein^{4

17

20}, Peter Brotchie^{4

13}

Affiliations

¹ Annalise.ai, Sydney, NSW, Australia. quinlan.buchlak1@my.nd.edu.au.
² School of Medicine, University of Notre Dame Australia, Sydney, NSW, Australia. quinlan.buchlak1@my.nd.edu.au.
³ Department of Neurosurgery, Monash Health, Clayton, VIC, Australia. quinlan.buchlak1@my.nd.edu.au.
⁴ Annalise.ai, Sydney, NSW, Australia.
⁵ Department of Radiology, Alfred Health, Melbourne, VIC, Australia.
⁶ Department of Radiology, University Medical Center, University of Medicine and Pharmacy, Ho Chi Minh City, Vietnam.
⁷ Department of General Practice, University of Melbourne, Melbourne, VIC, Australia.
⁸ Westmead Applied Research Centre, University of Sydney, Sydney, NSW, Australia.
⁹ School of Medicine, University of Notre Dame Australia, Sydney, NSW, Australia.
¹⁰ Faculty of Engineering and Information Technology, University of Technology Sydney, Sydney, NSW, Australia.
¹¹ Department of Neurosurgery, Monash Health, Clayton, VIC, Australia.
¹² Department of Surgery, Monash University, Clayton, VIC, Australia.
¹³ Department of Radiology, St Vincent's Health Australia, Melbourne, VIC, Australia.
¹⁴ Department of Radiology, Austin Hospital, Melbourne, VIC, Australia.
¹⁵ Australian Institute for Machine Learning, The University of Adelaide, Adelaide, SA, Australia.
¹⁶ Center for Neurosciences and Spine, Virginia Mason Franciscan Health, Seattle, WA, USA.
¹⁷ I-MED Radiology Network, Brisbane, QLD, Australia.
¹⁸ School of Public and Preventive Health, Monash University, Clayton, VIC, Australia.
¹⁹ Department of Clinical Imaging Science, University of Sydney, Sydney, NSW, Australia.
²⁰ Department of Radiology, Monash Health, Clayton, VIC, Australia.

PMID: 37606663
PMCID: PMC10853361
DOI: 10.1007/s00330-023-10074-8

Effects of a comprehensive brain computed tomography deep learning model on radiologist detection accuracy

Quinlan D Buchlak et al. Eur Radiol. 2024 Feb.

. 2024 Feb;34(2):810-822.

doi: 10.1007/s00330-023-10074-8. Epub 2023 Aug 22.

Authors

Affiliations

¹ Annalise.ai, Sydney, NSW, Australia. quinlan.buchlak1@my.nd.edu.au.
² School of Medicine, University of Notre Dame Australia, Sydney, NSW, Australia. quinlan.buchlak1@my.nd.edu.au.
³ Department of Neurosurgery, Monash Health, Clayton, VIC, Australia. quinlan.buchlak1@my.nd.edu.au.
⁴ Annalise.ai, Sydney, NSW, Australia.
⁵ Department of Radiology, Alfred Health, Melbourne, VIC, Australia.
⁶ Department of Radiology, University Medical Center, University of Medicine and Pharmacy, Ho Chi Minh City, Vietnam.
⁷ Department of General Practice, University of Melbourne, Melbourne, VIC, Australia.
⁸ Westmead Applied Research Centre, University of Sydney, Sydney, NSW, Australia.
⁹ School of Medicine, University of Notre Dame Australia, Sydney, NSW, Australia.
¹⁰ Faculty of Engineering and Information Technology, University of Technology Sydney, Sydney, NSW, Australia.
¹¹ Department of Neurosurgery, Monash Health, Clayton, VIC, Australia.
¹² Department of Surgery, Monash University, Clayton, VIC, Australia.
¹³ Department of Radiology, St Vincent's Health Australia, Melbourne, VIC, Australia.
¹⁴ Department of Radiology, Austin Hospital, Melbourne, VIC, Australia.
¹⁵ Australian Institute for Machine Learning, The University of Adelaide, Adelaide, SA, Australia.
¹⁶ Center for Neurosciences and Spine, Virginia Mason Franciscan Health, Seattle, WA, USA.
¹⁷ I-MED Radiology Network, Brisbane, QLD, Australia.
¹⁸ School of Public and Preventive Health, Monash University, Clayton, VIC, Australia.
¹⁹ Department of Clinical Imaging Science, University of Sydney, Sydney, NSW, Australia.
²⁰ Department of Radiology, Monash Health, Clayton, VIC, Australia.

PMID: 37606663
PMCID: PMC10853361
DOI: 10.1007/s00330-023-10074-8

Abstract

Objectives: Non-contrast computed tomography of the brain (NCCTB) is commonly used to detect intracranial pathology but is subject to interpretation errors. Machine learning can augment clinical decision-making and improve NCCTB scan interpretation. This retrospective detection accuracy study assessed the performance of radiologists assisted by a deep learning model and compared the standalone performance of the model with that of unassisted radiologists.

Methods: A deep learning model was trained on 212,484 NCCTB scans drawn from a private radiology group in Australia. Scans from inpatient, outpatient, and emergency settings were included. Scan inclusion criteria were age ≥ 18 years and series slice thickness ≤ 1.5 mm. Thirty-two radiologists reviewed 2848 scans with and without the assistance of the deep learning system and rated their confidence in the presence of each finding using a 7-point scale. Differences in AUC and Matthews correlation coefficient (MCC) were calculated using a ground-truth gold standard.

Results: The model demonstrated an average area under the receiver operating characteristic curve (AUC) of 0.93 across 144 NCCTB findings and significantly improved radiologist interpretation performance. Assisted and unassisted radiologists demonstrated an average AUC of 0.79 and 0.73 across 22 grouped parent findings and 0.72 and 0.68 across 189 child findings, respectively. When assisted by the model, radiologist AUC was significantly improved for 91 findings (158 findings were non-inferior), and reading time was significantly reduced.

Conclusions: The assistance of a comprehensive deep learning model significantly improved radiologist detection accuracy across a wide range of clinical findings and demonstrated the potential to improve NCCTB interpretation.

Clinical relevance statement: This study evaluated a comprehensive CT brain deep learning model, which performed strongly, improved the performance of radiologists, and reduced interpretation time. The model may reduce errors, improve efficiency, facilitate triage, and better enable the delivery of timely patient care.

Key points: • This study demonstrated that the use of a comprehensive deep learning system assisted radiologists in the detection of a wide range of abnormalities on non-contrast brain computed tomography scans. • The deep learning model demonstrated an average area under the receiver operating characteristic curve of 0.93 across 144 findings and significantly improved radiologist interpretation performance. • The assistance of the comprehensive deep learning model significantly reduced the time required for radiologists to interpret computed tomography scans of the brain.

Keywords: Artificial intelligence; Brain; Machine learning; Supervised machine learning; Tomography, x-ray computed.

PubMed Disclaimer

Conflict of interest statement

This study was funded by annalise.ai. QB, CT, JS, AJ, XH, GB, JW, GS, LDSP, HX, HA, HP, JC, NE, MM, CC, BH, MR, BJ, JH, CJ, SE, and PB were employed by or seconded to annalise.ai. CJ and SE were employed by I-MED. The authors listed above report personal fees from annalise.ai during the conduct of the study and personal fees from annalise.ai outside the submitted work. Remaining authors have no competing interests. The annalise.ai CTB deep learning model is available commercially.

Figures

**Fig. 1**
Change in AUC of parent findings when radiologists were assisted by the deep learning model. Mean AUCs of the model, unassisted, and assisted radiologists and change in (i.e. delta) AUC, along with adjusted 95% CIs, are shown for each parent finding. Findings were considered clinically significant where the lower limit of the 95% CI was greater than 0.05, and statistically significant where the lower limit of the 95% CI was greater than zero

**Fig. 2**
ROC curves for the parent findings demonstrating the performance of the model, and the mean performance of the assisted and unassisted radiologists

**Fig. 3**
Performance improvement using the deep learning model. Precision and recall (i.e. sensitivity) for the unassisted and assisted radiologists averaged across all findings, based on the chosen beta levels for each finding. Arrows indicate the shift in recall and precision of the radiologists when assisted by AI. On average, model assistance resulted in increased recall (sensitivity) with no decrement in precision

**Fig. 4**
A Non-contrast CT brain study of a 79-year-old female who presented with acute stroke symptoms. Subtle hypodensity in the right occipital lobe was missed by 30 of the 32 readers in the unassisted arm of the study, but detected by 26 readers when using the deep learning tool as an assistant. B Output of the model. The model accurately localized the large area of infarction within the right occipital lobe (purple shading). C DWI image clearly showing the area of acute infarction in this patient. D An example of small bilateral isodense subacute subdural haematomas. E The haematomas were characterized by the model as subacute subdural haematomas and localized with purple shading. F A CT scan performed 7 days later. The haematoma is more conspicuous on the later scan as it evolves to become hypodense. G Non-contrast CT brain study demonstrating a colloid cyst. H The same colloid cyst case along with an example of the model’s segmentation and high confidence

**Fig. 5**
Non-contrast CT brain study demonstrating an intraventricular haemorrhage, along with an example of the decision support system’s user interface

**Fig. 6**
A three-dimensional (3D) visualisation of a single case containing multiple clinical findings demonstrating the 3D functionality of the model. The findings predicted by the model are presented alongside the ground-truth

See this image and copyright information in PMC

Comment in

Ready for testing artificial intelligence in radiology clinical practice: We would do well to be in the front line leveraging their strengths but also highlighting today weaknesses.
Bender B. Bender B. Eur Radiol. 2024 Feb;34(2):808-809. doi: 10.1007/s00330-023-10240-y. Epub 2023 Sep 22. Eur Radiol. 2024. PMID: 37740089 Free PMC article. No abstract available.

References

1. Coles JP. Imaging after brain injury. Br J Anaesth. 2007;99:49–60. doi: 10.1093/bja/aem141. - DOI - PubMed
1. Powers WJ, Rabinstein AA, Ackerson T, et al. 2018 Guidelines for the Early Management of Patients With Acute Ischemic Stroke: A Guideline for Healthcare Professionals From the American Heart Association/American Stroke Association. Stroke. 2018;49:e46–e99. doi: 10.1161/STR.0000000000000158. - DOI - PubMed
1. Mettler FA, Mahesh M, Bhargavan-Chatfield M, et al. Patient Exposure from Radiologic and Nuclear Medicine Procedures in the United States: Procedure Volume and Effective Dose for the Period 2006–2016. Radiology. 2020;295:418–427. doi: 10.1148/radiol.2020192256. - DOI - PMC - PubMed
1. Donald JJ, Barnard SA. Common patterns in 558 diagnostic radiology errors. J Med Imaging Radiat Oncol. 2012;56:173–178. doi: 10.1111/j.1754-9485.2012.02348.x. - DOI - PubMed
1. Jordan MJ, Lightfoote JB, Jordan JE. Quality outcomes of reinterpretation of brain CT imaging studies by subspecialty experts in neuroradiology. J Natl Med Assoc. 2006;98:1326–1328. - PMC - PubMed

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Effects of a comprehensive brain computed tomography deep learning model on radiologist detection accuracy

Affiliations

Effects of a comprehensive brain computed tomography deep learning model on radiologist detection accuracy

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

Comment in

References

MeSH terms

LinkOut - more resources

Full Text Sources