Deep Learning Pitfall: Impact of Novel Ultrasound Equipment Introduction on Algorithm Performance and the Realities of Domain Adaptation
- PMID: 34133034
- DOI: 10.1002/jum.15765
Deep Learning Pitfall: Impact of Novel Ultrasound Equipment Introduction on Algorithm Performance and the Realities of Domain Adaptation
Abstract
Objectives: To test deep learning (DL) algorithm performance repercussions by introducing novel ultrasound equipment into a clinical setting.
Methods: Researchers introduced prospectively obtained inferior vena cava (IVC) videos from a similar patient population using novel ultrasound equipment to challenge a previously validated DL algorithm (trained on a common point of care ultrasound [POCUS] machine) to assess IVC collapse. Twenty-one new videos were obtained for each novel ultrasound machine. The videos were analyzed for complete collapse by the algorithm and by 2 blinded POCUS experts. Cohen's kappa was calculated for agreement between the 2 POCUS experts and DL algorithm. Previous testing showed substantial agreement between algorithm and experts with Cohen's kappa of 0.78 (95% CI 0.49-1.0) and 0.66 (95% CI 0.31-1.0) on new patient data using, the same ultrasound equipment.
Results: Challenged with higher image quality (IQ) POCUS cart ultrasound videos, algorithm performance declined with kappa values of 0.31 (95% CI 0.19-0.81) and 0.39 (95% CI 0.11-0.89), showing fair agreement. Algorithm performance plummeted on a lower IQ, smartphone device with a kappa value of -0.09 (95% CI -0.95 to 0.76) and 0.09 (95% CI -0.65 to 0.82), respectively, showing less agreement than would be expected by chance. Two POCUS experts had near perfect agreement with a kappa value of 0.88 (95% CI 0.64-1.0) regarding IVC collapse.
Conclusions: Performance of this previously validated DL algorithm worsened when faced with ultrasound studies from 2 novel ultrasound machines. Performance was much worse on images from a lower IQ hand-held device than from a superior cart-based device.
Keywords: artificial intelligence; deep learning; domain shift; inferior vena cava; pediatrics; point of care ultrasound.
© 2021 American Institute of Ultrasound in Medicine.
Similar articles
-
Creation and Testing of a Deep Learning Algorithm to Automatically Identify and Label Vessels, Nerves, Tendons, and Bones on Cross-sectional Point-of-Care Ultrasound Scans for Peripheral Intravenous Catheter Placement by Novices.J Ultrasound Med. 2020 Sep;39(9):1721-1727. doi: 10.1002/jum.15270. Epub 2020 Mar 17. J Ultrasound Med. 2020. PMID: 32181922
-
Development of a Deep Learning Network to Classify Inferior Vena Cava Collapse to Predict Fluid Responsiveness.J Ultrasound Med. 2021 Aug;40(8):1495-1504. doi: 10.1002/jum.15527. Epub 2020 Oct 10. J Ultrasound Med. 2021. PMID: 33038035
-
DIY AI, deep learning network development for automated image classification in a point-of-care ultrasound quality assurance program.J Am Coll Emerg Physicians Open. 2020 Mar 1;1(2):124-131. doi: 10.1002/emp2.12018. eCollection 2020 Apr. J Am Coll Emerg Physicians Open. 2020. PMID: 33000024 Free PMC article.
-
Point-of-care ultrasound to assess degree of dehydration in children: a systematic review with meta-analysis.Arch Dis Child. 2024 Mar 19;109(4):275-281. doi: 10.1136/archdischild-2023-325403. Arch Dis Child. 2024. PMID: 37315988
-
Development of a fluid resuscitation protocol using inferior vena cava and lung ultrasound.J Crit Care. 2016 Feb;31(1):96-100. doi: 10.1016/j.jcrc.2015.09.016. Epub 2015 Sep 25. J Crit Care. 2016. PMID: 26475100 Review.
Cited by
-
Inferior vena cava distensibility during pressure support ventilation: a prospective study evaluating interchangeability of subcostal and trans‑hepatic views, with both M‑mode and automatic border tracing.J Clin Monit Comput. 2024 Oct;38(5):981-990. doi: 10.1007/s10877-024-01177-8. Epub 2024 May 31. J Clin Monit Comput. 2024. PMID: 38819726 Free PMC article.
-
Machine learning algorithm using publicly available echo database for simplified "visual estimation" of left ventricular ejection fraction.World J Exp Med. 2022 Mar 20;12(2):16-25. doi: 10.5493/wjem.v12.i2.16. eCollection 2022 Mar 20. World J Exp Med. 2022. PMID: 35433318 Free PMC article.
-
Inferior vena cava distensibility from subcostal and trans-hepatic imaging using both M-mode or artificial intelligence: a prospective study on mechanically ventilated patients.Intensive Care Med Exp. 2023 Jul 10;11(1):40. doi: 10.1186/s40635-023-00529-z. Intensive Care Med Exp. 2023. PMID: 37423948 Free PMC article.
-
Assessment of the inferior vena cava collapsibility from subcostal and trans-hepatic imaging using both M-mode or artificial intelligence: a prospective study on healthy volunteers.Intensive Care Med Exp. 2023 Apr 3;11(1):15. doi: 10.1186/s40635-023-00505-7. Intensive Care Med Exp. 2023. PMID: 37009935 Free PMC article.
-
Artificial Intelligence (AI) Applications for Point of Care Ultrasound (POCUS) in Low-Resource Settings: A Scoping Review.Diagnostics (Basel). 2024 Aug 1;14(15):1669. doi: 10.3390/diagnostics14151669. Diagnostics (Basel). 2024. PMID: 39125545 Free PMC article.
References
-
- Safina A, Lau L, Brennan P, et al. Precision imaging-its impact on image quality and diagnostic confidence in breast ultrasound examinations. Br J Radiol 2015; 88:20140340.
-
- Birnholz J. Practice of ultrasound: part 9-image quality. 2013. www.auntminnie.com/. Accessed January 3, 2014.
-
- Lévêque L, Zhang W, Parker P, Liu H. The impact of specialty settings on the perceived quality of medical ultrasound video. IEEE Access. 2017; 5:16998-17005.
-
- Han X, Jovicich J, Salat D, et al. Reliability of mri-derived measurements of human cerebral cortical thickness: the effects of field strength, scanner upgrade and manufacturer. NeuroImage 2006; 32:180-194.
-
- Panayides AS, Amini A, Filipovic ND, et al. AI in medical imaging informatics: current challenges and future directions. IEEE J Biomed Health Inform 2020; 247:1837-1857.
MeSH terms
LinkOut - more resources
Full Text Sources