Reproducibility of Deep Learning Algorithms Developed for Medical Imaging Analysis: A Systematic Review
- PMID: 37407841
- PMCID: PMC10501962
- DOI: 10.1007/s10278-023-00870-5
Reproducibility of Deep Learning Algorithms Developed for Medical Imaging Analysis: A Systematic Review
Abstract
Since 2000, there have been more than 8000 publications on radiology artificial intelligence (AI). AI breakthroughs allow complex tasks to be automated and even performed beyond human capabilities. However, the lack of details on the methods and algorithm code undercuts its scientific value. Many science subfields have recently faced a reproducibility crisis, eroding trust in processes and results, and influencing the rise in retractions of scientific papers. For the same reasons, conducting research in deep learning (DL) also requires reproducibility. Although several valuable manuscript checklists for AI in medical imaging exist, they are not focused specifically on reproducibility. In this study, we conducted a systematic review of recently published papers in the field of DL to evaluate if the description of their methodology could allow the reproducibility of their findings. We focused on the Journal of Digital Imaging (JDI), a specialized journal that publishes papers on AI and medical imaging. We used the keyword "Deep Learning" and collected the articles published between January 2020 and January 2022. We screened all the articles and included the ones which reported the development of a DL tool in medical imaging. We extracted the reported details about the dataset, data handling steps, data splitting, model details, and performance metrics of each included article. We found 148 articles. Eighty were included after screening for articles that reported developing a DL model for medical image analysis. Five studies have made their code publicly available, and 35 studies have utilized publicly available datasets. We provided figures to show the ratio and absolute count of reported items from included studies. According to our cross-sectional study, in JDI publications on DL in medical imaging, authors infrequently report the key elements of their study to make it reproducible.
Keywords: Artificial intelligence; Deep learning; Machine learning; Medical imaging; Reproducibility.
© 2023. The Author(s) under exclusive licence to Society for Imaging Informatics in Medicine.
Conflict of interest statement
The authors declare no competing interests.
Figures





Similar articles
-
Home treatment for mental health problems: a systematic review.Health Technol Assess. 2001;5(15):1-139. doi: 10.3310/hta5150. Health Technol Assess. 2001. PMID: 11532236
-
Artificial intelligence for detecting keratoconus.Cochrane Database Syst Rev. 2023 Nov 15;11(11):CD014911. doi: 10.1002/14651858.CD014911.pub2. Cochrane Database Syst Rev. 2023. PMID: 37965960 Free PMC article.
-
The educational effects of portfolios on undergraduate student learning: a Best Evidence Medical Education (BEME) systematic review. BEME Guide No. 11.Med Teach. 2009 Apr;31(4):282-98. doi: 10.1080/01421590902889897. Med Teach. 2009. PMID: 19404891
-
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3. Cochrane Database Syst Rev. 2022. PMID: 35593186 Free PMC article.
-
Health professionals' experience of teamwork education in acute hospital settings: a systematic review of qualitative literature.JBI Database System Rev Implement Rep. 2016 Apr;14(4):96-137. doi: 10.11124/JBISRIR-2016-1843. JBI Database System Rev Implement Rep. 2016. PMID: 27532314
Cited by
-
Checklist for Reproducibility of Deep Learning in Medical Imaging.J Imaging Inform Med. 2024 Aug;37(4):1664-1673. doi: 10.1007/s10278-024-01065-2. Epub 2024 Mar 14. J Imaging Inform Med. 2024. PMID: 38483694 Free PMC article.
-
Artificial Intelligence Uncertainty Quantification in Radiotherapy Applications - A Scoping Review.medRxiv [Preprint]. 2024 May 13:2024.05.13.24307226. doi: 10.1101/2024.05.13.24307226. medRxiv. 2024. Update in: Radiother Oncol. 2024 Dec;201:110542. doi: 10.1016/j.radonc.2024.110542. PMID: 38798581 Free PMC article. Updated. Preprint.
-
Public Disclosure of Results From Artificial Intelligence/Machine Learning Research in Health Care: Comprehensive Analysis of ClinicalTrials.gov, PubMed, and Scopus Data (2010-2023).J Med Internet Res. 2025 Mar 21;27:e60148. doi: 10.2196/60148. J Med Internet Res. 2025. PMID: 40117574 Free PMC article.
-
Artificial intelligence uncertainty quantification in radiotherapy applications - A scoping review.Radiother Oncol. 2024 Dec;201:110542. doi: 10.1016/j.radonc.2024.110542. Epub 2024 Sep 17. Radiother Oncol. 2024. PMID: 39299574 Free PMC article.
-
Artificial Intelligence in the Differential Diagnosis of Cardiomyopathy Phenotypes.Diagnostics (Basel). 2024 Jan 10;14(2):156. doi: 10.3390/diagnostics14020156. Diagnostics (Basel). 2024. PMID: 38248033 Free PMC article. Review.
References
-
- Arbabshirani MR, Fornwalt BK, Mongelluzzo GJ, Suever JD, Geise BD, Patel AA, et al. Advanced machine learning in action: identification of intracranial hemorrhage on computed tomography scans of the head with clinical workflow integration. NPJ Digit Med. 2018;1:9. doi: 10.1038/s41746-017-0015-z. - DOI - PMC - PubMed
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Medical