The Low Rate of Adherence to Checklist for Artificial Intelligence in Medical Imaging Criteria Among Published Prostate MRI Artificial Intelligence Algorithms
- PMID: 35922018
- PMCID: PMC9887098
- DOI: 10.1016/j.jacr.2022.05.022
The Low Rate of Adherence to Checklist for Artificial Intelligence in Medical Imaging Criteria Among Published Prostate MRI Artificial Intelligence Algorithms
Abstract
Objective: To determine the rigor, generalizability, and reproducibility of published classification and detection artificial intelligence (AI) models for prostate cancer (PCa) on MRI using the Checklist for Artificial Intelligence in Medical Imaging (CLAIM) guidelines, a 42-item checklist that is considered a measure of best practice for presenting and reviewing medical imaging AI research.
Materials and methods: This review searched English literature for studies proposing PCa AI detection and classification models on MRI. Each study was evaluated with the CLAIM checklist. The additional outcomes for which data were sought included measures of AI model performance (eg, area under the curve [AUC], sensitivity, specificity, free-response operating characteristic curves), training and validation and testing group sample size, AI approach, detection versus classification AI, public data set utilization, MRI sequences used, and definition of gold standard for ground truth. The percentage of CLAIM checklist fulfillment was used to stratify studies into quartiles. Wilcoxon's rank-sum test was used for pair-wise comparisons.
Results: In all, 75 studies were identified, and 53 studies qualified for analysis. The original CLAIM items that most studies did not fulfill includes item 12 (77% no): de-identification methods; item 13 (68% no): handling missing data; item 15 (47% no): rationale for choosing ground truth reference standard; item 18 (55% no): measurements of inter- and intrareader variability; item 31 (60% no): inclusion of validated interpretability maps; item 37 (92% no): inclusion of failure analysis to elucidate AI model weaknesses. An AUC score versus percentage CLAIM fulfillment quartile revealed a significant difference of the mean AUC scores between quartile 1 versus quartile 2 (0.78 versus 0.86, P = .034) and quartile 1 versus quartile 4 (0.78 versus 0.89, P = .003) scores. Based on additional information and outcome metrics gathered in this study, additional measures of best practice are defined. These new items include disclosure of public dataset usage, ground truth definition in comparison to other referenced works in the defined task, and sample size power calculation.
Conclusion: A large proportion of AI studies do not fulfill key items in CLAIM guidelines within their methods and results sections. The percentage of CLAIM checklist fulfillment is weakly associated with improved AI model performance. Additions or supplementations to CLAIM are recommended to improve publishing standards and aid reviewers in determining study rigor.
Keywords: AI; CLAIM; classification; detection; prostate cancer; study rigor.
Published by Elsevier Inc.
Conflict of interest statement
Conflict of Interest
The authors declare no conflict of interest.
Figures





Similar articles
-
Reporting Quality of Research Studies on AI Applications in Medical Images According to the CLAIM Guidelines in a Radiology Journal With a Strong Prominence in Asia.Korean J Radiol. 2023 Dec;24(12):1179-1189. doi: 10.3348/kjr.2023.1027. Korean J Radiol. 2023. PMID: 38016678 Free PMC article. Review.
-
Adherence to the Checklist for Artificial Intelligence in Medical Imaging (CLAIM): an umbrella review with a comprehensive two-level analysis.Diagn Interv Radiol. 2025 Feb 10. doi: 10.4274/dir.2025.243182. Online ahead of print. Diagn Interv Radiol. 2025. PMID: 39937033
-
Artificial Intelligence in Magnetic Resonance Imaging-based Prostate Cancer Diagnosis: Where Do We Stand in 2021?Eur Urol Focus. 2022 Mar;8(2):409-417. doi: 10.1016/j.euf.2021.03.020. Epub 2021 Mar 25. Eur Urol Focus. 2022. PMID: 33773964 Review.
-
Assessment of artificial intelligence (AI) reporting methodology in glioma MRI studies using the Checklist for AI in Medical Imaging (CLAIM).Neuroradiology. 2023 May;65(5):907-913. doi: 10.1007/s00234-023-03126-9. Epub 2023 Feb 7. Neuroradiology. 2023. PMID: 36746792 Free PMC article.
-
Checklist for Artificial Intelligence in Medical Imaging Reporting Adherence in Peer-Reviewed and Preprint Manuscripts With the Highest Altmetric Attention Scores: A Meta-Research Study.Can Assoc Radiol J. 2023 May;74(2):334-342. doi: 10.1177/08465371221134056. Epub 2022 Oct 27. Can Assoc Radiol J. 2023. PMID: 36301600
Cited by
-
Reporting Quality of Research Studies on AI Applications in Medical Images According to the CLAIM Guidelines in a Radiology Journal With a Strong Prominence in Asia.Korean J Radiol. 2023 Dec;24(12):1179-1189. doi: 10.3348/kjr.2023.1027. Korean J Radiol. 2023. PMID: 38016678 Free PMC article. Review.
-
Deep learning algorithm performance in contouring head and neck organs at risk: a systematic review and single-arm meta-analysis.Biomed Eng Online. 2023 Nov 1;22(1):104. doi: 10.1186/s12938-023-01159-y. Biomed Eng Online. 2023. PMID: 37915046 Free PMC article.
-
The Evidence for Using Artificial Intelligence to Enhance Prostate Cancer MR Imaging.Curr Oncol Rep. 2023 Apr;25(4):243-250. doi: 10.1007/s11912-023-01371-y. Epub 2023 Feb 7. Curr Oncol Rep. 2023. PMID: 36749494 Review.
-
Artificial Intelligence Reporting Guidelines' Adherence in Nephrology for Improved Research and Clinical Outcomes.Biomedicines. 2024 Mar 7;12(3):606. doi: 10.3390/biomedicines12030606. Biomedicines. 2024. PMID: 38540219 Free PMC article. Review.
-
Mixed Supervision of Histopathology Improves Prostate Cancer Classification From MRI.IEEE Trans Med Imaging. 2024 Jul;43(7):2610-2622. doi: 10.1109/TMI.2024.3382909. Epub 2024 Jul 1. IEEE Trans Med Imaging. 2024. PMID: 38547000 Free PMC article.
References
-
- Key Statistics for Prostate Cancer | Prostate Cancer Facts [Internet]. Available from: https://www.cancer.org/cancer/prostate-cancer/about/key-statistics.html
-
- Wildeboer RR, van Sloun RJG, Wijkstra H, Mischi M. Artificial intelligence in multiparametric prostate cancer imaging with focus on deep-learning methods. Comput Methods Programs Biomed. 2020. Jun;189:105316. - PubMed
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources