Methods for Clinical Evaluation of Artificial Intelligence Algorithms for Medical Diagnosis
- PMID: 36346314
- DOI: 10.1148/radiol.220182
Methods for Clinical Evaluation of Artificial Intelligence Algorithms for Medical Diagnosis
Abstract
Adequate clinical evaluation of artificial intelligence (AI) algorithms before adoption in practice is critical. Clinical evaluation aims to confirm acceptable AI performance through adequate external testing and confirm the benefits of AI-assisted care compared with conventional care through appropriately designed and conducted studies, for which prospective studies are desirable. This article explains some of the fundamental methodological points that should be considered when designing and appraising the clinical evaluation of AI algorithms for medical diagnosis. The specific topics addressed include the following: (a) the importance of external testing of AI algorithms and strategies for conducting the external testing effectively, (b) the various metrics and graphical methods for evaluating the AI performance as well as essential methodological points to note in using and interpreting them, (c) paired study designs primarily for comparative performance evaluation of conventional and AI-assisted diagnoses, (d) parallel study designs primarily for evaluating the effect of AI intervention with an emphasis on randomized clinical trials, and (e) up-to-date guidelines for reporting clinical studies on AI, with an emphasis on guidelines registered in the EQUATOR Network library. Sound methodological knowledge of these topics will aid the design, execution, reporting, and appraisal of clinical evaluation of AI.
© RSNA, 2022.
Similar articles
-
Review of study reporting guidelines for clinical studies using artificial intelligence in healthcare.BMJ Health Care Inform. 2021 Aug;28(1):e100385. doi: 10.1136/bmjhci-2021-100385. BMJ Health Care Inform. 2021. PMID: 34426417 Free PMC article. Review.
-
Framework and metrics for the clinical use and implementation of artificial intelligence algorithms into endoscopy practice: recommendations from the American Society for Gastrointestinal Endoscopy Artificial Intelligence Task Force.Gastrointest Endosc. 2023 May;97(5):815-824.e1. doi: 10.1016/j.gie.2022.10.016. Epub 2023 Feb 8. Gastrointest Endosc. 2023. PMID: 36764886
-
Raising the Bar for Randomized Trials Involving Artificial Intelligence: The SPIRIT-Artificial Intelligence and CONSORT-Artificial Intelligence Guidelines.J Invest Dermatol. 2021 Sep;141(9):2109-2111. doi: 10.1016/j.jid.2021.02.744. Epub 2021 Mar 22. J Invest Dermatol. 2021. PMID: 33766511
-
A novel way to prospectively evaluate of AI-enhanced ECG algorithms.J Electrocardiol. 2024 Sep-Oct;86:153756. doi: 10.1016/j.jelectrocard.2024.06.046. Epub 2024 Jul 6. J Electrocardiol. 2024. PMID: 38997873 Review.
-
Characteristics of Artificial Intelligence Clinical Trials in the Field of Healthcare: A Cross-Sectional Study on ClinicalTrials.gov.Int J Environ Res Public Health. 2022 Oct 21;19(20):13691. doi: 10.3390/ijerph192013691. Int J Environ Res Public Health. 2022. PMID: 36294269 Free PMC article.
Cited by
-
Interpretable artificial intelligence-based app assists inexperienced radiologists in diagnosing biliary atresia from sonographic gallbladder images.BMC Med. 2024 Jan 25;22(1):29. doi: 10.1186/s12916-024-03247-9. BMC Med. 2024. PMID: 38267950 Free PMC article.
-
Empirical data drift detection experiments on real-world medical imaging data.Nat Commun. 2024 Feb 29;15(1):1887. doi: 10.1038/s41467-024-46142-w. Nat Commun. 2024. PMID: 38424096 Free PMC article.
-
Toward explainable AI in radiology: Ensemble-CAM for effective thoracic disease localization in chest X-ray images using weak supervised learning.Front Big Data. 2024 May 2;7:1366415. doi: 10.3389/fdata.2024.1366415. eCollection 2024. Front Big Data. 2024. PMID: 38756502 Free PMC article.
-
Opportunistic Identification of Vertebral Compression Fractures on CT Scans of the Chest and Abdomen, Using an AI Algorithm, in a Real-Life Setting.Calcif Tissue Int. 2024 May;114(5):468-479. doi: 10.1007/s00223-024-01196-2. Epub 2024 Mar 26. Calcif Tissue Int. 2024. PMID: 38530406 Free PMC article.
-
Improving the efficiency and accuracy of cardiovascular magnetic resonance with artificial intelligence-review of evidence and proposition of a roadmap to clinical translation.J Cardiovasc Magn Reson. 2024 Winter;26(2):101051. doi: 10.1016/j.jocmr.2024.101051. Epub 2024 Jun 22. J Cardiovasc Magn Reson. 2024. PMID: 38909656 Free PMC article. Review.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources