Are current clinical studies on artificial intelligence-based medical devices comprehensive enough to support a full health technology assessment? A systematic review
- PMID: 37210155
- DOI: 10.1016/j.artmed.2023.102547
Are current clinical studies on artificial intelligence-based medical devices comprehensive enough to support a full health technology assessment? A systematic review
Abstract
Introduction: Artificial Intelligence-based Medical Devices (AI-based MDs) are experiencing exponential growth in healthcare. This study aimed to investigate whether current studies assessing AI contain the information required for health technology assessment (HTA) by HTA bodies.
Methods: We conducted a systematic literature review based on the Preferred Reporting Items for Systematic Reviews and Meta-Analyses methodology to extract articles published between 2016 and 2021 related to the assessment of AI-based MDs. Data extraction focused on study characteristics, technology, algorithms, comparators, and results. AI quality assessment and HTA scores were calculated to evaluate whether the items present in the included studies were concordant with the HTA requirements. We performed a linear regression for the HTA and AI scores with the explanatory variables of the impact factor, publication date, and medical specialty. We conducted a univariate analysis of the HTA score and a multivariate analysis of the AI score with an alpha risk of 5 %.
Results: Of 5578 retrieved records, 56 were included. The mean AI quality assessment score was 67 %; 32 % of articles had an AI quality score ≥ 70 %, 50 % had a score between 50 % and 70 %, and 18 % had a score under 50 %. The highest quality scores were observed for the study design (82 %) and optimisation (69 %) categories, whereas the scores were lowest in the clinical practice category (23 %). The mean HTA score was 52 % for all seven domains. 100 % of the studies assessed clinical effectiveness, whereas only 9 % evaluated safety, and 20 % evaluated economic issues. There was a statistically significant relationship between the impact factor and the HTA and AI scores (both p = 0.046).
Discussion: Clinical studies on AI-based MDs have limitations and often lack adapted, robust, and complete evidence. High-quality datasets are also required because the output data can only be trusted if the inputs are reliable. The existing assessment frameworks are not specifically designed to assess AI-based MDs. From the perspective of regulatory authorities, we suggest that these frameworks should be adapted to assess the interpretability, explainability, cybersecurity, and safety of ongoing updates. From the perspective of HTA agencies, we highlight that transparency, professional and patient acceptance, ethical issues, and organizational changes are required for the implementation of these devices. Economic assessments of AI should rely on a robust methodology (business impact or health economic models) to provide decision-makers with more reliable evidence.
Conclusion: Currently, AI studies are insufficient to cover HTA prerequisites. HTA processes also need to be adapted because they do not consider the important specificities of AI-based MDs. Specific HTA workflows and accurate assessment tools should be designed to standardise evaluations, generate reliable evidence, and create confidence.
Keywords: Artificial intelligence; Artificial intelligence-based medical device; Clinical trial; Economic evaluation; Health technology assessment; Machine learning.
Copyright © 2023 The Author(s). Published by Elsevier B.V. All rights reserved.
Conflict of interest statement
Declaration of competing interest None.
Similar articles
-
A rapid and systematic review of the clinical effectiveness and cost-effectiveness of topotecan for ovarian cancer.Health Technol Assess. 2001;5(28):1-110. doi: 10.3310/hta5280. Health Technol Assess. 2001. PMID: 11701100
-
Home treatment for mental health problems: a systematic review.Health Technol Assess. 2001;5(15):1-139. doi: 10.3310/hta5150. Health Technol Assess. 2001. PMID: 11532236
-
Antidepressants for pain management in adults with chronic pain: a network meta-analysis.Health Technol Assess. 2024 Oct;28(62):1-155. doi: 10.3310/MKRT2948. Health Technol Assess. 2024. PMID: 39367772 Free PMC article.
-
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3. Cochrane Database Syst Rev. 2022. PMID: 35593186 Free PMC article.
-
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.Cochrane Database Syst Rev. 2021 Apr 19;4(4):CD011535. doi: 10.1002/14651858.CD011535.pub4. Cochrane Database Syst Rev. 2021. Update in: Cochrane Database Syst Rev. 2022 May 23;5:CD011535. doi: 10.1002/14651858.CD011535.pub5. PMID: 33871055 Free PMC article. Updated.
Cited by
-
Are the European reference networks for rare diseases ready to embrace machine learning? A mixed-methods study.Orphanet J Rare Dis. 2024 Jan 25;19(1):25. doi: 10.1186/s13023-024-03047-7. Orphanet J Rare Dis. 2024. PMID: 38273306 Free PMC article.
-
Ethical, Legal, and Social Assessment of AI-Based Technologies for Prevention and Diagnosis of Rare Diseases in Health Technology Assessment Processes.Healthcare (Basel). 2025 Apr 4;13(7):829. doi: 10.3390/healthcare13070829. Healthcare (Basel). 2025. PMID: 40218125 Free PMC article.
-
ESR Essentials: how to get to valuable radiology AI: the role of early health technology assessment-practice recommendations by the European Society of Medical Imaging Informatics.Eur Radiol. 2025 Jun;35(6):3432-3441. doi: 10.1007/s00330-024-11188-3. Epub 2024 Dec 5. Eur Radiol. 2025. PMID: 39636421 Free PMC article. Review.
-
International Market Access Strategies for Artificial Intelligence-Based Medical Devices: Can We Standardize the Process to Faster Patient Access?Mayo Clin Proc Digit Health. 2023 Aug 8;1(3):406-412. doi: 10.1016/j.mcpdig.2023.06.011. eCollection 2023 Sep. Mayo Clin Proc Digit Health. 2023. PMID: 40206615 Free PMC article.
-
AI for IMPACTS Framework for Evaluating the Long-Term Real-World Impacts of AI-Powered Clinician Tools: Systematic Review and Narrative Synthesis.J Med Internet Res. 2025 Feb 5;27:e67485. doi: 10.2196/67485. J Med Internet Res. 2025. PMID: 39909417 Free PMC article.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Research Materials
Miscellaneous