Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2024 Jan;310(1):e223170.
doi: 10.1148/radiol.223170.

Strategies for Implementing Machine Learning Algorithms in the Clinical Practice of Radiology

Affiliations
Review

Strategies for Implementing Machine Learning Algorithms in the Clinical Practice of Radiology

Allison Chae et al. Radiology. 2024 Jan.

Abstract

Despite recent advancements in machine learning (ML) applications in health care, there have been few benefits and improvements to clinical medicine in the hospital setting. To facilitate clinical adaptation of methods in ML, this review proposes a standardized framework for the step-by-step implementation of artificial intelligence into the clinical practice of radiology that focuses on three key components: problem identification, stakeholder alignment, and pipeline integration. A review of the recent literature and empirical evidence in radiologic imaging applications justifies this approach and offers a discussion on structuring implementation efforts to help other hospital practices leverage ML to improve patient care. Clinical trial registration no. 04242667 © RSNA, 2024 Supplemental material is available for this article.

PubMed Disclaimer

Conflict of interest statement

Disclosures of conflicts of interest: A.C. No relevant relationships. M.S.Y. No relevant relationships. H.S. Scholar Grant from RSNA; Society of Radiologists in Ultrasound Early Career Award, Penn-Calico Collaborative grant (pending); patent planned in the future for aspects of an artificial intelligence pipeline; leadership role in MDPI Computers Special Issue editor, SIIM Clinical Data Informaticist Task Force. A.D.G. No relevant relationships. N.C. No relevant relationships. M.T.M. Research stipend from Sarnoff Cardiovascular Research Foundation; meeting and travel support from Sarnoff Cardiovascular Research Foundation. J.D. No relevant relationships. A.E. No relevant relationships. A.B. No relevant relationships. M.D.R. No relevant relationships. D.R. No relevant relationships. C.E.K. Consulting fees from Daccan Partners; travel support and honorarium from American Society of Gastrointestinal Endoscopy; Editor for Radiology: Artificial Intelligence. W.R.W. No relevant relationships. J.C.G. Grants to author’s institution from NIH, Siemens Healthineers; stock/stock options from Merck.

Figures

None
Graphical abstract
Overview of Penn Medicine BioBank (PMBB) imaging data. (A) Bar graph
shows the number of studies within the Penn Medicine BioBank by imaging
modality. The number of studies per Penn Medicine BioBank capita is the
average number of studies per patient within the Penn Medicine BioBank. (B)
Line graph shows the number of imaging studies acquired per year contained
within the Penn Medicine BioBank by imaging modality. (C) Line graph of
1–CDF, where CDF is the cumulative distribution function.
1–CDF corresponds to the proportion of patients (by modality)
according to number of examinations. (D) Histogram shows the time between
sequential repeat imaging studies by patient for the four most common
imaging modalities.
Figure 1:
Overview of Penn Medicine BioBank (PMBB) imaging data. (A) Bar graph shows the number of studies within the Penn Medicine BioBank by imaging modality. The number of studies per Penn Medicine BioBank capita is the average number of studies per patient within the Penn Medicine BioBank. (B) Line graph shows the number of imaging studies acquired per year contained within the Penn Medicine BioBank by imaging modality. (C) Line graph of 1–CDF, where CDF is the cumulative distribution function. 1–CDF corresponds to the proportion of patients (by modality) according to number of examinations. (D) Histogram shows the time between sequential repeat imaging studies by patient for the four most common imaging modalities.
Comparisons of principal component distributions of six artificial
intelligence–extracted image-derived phenotype (IDP) metrics (36),
calculated from abdominal CT in 1276 anonymized patients from the Penn
Medicine BioBank. These image-derived phenotypes included liver CT
attenuation, spleen CT attenuation, liver volume, spleen volume, visceral
fat volume, and subcutaneous fat volume. Using principal component analysis
(PCA), the principal component of these image-derived phenotypes was
extracted and its distribution was plotted as a histogram for patients
stratified by different clinical diagnoses. Bar graphs show different
image-derived phenotype principal component distributions in patients
without diagnoses (gray bars) versus in patients diagnosed with (A) obesity
(n = 91), (B) obstructive sleep apnea (n = 201), and (C) hypertension (n =
1082). Image-derived phenotype principal component distributions in patients
without diagnoses (gray bars) versus in patients diagnosed with (D)
nonalcoholic fatty liver disease (NAFLD; n = 429) and (E) diabetes (n =
790). (F) Genitourinary diseases (n = 1202), which are not clinically
associated with these image-derived phenotype metrics, were not associated
with a statistically significant different principal component distribution
compared with healthy patients (P = .12). P values were calculated with
statistical analyses comparing distributions of patients with and without
disease diagnoses, with a two-sample Kolmogorov-Smirnov test for goodness of
fit.
Figure 2:
Comparisons of principal component distributions of six artificial intelligence–extracted image-derived phenotype (IDP) metrics (36), calculated from abdominal CT in 1276 anonymized patients from the Penn Medicine BioBank. These image-derived phenotypes included liver CT attenuation, spleen CT attenuation, liver volume, spleen volume, visceral fat volume, and subcutaneous fat volume. Using principal component analysis (PCA), the principal component of these image-derived phenotypes was extracted and its distribution was plotted as a histogram for patients stratified by different clinical diagnoses. Bar graphs show different image-derived phenotype principal component distributions in patients without diagnoses (gray bars) versus in patients diagnosed with (A) obesity (n = 91), (B) obstructive sleep apnea (n = 201), and (C) hypertension (n = 1082). Image-derived phenotype principal component distributions in patients without diagnoses (gray bars) versus in patients diagnosed with (D) nonalcoholic fatty liver disease (NAFLD; n = 429) and (E) diabetes (n = 790). (F) Genitourinary diseases (n = 1202), which are not clinically associated with these image-derived phenotype metrics, were not associated with a statistically significant different principal component distribution compared with healthy patients (P = .12). P values were calculated with statistical analyses comparing distributions of patients with and without disease diagnoses, with a two-sample Kolmogorov-Smirnov test for goodness of fit.
Overview of common frameworks for integrating machine learning and
medical imaging. (A) Artificial intelligence (AI) models interface directly
with raw acquired data and are implemented on a per-scanner basis. (B)
Machine learning models have access to the processed image outputs from the
individual scanners before they are sent to the central picture archiving
and communication system (PACS) server. (C) Models communicate with the
picture archiving and communication system server to obtain relevant inputs
for AI interpretation.
Figure 3:
Overview of common frameworks for integrating machine learning and medical imaging. (A) Artificial intelligence (AI) models interface directly with raw acquired data and are implemented on a per-scanner basis. (B) Machine learning models have access to the processed image outputs from the individual scanners before they are sent to the central picture archiving and communication system (PACS) server. (C) Models communicate with the picture archiving and communication system server to obtain relevant inputs for AI interpretation.
Research directions for future work (93,99).
Figure 4:
Research directions for future work (93,99).

References

    1. Yan A , McAuley J , Lu X , et al. . RadBERT: Adapting transformer-based language models to radiology . Radiol Artif Intell 2022. ; 4 ( 4 ): e210258 . - PMC - PubMed
    1. Jaiswal A , Tang L , Ghosh M , Rousseau JF , Peng Y , Ding Y . RadBERT-CL: Factually-Aware Contrastive Learning For Radiology Report Classification . Proc Mach Learn Res 2021. ; 158 : 196 – 208 . - PMC - PubMed
    1. Lee JE , Choi SY , Hwang JA , et al. . The potential for reduced radiation dose from deep learning-based CT image reconstruction: A comparison with filtered back projection and hybrid iterative reconstruction using a phantom . Medicine (Baltimore) 2021. ; 100 ( 19 ): e25814 . - PMC - PubMed
    1. Sriram A , Zbontar J , Murrell T , et al. . End-to-End Variational Networks for Accelerated MRI Reconstruction . In: MICCAI 2020. . Springer International Publishing; , 2020. ; 64 – 73 .
    1. Yao MS , Hansen MS . A Path Towards Clinical Adaptation of Accelerated MRI . Proc Mach Learn Res 2022. ; 193 : 489 – 511 . - PMC - PubMed

LinkOut - more resources