Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Apr;63(4):500-510.
doi: 10.2967/jnumed.121.262567. Epub 2021 Nov 5.

Nuclear Medicine and Artificial Intelligence: Best Practices for Algorithm Development

Affiliations

Nuclear Medicine and Artificial Intelligence: Best Practices for Algorithm Development

Tyler J Bradshaw et al. J Nucl Med. 2022 Apr.

Abstract

The nuclear medicine field has seen a rapid expansion of academic and commercial interest in developing artificial intelligence (AI) algorithms. Users and developers can avoid some of the pitfalls of AI by recognizing and following best practices in AI algorithm development. In this article, recommendations on technical best practices for developing AI algorithms in nuclear medicine are provided, beginning with general recommendations and then continuing with descriptions of how one might practice these principles for specific topics within nuclear medicine. This report was produced by the AI Task Force of the Society of Nuclear Medicine and Molecular Imaging.

Keywords: algorithm; artificial intelligence; best practices; computer/PACS; research methods; statistics.

PubMed Disclaimer

Figures

FIGURE 1.
FIGURE 1.
Trend in publications on AI within nuclear medicine according to Scopus (Elsevier). Word cloud contains most commonly used terms in recent abstracts.
FIGURE 2.
FIGURE 2.
AI applications spanning the gamut of nuclear medicine subspecialties.
FIGURE 3.
FIGURE 3.
Pipeline for AI algorithm development together with key considerations of each stage of development.
FIGURE 4.
FIGURE 4.
Annotation quality as function of different labeling techniques for diagnostic applications. This hierarchy does not imply how useful annotation method is (e.g., expert labels are often more useful than simulations because of limited realism of simulated data).
FIGURE 5.
FIGURE 5.
Different approaches to cross validation, depending on dataset size and whether model selection is needed. Figure illustrates 5-fold cross validation without model selection/hyperparameter tuning (top), 5-fold cross validation with holdout test set (middle), and nested cross validation (5-fold outer loop, 4-fold inner loop) (bottom).

References

    1. Roberts M, Driggs D, Thorpe M, et al. . Common pitfalls and recommendations for using machine learning to detect and prognosticate for COVID-19 using chest radiographs and CT scans. Nat Mach Intell. 2021;3:199–217.
    1. Haibe-Kains B, Adam GA, Hosny A, et al. . Transparency and reproducibility in artificial intelligence. Nature. 2020;586:E14–E16. - PMC - PubMed
    1. Zech JR, Badgeley MA, Liu M, Costa AB, Titano JJ, Oermann EK. Variable generalization performance of a deep learning model to detect pneumonia in chest radiographs: a cross-sectional study. PLoS Med. 2018;15:e1002683. - PMC - PubMed
    1. Buvat I, Orlhac F. The T.R.U.E. checklist for identifying impactful artificial intelligence-based findings in nuclear medicine: is it True? Is it Reproducible? Is it Useful? Is it Explainable? J Nucl Med. 2021;62:752–754. - PubMed
    1. Xue Y, Xu T, Zhang H, Long LR, Huang X. Segan: adversarial network with multi-scale L1 loss for medical image segmentation. Neuroinformatics. 2018;16:383–392. - PubMed

LinkOut - more resources