Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2025 Mar 20;138(6):651-663.
doi: 10.1097/CM9.0000000000003489. Epub 2025 Feb 26.

Artificial intelligence in medical imaging: From task-specific models to large-scale foundation models

Affiliations
Review

Artificial intelligence in medical imaging: From task-specific models to large-scale foundation models

Yueyan Bian et al. Chin Med J (Engl). .

Abstract

Artificial intelligence (AI), particularly deep learning, has demonstrated remarkable performance in medical imaging across a variety of modalities, including X-ray, computed tomography (CT), magnetic resonance imaging (MRI), ultrasound, positron emission tomography (PET), and pathological imaging. However, most existing state-of-the-art AI techniques are task-specific and focus on a limited range of imaging modalities. Compared to these task-specific models, emerging foundation models represent a significant milestone in AI development. These models can learn generalized representations of medical images and apply them to downstream tasks through zero-shot or few-shot fine-tuning. Foundation models have the potential to address the comprehensive and multifactorial challenges encountered in clinical practice. This article reviews the clinical applications of both task-specific and foundation models, highlighting their differences, complementarities, and clinical relevance. We also examine their future research directions and potential challenges. Unlike the replacement relationship seen between deep learning and traditional machine learning, task-specific and foundation models are complementary, despite inherent differences. While foundation models primarily focus on segmentation and classification, task-specific models are integrated into nearly all medical image analyses. However, with further advancements, foundation models could be applied to other clinical scenarios. In conclusion, all indications suggest that task-specific and foundation models, especially the latter, have the potential to drive breakthroughs in medical imaging, from image processing to clinical workflows.

PubMed Disclaimer

Conflict of interest statement

None.

Figures

Figure 1
Figure 1
Datasets cover various imaging modalities and multiple anatomical structures in medical imaging.
Figure 2
Figure 2
The typical architecture of an existing foundation model in medical imaging analysis. The image encoder extracts image features, and the prompt encoder integrates user interactions from different prompt modes, including points, bounding boxes, and masks. The mask decoder is used to predict segmentation outputs using image and prompt embeddings.

Similar articles

References

    1. Tiu E, Talius E, Patel P, Langlotz CP, Ng AY, Rajpurkar P. Expert-level detection of pathologies from unannotated chest X-ray images via self-supervised learning. Nat Biomed Eng 2022;6:1399–1406. doi: 10.1038/s41551-022-00936-9. - PMC - PubMed
    1. Gillebert CR, Humphreys GW, Mantini D. Automated delineation of stroke lesions using brain CT images. Neuroimage Clin 2014;4:540–548. doi: 10.1016/j.nicl.2014.03.009. - PMC - PubMed
    1. Deng M, Yu R, Wang L, Shi F, Yap PT, Shen D. Learning-based 3T brain MRI segmentation with guidance from 7T MRI labeling. Med Phys 2016;43:6588–6597. doi: 10.1118/1.4967487. - PubMed
    1. Jun TJ Kang SJ Lee JG Kweon J Na W Kang D, et al. . Automated detection of vulnerable plaque in intravascular ultrasound images. Med Biol Eng Comput 2019;57:863–876. doi: 10.1007/s11517-018-1925-x. - PubMed
    1. Pieszko K Shanbhag A Killekar A Miller RJH Lemley M Otaki Y, et al. . Deep learning of coronary calcium scores from PET/CT attenuation maps accurately predicts adverse cardiovascular events. JACC Cardiovasc Imaging 2023;16:675–687. doi: 10.1016/j.jcmg.2022.06.006. - PubMed