Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Feb;31(2):609-617.
doi: 10.1038/s41591-024-03359-y. Epub 2024 Dec 11.

Self-improving generative foundation model for synthetic medical image generation and clinical applications

Affiliations

Self-improving generative foundation model for synthetic medical image generation and clinical applications

Jinzhuo Wang et al. Nat Med. 2025 Feb.

Abstract

In many clinical and research settings, the scarcity of high-quality medical imaging datasets has hampered the potential of artificial intelligence (AI) clinical applications. This issue is particularly pronounced in less common conditions, underrepresented populations and emerging imaging modalities, where the availability of diverse and comprehensive datasets is often inadequate. To address this challenge, we introduce a unified medical image-text generative model called MINIM that is capable of synthesizing medical images of various organs across various imaging modalities based on textual instructions. Clinician evaluations and rigorous objective measurements validate the high quality of MINIM's synthetic images. MINIM exhibits an enhanced generative capability when presented with previously unseen data domains, demonstrating its potential as a generalist medical AI (GMAI). Our findings show that MINIM's synthetic images effectively augment existing datasets, boosting performance across multiple medical applications such as diagnostics, report generation and self-supervised learning. On average, MINIM enhances performance by 12% for ophthalmic, 15% for chest, 13% for brain and 17% for breast-related tasks. Furthermore, we demonstrate MINIM's potential clinical utility in the accurate prediction of HER2-positive breast cancer from MRI images. Using a large retrospective simulation analysis, we demonstrate MINIM's clinical potential by accurately identifying targeted therapy-sensitive EGFR mutations using lung cancer computed tomography images, which could potentially lead to improved 5-year survival rates. Although these results are promising, further validation and refinement in more diverse and prospective settings would greatly enhance the model's generalizability and robustness.

PubMed Disclaimer

Conflict of interest statement

Competing interests: The authors declare no competing interests.

References

    1. Gao, Y., Baptista-Hon, D. T. & Zhang, K. The inevitable transformation of medicine and research by large language models: the possibilities and pitfalls. MedComm Futur. Med. 2, e49 (2023). - DOI
    1. Wang, D.-Q., Feng, L.-Y., Ye, J.-G., Zou, J.-G. & Zheng, Y.-F. Accelerating the integration of ChatGPT and other large-scale AI models into biomedical research and healthcare. MedComm Futur. Med. 2, e43 (2023).
    1. Xia, K. & Wang, J. Recent advances of transformers in medical image analysis: a comprehensive review. MedComm Futur. Med. 2, e38 (2023). - DOI
    1. Ye, Y., Sarkar, S., Bhaskar, A., Tomlinson, B. & Monteiro, O. Using ChatGPT in a clinical setting: a case report. MedComm Futur. Med. 2, e51 (2023). - DOI
    1. Gao, C. et al. Synthetic data accelerates the development of generalizable learning-based algorithms for X-ray image analysis. Nat. Mach. Intell. 5, 294–308 (2023). - DOI - PubMed - PMC

Substances