Synthetic Scientific Image Generation with VAE, GAN, and Diffusion Model Architectures
- PMID: 40863462
- PMCID: PMC12387873
- DOI: 10.3390/jimaging11080252
Synthetic Scientific Image Generation with VAE, GAN, and Diffusion Model Architectures
Abstract
Generative AI (genAI) has emerged as a powerful tool for synthesizing diverse and complex image data, offering new possibilities for scientific imaging applications. This review presents a comprehensive comparative analysis of leading generative architectures, ranging from Variational Autoencoders (VAEs) to Generative Adversarial Networks (GANs) on through to Diffusion Models, in the context of scientific image synthesis. We examine each model's foundational principles, recent architectural advancements, and practical trade-offs. Our evaluation, conducted on domain-specific datasets including microCT scans of rocks and composite fibers, as well as high-resolution images of plant roots, integrates both quantitative metrics (SSIM, LPIPS, FID, CLIPScore) and expert-driven qualitative assessments. Results show that GANs, particularly StyleGAN, produce images with high perceptual quality and structural coherence. Diffusion-based models for inpainting and image variation, such as DALL-E 2, delivered high realism and semantic alignment but generally struggled in balancing visual fidelity with scientific accuracy. Importantly, our findings reveal limitations of standard quantitative metrics in capturing scientific relevance, underscoring the need for domain-expert validation. We conclude by discussing key challenges such as model interpretability, computational cost, and verification protocols, and discuss future directions where generative AI can drive innovation in data augmentation, simulation, and hypothesis generation in scientific research.
Keywords: Generative Adversarial Networks; diffusion; generative AI; image generation; synthetic data.
Conflict of interest statement
The authors declare no conflicts of interest.
Figures
References
-
- Sordo Z., Chagnon E., Ushizima D. A Review on Generative AI For Text-To-Image and Image-To-Image Generation and Implications To Scientific Images. arXiv. 2025cs.CV/2502.21151
-
- Foster D. Generative Deep Learning: Teaching Machines to Paint, Write, Compose, and Play. 2nd ed. O’Reilly Media; Sebastopol, CA, USA: 2023.
-
- Sun Y., Sheng D., Zhou Z., Wu Y. AI hallucination: Towards a comprehensive classification of distorted information in artificial intelligence-generated content. Humanit. Soc. Sci. Commun. 2024;11:1278. doi: 10.1057/s41599-024-03811-x. - DOI
-
- Lucas J.S., Maung B.M., Tabar M., McBride K., Lee D. The Longtail Impact of Generative AI on Disinformation: Harmonizing Dichotomous Perspectives. IEEE Intell. Syst. 2024;39:12–19. doi: 10.1109/MIS.2024.3439109. - DOI
LinkOut - more resources
Full Text Sources
