Diffusion Models in Vision: A Survey

Florinel-Alin Croitoru, Vlad Hondru, Radu Tudor Ionescu, Mubarak Shah

PMID: 37030794
DOI: 10.1109/TPAMI.2023.3261988

Diffusion Models in Vision: A Survey

Florinel-Alin Croitoru et al. IEEE Trans Pattern Anal Mach Intell. 2023 Sep.

. 2023 Sep;45(9):10850-10869.

doi: 10.1109/TPAMI.2023.3261988. Epub 2023 Aug 7.

Authors

Florinel-Alin Croitoru, Vlad Hondru, Radu Tudor Ionescu, Mubarak Shah

PMID: 37030794
DOI: 10.1109/TPAMI.2023.3261988

Abstract

Denoising diffusion models represent a recent emerging topic in computer vision, demonstrating remarkable results in the area of generative modeling. A diffusion model is a deep generative model that is based on two stages, a forward diffusion stage and a reverse diffusion stage. In the forward diffusion stage, the input data is gradually perturbed over several steps by adding Gaussian noise. In the reverse stage, a model is tasked at recovering the original input data by learning to gradually reverse the diffusion process, step by step. Diffusion models are widely appreciated for the quality and diversity of the generated samples, despite their known computational burdens, i.e., low speeds due to the high number of steps involved during sampling. In this survey, we provide a comprehensive review of articles on denoising diffusion models applied in vision, comprising both theoretical and practical contributions in the field. First, we identify and present three generic diffusion modeling frameworks, which are based on denoising diffusion probabilistic models, noise conditioned score networks, and stochastic differential equations. We further discuss the relations between diffusion models and other deep generative models, including variational auto-encoders, generative adversarial networks, energy-based models, autoregressive models and normalizing flows. Then, we introduce a multi-perspective categorization of diffusion models applied in computer vision. Finally, we illustrate the current limitations of diffusion models and envision some interesting directions for future research.

PubMed Disclaimer

Cited by

Accurate and efficient insulator maintenance: A DETR algorithm for drone imagery.
Tian Y, Ahmad RB, Abdullah NAB. Tian Y, et al. PLoS One. 2025 Feb 25;20(2):e0318225. doi: 10.1371/journal.pone.0318225. eCollection 2025. PLoS One. 2025. PMID: 39999207 Free PMC article.
Closing the Domain Gap: Can Pseudo-Labels from Synthetic UAV Data Enable Real-World Flood Segmentation?
Simantiris G, Bacharidis K, Panagiotakis C. Simantiris G, et al. Sensors (Basel). 2025 Jun 6;25(12):3586. doi: 10.3390/s25123586. Sensors (Basel). 2025. PMID: 40573473 Free PMC article.
Text-to-image models reveal specific color-emotion associations.
Alvarado J. Alvarado J. Front Psychol. 2025 Jun 13;16:1593928. doi: 10.3389/fpsyg.2025.1593928. eCollection 2025. Front Psychol. 2025. PMID: 40584075 Free PMC article.
A paired CT and MRI dataset for advanced medical imaging applications.
Siam ZS, Akon MY, Munmun IJ, Al-Amin A, Salam MA, Mamoon IA. Siam ZS, et al. Data Brief. 2025 Jun 10;61:111768. doi: 10.1016/j.dib.2025.111768. eCollection 2025 Aug. Data Brief. 2025. PMID: 40655994 Free PMC article.
Comprehensive Review: Machine and Deep Learning in Brain Stroke Diagnosis.
Fernandes JND, Cardoso VEM, Comesaña-Campos A, Pinheira A. Fernandes JND, et al. Sensors (Basel). 2024 Jul 4;24(13):4355. doi: 10.3390/s24134355. Sensors (Basel). 2024. PMID: 39001134 Free PMC article. Review.

See all "Cited by" articles

LinkOut - more resources

Full Text Sources
- IEEE Computer Society
- IEEE Engineering in Medicine and Biology Society
Other Literature Sources
- The Lens - Patent Citations Database

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Diffusion Models in Vision: A Survey

Diffusion Models in Vision: A Survey

Authors

Abstract

Similar articles

Cited by

LinkOut - more resources

Full Text Sources

Other Literature Sources