Dynamical regimes of diffusion models
- PMID: 39551866
- PMCID: PMC11570668
- DOI: 10.1038/s41467-024-54281-3
Dynamical regimes of diffusion models
Abstract
We study generative diffusion models in the regime where both the data dimension and the sample size are large, and the score function is trained optimally. Using statistical physics methods, we identify three distinct dynamical regimes during the generative diffusion process. The generative dynamics, starting from pure noise, first encounters a speciation transition, where the broad structure of the data emerges, akin to symmetry breaking in phase transitions. This is followed by a collapse phase, where the dynamics is attracted to a specific training point through a mechanism similar to condensation in a glass phase. The speciation time can be obtained from a spectral analysis of the data's correlation matrix, while the collapse time relates to an excess entropy measure, and reveals the existence of a curse of dimensionality for diffusion models. These theoretical findings are supported by analytical solutions for Gaussian mixtures and confirmed by numerical experiments on real datasets.
© 2024. The Author(s).
Conflict of interest statement
Figures





References
-
- Sohl-Dickstein, J., Weiss, E., Maheswaranathan, N. & Ganguli, S. Deep unsupervised learning using nonequilibrium thermodynamics. In Proc.International Conference on Machine Learning. (PMLR, 2015).
-
- Song, Y. & Ermon, S. Generative modeling by estimating gradients of the data distribution. In Proc.Advances in Neural Information Processing Systems. (Curran Associates Inc., 2019).
-
- Song, Y. et al. Score-based generative modeling through stochastic differential equations. In Proc.International Conference on Learning Representations (2021).
-
- Guth, F., Coste, S., De Bortoli, V. & Mallat, S. Wavelet score-based generative modeling. Adv. Neural Inf. Process. Syst.35, 478–491 (2022).
-
- Yang, L. et al. Diffusion models: a comprehensive survey of methods and applications. ACM Comput. Surv.56, 1–39 (2023).
Grants and funding
LinkOut - more resources
Full Text Sources