Abstract
Typical generative diffusion models rely on a Gaussian diffusion process for training the backward transformations, which can then be used to generate samples from Gaussian noise. However, real world data often takes place in discrete-state spaces, including many scientific applications. Here, we develop a theoretical formulation for arbitrary discrete-state Markov processes in the forward diffusion process using exact (as opposed to variational) analysis. We relate the theory to the existing continuous-state Gaussian diffusion as well as other approaches to discrete diffusion, and identify the corresponding reverse-time stochastic process and score function in the continuous-time setting, and the reverse-time mapping in the discrete-time setting. As an example of this framework, we introduce “Blackout Diffusion”, which learns to produce samples from an empty image instead of from noise. Numerical experiments on the CIFAR-10, Binarized MNIST, and CelebA datasets confirm the feasibility of our approach. Generalizing from specific (Gaussian) forward processes to discrete-state processes without a variational approximation sheds light on how to interpret diffusion models, which we discuss.
Original language | English |
---|---|
Pages (from-to) | 9034-9059 |
Number of pages | 26 |
Journal | Proceedings of Machine Learning Research |
Volume | 202 |
State | Published - 2023 |
Externally published | Yes |
Event | 40th International Conference on Machine Learning, ICML 2023 - Honolulu, United States Duration: Jul 23 2023 → Jul 29 2023 |
Funding
YTL is supported by the Laboratory Directed Research and Development (LDRD) Project “Uncertainty Quantification for Robust Machine Learning” (20210043DR). ZRF and JES are supported by the Center for the Nonlinear Studies through the LDRD. NL acknowledges the support of LDRD project 20230290ER. The authors acknowledge significant support from the Darwin test bed at Los Alamos National Laboratory (LANL), funded by the Computational Systems and Software Environments subprogram of LANL’s Advanced Simulation and Computing program. ZF notes that this manuscript has been authored by UT-Battelle, LLC under Contract No. DE-AC05-00OR22725 with the U.S. Department of Energy. YTL dedicates this paper to the memory of his mentor Prof. Charles R. Doering, whose invaluable knowledge and techniques greatly influenced this work. Despite Charlie left this world prematurely, his teachings will continue to inspire future research.
Funders | Funder number |
---|---|
Center for the Nonlinear Studies | 20230290ER |
U.S. Department of Energy | |
Laboratory Directed Research and Development | 20210043DR |
Los Alamos National Laboratory | DE-AC05-00OR22725 |