Blackout Diffusion: Generative Diffusion Models in Discrete-State Spaces

Javier E. Santos, Zachary R. Fox, Nicholas Lubbers, Yen Ting Lin

Research output: Contribution to journalConference articlepeer-review

1 Scopus citations

Abstract

Typical generative diffusion models rely on a Gaussian diffusion process for training the backward transformations, which can then be used to generate samples from Gaussian noise. However, real world data often takes place in discrete-state spaces, including many scientific applications. Here, we develop a theoretical formulation for arbitrary discrete-state Markov processes in the forward diffusion process using exact (as opposed to variational) analysis. We relate the theory to the existing continuous-state Gaussian diffusion as well as other approaches to discrete diffusion, and identify the corresponding reverse-time stochastic process and score function in the continuous-time setting, and the reverse-time mapping in the discrete-time setting. As an example of this framework, we introduce “Blackout Diffusion”, which learns to produce samples from an empty image instead of from noise. Numerical experiments on the CIFAR-10, Binarized MNIST, and CelebA datasets confirm the feasibility of our approach. Generalizing from specific (Gaussian) forward processes to discrete-state processes without a variational approximation sheds light on how to interpret diffusion models, which we discuss.

Original languageEnglish
Pages (from-to)9034-9059
Number of pages26
JournalProceedings of Machine Learning Research
Volume202
StatePublished - 2023
Externally publishedYes
Event40th International Conference on Machine Learning, ICML 2023 - Honolulu, United States
Duration: Jul 23 2023Jul 29 2023

Funding

YTL is supported by the Laboratory Directed Research and Development (LDRD) Project “Uncertainty Quantification for Robust Machine Learning” (20210043DR). ZRF and JES are supported by the Center for the Nonlinear Studies through the LDRD. NL acknowledges the support of LDRD project 20230290ER. The authors acknowledge significant support from the Darwin test bed at Los Alamos National Laboratory (LANL), funded by the Computational Systems and Software Environments subprogram of LANL’s Advanced Simulation and Computing program. ZF notes that this manuscript has been authored by UT-Battelle, LLC under Contract No. DE-AC05-00OR22725 with the U.S. Department of Energy. YTL dedicates this paper to the memory of his mentor Prof. Charles R. Doering, whose invaluable knowledge and techniques greatly influenced this work. Despite Charlie left this world prematurely, his teachings will continue to inspire future research.

FundersFunder number
Center for the Nonlinear Studies20230290ER
U.S. Department of Energy
Laboratory Directed Research and Development20210043DR
Los Alamos National LaboratoryDE-AC05-00OR22725

    Fingerprint

    Dive into the research topics of 'Blackout Diffusion: Generative Diffusion Models in Discrete-State Spaces'. Together they form a unique fingerprint.

    Cite this