Skip to main navigation Skip to search Skip to main content

Modulated Diffusion: Accelerating Generative Modeling with Modulated Quantization

Research output: Contribution to journalConference articlepeer-review

Abstract

Diffusion models have emerged as powerful gen-erative models, but their high computational cost in iterative sampling remains a significant bot-tleneck. In this work, we present an in-depth and insightful study of state-of-the-art accelera-tion techniques for diffusion models, including caching and quantization, and reveal their limita-tions in computation error and generation quality. To break these limits, this work introduces Modu-lated Diffusion (MoDiff), an innovative, rigorous, and principled framework that accelerates gener-ative modeling through modulated quantization and error compensation. MoDiff not only inherits the advantages of existing caching and quantiza-tion methods but also serves as a general frame-work to accelerate all diffusion models. The ad-vantages of MoDiff are supported by solid theoret-ical insight and analysis. In addition, extensive ex-periments on CIFAR-10 and LSUN demonstrate that MoDiff significantly reduces activation quan-tization from 8 bits to 3 bits without performance degradation in post-training quantization (PTQ). Our code implementation is available at https://github.com/WeizhiGao/MoDiff.

Original languageEnglish
Pages (from-to)18337-18362
Number of pages26
JournalProceedings of Machine Learning Research
Volume267
StatePublished - 2025
Event42nd International Conference on Machine Learning, ICML 2025 - Vancouver, Canada
Duration: Jul 13 2025Jul 19 2025

Funding

Weizhi Gao, Zhichao Hou, and Dr. Xiaorui Liu are supported by the National Science Foundation (NSF) Award under grant number IIS-2443182, NSF National AI Research Resource Pilot Award, NCSU Data Science Academy Seed Grant Award, and NCSU Faculty Research and Professional Development Award.

Fingerprint

Dive into the research topics of 'Modulated Diffusion: Accelerating Generative Modeling with Modulated Quantization'. Together they form a unique fingerprint.

Cite this